Model-assisted Estimation of Forest Resources with Generalized Additive Models

Size: px
Start display at page:

Download "Model-assisted Estimation of Forest Resources with Generalized Additive Models"

Transcription

1 Model-assisted Estimation of Forest Resources with Generalized Additive Models Jean D. Opsomer, F. Jay Breidt, Gretchen G. Moisen, and Göran Kauermann March 26, 2003 Abstract Multi-phase surveys are often conducted in forest inventory, with the goal of estimating forested area and tree characteristics over large regions. This article describes how design-based estimation of such quantities, based on information gathered during ground visits of sampled plots, can be made more precise by incorporating auxiliary information available from remote sensing. The relationship between the ground visit measurements and the remote sensing variables is modelled using generalized additive models. Nonparametric estimators for these models are discussed and applied to forest data collected in the mountains of northern Utah in the United States. Model-assisted estimators that utilize the nonparametric regression fits are proposed for these data. The design context of this study is two-phase systematic sampling from a spatial continuum, under which properties of model-assisted estimators are derived. Difficulties with the standard variance estimation approach, which assumes simple random sampling in each phase, are described. An alternative assessment of estimator performance based on simulation is implemented. The simulation provides strong evidence that using the model predictions in a model-assisted survey estimation procedure results in substantial efficiency improvements over current estimation approaches. KEY WORDS: multi-phase survey estimation, nonparametric regression, local scoring, calibration, systematic sampling, variance estimation. Jean D. Opsomer is Associate Professor, Department of Statistics, Iowa State University, Ames, IA 50011; F. Jay Breidt is Professor, Department of Statistics, Colorado State University, Fort Collins, CO 80523; Gretchen G. Moisen is Research Forester, USDA Forest Service, Rocky Mountain Research Station, th Street, Ogden, UT 84401; Göran Kauermann is Professor, Department of Economics, University of Bielefeld, Bielefeld, Germany. This work was supported in part by USDA Forest Service Rocky Mountain Research Station RJVA 02-JV and 01-JV , and National Science Foundation grants DMS and DMS

2 1 Introduction Accurate estimation of forest resources over large geographic areas is of significant interest to forest managers and forestry scientists. Nationwide forest surveys of the U.S. are conducted by the U.S. Department of Agriculture Forest Service Forest Inventory and Analysis (FIA) program (U. S. Department of Agriculture Forest Service (1992), Frayer and Furnival (1999), Gillespie (1999)). In these surveys, design-based estimates of quantities like total tree volume, growth and mortality, or area by forest type are produced on a regular basis. In the current article, we consider the estimation of such quantities within a 2.5 million ha ecological province (Bailey et al. 1994) that includes the Wasatch and Uinta Mountain Ranges of northern Utah. Forests in the area consist of pinyon-juniper, oak, and maple generally in the lower elevations, and lodgepole pine, ponderosa pine, aspen, and spruce-fir generally in the higher elevations. Many forest types intermix and swap elevation zones according to other topographic variables like aspect and slope. In addition to its ecological diversity, the area hosts numerous large ownerships including National Forests, Indian Reservations, National Parks and Monuments, state land holdings, and private lands. Each owner group faces different land management issues requiring precise forest resource information. Figure 1 displays the region of interest and the sample points collected in the early 1990 s for the survey we will consider here. While this article will focus on this particular example, the approach proposed here can be applied in other natural resource estimation problems. Currently, forest survey data are being collected through a two-phase systematic sampling procedure. In phase 1, remote sensing data and geographical information system (GIS) coverage information are extracted on an intensive sample grid. Phase 2 consists of a fieldvisited subset of the phase 1 grid. During these field visits, several hundred variables are collected, ranging from individual tree characteristics and size measurements to complex ratings on scales of ecological health. Once the data are collected, estimates of population totals and related quantities need to be calculated and tabulated for the overall region, as well as for a variety of domains defined by political subdivisions, types of forest, ownership category, etc. There are literally 2

3 thousands of estimates in the core tables put out by the FIA, with an even larger number of potential custom estimates that can be requested by data users. It is desirable for these estimates to be internally consistent, in the sense that the estimate of a sum of subdomain totals equals the sum of the subdomain total estimates. We refer to this estimation context as the problem of generic inference: making sensible estimates for a large number of quantities in a straightforward and internally consistent way. This can be contrasted with specific inference, in which the statistician responsible for producing estimates is studying a small number of variables and is able to build custom models for the dataset at hand. In the generic inference context, the statistician has neither time nor resources to conduct detailed analyses of all response variables. Therefore, the only practical way to produce estimates is often through design-based estimation, in which survey weights are constructed and applied to all variables and domains of interest. These weights are derived from the sampling design, but are adjusted based on ancillary information available for the sampled universe and/or collected as part of the survey. The ancillary information is used to calibrate the survey weights (making them sum to known universe quantities), and to improve the efficiency of the survey estimators. Once the weights are computed, users of the data can easily produce estimates for any variable of interest. Subdomain analyses are also simplified because the linear form of the estimators guarantees internal consistency. A large number of techniques are available to adjust survey weights based on auxiliary information. The use of auxiliary information in surveys dates back at least to Laplace (see Cochran, 1978), who employed a ratio estimator. The earliest references to regression in surveys include Jessen (1942) and Cochran (1942). Typically, auxiliary information is incorporated into the survey inference through parametric, linear models, leading to the familiar ratio and regression estimators (e.g., Cochran, 1977), post-stratification estimators (Holt and Smith, 1979), best linear unbiased estimators (Brewer, 1963; Royall, 1970), generalized regression estimators (Cassel et al. 1977; Särndal, 1980; Robinson and Särndal, 1983), and related estimators (Wright, 1983; Isaki and Fuller, 1982). Fuller (2002) is an excellent review. Recent advances in the use of auxiliary information include nonlinear estimation (Wu and Sitter, 2001), nonparametric survey regression estimation (Kuo (1988), Dorfman (1992), 3

4 Dorfman and Hall (1993), Chambers et al. (1993), Breidt and Opsomer (2000)), and the calibration point of view (Deville and Särndal, 1992). The approach currently used at the FIA is based on two-phase post-stratification (Chojnacky, 1998). Photo-interpreted vegetation cover type and ownership are used to divide the region of interest into homogeneous subsets, and the survey weights are calibrated to the phase 1 counts in each of the subsets. While this relatively simple estimator is more efficient than the two-phase expansion estimator, the increasing availability of a variety of inexpensive auxiliary information derived from remote sensing sources creates a tremendous opportunity, both to reduce costs and to further improve precision on forest survey estimates. This opportunity is all the more pressing because scientists within the Forest Service and other institutions have been using remote sensing and other GIS data to develop predictive and analytical models describing forest characteristics. This has been done in the specific inference context, in which significant effort is directed toward finding appropriate models for a small number of important variables. Because of the multivariate nature of the data and the complicated relationships among variables, nonparametric and semiparametric models have often been found to be a good compromise between model specification and flexibility. While these modelling efforts have led to improved understanding of the relationships between key forestry variables and remotely sensed information, so far this has not been reflected in corresponding improvements in forest survey estimates. The ultimate objective of this article is to explain how the results from the specific inferential efforts by forestry specialists can be used to improve the quality of their generic inference outputs as well. Model-assisted survey estimation (Särndal et al. 1992) is a well-known approach for incorporating auxiliary information in design-based survey estimation. It assumes the existence of a superpopulation model between the auxiliary variables and the variable of interest for the population to be sampled. This model is used to assist in the estimation of population quantities of interest in the sense that the estimators are quite efficient if the model is correctly specified, but maintain desirable properties such as design consistency and approximate design unbiasedness even if the model is misspecified (Robinson and Särndal, 1983). This is in contrast to purely model-based estimation, for which model misspecification 4

5 can lead to biased or inconsistent estimators. This is a critical distinction for generic inference, since any assumed model is unlikely to be equally appropriate across all the variables for which estimates need to be constructed. While model-assisted estimation has the potential to improve the precision of survey estimators when appropriate auxiliary information is available, it typically requires that these models be linear or at least have a known parametric shape. Breidt and Opsomer (2000) introduced local polynomial regression estimation, a survey estimation approach combining the modelling flexibility of nonparametric regression with model-assisted estimation. In this article, we describe how this approach can be extended to estimation for survey data from a two-phase design and with generalized nonparametric regression models. In the Utah mountains application as in many other forestry surveys, sampling is systematic, so that no direct design-based estimator of the variance is available. As we will discuss, the frequently used simple random sampling approximation results in a variance estimator with very poor practical behavior, so that reliance on this approximation for evaluating the reliability of survey estimators should be done with some care. We will argue that this traditional approach should at least be supplemented by other means of assessing variability. In this paper, we describe an approach based on simulating the population to be sampled and calculating the variance across repeated systematic samples from that population. In Section 2.1, we explain the two-phase systematic sampling design for the continuous spatial domain of interest, and in Section 2.2, we describe model-assisted survey estimation in this context. In Section 2.3, we incorporate additive and generalized additive models into this estimation framework. We discuss the specific models used for prediction for the northern Utah mountains forest inventory in Section 3.1, and show the results of applying nonparametric model-assisted estimation methods to the Utah data in Section 3.2. Section 3.3 discusses the issue of variance estimation for systematic sampling, Section 3.4 provides the evaluation of the methodology via simulation, and Section 4 concludes. 5

6 2 Methodology 2.1 Two-phase systematic sampling from a spatial domain The northern Utah mountains data were collected in two phases on a regularly spaced grid (see Figure 1). We describe the design properties of model-assisted estimation in this context by considering a rectangular spatial domain D = [0, L 1 ] [0, L 2 ], where L k = n 1k δ k = n 2k h k δ k, with n jk and h k positive integers and δ k positive real numbers. Here, δ k represents the grid spacing in dimension k, and h k the sub-sampling rate on dimension k for the phase two sample. Then, n 1 = n 11 n 12 is the phase one sample size and n 2 = n 21 n 22 is the phase two sample size. An irregular spatial domain is handled by intersecting it with the rectangle D. The two-phase systematic sampling design is implemented as follows. Let u k represent independent Uniform(0, 1) random variables and d k independent Discrete Uniform{1, 2,..., h k } random variables, with the u k, d k independent of each other. Given u = (u 1, u 2 ), the phase one sample is the randomly-located lattice {L i (u)} = {((u 1 + i 1 1)δ 1, (u 2 + i 2 1)δ 2 )} for i = (i 1, i 2 ) {1,..., n 11 } {1,..., n 12 }. Given d = (d 1, d 2 ), the phase two sample is the random sub-lattice {l j (u, d)} = {((u 1 + d 1 + (j 1 1)h 1 1)δ 1, (u 2 + d 2 + (j 2 1)h 2 1)δ 2 )} for j = (j 1, j 2 ) {1,..., n 21 } {1,..., n 22 }. Note that d {l j (u, d)} = {L i (u)}. 2.2 Model-assisted estimation To motivate the model-assisted approach which we use, we begin with a discussion of the twophase difference estimator. Let z(v) denote the study variable of interest, defined for v D but observed only for v {l j (u, d)}, and let z 0 (v) represent a different variable that is known for all v {L i (u)}. Note that neither z( ) nor z 0 ( ) is assumed stochastic, and in particular 6

7 neither depends on the random vectors u, d. Define D i = [(i 1 1)δ 1, i 1 δ 1 ] [(i 2 1)δ 2, i 2 δ 2 ]. Then the population total θ := = D z(v) dv = i [0,1] [0,1] can be estimated with the two-phase difference estimator ˆθ := z 0 (L i (u)) i 1/(δ 1 δ 2 ) + j = { z 0 (l j (u, d )) 1/(δ d 1 δ 2 ) j i D i z(v) dv z(l i (u)) du (1) 1/(δ 1 δ 2 ) z(l j (u, d)) z 0 (l j (u, d)) 1/(h 1 δ 1 h 2 δ 2 ) + z(l j(u, d )) z 0 (l j (u, d )) 1/(δ 1 δ 2 ) 1 {d=d } 1/(h 1 h 2 ) }, (2) with 1 {d=d } = 1 if d = d and 0 otherwise, where the summation over d is over all the possible values for the random pair (d 1, d 2 ). Since the indicator 1 {d=d } has expectation 1/(h 1 h 2 ), we have that E(ˆθ u) = d j z(l j (u, d )) 1/(δ 1 δ 2 ) = i z(l i (u)) 1/(δ 1 δ 2 ), (3) from which it is immediate that E(ˆθ) = [0,1] [0,1] E(ˆθ u) du = θ. (4) Also by standard results on systematic sampling from a finite population, we have that where D = D dv, S 2 (u) = Var (ˆθ ) D 2 ( u = 1 1 ) S 2 (u), (5) (n 21 n 22 ) 2 h 1 h 2 d t 2 d(u) ( d t d (u)) 2 /(h 1 h 2 ), for d {1,..., h 1 } {1,..., h 2 } h 1 h 2 1 and ( z(lj (u, d)) z 0 (l j (u, d)) ). (6) t d (u) = j 7

8 Therefore, the estimator ˆθ is design-unbiased regardless of the relationship between z and z 0, with design variance given by Var(ˆθ) = Var ( E(ˆθ u) ) + E ( Var(ˆθ u) ) = [0,1] [0,1] ( E(ˆθ u) θ ) 2 du + D 2 (n 21 n 22 ) 2 ( 1 1 ) E(S 2 (u)). (7) h 1 h 2 The first component of the variance does not depend on the choice of z 0, but the second component of the variance will be small if z 0 is a good predictor of z. In the following result, (4) and (7) are combined to show that ˆθ is design consistent under an asymptotic formulation in which the sampling density in D increases ( infill asymptotics ), assuming integrability conditions on z and z 0. This result is similar to consistency results obtained in design-based stereology (Arnau and Cruz-Orive, 1996), but the two-phase structure is novel. Result 1 If z( ) and z 0 ( ) are bounded and continuous almost everywhere on D, then ˆθ converges in mean square to θ as n jk with D fixed. Proof: The estimator ˆθ is unbiased by (4), so it suffices to show that its variance goes to zero. By hypothesis, both z and z 0 are Riemann integrable on D, so that from (3) and from (6) Since z and z 0 are bounded, we have that and lim E[ˆθ u] = θ n 11,n 12 lim D t d(u) = (z(v) z 0 (v)) dv. n 21,n 22 n 21 n 22 D ( ) 2 lim n 11,n 12 E[ˆθ u] θ du = [0,1] [0,1] [ lim E[S2 (u)] n 21,n 22 D 2 (n 21 n 22 ) = 2 D 2 E so that mean square consistency follows. lim [0,1] [0,1] n 11,n 12 lim n 21,n 22 ( E[ˆθ u] θ ) 2 du = 0 S 2 ] (u) = 0, (n 21 n 22 ) 2 In the absence of useful information from the first-phase sample, the simple expansion estimator ˆθ exp = j z(l j (u, d)) 1/(h 1 δ 1 h 2 δ 2 ) 8 (8)

9 obtained from (2) with z 0 0 can be used. In most cases of two-phase sampling, however, relatively inexpensive auxiliary information X(L i (u)) is collected at each phase one site. This information can be used to construct predictors of z guided by a superpopulation model E[z(v) X(v)] = µ(x(v)). (9) Typically, µ( ) is estimated from regression of {z(l j (u, d))} on {X(l j (u, d))}, and the resulting ˆµ(X(l j (u, d))) ˆµ j (u, d) is then substituted into (2) to form the model-assisted estimator ˆθ ma = d { ˆµj (u, d ) j 1/(δ 1 δ 2 ) + z(l j(u, d )) ˆµ j (u, d } ) 1 {d=d }. (10) 1/(δ 1 δ 2 ) 1/(h 1 h 2 ) Unlike z 0 ( ), ˆµ( ) usually does depend on u and d so the unbiasedness argument in (4) and the variance expression in (7) no longer hold exactly. However, under mild conditions which we do not explore further here, the model-assisted estimator should follow the traditional model-assisted paradigm and remain asymptotically design-unbiased and consistent, with approximate variance given by Var(ˆθ ma ) = [0,1] [0,1] ( E(ˆθma u) θ ) 2 du + D 2 (n 21 n 22 ) 2 ( 1 1 ) E(Se 2 (u)) (11) h 1 h 2 where S 2 e (u) = d ˆt 2 d(u) ( ˆt d d (u) ) 2 /(h1 h 2 ), for d {1,..., h 1 } {1,..., h 2 } h 1 h 2 1 and ˆt d (u) = j (z(l j (u, d)) µ(l j (u, d))), and µ( ) is obtained from the (hypothetical) regression of {z(l i (u))} on {X(L i (u))}. It is now clear why a model can improve the efficiency of the estimator. If the model fits the data well, the variance of the residuals z(l i (u)) µ(l i (u)) can be expected to be smaller than the variance of the z(l i (u)). If the model fits poorly, the residual variance should be equally large or even potentially larger than the study variable s variance. Hence, the efficiency gains of the model-assisted estimator depend on the selection of a good model for µ( ) in (9). 9

10 Traditionally, µ( ) is assumed to be linear, in which case ˆθ ma is known as the generalized regression estimator (see Särndal et al. 1992). The post-stratified estimator for θ can be considered as a special case of the generalized regression estimator, in which the auxiliary variables are categorical. By classifying the phase two sample into a small number of poststrata based on the phase one information, this estimator is commonly used in forest resource monitoring as a relatively simple way to incorporate auxiliary information in the estimation. See Särndal et al. (1992, Section 7.6) for explicit expressions for the post-stratified estimator and its variance. Recently, Breidt and Opsomer (2000) considered nonparametric models, fitted by local polynomial regression, as a more flexible alternative to linear and parametric models. A wide range of other nonparametric regression methods are available, including other kernel-based methods, spline methods and orthogonal decomposition-based approaches (see e.g. Opsomer (2002) for an overview), and in principle any of these can be used to produce predictions ˆµ j. So far, applications of these methods to survey estimation have been limited to element sampling. The estimator ˆθ ma has some additional desirable properties if the regression method is linear, in the sense that ˆµ i (u, d) = j ω ij z(l j (u, d)) for a set of smoothing weights ω ij that do not depend on the {z(l j (u, d))}. This linearity holds for many generalized regression estimators, including the post-stratification estimator. In this case, ˆθ ma can be written as a linear combination of the sample observations ˆθ ma = j ω j z(l j (d, u)), with weights {ω j } independent of the z(l j (u, d)). These regression weights are ideal for generic inference, as they can be used for any variables collected in the same survey, and to the extent that such variables also follow model (9), they will also benefit from the efficiency gain. 2.3 Model-assisted estimation using generalized additive models Suppose now that µ(x(v)) is the generalized additive model µ(x(v)) = E(z(v) X(v)) = g(m 1 (X 1 (v)) m r (X r (r))) (12) for some known link function g( ) and unknown smooth functions m k ( ), k = 1,..., r, where the X k (v) are known subsets of the vector X(v). Given a set of estimated func- 10

11 tions ˆm k ( ), k = 1,..., r, model predictions ˆµ i = g( ˆm 1 (X 1 (L i (u))) ˆm r (X r (L i (u)))) are readily calculated, for instance using the gam() local scoring estimation routines (Hastie and Tibshirani, 1990) implemented in S-Plus. When the link function g( ) is the identity link, model (12) is referred to as an additive model and the resulting estimators are linear, in the sense that they can be written as a linear combination of the observations. If g( ) is not the identity link, however, local scoring estimators are not linear and the resulting estimator ˆθ gam is no longer a linear combination of the {z(l j (u, d))}, so that weights are not available. In Section 3.2, we discuss an approach for obtaining weights from a generalized additive model for the forestry application. 3 Application to Forest Inventory 3.1 Generalized additive models for the forest inventory data We now discuss generalized additive models for the Utah mountains forest inventory. Field data used in this study were collected on a 5 km sample grid (Figure 1). On the 968 phase 2 sample plots, numerous forest site variables and individual tree measurements were collected, including a binary classification (FOREST) of the plot into forest or non-forest. The FOREST variable is critical in the inventory because many other response variables are defined to be zero on non-forested sites. In this study, we consider five additional variables, all of which follow this definitional constraint: NVOLTOT, total wood volume in cubic ft per acre; BA, tree basal area per acre; BIOMASS, total wood biomass in tons per acre; CRCOV, percent crown cover; and QMDALL, quadratic mean diameter in inches. In addition to the field plot data, remotely sensed information was extracted on the 5 km field plot locations as well as on an intensified 1 km grid (24,980 points), which will represent the phase 1 data. The ancillary variables used in our models came from three sources: 1. Digital elevation models produced by the U.S. Defense Mapping Agency, which provided elevation (ELEV90CU), transformed aspect (TRASP90) and slope (SLP90CU) m resolution Thematic Mapper (TM) imagery, from which we extracted the vegeta- 11

12 tion cover type from the U.S. National Land Cover dataset (Vogelmann et al. 2001) collapsed to seven vegetation classes (NLCD7). Also, letting MRLC00Bk denote the kth TM spectral band, we used MRLC00B5 by itself and we computed a Normalized Difference Vegetation Index (NDVI) as (MRLC00B4 MRLC00B3)/(MRLC00B4+MRLC00B3). 3. Spatial coordinates (Xs and Ys). Moisen and Edwards (1999), Frescino et al. (2001) and Moisen and Frescino (2002) developed parametric and nonparametric models relating remotely sensed data to forest attributes observed during field visits. Taking a similar approach here, we model the response variable FOREST as a nonparametric function of the ancillary predictor variables mentioned above through a generalized additive model with a logit link function g( ). The model (12) was fitted using gam() in S-Plus. Component functions were obtained through loess smoothers with local polynomials of degree 1 and a relatively large smoothing parameter (see Opsomer (2002) for an explanation of the loess smoothing method and smoothing parameter selection). Predictor variables ELEV90CU, TRASP90, SLP90CU, MRLC00B5, and NDVI entered the model as univariate smooth terms, while Xs with Ys contributed as a bivariate smooth function, and NLCD7 entered as a categorical variable in the model. The plots of the smooth contributing terms in the FOREST model are shown in Figure Model-assisted estimation for the forest inventory We calculate the following estimators for FOREST: 1. EXP, the expansion estimator in (8), 2. PS, a two-phase post-stratified estimator with the seven categories of variable NLCD7 as post-strata, representing a common choice in FIA, 3. REG, a model-assisted estimator from (10), with parametric regression on the dummy variables for NLCD7 plus linear terms for ELEV90CU, TRASP90, SLP90CU, MRLC00B5, NDVI, and Xs and Ys spatial coordinates, 12

13 4. GAM, the gam-assisted estimator from (10) with the model described in the subsection above fitted via local scoring. For the remaining response variables, we take the traditional large-scale survey point of view in which estimation is performed through the use of survey weights, as explained in Section 1. To obtain the operational advantages of weights, along with the efficiency gains of the gam, we consider a regression model that treats the gam fits for FOREST as an auxiliary variable in the estimation of the remaining variables. One possible way to do this is to simply treat the FOREST fits as an auxiliary variable in a linear model specification, and compute the weights of a (linear) regression estimator. In that way, any variables correlated to the model-fitted probability of the presence of forest will be estimated more efficiently. In this case however, the special structure of the relationship between the presence/absence of forest and the other variables suggests a more appropriate model. For every phase 1 site, we use the available auxiliary information to construct the indicator that is one when the GAM-predicted probability of forest is greater than the empirical proportion of forest in phase 2. A regression model consisting entirely of interactions between this forest indicator and other covariates is then constructed. The covariates include dummy variables for NLCD7 (with non-forest categories collapsed to ensure full rank), plus linear terms for ELEV90CU, TRASP90, SLP90CU, MRLC00B5, NDVI, and Xs and Ys spatial coordinates. Note that this regression model predicts zero for the response variable at any site for which FOREST is predicted to be zero. A standard model-assisted linear regression estimator is then built from this regression model. We refer to this regression-interaction model-assisted estimator as REGI. Treating the gam-predicted probabilities as fixed with respect to the design, the estimator REGI can be written as a linear combination of the response variables, and weights are obtained. Because only the interaction variables were included in the model, these weights will be calibrated to the totals on the part of phase 1 classified as likely to be forest, and will only be approximately calibrated on all of phase 1. Table 1 shows the estimates for all six variables, as well as the estimated standard deviations. Following standard FIA practice, these estimated standard deviations assume simple random sampling with replacement in phase 1 and without replacement in phase 2. These 13

14 empirical results suggest that the GAM estimator and the related regression estimator with interactions (REGI) dominate the simple expansion estimator, the post-stratification estimator, and the regression estimator. 3.3 Variance estimation under systematic sampling The estimated efficiencies in Table 1 are somewhat suspect, however, because they rely on asymptotic variance approximations, and they act as if the actual systematic samples were in fact drawn via simple random sampling. This last point is potentially serious when the number of possible systematic samples is small, as in this 1-in-25 systematic subsample. To illustrate this problem, consider the ideal circumstance in which the trend in the variable of interest can be completely removed by covariates, and the residuals are iid normal (0, σ 2 ) random variables. Condition on phase 1 and let H = h 1 h 2 denote the total number of systematic phase 2 samples. Let n 2 denote the phase 2 sample size. Then, as shown in Theorem 8.5 of Cochran (1977), the average (over all possible realizations of the normal residuals) systematic sampling variance in (5) is equal to the average of the simple random sampling variance estimator, D 2 ( 1 1 ) j {z(l j (u, d)) ˆµ j } 2 ( j {z(l j (u, d)) ˆµ j } ) 2 /n2. (13) (n 21 n 22 ) 2 h 1 h 2 n 2 1 But such unbiasedness is not so interesting for a given realization of the population. Indeed, consider F = systematic sampling variance in (5) simple random sampling variance estimator in (13). Under the assumptions above, it is immediate that this ratio is F-distributed with H 1 numerator degrees of freedom and n 2 1 denominator degrees of freedom. As n 2, { } 1/2 F = 1 + O p 2(1 + H/n2 ), H 1 so that, at least in this simple case, the simple random sampling variance estimator is inconsistent unless H tends to infinity. In the northern Utah mountains data set, n 2 = 968 but H = 25. The quartiles of the corresponding F distribution are and Thus, 14

15 in about half of the possible realizations of the population, the simple random sampling variance estimator will be off by ±20% or more. The quantile is and the quantile is 1.655, so departures on the order of ±50% are easily possible. This problem of variance estimation is basically intractable given only the sample, since it amounts to a sample of size one. Indeed, all of the variance estimators for systematic sampling given in Wolter (1985, Section 7.2.1) will perform poorly, as they all are forced to rely on within-systematic-sample variation to approximate between-systematic-sample variation. We therefore consider a simulation-based alternative in the following subsection. 3.4 Simulations Because the variance estimators are so unreliable in this context, we undertake a numerical experiment to assess the efficiencies of the various estimators. Our approach is to construct a population that closely mimics the one we are sampling from, draw repeated systematic samples from that population, and calculate variances based on these repeated samples. While any conclusions drawn from this procedure will of course depend on how well the chosen population model corresponds to the true population, we believe that it has the potential to provide a more reliable measure of the true variability of the estimators than the simple random sampling approximation. We begin by fitting large, parametric models to each of the variables studied in Table 1. The first model is a logistic regression for the forest/non-forest indicator. It includes six dummy variables for the categories of NLCD7, fourth-order polynomials for ELEV90CU, TRASP90, SLP90CU, MRLC00B5, NDVI and the two spatial coordinates, as well as a first-order interaction term for the spatial coordinates. The models for the remaining study variables contain similar terms, and are fitted to the positive responses after suitable transformation (typically square root). Using these fitted models, we simulate populations of study variables on all of the phase 1 sites. In this experiment, we condition on phase 1 because its percentage contribution to the empirical variances of the estimators was found to be small (around 5-7%), as shown in Table 1, and its contribution is common to all the estimators. The binary variable is 15

16 simulated with unequal probability Bernoulli random variables, and the remaining response variables are simulated on the transformed scale with Gaussian noise, then mapped back to the original scale. These response variables are set to zero wherever the simulated FOREST variable is zero. We draw all 25 possible systematic phase 2 samples from phase 1, compute the estimates for each sample, and then compute averages and variances over these 25 samples. Note that these 25 represent the entire conditional randomization distribution of the estimators, so that empirical means and variances are exactly the conditional expectation and conditional variance, given phase 1. The results are given in Table 2. The expectations of the estimators for the simulated populations are comparable to the corresponding estimates for the actual populations in Table 1, and the expectations of the estimated standard errors for the simulated populations are comparable to the corresponding estimates for the actual populations. These comparisons suggest that the simulated populations reproduce at least the second-order moment structure of the real data fairly well. As expected, the expansion estimator is exactly unbiased, and the remaining estimators are all essentially unbiased (percent relative biases no more than 0.25% in all cases) due to their model-assisted structure. The PS estimator, which is the Forest Service standard, is better than the EXP estimator in all cases, but even the simple regression estimator REG usually offers gains over both the expansion estimator and the PS estimator. The GAM estimator is much more efficient than its competitors for the FOREST variable, and the regression estimator with GAM-dependent interaction terms (REGI) is more efficient than its competitors for all of the other variables. The efficiency gains estimated through this simulation procedure are quite different from those estimated using the simple random sampling approximation, with those for FOREST, BA and QMDALL larger but those for NVOLTOT, BIOMASS and CRCOV smaller. This is readily explained by the results in the last column of Table 2, which shows that the simple random sampling variance estimator performs very poorly in this context, behaving somewhat like the hypothetical F random variable described in Section 3.2. Overall, these simulation results suggest that though the efficiency gains reported in Table 1 may be unre- 16

17 liable due to the lack of good variance estimators, there are in fact real gains obtained with the GAM-assisted and related regression estimators. 4 Conclusion Auxiliary information from remote sensing or other sources is becoming increasingly available to organizations involved in natural resource surveys. Scientists in these organizations are already developing detailed prediction models for many variables of interest, but they have tended not to use these prediction models in their survey estimation procedures. In this article, we have explained how nonparametric model-assisted estimation techniques can be used to incorporate the results of such modelling efforts in the production of survey estimates, even in the case of fairly complex models and multi-phase designs. We have provided some theoretical justification for gam-assisted survey inference in the context of two phases of systematic sampling from a spatial domain. The gam-assisted methodology was applied in a survey of forest resources in the mountains of northern Utah, a region important for its ecological and land-use diversity. Theoretical properties of this approach in complex surveys deserve further investigation. Important open issues include model selection and the selection of the smoothing parameters for the nonparametric regression fitting algorithms, since this affects both the estimates of the quantities of interest and their estimated variances. In the course of this research, the unsatisfactory behavior of the traditional estimator of the design variance under systematic sampling became apparent, and we used a simulation-based alternative to evaluate our proposed estimation procedure. Future research into simulation-based variance estimation in this context, including choice of models and robustness to their selection, certainly appears warranted. References Arnau, X. G. and L. M. Cruz-Orive (1996). Consistency in systematic sampling. Advances in Applied Probability 28,

18 Bailey, R. G., P. E. Avers, T. King, and W. H. McNab (Eds.) (1994). Ecoregions and Subregions of the United States (map). Washington, DC: U.S. Geological Survey. Scale 1:7,500,000, colored, accompanied by a supplementary table of map unit descriptions, prepared for the U.S. Department of Agriculture, Forest Service. Breidt, F. J. and J. D. Opsomer (2000). Local polynomial regression estimators in survey sampling. Annals of Statistics 28, Brewer, K. R. W. (1963). Ratio estimation and finite populations: Some results deducible from the assumption of an underlying stochastic process (Corr: 66V8 p37). The Australian Journal of Statistics 5, Cassel, C. M., C. E. Särndal, and J. H. Wretman (1977). Foundations of Inference in Survey Sampling. New York: Wiley. Chambers, R. L., A. H. Dorfman, and T. E. Wehrly (1993). Bias robust estimation in finite populations using nonparametric calibration. Journal of the American Statistical Association 88, Chojnacky, D. C. (1998). Double sampling for stratification: A forest inventory application in the interior west. Research Paper RMRS-RP-7, U. S. Department of Agriculture, Forest Service, Rocky Mountain Research Station, Ogden, UT. Cochran, W. G. (1942). Sampling theory when the sampling units are of unequal size. Journal of the American Statistical Association 37, Cochran, W. G. (1977). Sampling Techniques (3rd ed.). New York: John Wiley & Sons. Cochran, W. G. (1978). Laplace s ratio estimator. In H. A. David (Ed.), Contributions to survey sampling and applied statistics, pp New York: Academic Press. Deville, J.-C. and C.-E. Särndal (1992). Calibration estimators in survey sampling. Journal of the American Statistical Association 87, Dorfman, A. H. (1992). Non-parametric regression for estimating totals in finite populations. In ASA Proceedings of the Section on Survey Research Methods, pp American Statistical Association (Alexandria, VA). 18

19 Dorfman, A. H. and P. Hall (1993). Estimators of the finite population distribution function using nonparametric regression. The Annals of Statistics 21, Frayer, W. E. and G. M. Furnival (1999). Forest survey sampling designs: A history. Journal of Forestry 97, 4 8. Frescino, T., T.C. Edwards, Jr, and G. Moisen (2001). Modelling spatially explicit structural attributes using generalized additive models. Journal of Vegetation Science 12, Fuller, W. A. (2002). Regression estimation for survey samples. Survey Methodology 28, Gillespie, A. J. R. (1999). Rationale for a national annual forest inventory program. Journal of Forestry 97, Hastie, T. J. and R. J. Tibshirani (1990). Generalized Additive Models. Washington, D. C.: Chapman and Hall. Holt, D. and T. M. F. Smith (1979). Post-stratification. Journal of the Royal Statistical Society, Series A 142, Isaki, C. and W. Fuller (1982). Survey design under the regression superpopulation model. Journal of the American Statistical Association 77, Jessen, R. J. (1942). Statistical investigation of a sample survey for obtaining farm facts. Research Bulletin 304, Iowa Agriculture Experiment Station. Kuo, L. (1988). Classical and prediction approaches to estimating distribution functions from survey data. In ASA Proceedings of the Section on Survey Research Methods, pp American Statistical Association (Alexandria, VA). Moisen, G. and T. Edwards (1999). Use of generalized linear models and digital data in a forest inventory of Utah. Journal of Agricultural, Biological and Environmental Statistics 4, Moisen, G. G. and T. S. Frescino (2002). Comparing five modelling techniques for predicting forest characteristics. Ecological Modelling 157,

20 Opsomer, J. D. (2002). Nonparametric regression model. In A. H. El-Shaarawi and W. W. Piegorsch (Eds.), Encyclopedia of Environmetrics, Volume 3, pp Chichester, UK: Wiley & Sons. Robinson, P. M. and C.-E. Särndal (1983). Asymptotic properties of the generalized regression estimator in probability sampling. Sankhya, Series B 45, Royall, R. M. (1970). On finite population sampling theory under certain linear regression models. Biometrika 57, Särndal, C.-E. (1980). On π-inverse weighting versus best linear unbiased weighting in probability sampling. Biometrika 67, Särndal, C.-E., B. Swensson, and J. Wretman (1992). Model Assisted Survey Sampling. New York: Springer-Verlag. U. S. Department of Agriculture Forest Service (1992). Forest Service resource inventories: An overview. Technical report, Washington, DC. Vogelmann, J. E., S. M. Howard, L. Yang, C. R. Larson, B. K. Wylie, and N. V. Driel (2001). Completion of the 1990s National Land Cover data set for the conterminous United States from Landsat Thematic Mapper data and ancillary data sources. Photogrammetric Engineering and Remote Sensing 67, Wolter, K. M. (1985). Introduction to Variance Estimation. New York: Springer-Verlag Inc. Wright, R. L. (1983). Finite population sampling with multivariate auxiliary information. Journal of the American Statistical Association 78, Wu, C. and R. R. Sitter (2001). A model-calibration approach to using complete auxiliary information from survey data. Journal of the American Statistical Association 96,

21 Table 1: Estimation results for the northern Utah mountains data. Estimators are expansion (EXP), post-stratification (PS), regression on linear terms of the continuous auxiliaries, plus dummies for the categorical variable (REG); generalized additive model on the same variables (GAM); and linear regression on the same terms, but with all terms interacted with the indicator that the GAM-predicted probability of forest is greater than the empirical proportion of forest (REGI). Estimated Percent Est. Relative Study Estimated Standard Variance Efficiency of Variable Estimator Mean Error from Phase 1 GAM/REGI FOREST EXP (forest/ PS non-forest REG binary) GAM NVOLTOT EXP (total wood PS volume in REG cuft/acre) REGI BA EXP (tree basal PS area per REG acre) REGI BIOMASS EXP (total wood PS biomass in REG tons/acre) REGI CRCOV EXP (percent PS crown REG cover) REGI QMDALL EXP (quadratic PS mean diameter REG in inches) REGI

22 Table 2: Results from all 25 possible systematic sub-samples from the phase 1 sample for the simulated populations. Estimators are expansion (EXP), post-stratification (PS), regression on linear terms of the continuous auxiliaries, plus dummies for the categorical variable (REG); generalized additive model on the same variables (GAM); and linear regression on the same terms, but with all terms interacted with the indicator that the GAM-predicted probability of forest is greater than the empirical proportion of forest (REGI). Expectation Expectation of Percent Relative Percent Bias Simulated of Estimated Relative Efficiency of of Variance Variable Estimator Estimator Std. Error Bias GAM/REGI Estimator FOREST EXP (forest/ PS non-forest REG binary) GAM NVOLTOT EXP (total wood PS volume in REG cuft/acre) REGI BA EXP (tree basal PS area per REG acre) REGI BIOMASS EXP (total wood PS biomass in REG tons/acre) REGI CRCOV EXP (percent PS crown REG cover) REGI QMDALL EXP (quadratic PS mean diameter REG in inches) REGI

23 Figure Captions Figure 1: Representation of the study region in northern Utah. Each triangle represents a field-visited phase 2 plot. Each dot in the magnified section represents a remotelysensed phase 1 plot. See Section 3.1 for an explanation of the phase 1 and phase 2 plots. Figure 2: GAM model fits for binary indicator of forest/non-forest (FOREST). 23

24 Figure 1: 24

25 Figure 2: lo(xs, Ys, span = 0.5) Ys Xs s(elev90cu, df = 4) ELEV90CU s(trasp90, df = 4) s(slp90cu, df = 4) TRASP SLP90CU s(mrlc00b5, df = 4) s(ndvi, df = 4) MRLC00B NDVI 25

Model-assisted Estimation of Forest Resources with Generalized Additive Models

Model-assisted Estimation of Forest Resources with Generalized Additive Models Model-assisted Estimation of Forest Resources with Generalized Additive Models Jean Opsomer, Jay Breidt, Gretchen Moisen, Göran Kauermann August 9, 2006 1 Outline 1. Forest surveys 2. Sampling from spatial

More information

Model-Assisted Estimation of Forest Resources With Generalized Additive Models

Model-Assisted Estimation of Forest Resources With Generalized Additive Models Model-Assisted Estimation of Forest Resources With Generalized Additive Models Jean D. OPSOMER, F.JayBREIDT, Gretchen G. MOISEN, and Göran KAUERMANN Multiphase surveys are often conducted in forest inventories,

More information

Additional results for model-based nonparametric variance estimation for systematic sampling in a forestry survey

Additional results for model-based nonparametric variance estimation for systematic sampling in a forestry survey Additional results for model-based nonparametric variance estimation for systematic sampling in a forestry survey J.D. Opsomer Colorado State University M. Francisco-Fernández Universidad de A Coruña July

More information

F. Jay Breidt Colorado State University

F. Jay Breidt Colorado State University Model-assisted survey regression estimation with the lasso 1 F. Jay Breidt Colorado State University Opening Workshop on Computational Methods in Social Sciences SAMSI August 2013 This research was supported

More information

Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling

Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling Ji-Yeon Kim Iowa State University F. Jay Breidt Colorado State University Jean D. Opsomer Colorado State University

More information

Two Applications of Nonparametric Regression in Survey Estimation

Two Applications of Nonparametric Regression in Survey Estimation Two Applications of Nonparametric Regression in Survey Estimation 1/56 Jean Opsomer Iowa State University Joint work with Jay Breidt, Colorado State University Gerda Claeskens, Université Catholique de

More information

REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY

REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY J.D. Opsomer, W.A. Fuller and X. Li Iowa State University, Ames, IA 50011, USA 1. Introduction Replication methods are often used in

More information

Sensitivity of FIA Volume Estimates to Changes in Stratum Weights and Number of Strata. Data. Methods. James A. Westfall and Michael Hoppus 1

Sensitivity of FIA Volume Estimates to Changes in Stratum Weights and Number of Strata. Data. Methods. James A. Westfall and Michael Hoppus 1 Sensitivity of FIA Volume Estimates to Changes in Stratum Weights and Number of Strata James A. Westfall and Michael Hoppus 1 Abstract. In the Northeast region, the USDA Forest Service Forest Inventory

More information

Finite Population Sampling and Inference

Finite Population Sampling and Inference Finite Population Sampling and Inference A Prediction Approach RICHARD VALLIANT ALAN H. DORFMAN RICHARD M. ROYALL A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim Brisbane

More information

A comparison of stratified simple random sampling and sampling with probability proportional to size

A comparison of stratified simple random sampling and sampling with probability proportional to size A comparison of stratified simple random sampling and sampling with probability proportional to size Edgar Bueno Dan Hedlin Per Gösta Andersson Department of Statistics Stockholm University Introduction

More information

NONPARAMETRIC ENDOGENOUS POST-STRATIFICATION ESTIMATION

NONPARAMETRIC ENDOGENOUS POST-STRATIFICATION ESTIMATION Statistica Sinica 2011): Preprint 1 NONPARAMETRIC ENDOGENOUS POST-STRATIFICATION ESTIMATION Mark Dahlke 1, F. Jay Breidt 1, Jean D. Opsomer 1 and Ingrid Van Keilegom 2 1 Colorado State University and 2

More information

GENERALIZED LINEAR MIXED MODELS FOR ANALYZING ERROR IN A SATELLITE-BASED VEGETATION MAP OF UTAH

GENERALIZED LINEAR MIXED MODELS FOR ANALYZING ERROR IN A SATELLITE-BASED VEGETATION MAP OF UTAH Published as: Moisen, G. G., D. R. Cutler, and T. C. Edwards, Jr. 1999. Generalized linear mixed models for analyzing error in a satellite-based vegetation map of Utah. Pages 37-44 in H. T. Mowrer and

More information

Comparison of Imputation Procedures for Replacing Denied-access Plots

Comparison of Imputation Procedures for Replacing Denied-access Plots Comparison of Imputation Procedures for Replacing Denied-access Plots Susan L. King 1 Abstract. In forest inventories, missing plots are caused by hazardous terrain, inaccessible locations, or denied access.

More information

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR Statistica Sinica 8(1998), 1165-1173 A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR Phillip S. Kott National Agricultural Statistics Service Abstract:

More information

Generalized Linear Mixed Models for Analyzing Error in a Satellite-based Vegetation Map of Utah

Generalized Linear Mixed Models for Analyzing Error in a Satellite-based Vegetation Map of Utah Published as: Moisen, G. G., R. D. Cutler, and T. C. Edwards, Jr. 1996. Generalized linear mixed models for analyzing error in a satellite-based vegetation map of Utah. Pages 459-466 in H. T. Mowrer, R.

More information

Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/21/03) Ed Stanek

Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/21/03) Ed Stanek Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/2/03) Ed Stanek Here are comments on the Draft Manuscript. They are all suggestions that

More information

Estimating basal area, trees, and above ground biomass per acre for common tree species across the Uncompahgre Plateau using NAIP CIR imagery

Estimating basal area, trees, and above ground biomass per acre for common tree species across the Uncompahgre Plateau using NAIP CIR imagery Estimating basal area, trees, and above ground biomass per acre for common tree species across the Uncompahgre Plateau using NAIP CIR imagery By John Hogland Nathaniel Anderson Greg Jones Obective Develop

More information

INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING

INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING Statistica Sinica 24 (2014), 1001-1015 doi:http://dx.doi.org/10.5705/ss.2013.038 INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING Seunghwan Park and Jae Kwang Kim Seoul National Univeristy

More information

REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES

REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES Statistica Sinica 8(1998), 1153-1164 REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES Wayne A. Fuller Iowa State University Abstract: The estimation of the variance of the regression estimator for

More information

arxiv: v2 [math.st] 20 Jun 2014

arxiv: v2 [math.st] 20 Jun 2014 A solution in small area estimation problems Andrius Čiginas and Tomas Rudys Vilnius University Institute of Mathematics and Informatics, LT-08663 Vilnius, Lithuania arxiv:1306.2814v2 [math.st] 20 Jun

More information

A comparison of stratified simple random sampling and sampling with probability proportional to size

A comparison of stratified simple random sampling and sampling with probability proportional to size A comparison of stratified simple random sampling and sampling with probability proportional to size Edgar Bueno Dan Hedlin Per Gösta Andersson 1 Introduction When planning the sampling strategy (i.e.

More information

Small Area Modeling of County Estimates for Corn and Soybean Yields in the US

Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Matt Williams National Agricultural Statistics Service United States Department of Agriculture Matt.Williams@nass.usda.gov

More information

A Forest Inventory ESTimation and Analysis tool. Tracey Frescino Paul Patterson Elizabeth Freeman Gretchen Moisen

A Forest Inventory ESTimation and Analysis tool. Tracey Frescino Paul Patterson Elizabeth Freeman Gretchen Moisen A Forest Inventory ESTimation and Analysis tool Tracey Frescino Paul Patterson Elizabeth Freeman Gretchen Moisen FIA s Data and Estimation Tools FIA DataMart - Web-based tool to access FIA database through

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

Non-uniform coverage estimators for distance sampling

Non-uniform coverage estimators for distance sampling Abstract Non-uniform coverage estimators for distance sampling CREEM Technical report 2007-01 Eric Rexstad Centre for Research into Ecological and Environmental Modelling Research Unit for Wildlife Population

More information

Model Assisted Survey Sampling

Model Assisted Survey Sampling Carl-Erik Sarndal Jan Wretman Bengt Swensson Model Assisted Survey Sampling Springer Preface v PARTI Principles of Estimation for Finite Populations and Important Sampling Designs CHAPTER 1 Survey Sampling

More information

Transformation and Smoothing in Sample Survey Data

Transformation and Smoothing in Sample Survey Data Scandinavian Journal of Statistics, Vol. 37: 496 513, 2010 doi: 10.1111/j.1467-9469.2010.00691.x Published by Blackwell Publishing Ltd. Transformation and Smoothing in Sample Survey Data YANYUAN MA Department

More information

Combining data from two independent surveys: model-assisted approach

Combining data from two independent surveys: model-assisted approach Combining data from two independent surveys: model-assisted approach Jae Kwang Kim 1 Iowa State University January 20, 2012 1 Joint work with J.N.K. Rao, Carleton University Reference Kim, J.K. and Rao,

More information

Penalized Balanced Sampling. Jay Breidt

Penalized Balanced Sampling. Jay Breidt Penalized Balanced Sampling Jay Breidt Colorado State University Joint work with Guillaume Chauvet (ENSAI) February 4, 2010 1 / 44 Linear Mixed Models Let U = {1, 2,...,N}. Consider linear mixed models

More information

Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities

Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Peter M. Aronow and Cyrus Samii Forthcoming at Survey Methodology Abstract We consider conservative variance

More information

Advanced Methods for Agricultural and Agroenvironmental. Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller

Advanced Methods for Agricultural and Agroenvironmental. Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller Advanced Methods for Agricultural and Agroenvironmental Monitoring Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller Outline 1. Introduction to the National Resources Inventory 2. Hierarchical

More information

Instructions for Running the FVS-WRENSS Water Yield Post-processor

Instructions for Running the FVS-WRENSS Water Yield Post-processor Instructions for Running the FVS-WRENSS Water Yield Post-processor Overview The FVS-WRENSS post processor uses the stand attributes and vegetative data from the Forest Vegetation Simulator (Dixon, 2002)

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2

MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2 MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2 1 Bootstrapped Bias and CIs Given a multiple regression model with mean and

More information

Modification and Improvement of Empirical Likelihood for Missing Response Problem

Modification and Improvement of Empirical Likelihood for Missing Response Problem UW Biostatistics Working Paper Series 12-30-2010 Modification and Improvement of Empirical Likelihood for Missing Response Problem Kwun Chuen Gary Chan University of Washington - Seattle Campus, kcgchan@u.washington.edu

More information

Parametric Techniques Lecture 3

Parametric Techniques Lecture 3 Parametric Techniques Lecture 3 Jason Corso SUNY at Buffalo 22 January 2009 J. Corso (SUNY at Buffalo) Parametric Techniques Lecture 3 22 January 2009 1 / 39 Introduction In Lecture 2, we learned how to

More information

Interaction effects for continuous predictors in regression modeling

Interaction effects for continuous predictors in regression modeling Interaction effects for continuous predictors in regression modeling Testing for interactions The linear regression model is undoubtedly the most commonly-used statistical model, and has the advantage

More information

One-phase estimation techniques

One-phase estimation techniques One-phase estimation techniques Based on Horwitz-Thompson theorem for continuous populations Radim Adolt ÚHÚL Brandýs nad Labem, Czech Republic USEWOOD WG2, Training school in Dublin, 16.-19. September

More information

Modelling Survival Data using Generalized Additive Models with Flexible Link

Modelling Survival Data using Generalized Additive Models with Flexible Link Modelling Survival Data using Generalized Additive Models with Flexible Link Ana L. Papoila 1 and Cristina S. Rocha 2 1 Faculdade de Ciências Médicas, Dep. de Bioestatística e Informática, Universidade

More information

Parametric Techniques

Parametric Techniques Parametric Techniques Jason J. Corso SUNY at Buffalo J. Corso (SUNY at Buffalo) Parametric Techniques 1 / 39 Introduction When covering Bayesian Decision Theory, we assumed the full probabilistic structure

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David

More information

Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"

Kneib, Fahrmeir: Supplement to Structured additive regression for categorical space-time data: A mixed model approach Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach" Sonderforschungsbereich 386, Paper 43 (25) Online unter: http://epub.ub.uni-muenchen.de/

More information

Land Cover and Soil Properties of the San Marcos Subbasin

Land Cover and Soil Properties of the San Marcos Subbasin Land Cover and Soil Properties of the San Marcos Subbasin Cody McCann EWRE Graduate Studies December 6, 2012 Table of Contents Project Background............................................................

More information

Monte Carlo Study on the Successive Difference Replication Method for Non-Linear Statistics

Monte Carlo Study on the Successive Difference Replication Method for Non-Linear Statistics Monte Carlo Study on the Successive Difference Replication Method for Non-Linear Statistics Amang S. Sukasih, Mathematica Policy Research, Inc. Donsig Jang, Mathematica Policy Research, Inc. Amang S. Sukasih,

More information

Changing Ecoregional Map Boundaries

Changing Ecoregional Map Boundaries February 12, 2004 By Robert G. Bailey, USDA Forest Service, Inventory & Monitoring Institute Changing Ecoregional Map Boundaries The Forest Service has developed a mapping framework to help managers better

More information

Spatio-temporal prediction of site index based on forest inventories and climate change scenarios

Spatio-temporal prediction of site index based on forest inventories and climate change scenarios Forest Research Institute Spatio-temporal prediction of site index based on forest inventories and climate change scenarios Arne Nothdurft 1, Thilo Wolf 1, Andre Ringeler 2, Jürgen Böhner 2, Joachim Saborowski

More information

No is the Easiest Answer: Using Calibration to Assess Nonignorable Nonresponse in the 2002 Census of Agriculture

No is the Easiest Answer: Using Calibration to Assess Nonignorable Nonresponse in the 2002 Census of Agriculture No is the Easiest Answer: Using Calibration to Assess Nonignorable Nonresponse in the 2002 Census of Agriculture Phillip S. Kott National Agricultural Statistics Service Key words: Weighting class, Calibration,

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Nonparametric Principal Components Regression

Nonparametric Principal Components Regression Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS031) p.4574 Nonparametric Principal Components Regression Barrios, Erniel University of the Philippines Diliman,

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Chapter V A Stand Basal Area Growth Disaggregation Model Based on Dominance/Suppression Competitive Relationships

Chapter V A Stand Basal Area Growth Disaggregation Model Based on Dominance/Suppression Competitive Relationships Chapter V A Stand Basal Area Growth Disaggregation Model Based on Dominance/Suppression Competitive Relationships Introduction Forest growth and yield models were traditionally classified into one of three

More information

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest

More information

Denis White NSI Technical Services Corporation 200 SW 35th St. Corvallis, Oregon 97333

Denis White NSI Technical Services Corporation 200 SW 35th St. Corvallis, Oregon 97333 POLYGON OVERLAY TO SUPPORT POINT SAMPLE MAPPING: THE NATIONAL RESOURCES INVENTORY Denis White NSI Technical Services Corporation 200 SW 35th St. Corvallis, Oregon 97333 Margaret Maizel ' American Farmland

More information

Carl N. Morris. University of Texas

Carl N. Morris. University of Texas EMPIRICAL BAYES: A FREQUENCY-BAYES COMPROMISE Carl N. Morris University of Texas Empirical Bayes research has expanded significantly since the ground-breaking paper (1956) of Herbert Robbins, and its province

More information

Quality and Coverage of Data Sources

Quality and Coverage of Data Sources Quality and Coverage of Data Sources Objectives Selecting an appropriate source for each item of information to be stored in the GIS database is very important for GIS Data Capture. Selection of quality

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Single-index model-assisted estimation in survey sampling

Single-index model-assisted estimation in survey sampling Journal of onparametric Statistics Vol. 21, o. 4, May 2009, 487 504 Single-index model-assisted estimation in survey sampling Li Wang* Department of Statistics, University of Georgia, Athens, GA, 30602,

More information

Protocol Calibration in the National Resources Inventory

Protocol Calibration in the National Resources Inventory Protocol Calibration in the National Resources Inventory Cindy Yu Jason C. Legg August 23, 2007 ABSTRACT The National Resources Inventory (NRI) is a large-scale longitudinal survey conducted by the National

More information

VCS MODULE VMD0018 METHODS TO DETERMINE STRATIFICATION

VCS MODULE VMD0018 METHODS TO DETERMINE STRATIFICATION VMD0018: Version 1.0 VCS MODULE VMD0018 METHODS TO DETERMINE STRATIFICATION Version 1.0 16 November 2012 Document Prepared by: The Earth Partners LLC. Table of Contents 1 SOURCES... 2 2 SUMMARY DESCRIPTION

More information

Empirical Likelihood Methods for Sample Survey Data: An Overview

Empirical Likelihood Methods for Sample Survey Data: An Overview AUSTRIAN JOURNAL OF STATISTICS Volume 35 (2006), Number 2&3, 191 196 Empirical Likelihood Methods for Sample Survey Data: An Overview J. N. K. Rao Carleton University, Ottawa, Canada Abstract: The use

More information

Lawrence D. Brown* and Daniel McCarthy*

Lawrence D. Brown* and Daniel McCarthy* Comments on the paper, An adaptive resampling test for detecting the presence of significant predictors by I. W. McKeague and M. Qian Lawrence D. Brown* and Daniel McCarthy* ABSTRACT: This commentary deals

More information

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW SSC Annual Meeting, June 2015 Proceedings of the Survey Methods Section ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW Xichen She and Changbao Wu 1 ABSTRACT Ordinal responses are frequently involved

More information

SUPPLEMENT TO PARAMETRIC OR NONPARAMETRIC? A PARAMETRICNESS INDEX FOR MODEL SELECTION. University of Minnesota

SUPPLEMENT TO PARAMETRIC OR NONPARAMETRIC? A PARAMETRICNESS INDEX FOR MODEL SELECTION. University of Minnesota Submitted to the Annals of Statistics arxiv: math.pr/0000000 SUPPLEMENT TO PARAMETRIC OR NONPARAMETRIC? A PARAMETRICNESS INDEX FOR MODEL SELECTION By Wei Liu and Yuhong Yang University of Minnesota In

More information

Summary and discussion of The central role of the propensity score in observational studies for causal effects

Summary and discussion of The central role of the propensity score in observational studies for causal effects Summary and discussion of The central role of the propensity score in observational studies for causal effects Statistics Journal Club, 36-825 Jessica Chemali and Michael Vespe 1 Summary 1.1 Background

More information

Nonparametric Small Area Estimation via M-quantile Regression using Penalized Splines

Nonparametric Small Area Estimation via M-quantile Regression using Penalized Splines Nonparametric Small Estimation via M-quantile Regression using Penalized Splines Monica Pratesi 10 August 2008 Abstract The demand of reliable statistics for small areas, when only reduced sizes of the

More information

Chapter 8: Estimation 1

Chapter 8: Estimation 1 Chapter 8: Estimation 1 Jae-Kwang Kim Iowa State University Fall, 2014 Kim (ISU) Ch. 8: Estimation 1 Fall, 2014 1 / 33 Introduction 1 Introduction 2 Ratio estimation 3 Regression estimator Kim (ISU) Ch.

More information

Small Sample Corrections for LTS and MCD

Small Sample Corrections for LTS and MCD myjournal manuscript No. (will be inserted by the editor) Small Sample Corrections for LTS and MCD G. Pison, S. Van Aelst, and G. Willems Department of Mathematics and Computer Science, Universitaire Instelling

More information

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

Linear Models 1. Isfahan University of Technology Fall Semester, 2014 Linear Models 1 Isfahan University of Technology Fall Semester, 2014 References: [1] G. A. F., Seber and A. J. Lee (2003). Linear Regression Analysis (2nd ed.). Hoboken, NJ: Wiley. [2] A. C. Rencher and

More information

Monitoring Random Start Forward Searches for Multivariate Data

Monitoring Random Start Forward Searches for Multivariate Data Monitoring Random Start Forward Searches for Multivariate Data Anthony C. Atkinson 1, Marco Riani 2, and Andrea Cerioli 2 1 Department of Statistics, London School of Economics London WC2A 2AE, UK, a.c.atkinson@lse.ac.uk

More information

Inferences for the Ratio: Fieller s Interval, Log Ratio, and Large Sample Based Confidence Intervals

Inferences for the Ratio: Fieller s Interval, Log Ratio, and Large Sample Based Confidence Intervals Inferences for the Ratio: Fieller s Interval, Log Ratio, and Large Sample Based Confidence Intervals Michael Sherman Department of Statistics, 3143 TAMU, Texas A&M University, College Station, Texas 77843,

More information

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA

More information

HISTORICAL PERSPECTIVE OF SURVEY SAMPLING

HISTORICAL PERSPECTIVE OF SURVEY SAMPLING HISTORICAL PERSPECTIVE OF SURVEY SAMPLING A.K. Srivastava Former Joint Director, I.A.S.R.I., New Delhi -110012 1. Introduction The purpose of this article is to provide an overview of developments in sampling

More information

Calibration estimation using exponential tilting in sample surveys

Calibration estimation using exponential tilting in sample surveys Calibration estimation using exponential tilting in sample surveys Jae Kwang Kim February 23, 2010 Abstract We consider the problem of parameter estimation with auxiliary information, where the auxiliary

More information

Professors Lin and Ying are to be congratulated for an interesting paper on a challenging topic and for introducing survival analysis techniques to th

Professors Lin and Ying are to be congratulated for an interesting paper on a challenging topic and for introducing survival analysis techniques to th DISCUSSION OF THE PAPER BY LIN AND YING Xihong Lin and Raymond J. Carroll Λ July 21, 2000 Λ Xihong Lin (xlin@sph.umich.edu) is Associate Professor, Department ofbiostatistics, University of Michigan, Ann

More information

arxiv: v1 [stat.co] 26 May 2009

arxiv: v1 [stat.co] 26 May 2009 MAXIMUM LIKELIHOOD ESTIMATION FOR MARKOV CHAINS arxiv:0905.4131v1 [stat.co] 6 May 009 IULIANA TEODORESCU Abstract. A new approach for optimal estimation of Markov chains with sparse transition matrices

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Prediction Intervals in the Presence of Outliers

Prediction Intervals in the Presence of Outliers Prediction Intervals in the Presence of Outliers David J. Olive Southern Illinois University July 21, 2003 Abstract This paper presents a simple procedure for computing prediction intervals when the data

More information

Prediction of Snow Water Equivalent in the Snake River Basin

Prediction of Snow Water Equivalent in the Snake River Basin Hobbs et al. Seasonal Forecasting 1 Jon Hobbs Steve Guimond Nate Snook Meteorology 455 Seasonal Forecasting Prediction of Snow Water Equivalent in the Snake River Basin Abstract Mountainous regions of

More information

Classification via kernel regression based on univariate product density estimators

Classification via kernel regression based on univariate product density estimators Classification via kernel regression based on univariate product density estimators Bezza Hafidi 1, Abdelkarim Merbouha 2, and Abdallah Mkhadri 1 1 Department of Mathematics, Cadi Ayyad University, BP

More information

Gaussian predictive process models for large spatial data sets.

Gaussian predictive process models for large spatial data sets. Gaussian predictive process models for large spatial data sets. Sudipto Banerjee, Alan E. Gelfand, Andrew O. Finley, and Huiyan Sang Presenters: Halley Brantley and Chris Krut September 28, 2015 Overview

More information

Do not copy, post, or distribute

Do not copy, post, or distribute 14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible

More information

DSGE Methods. Estimation of DSGE models: GMM and Indirect Inference. Willi Mutschler, M.Sc.

DSGE Methods. Estimation of DSGE models: GMM and Indirect Inference. Willi Mutschler, M.Sc. DSGE Methods Estimation of DSGE models: GMM and Indirect Inference Willi Mutschler, M.Sc. Institute of Econometrics and Economic Statistics University of Münster willi.mutschler@wiwi.uni-muenster.de Summer

More information

Statistics: Learning models from data

Statistics: Learning models from data DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial

More information

Hierarchical Modeling for Multivariate Spatial Data

Hierarchical Modeling for Multivariate Spatial Data Hierarchical Modeling for Multivariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

DSGE-Models. Limited Information Estimation General Method of Moments and Indirect Inference

DSGE-Models. Limited Information Estimation General Method of Moments and Indirect Inference DSGE-Models General Method of Moments and Indirect Inference Dr. Andrea Beccarini Willi Mutschler, M.Sc. Institute of Econometrics and Economic Statistics University of Münster willi.mutschler@uni-muenster.de

More information

Semi-parametric estimation of non-stationary Pickands functions

Semi-parametric estimation of non-stationary Pickands functions Semi-parametric estimation of non-stationary Pickands functions Linda Mhalla 1 Joint work with: Valérie Chavez-Demoulin 2 and Philippe Naveau 3 1 Geneva School of Economics and Management, University of

More information

The Use of Survey Weights in Regression Modelling

The Use of Survey Weights in Regression Modelling The Use of Survey Weights in Regression Modelling Chris Skinner London School of Economics and Political Science (with Jae-Kwang Kim, Iowa State University) Colorado State University, June 2013 1 Weighting

More information

Improvement in Estimating the Finite Population Mean Under Maximum and Minimum Values in Double Sampling Scheme

Improvement in Estimating the Finite Population Mean Under Maximum and Minimum Values in Double Sampling Scheme J. Stat. Appl. Pro. Lett. 2, No. 2, 115-121 (2015) 115 Journal of Statistics Applications & Probability Letters An International Journal http://dx.doi.org/10.12785/jsapl/020203 Improvement in Estimating

More information

Implications of Ignoring the Uncertainty in Control Totals for Generalized Regression Estimators. Calibration Estimators

Implications of Ignoring the Uncertainty in Control Totals for Generalized Regression Estimators. Calibration Estimators Implications of Ignoring the Uncertainty in Control Totals for Generalized Regression Estimators Jill A. Dever, RTI Richard Valliant, JPSM & ISR is a trade name of Research Triangle Institute. www.rti.org

More information

Testing Restrictions and Comparing Models

Testing Restrictions and Comparing Models Econ. 513, Time Series Econometrics Fall 00 Chris Sims Testing Restrictions and Comparing Models 1. THE PROBLEM We consider here the problem of comparing two parametric models for the data X, defined by

More information

MORE ON SIMPLE REGRESSION: OVERVIEW

MORE ON SIMPLE REGRESSION: OVERVIEW FI=NOT0106 NOTICE. Unless otherwise indicated, all materials on this page and linked pages at the blue.temple.edu address and at the astro.temple.edu address are the sole property of Ralph B. Taylor and

More information

University, Tempe, Arizona, USA b Department of Mathematics and Statistics, University of New. Mexico, Albuquerque, New Mexico, USA

University, Tempe, Arizona, USA b Department of Mathematics and Statistics, University of New. Mexico, Albuquerque, New Mexico, USA This article was downloaded by: [University of New Mexico] On: 27 September 2012, At: 22:13 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered

More information

6.435, System Identification

6.435, System Identification System Identification 6.435 SET 3 Nonparametric Identification Munther A. Dahleh 1 Nonparametric Methods for System ID Time domain methods Impulse response Step response Correlation analysis / time Frequency

More information

Digital Change Detection Using Remotely Sensed Data for Monitoring Green Space Destruction in Tabriz

Digital Change Detection Using Remotely Sensed Data for Monitoring Green Space Destruction in Tabriz Int. J. Environ. Res. 1 (1): 35-41, Winter 2007 ISSN:1735-6865 Graduate Faculty of Environment University of Tehran Digital Change Detection Using Remotely Sensed Data for Monitoring Green Space Destruction

More information

Lecture 3: Statistical Decision Theory (Part II)

Lecture 3: Statistical Decision Theory (Part II) Lecture 3: Statistical Decision Theory (Part II) Hao Helen Zhang Hao Helen Zhang Lecture 3: Statistical Decision Theory (Part II) 1 / 27 Outline of This Note Part I: Statistics Decision Theory (Classical

More information

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting)

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) Phillip S. Kott RTI International NISS Worshop on the Analysis of Complex Survey Data With Missing Item Values October 17, 2014 1 RTI

More information

Nonparametric regression estimation under complex sampling designs

Nonparametric regression estimation under complex sampling designs Retrospective Theses and Dissertations 2004 Nonparametric regression estimation under complex sampling designs Ji-Yeon Kim Iowa State University Follow this and additional works at: http://lib.dr.iastate.edu/rtd

More information

AN INTEGRATED METHOD FOR FOREST CANOPY COVER MAPPING USING LANDSAT ETM+ IMAGERY INTRODUCTION

AN INTEGRATED METHOD FOR FOREST CANOPY COVER MAPPING USING LANDSAT ETM+ IMAGERY INTRODUCTION AN INTEGRATED METHOD FOR FOREST CANOPY COVER MAPPING USING LANDSAT ETM+ IMAGERY Zhongwu Wang, Remote Sensing Analyst Andrew Brenner, General Manager Sanborn Map Company 455 E. Eisenhower Parkway, Suite

More information

A Note on Bayesian Inference After Multiple Imputation

A Note on Bayesian Inference After Multiple Imputation A Note on Bayesian Inference After Multiple Imputation Xiang Zhou and Jerome P. Reiter Abstract This article is aimed at practitioners who plan to use Bayesian inference on multiplyimputed datasets in

More information

BY GRETCHEN G. MOISEN AND TRACEY S. FRESCINO

BY GRETCHEN G. MOISEN AND TRACEY S. FRESCINO COMPARING FIVE MODELLING TECHNIQUES FOR PREDICTING FOREST CHARACTERISTICS (To appear in Ecological Modelling 2002, special issue on advances in GLM/ modelling) BY GRETCHEN G. MOISEN AND TRACEY S. FRESCINO

More information