Nonparametric Small Area Estimation Using Penalized Spline Regression
|
|
- Morgan Oliver
- 6 years ago
- Views:
Transcription
1 Nonparametric Small Area Estimation Using Penalized Spline Regression
2 0verview Spline-based nonparametric regression Nonparametric small area estimation Prediction mean squared error Bootstrapping small area and local effects
3 Penalized spline Regression, often referred to as P-splines popularized by Eilers and Marx (1996) P-splines; attractive smoothing method because of their flexibility also a natural candidate for constructing nonparametric small area estimation Approach; shown consistency of the estimator provided tests for small area effects with bootstrap methods
4 Spline-based nonparametric regression Spline based nonparametric regression model and estimator, consider the simple model; where y i = m(x i ) + e i e i are independent random variables with mean zero and variance σ 2 e m(x i ) is unkown but to be estimated using P-spline
5 It is assumed that the unkown function can be approximated suffeciently well by; where m(x; β, γ) = β 0 + β 1 x β p x p + p is the degree of spline K γ k (x κ k ) p + (1) k=1 (x) n + denotes the function x n I {x>0} β = (β 0,..., β p ) and γ = (γ 1,..., γ K ) are the coefficients for the parametric and the spline portions of the model
6 In P-spline regression; K is typically taken to be large e.g. with 1 knot every 4 or 5 observations Model 1 is over-parameterized which is avoided by a penalty. Define the resgression estimators as the minimizers over β and γ of n (y i m(x i ; β, γ)) 2 + λ γ γ γ i=1 where λ γ is a fixed penalty parameter.
7 It is to interest to treat λ γ as an unkown parameter, Since, different values of λ γ result in different estimates of β and γ treat γ as a random effect possible to estimate jointly λ γ, β and γ by ML methods both the P-spline and the SAE models can be viewed as random effects models combine both into a nonparametric SAE framework based on linear mixed model regression
8 Nonparametric small area estimation Specifically, suppose there are T small areas, U 1,..., U T define d it = I {i Ut} let d i = (d i1,..., d it ) and D = (d 1,..., d n) let Y = (y 1,..., y n ), 1 x 1 x p 1 (x 1 κ 1 ) p + (x 1 κ K ) p + X =..., Z =.. 1 x n xn p (x n κ 1 ) p + (x n κ K ) p +
9 Assume that the data follow the model Y = Xβ + Zγ + Du + e (2) where γ (0, Σ γ ) with Σ γ = σγi 2 K u (0, Σ u ) with Σ u = σui 2 T e (0, Σ e ) with Σ e = σei 2 n V (Y) V = ZΣ γ Z + DΣ u D + Σ e
10 Given the model (2) and the spesifications, the GLS estimator ˆβ = (X VX) 1 X V 1 Y and the predictors ˆγ = Σ γ Z V 1 (Y X ˆβ) û = Σ u D V 1 (Y X ˆβ) are optimal among all linear estimators/predictors
11 For a given small area U t, interested, for example, in predicting where ȳ t = x t β + z t γ + u t x t and z t are the true means of the powers of x i and are assumed known u t is the small area effect and d t u = u t = b t u b t is a vector with 1 in the t-th position and 0 else. As a predictor of ȳ t, ŷ t = x t ˆβ + z t ˆγ + b t û is used which is a linear combination of the GLS estimator and the BLUPs
12 Prediction mean squared error For simplicity, consider the prediction error ŷ ȳ in the case of estimated variance components W = [Z D], w = (γ, u ), w t = ( z t, b t ) Then it is obtained with c t = x t w t Σ w W V 1 X that; ŷ ȳ = c t (ˆβ β) + w t (Σ w W V 1 (Y Xβ) w) (3) The EBLUP ỹ ȳ = ĉ t (ˆβ β) + w t (ˆΣ w W ˆV 1 (Y Xβ) w) The prediction mean squared error of EBLUP predictor can be estimated.
13 Bootstrapping small area and local effects Need of using bootstrap-based procedure; a likelihood ratio test for testing the presence of small area random effects can be constructed. likelihood ratio statistic is often poorly approximated by the asymptotics useful to supplement bootstrap-based procedure Bootstrap replicate observations are generated as Y = Xˆβ + Zγ + Du + e where γ, u and e are bootstrap replicates of the random components in the model.
14 Robust way to draw replicates; to resample from the emprical distributions of the fitted residuals and predictions assume the variances σ 2 are known start with the predictors ˆγ and û obtained earlier denote H = VP where P = V 1 V 1 X(X V 1 X) 1 X V 1 Var(ˆγ) = σ 4 γz V 1 (I H)Z
15 adjust ˆγ to obtain predictors with the correct second moment by letting γ = ˆγ(Z V 1 (I H)Z) 1/2 /σ γ (4) ũ = û(d V 1 (I H)D) 1/2 /σ u (5) to generate estimated errors start from; ê = Y Xˆβ Zˆγ Dû ẽ = ê(v 1 (I H)) 1/2 /σ e (6) bootstrap sampling can be done form γ, ũ and ẽ
16 Some conclusions on current part of the approach; it would create bootstrap random effects and errors that of estimating β the error of the parameter estimation is often very small relative to the prediction error bootstrap approach that ignores estimation is likely perform almost as well as a more complicated procedure used the expressions (4), (5) and (6) after replacing the variance components by their REML estimators similarly found that the matrix H which accounts for the estimation of β had no effect on the adjustments, set H = 0
17 Given B bootstrap samples, the prediction mean squared error for the small areas can be obtained by 1 B B (ỹt b b=1 ȳ b t ) where ȳt b ỹt b is the mean for small area t in the b-th bootstrap sample represents its REML predictor
18 Survey of lakes in the Northeastern states of U.S. Between 1991 and 1996, the Environmental Monitoring and Assessment Program (EMAP) of the U.S. Environmental Protection Agency conducted a survey of lakes in the Northeastern states of U.S. It is considered the estimation of the mean acid neutralizing capacity (ANC) for each of 113 small areas defined by 8-digit Hydrologic Unit Codes(HUC) within the region of interest. ANC measures the buffering capacity of water against negative changes in ph HUCs represent a nested subdivision of all U.S. land based on hydrological features
19 Figure: Locations of sampled lakes in Northeastern U.S.,the region of interest and the locations of the sampled lakes on a population of 21,026 lakes from which 334 lakes were surveyed.
20 Summary for the analysis; the aim to identify HUCs of concern within the region, based on the results from the survey lakes in close geographical proximity but located in different HUCs are expected to be less similar than two lakes in the same HUC factors affecting ANC such as acid deposition and soil characteristics cut across HUCs a HUC prediction model has the potential to capture most of the interesting patterns in the data
21 Figure: Hydrologic Unit Code (HUC) small areas within Northeastern U.S. region, with average ANC computed in all small areas containing sample observations.
22 The variables that can be used in the construction of a SAE model in the application are the geographical coordinates of the centroid of each lake (in the UTM coordinate system) and its elevation. Construction of the small area estimator using a transformed radial basis defined as; [ Z = [C(x i κ k )] 1 k K C(κk κ k ) ] 1/2 1 k,k, i = (1,..., n) K where C(r) = r 2 log r and x i = (x 1i, x 2i ) denotes the geographical coordinates for observation i and κ k, k = 1,..., K are spline knots. The locations of the K = 80 knots are selected by the space-filling algorithm.
23 Figure: Shows the locations of the knots selected by the space-filling algorithm. Lake locations (open circles) and knot locations of the bivariate radial spline function on the UTM coordinates(solid circles).
24 The ANC small area model can now be written as in (2); That model includes Y for the ANC observations Y = Xβ + Zγ + Du + e X a matrix containing an intercept and the linear elevation term Z for the spatial locations D a matrix of indicators for the HUCs Then the model can be fitted using REML.
25 Figure: Parameter estimates for penalized spline small area estimation model for Northeastern Lakes data.the P-values are computed using the bootstrap procedure described earlier.
26 Figure: Map of model predicted mean ANC for all HUCs. The small area estimation map is smoother and also contains values in all HUCs, offsetting some of the limitations of the original data.
27 Figure: Plot of estimated root EMSE for the small area predictions obtained by the bootstrap method against the estimated root EMSE.
28 Figure: Comparison of AIC values and correlations between HUC model predictions and averages of the sample observations in the HUCs, for inclusion and exclusion of random effect terms in model; Left number in each cell is AIC, right number is correlation.
29 Figure: (a)comparison of HUC predictions for model with both random effects and model with HUC random effect only. (b)comparison of HUC predictions for model with both random effects and model with spline random effect only.
30 Some conclusions; inclusion of a spatial spline can improve the fit relative to a model which only uses a random effect for the small areas this would be done in traditional small area estimation
31 Thank You..
32 References Eilers, P. H. C. and B. D. Marx (1996). Flexible smoothing with B-splines and penalties. Statistical Science 11 (2), Jiang, J. and P. Lahiri (2006). Mixed model prediction and small area estimation. Test 15, 196. Opsomer, J.D., Claeskens, G., Ranalli, M.G., Kauermann, G., Breidt, F. J. (2006). Nonparametric Small Area Estimation Using Penalized Spline Regression Journal of the Royal Stat. Society, Series B, 70, Ruppert, R., M. Wand, and R. Carroll (2003). Semiparametric Regression. Cambridge University Press.
Nonparametric Small Area Estimation Using Penalized Spline Regression
Nonparametric Small Area Estimation Using Penalized Spline Regression J. D. Opsomer Iowa State University G. Claeskens Katholieke Universiteit Leuven M. G. Ranalli Universita degli Studi di Perugia G.
More informationTwo Applications of Nonparametric Regression in Survey Estimation
Two Applications of Nonparametric Regression in Survey Estimation 1/56 Jean Opsomer Iowa State University Joint work with Jay Breidt, Colorado State University Gerda Claeskens, Université Catholique de
More informationNonparametric Small Area Estimation via M-quantile Regression using Penalized Splines
Nonparametric Small Estimation via M-quantile Regression using Penalized Splines Monica Pratesi 10 August 2008 Abstract The demand of reliable statistics for small areas, when only reduced sizes of the
More informationPenalized Balanced Sampling. Jay Breidt
Penalized Balanced Sampling Jay Breidt Colorado State University Joint work with Guillaume Chauvet (ENSAI) February 4, 2010 1 / 44 Linear Mixed Models Let U = {1, 2,...,N}. Consider linear mixed models
More informationSmall Area Estimation Using a Nonparametric Model Based Direct Estimator
University of Wollongong Research Online Centre for Statistical & Survey Methodology Working Paper Series Faculty of Engineering and Information Sciences 2009 Small Area Estimation Using a Nonparametric
More informationSmall Area Estimation for Skewed Georeferenced Data
Small Area Estimation for Skewed Georeferenced Data E. Dreassi - A. Petrucci - E. Rocco Department of Statistics, Informatics, Applications "G. Parenti" University of Florence THE FIRST ASIAN ISI SATELLITE
More informationOn the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models
On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of
More informationLikelihood Ratio Tests. that Certain Variance Components Are Zero. Ciprian M. Crainiceanu. Department of Statistical Science
1 Likelihood Ratio Tests that Certain Variance Components Are Zero Ciprian M. Crainiceanu Department of Statistical Science www.people.cornell.edu/pages/cmc59 Work done jointly with David Ruppert, School
More informationOn the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models
On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Institute of Statistics and Econometrics Georg-August-University Göttingen Department of Statistics
More informationExact Likelihood Ratio Tests for Penalized Splines
Exact Likelihood Ratio Tests for Penalized Splines By CIPRIAN CRAINICEANU, DAVID RUPPERT, GERDA CLAESKENS, M.P. WAND Department of Biostatistics, Johns Hopkins University, 615 N. Wolfe Street, Baltimore,
More informationIntegrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University
Integrated Likelihood Estimation in Semiparametric Regression Models Thomas A. Severini Department of Statistics Northwestern University Joint work with Heping He, University of York Introduction Let Y
More informationThe Hodrick-Prescott Filter
The Hodrick-Prescott Filter A Special Case of Penalized Spline Smoothing Alex Trindade Dept. of Mathematics & Statistics, Texas Tech University Joint work with Rob Paige, Missouri University of Science
More informationOn the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models
On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of
More informationPenalized Splines, Mixed Models, and Recent Large-Sample Results
Penalized Splines, Mixed Models, and Recent Large-Sample Results David Ruppert Operations Research & Information Engineering, Cornell University Feb 4, 2011 Collaborators Matt Wand, University of Wollongong
More informationSome properties of Likelihood Ratio Tests in Linear Mixed Models
Some properties of Likelihood Ratio Tests in Linear Mixed Models Ciprian M. Crainiceanu David Ruppert Timothy J. Vogelsang September 19, 2003 Abstract We calculate the finite sample probability mass-at-zero
More informationSmall Area Modeling of County Estimates for Corn and Soybean Yields in the US
Small Area Modeling of County Estimates for Corn and Soybean Yields in the US Matt Williams National Agricultural Statistics Service United States Department of Agriculture Matt.Williams@nass.usda.gov
More informationIntroduction to Statistical modeling: handout for Math 489/583
Introduction to Statistical modeling: handout for Math 489/583 Statistical modeling occurs when we are trying to model some data using statistical tools. From the start, we recognize that no model is perfect
More informationMultidimensional Density Smoothing with P-splines
Multidimensional Density Smoothing with P-splines Paul H.C. Eilers, Brian D. Marx 1 Department of Medical Statistics, Leiden University Medical Center, 300 RC, Leiden, The Netherlands (p.eilers@lumc.nl)
More informationComments on: Model-free model-fitting and predictive distributions : Applications to Small Area Statistics and Treatment Effect Estimation
Article Comments on: Model-free model-fitting and predictive distributions : Applications to Small Area Statistics and Treatment Effect Estimation SPERLICH, Stefan Andréas Reference SPERLICH, Stefan Andréas.
More informationCross-validation in model-assisted estimation
Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 009 Cross-validation in model-assisted estimation Lifeng You Iowa State University Follow this and additional
More informationRestricted Likelihood Ratio Tests in Nonparametric Longitudinal Models
Restricted Likelihood Ratio Tests in Nonparametric Longitudinal Models Short title: Restricted LR Tests in Longitudinal Models Ciprian M. Crainiceanu David Ruppert May 5, 2004 Abstract We assume that repeated
More informationASYMPTOTICS FOR PENALIZED SPLINES IN ADDITIVE MODELS
Mem. Gra. Sci. Eng. Shimane Univ. Series B: Mathematics 47 (2014), pp. 63 71 ASYMPTOTICS FOR PENALIZED SPLINES IN ADDITIVE MODELS TAKUMA YOSHIDA Communicated by Kanta Naito (Received: December 19, 2013)
More informationVariable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting
Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting Andreas Groll 1 and Gerhard Tutz 2 1 Department of Statistics, University of Munich, Akademiestrasse 1, D-80799, Munich,
More informationAndreas Blöchl: Trend Estimation with Penalized Splines as Mixed Models for Series with Structural Breaks
Andreas Blöchl: Trend Estimation with Penalized Splines as Mixed Models for Series with Structural Breaks Munich Discussion Paper No. 2014-2 Department of Economics University of Munich Volkswirtschaftliche
More informationSome Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model
Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model 1. Introduction Varying-coefficient partially linear model (Zhang, Lee, and Song, 2002; Xia, Zhang, and Tong, 2004;
More informationBootstrap. Director of Center for Astrostatistics. G. Jogesh Babu. Penn State University babu.
Bootstrap G. Jogesh Babu Penn State University http://www.stat.psu.edu/ babu Director of Center for Astrostatistics http://astrostatistics.psu.edu Outline 1 Motivation 2 Simple statistical problem 3 Resampling
More informationML and REML Variance Component Estimation. Copyright c 2012 Dan Nettleton (Iowa State University) Statistics / 58
ML and REML Variance Component Estimation Copyright c 2012 Dan Nettleton (Iowa State University) Statistics 611 1 / 58 Suppose y = Xβ + ε, where ε N(0, Σ) for some positive definite, symmetric matrix Σ.
More informationSpace-time modelling of air pollution with array methods
Space-time modelling of air pollution with array methods Dae-Jin Lee Royal Statistical Society Conference Edinburgh 2009 D.-J. Lee (Uc3m) GLAM: Array methods in Statistics RSS 09 - Edinburgh # 1 Motivation
More informationStat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010
1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of
More informationOutlier robust small area estimation
University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 2014 Outlier robust small area estimation Ray Chambers
More informationNonparametric Regression
Nonparametric Regression Econ 674 Purdue University April 8, 2009 Justin L. Tobias (Purdue) Nonparametric Regression April 8, 2009 1 / 31 Consider the univariate nonparametric regression model: where y
More informationFlexible Spatio-temporal smoothing with array methods
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session IPS046) p.849 Flexible Spatio-temporal smoothing with array methods Dae-Jin Lee CSIRO, Mathematics, Informatics and
More informationmboost - Componentwise Boosting for Generalised Regression Models
mboost - Componentwise Boosting for Generalised Regression Models Thomas Kneib & Torsten Hothorn Department of Statistics Ludwig-Maximilians-University Munich 13.8.2008 Boosting in a Nutshell Boosting
More informationA Penalized spline approach to functional mixed effects model analysis
Biometrics 63, 1?? December 2007 DOI: 10.1111/j.1541-0420.2005.00454.x A Penalized spline approach to functional mixed effects model analysis Huaihou Chen Department of Biostatistics, Mailman School of
More informationAdditive Isotonic Regression
Additive Isotonic Regression Enno Mammen and Kyusang Yu 11. July 2006 INTRODUCTION: We have i.i.d. random vectors (Y 1, X 1 ),..., (Y n, X n ) with X i = (X1 i,..., X d i ) and we consider the additive
More informationSpatio-temporal prediction of site index based on forest inventories and climate change scenarios
Forest Research Institute Spatio-temporal prediction of site index based on forest inventories and climate change scenarios Arne Nothdurft 1, Thilo Wolf 1, Andre Ringeler 2, Jürgen Böhner 2, Joachim Saborowski
More informationSelection of Variables and Functional Forms in Multivariable Analysis: Current Issues and Future Directions
in Multivariable Analysis: Current Issues and Future Directions Frank E Harrell Jr Department of Biostatistics Vanderbilt University School of Medicine STRATOS Banff Alberta 2016-07-04 Fractional polynomials,
More informationSimultaneous Confidence Bands for the Coefficient Function in Functional Regression
University of Haifa From the SelectedWorks of Philip T. Reiss August 7, 2008 Simultaneous Confidence Bands for the Coefficient Function in Functional Regression Philip T. Reiss, New York University Available
More informationBIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation
BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)
More information9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures
FE661 - Statistical Methods for Financial Engineering 9. Model Selection Jitkomut Songsiri statistical models overview of model selection information criteria goodness-of-fit measures 9-1 Statistical models
More informationA measurement error model approach to small area estimation
A measurement error model approach to small area estimation Jae-kwang Kim 1 Spring, 2015 1 Joint work with Seunghwan Park and Seoyoung Kim Ouline Introduction Basic Theory Application to Korean LFS Discussion
More informationGeneralized Additive Models
Generalized Additive Models The Model The GLM is: g( µ) = ß 0 + ß 1 x 1 + ß 2 x 2 +... + ß k x k The generalization to the GAM is: g(µ) = ß 0 + f 1 (x 1 ) + f 2 (x 2 ) +... + f k (x k ) where the functions
More informationStat 579: Generalized Linear Models and Extensions
Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject
More informationA Tolerance Interval Approach for Assessment of Agreement in Method Comparison Studies with Repeated Measurements
A Tolerance Interval Approach for Assessment of Agreement in Method Comparison Studies with Repeated Measurements Pankaj K. Choudhary 1 Department of Mathematical Sciences, University of Texas at Dallas
More informationMEI Exam Review. June 7, 2002
MEI Exam Review June 7, 2002 1 Final Exam Revision Notes 1.1 Random Rules and Formulas Linear transformations of random variables. f y (Y ) = f x (X) dx. dg Inverse Proof. (AB)(AB) 1 = I. (B 1 A 1 )(AB)(AB)
More informationWeb Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.
Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Ruppert A. EMPIRICAL ESTIMATE OF THE KERNEL MIXTURE Here we
More informationChapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70
Chapter 5: Models used in conjunction with sampling J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Nonresponse Unit Nonresponse: weight adjustment Item Nonresponse:
More informationRegression When the Predictors Are Images
University of Haifa From the SelectedWorks of Philip T. Reiss May, 2009 Regression When the Predictors Are Images Philip T. Reiss, New York University Available at: https://works.bepress.com/phil_reiss/15/
More informationSmall area semiparametric additive monotone models
Small area semiparametric additive monotone models C. Rueda 1, M.J. Lombardía 2 1 Dpto. Estadistica. University of Valladolid. Spain 2 Dpto. Matemáticas. University of Coruña. Spain Abstract In this paper,
More informationOn Modifications to Linking Variance Estimators in the Fay-Herriot Model that Induce Robustness
Statistics and Applications {ISSN 2452-7395 (online)} Volume 16 No. 1, 2018 (New Series), pp 289-303 On Modifications to Linking Variance Estimators in the Fay-Herriot Model that Induce Robustness Snigdhansu
More informationEstimation of cumulative distribution function with spline functions
INTERNATIONAL JOURNAL OF ECONOMICS AND STATISTICS Volume 5, 017 Estimation of cumulative distribution function with functions Akhlitdin Nizamitdinov, Aladdin Shamilov Abstract The estimation of the cumulative
More informationLikelihood ratio testing for zero variance components in linear mixed models
Likelihood ratio testing for zero variance components in linear mixed models Sonja Greven 1,3, Ciprian Crainiceanu 2, Annette Peters 3 and Helmut Küchenhoff 1 1 Department of Statistics, LMU Munich University,
More informationRegression, Ridge Regression, Lasso
Regression, Ridge Regression, Lasso Fabio G. Cozman - fgcozman@usp.br October 2, 2018 A general definition Regression studies the relationship between a response variable Y and covariates X 1,..., X n.
More informationAIR FORCE INSTITUTE OF TECHNOLOGY
TOPICS IN STATISTICAL CALIBRATION DISSERTATION Brandon M. Greenwell, Civilian AFIT-ENC-DS-14-M-01 DEPARTMENT OF THE AIR FORCE AIR UNIVERSITY AIR FORCE INSTITUTE OF TECHNOLOGY Wright-Patterson Air Force
More informationIf you believe in things that you don t understand then you suffer. Stevie Wonder, Superstition
If you believe in things that you don t understand then you suffer Stevie Wonder, Superstition For Slovenian municipality i, i = 1,, 192, we have SIR i = (observed stomach cancers 1995-2001 / expected
More information1. Introduction. Keywords: Auxiliary information; Nonparametric regression; Pseudo empirical likelihood; Model-assisted approach; MARS.
Nonparametric Methods for Sample Surveys of Environmental Populations Metodi nonparametrici nell inferenza per popolazioni finite di carattere ambientale Giorgio E. Montanari Dipartimento di Economia,
More informationA NOTE ON A NONPARAMETRIC REGRESSION TEST THROUGH PENALIZED SPLINES
Statistica Sinica 24 (2014), 1143-1160 doi:http://dx.doi.org/10.5705/ss.2012.230 A NOTE ON A NONPARAMETRIC REGRESSION TEST THROUGH PENALIZED SPLINES Huaihou Chen 1, Yuanjia Wang 2, Runze Li 3 and Katherine
More informationHeteroskedasticity-Robust Inference in Finite Samples
Heteroskedasticity-Robust Inference in Finite Samples Jerry Hausman and Christopher Palmer Massachusetts Institute of Technology December 011 Abstract Since the advent of heteroskedasticity-robust standard
More informationGeneralized Linear Models
Generalized Linear Models Lecture 3. Hypothesis testing. Goodness of Fit. Model diagnostics GLM (Spring, 2018) Lecture 3 1 / 34 Models Let M(X r ) be a model with design matrix X r (with r columns) r n
More informationRegression Review. Statistics 149. Spring Copyright c 2006 by Mark E. Irwin
Regression Review Statistics 149 Spring 2006 Copyright c 2006 by Mark E. Irwin Matrix Approach to Regression Linear Model: Y i = β 0 + β 1 X i1 +... + β p X ip + ɛ i ; ɛ i iid N(0, σ 2 ), i = 1,..., n
More informationEstimation of Panel Data Stochastic Frontier Models with. Nonparametric Time Varying Inefficiencies 1
Estimation of Panel Data Stochastic Frontier Models with Nonparametric Time Varying Inefficiencies Gholamreza Hajargasht Shiraz University Shiraz, Iran D.S. Prasada Rao Centre for Efficiency and Productivity
More informationMultivariate Calibration with Robust Signal Regression
Multivariate Calibration with Robust Signal Regression Bin Li and Brian Marx from Louisiana State University Somsubhra Chakraborty from Indian Institute of Technology Kharagpur David C Weindorf from Texas
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More information20. REML Estimation of Variance Components. Copyright c 2018 (Iowa State University) 20. Statistics / 36
20. REML Estimation of Variance Components Copyright c 2018 (Iowa State University) 20. Statistics 510 1 / 36 Consider the General Linear Model y = Xβ + ɛ, where ɛ N(0, Σ) and Σ is an n n positive definite
More informationLOCAL POLYNOMIAL AND PENALIZED TRIGONOMETRIC SERIES REGRESSION
Statistica Sinica 24 (2014), 1215-1238 doi:http://dx.doi.org/10.5705/ss.2012.040 LOCAL POLYNOMIAL AND PENALIZED TRIGONOMETRIC SERIES REGRESSION Li-Shan Huang and Kung-Sik Chan National Tsing Hua University
More information10. Time series regression and forecasting
10. Time series regression and forecasting Key feature of this section: Analysis of data on a single entity observed at multiple points in time (time series data) Typical research questions: What is the
More informationHigh-dimensional regression
High-dimensional regression Advanced Methods for Data Analysis 36-402/36-608) Spring 2014 1 Back to linear regression 1.1 Shortcomings Suppose that we are given outcome measurements y 1,... y n R, and
More informationECON The Simple Regression Model
ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In
More informationIssues on quantile autoregression
Issues on quantile autoregression Jianqing Fan and Yingying Fan We congratulate Koenker and Xiao on their interesting and important contribution to the quantile autoregression (QAR). The paper provides
More informationF. Jay Breidt Colorado State University
Model-assisted survey regression estimation with the lasso 1 F. Jay Breidt Colorado State University Opening Workshop on Computational Methods in Social Sciences SAMSI August 2013 This research was supported
More informationSome general observations.
Modeling and analyzing data from computer experiments. Some general observations. 1. For simplicity, I assume that all factors (inputs) x1, x2,, xd are quantitative. 2. Because the code always produces
More informationVariable Selection and Model Choice in Survival Models with Time-Varying Effects
Variable Selection and Model Choice in Survival Models with Time-Varying Effects Boosting Survival Models Benjamin Hofner 1 Department of Medical Informatics, Biometry and Epidemiology (IMBE) Friedrich-Alexander-Universität
More informationLecture 6 Multiple Linear Regression, cont.
Lecture 6 Multiple Linear Regression, cont. BIOST 515 January 22, 2004 BIOST 515, Lecture 6 Testing general linear hypotheses Suppose we are interested in testing linear combinations of the regression
More informationBootstrap Testing in Econometrics
Presented May 29, 1999 at the CEA Annual Meeting Bootstrap Testing in Econometrics James G MacKinnon Queen s University at Kingston Introduction: Economists routinely compute test statistics of which the
More informationAccounting for Complex Sample Designs via Mixture Models
Accounting for Complex Sample Designs via Finite Normal Mixture Models 1 1 University of Michigan School of Public Health August 2009 Talk Outline 1 2 Accommodating Sampling Weights in Mixture Models 3
More informationWeighted Least Squares
Weighted Least Squares The standard linear model assumes that Var(ε i ) = σ 2 for i = 1,..., n. As we have seen, however, there are instances where Var(Y X = x i ) = Var(ε i ) = σ2 w i. Here w 1,..., w
More informationThe linear model is the most fundamental of all serious statistical models encompassing:
Linear Regression Models: A Bayesian perspective Ingredients of a linear model include an n 1 response vector y = (y 1,..., y n ) T and an n p design matrix (e.g. including regressors) X = [x 1,..., x
More informationModel Selection for Geostatistical Models
Model Selection for Geostatistical Models Richard A. Davis Colorado State University http://www.stat.colostate.edu/~rdavis/lectures Joint work with: Jennifer A. Hoeting, Colorado State University Andrew
More informationQuick Review on Linear Multiple Regression
Quick Review on Linear Multiple Regression Mei-Yuan Chen Department of Finance National Chung Hsing University March 6, 2007 Introduction for Conditional Mean Modeling Suppose random variables Y, X 1,
More informationCorrelated Spatiotemporal Data Modeling Using Generalized Additive Mixed Model and Bivariate Smoothing Techniques
Science Journal of Applied Mathematics and Statistics 2018; 6(2): 49-57 http://www.sciencepublishinggroup.com/j/sjams doi: 10.11648/j.sjams.20180602.11 ISSN: 2376-9491 (Print); ISSN: 2376-9513 (Online)
More informationPrediction Intervals for Regression Models
Southern Illinois University Carbondale OpenSIUC Articles and Preprints Department of Mathematics 3-2007 Prediction Intervals for Regression Models David J. Olive Southern Illinois University Carbondale,
More informationLinear Regression Model. Badr Missaoui
Linear Regression Model Badr Missaoui Introduction What is this course about? It is a course on applied statistics. It comprises 2 hours lectures each week and 1 hour lab sessions/tutorials. We will focus
More informationSpatial Lasso with Applications to GIS Model Selection
Spatial Lasso with Applications to GIS Model Selection Hsin-Cheng Huang Institute of Statistical Science, Academia Sinica Nan-Jung Hsu National Tsing-Hua University David Theobald Colorado State University
More informationRLRsim: Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models
RLRsim: Testing for Random Effects or Nonparametric Regression Functions in Additive Mixed Models Fabian Scheipl 1 joint work with Sonja Greven 1,2 and Helmut Küchenhoff 1 1 Department of Statistics, LMU
More informationStatistics for analyzing and modeling precipitation isotope ratios in IsoMAP
Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP The IsoMAP uses the multiple linear regression and geostatistical methods to analyze isotope data Suppose the response variable
More informationA Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated
More informationP -spline ANOVA-type interaction models for spatio-temporal smoothing
P -spline ANOVA-type interaction models for spatio-temporal smoothing Dae-Jin Lee 1 and María Durbán 1 1 Department of Statistics, Universidad Carlos III de Madrid, SPAIN. e-mail: dae-jin.lee@uc3m.es and
More informationREGRESSION WITH SPATIALLY MISALIGNED DATA. Lisa Madsen Oregon State University David Ruppert Cornell University
REGRESSION ITH SPATIALL MISALIGNED DATA Lisa Madsen Oregon State University David Ruppert Cornell University SPATIALL MISALIGNED DATA 10 X X X X X X X X 5 X X X X X 0 X 0 5 10 OUTLINE 1. Introduction 2.
More informationWeighted Least Squares
Weighted Least Squares The standard linear model assumes that Var(ε i ) = σ 2 for i = 1,..., n. As we have seen, however, there are instances where Var(Y X = x i ) = Var(ε i ) = σ2 w i. Here w 1,..., w
More informationRobust Estimation for Ordinary Differential Equation Models
Biometrics, 1 21 DOI: 000 Robust Estimation for Ordinary Differential Equation Models J. Cao 1,, L. Wang 2, and J. Xu 1 1 Department of Statistics & Actuarial Science, Simon Fraser University, Burnaby,
More informationLecture 6: Methods for high-dimensional problems
Lecture 6: Methods for high-dimensional problems Hector Corrada Bravo and Rafael A. Irizarry March, 2010 In this Section we will discuss methods where data lies on high-dimensional spaces. In particular,
More informationESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS
ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, N.C.,
More informationModeling Real Estate Data using Quantile Regression
Modeling Real Estate Data using Semiparametric Quantile Regression Department of Statistics University of Innsbruck September 9th, 2011 Overview 1 Application: 2 3 4 Hedonic regression data for house prices
More informationNonparametric Regression. Badr Missaoui
Badr Missaoui Outline Kernel and local polynomial regression. Penalized regression. We are given n pairs of observations (X 1, Y 1 ),...,(X n, Y n ) where Y i = r(x i ) + ε i, i = 1,..., n and r(x) = E(Y
More informationChapter 1. Linear Regression with One Predictor Variable
Chapter 1. Linear Regression with One Predictor Variable 1.1 Statistical Relation Between Two Variables To motivate statistical relationships, let us consider a mathematical relation between two mathematical
More informationFast Adaptive Penalized Splines
Fast Adaptive Penalized Splines Tatyana Krivobokova Katholieke Universiteit Leuven Göran Kauermann Universität Bielefeld 17th April 2007 Ciprian M. Crainiceanu Johns Hopkins University Abstract This paper
More informationRegularization in Cox Frailty Models
Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University
More informationLecture 19 Multiple (Linear) Regression
Lecture 19 Multiple (Linear) Regression Thais Paiva STA 111 - Summer 2013 Term II August 1, 2013 1 / 30 Thais Paiva STA 111 - Summer 2013 Term II Lecture 19, 08/01/2013 Lecture Plan 1 Multiple regression
More informationUniversity of Minnesota Duluth
Trend analyses for species of concern: Analysis of CPUE data for walleye, cisco, and smallmouth bass 1970-2008 Dave Staples 1, Lucinda Johnson 2, Jennifer Olker 2, Dan Brenneman 2 1 Minnesota Department
More informationUNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017
UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Tuesday, January 17, 2017 Work all problems 60 points are needed to pass at the Masters Level and 75
More information