An anisotropic Matérn spatial covariance model: REML estimation and properties

Similar documents
Effect of 3D Stress States at Crack Front on Deformation, Fracture and Fatigue Phenomena

The Dynamics of Potassium in some. Australian soils

Application of a Helmholtz resonator excited by grazing flow for manipulation of a turbulent boundary layer

Butje Alfonsius Louk Fanggi

REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data

RESIDENTIAL SATISFACTION IN THE CHANGING URBAN FORM IN ADELAIDE: A COMPARATIVE ANALYSIS OF MAWSON LAKES AND CRAIGBURN FARM, SOUTH AUSTRALIA

Acacia victoriae and its Association with Sugary- Exudate Producing Herbivorous Insects and Trailing Dominant Ants in Semi-arid Australia

Phase-averaged analysis of an oscillating water column wave energy converter

Curvature measures for generalized linear models

On dealing with spatially correlated residuals in remote sensing and GIS

Description and Mechanisms of Bacterial Growth Responses to Water Activity and Compatible Solutes

Analysis of Idle Power and Iron Loss Reduction in an Interior PM Automotive Alternator

Basics of Point-Referenced Data Models

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:

Geostatistical Applications for Precision Agriculture

MODAL EXPANSION THEORIES FOR SINGLY - PERIODIC DIFFRACTION GRATINGS

STRUCTURAL PERMEABILITY IN AUSTRALIAN SEDIMENTARY BASINS

The performance of estimation methods for generalized linear mixed models

Statistícal Methods for Spatial Data Analysis

Doctor of Philosophy

Spatial analysis of a designed experiment. Uniformity trials. Blocking

Chapter 4 - Fundamentals of spatial processes Lecture notes

The influence of abiotic factors on the impact of a native stem hemiparasite on introduced versus native hosts.

Introduction to Spatial Data and Models

Multi-order Vector Finite Element Modeling of 3D Magnetotelluric Data Including Complex Geometry and Anisotropy. Aixa M.

A linear mixed model, with non-stationary mean and covariance, for soil potassium based on gamma radiometry

Acoustic Signal Processing. Algorithms for Reverberant. Environments

Cannibalism in Barramundi Lates calcarifer: Understanding Functional Mechanisms and Implication to Aquaculture

Introduction to Spatial Data and Models

Responses of temperate mobile. macroinvertebrates to reef habitat. structure and protection from fishing. Timothy John Alexander, B.Sc.

Estimation for state space models: quasi-likelihood and asymptotic quasi-likelihood approaches

Bayesian Transgaussian Kriging

Mir Md. Maruf Morshed

LESOTHO HIGH SCHOOL STUDENTS CONCEPTIONS OF EARTHQUAKES

Origin of Overpressure and Pore Pressure Prediction in Carbonate Reservoirs of the Abadan Plain Basin

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

Experimental designs for multiple responses with different models

Hydraulic habitat in the River Murray: the influence of geomorphology and large wood

What s for today. Introduction to Space-time models. c Mikyoung Jun (Texas A&M) Stat647 Lecture 14 October 16, / 19

Effect of cations on structural. stability of salt-affected soils. Alla Marchuk

Transiogram: A spatial relationship measure for categorical data

Estimating the radiation environment in the Great Barrier Reef

Springer Berlin Heidelberg New York Barcelona Budapest Hong Kong London Milan Paris Santa Clara Singapore Tokyo

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH

Introduction to Geostatistics

The boundedness of the Riesz transform on a metric cone

On Sturmian and Episturmian Words, and Related Topics

Estimation of direction of increase of gold mineralisation using pair-copulas

Geostatistics for Seismic Data Integration in Earth Models

Bronwyn Anne Camac, B. App. Sc. Geology. Australian School of Petroleum. The University of Adelaide. This thesis is submitted in fulfilment of the

Red Mud Minimisation and Management for the Alumina Industry by the Carbonation Method

Non-gaussian spatiotemporal modeling

Statistical Methods. for Forecasting

A Statistical Input Pruning Method for Artificial Neural Networks Used in Environmental Modelling

STATISTICS-STAT (STAT)

Point-Referenced Data Models

Newey, Philip Simon (2009) Colony mate recognition in the weaver ant Oecophylla smaragdina. PhD thesis, James Cook University.

Types of Spatial Data

Xtra, Xtra - read all about it, DiGGerXtra! Mario D Antuono, Neil Coombes and Dongwen Luo

Data Break 8: Kriging the Meuse RiverBIOS 737 Spring 2004 p.1/27

Uncertainty in merged radar - rain gauge rainfall products

Adaptive Sampling of Clouds with a Fleet of UAVs: Improving Gaussian Process Regression by Including Prior Knowledge

Conditional Distribution Fitting of High Dimensional Stationary Data

A FIRST COURSE IN INTEGRAL EQUATIONS

STATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods

Advances in Locally Varying Anisotropy With MDS

Stat 5101 Lecture Notes

The presentation is entitled ANISOTROPY OF THE ROCK QUALITY DESIGNATION & ITS GEOSTATISTICAL EVALUATION».

AN ELECTRO-THERMAL APPROACH TO ACTIVE THERMOGRAPHY RAJESH GUPTA

Weed management in sugar cane: critical periods of weed competition and mechanisms of interference from Paspalum paniculatum and P.

APPLIED STRUCTURAL EQUATION MODELLING FOR RESEARCHERS AND PRACTITIONERS. Using R and Stata for Behavioural Research

MACHINE LEARNING FOR GEOLOGICAL MAPPING: ALGORITHMS AND APPLICATIONS

NON-NUMERICAL RANKING BASED ON PAIRWISE COMPARISONS

The Active Young Solar-Type Star HR 1817 (= HD 35850)

What s for today. Random Fields Autocovariance Stationarity, Isotropy. c Mikyoung Jun (Texas A&M) stat647 Lecture 2 August 30, / 13

Chapter 4 - Fundamentals of spatial processes Lecture notes

Discriminant Analysis and Statistical Pattern Recognition

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

The ProbForecastGOP Package

STRUCfURE AND REACfIVITY OF THE. 1,3-DlOXOLAN-2-YLIUM ION SYSTEM. JOHN PAUL BELLAVIA, B. Sc. A Thesis. Submitted to the School of Graduate Studies

Spatial Linear Geostatistical Models

4th HR-HU and 15th HU geomathematical congress Geomathematics as Geoscience Reliability enhancement of groundwater estimations

Covariance function estimation in Gaussian process regression

Summary STK 4150/9150

ASSESSING AND EVALUATING RECREATION RESOURCE IMPACTS: SPATIAL ANALYTICAL APPROACHES. Yu-Fai Leung

FATIGUE BEHAVIOUR OF OFFSHORE STEEL JACKET PLATFORMS

for Complex Environmental Models

Practical application of drill hole spacing analysis in coal resource estimation

TIME SERIES ANALYSIS. Forecasting and Control. Wiley. Fifth Edition GWILYM M. JENKINS GEORGE E. P. BOX GREGORY C. REINSEL GRETA M.

Chapter 1. Summer School GEOSTAT 2014, Spatio-Temporal Geostatistics,

A robust statistically based approach to estimating the probability of contamination occurring between sampling locations

Free surface flows over submerged obstructions

Interpolation of Spatial Data

Integrated reliable and robust design

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time

Contrasting Models for Lactation Curve Analysis

INTEGRATING DIVERSE CALIBRATION PRODUCTS TO IMPROVE SEISMIC LOCATION

Talk Outline. An Uncertainty Analysis for Predicting CEC and salinity Using gamma-ray and EM data. Problem Definition? Digital soil mapping;

Transcription:

An anisotropic Matérn spatial covariance model: REML estimation and properties Kathryn Anne Haskard Doctor of Philosophy November 2007 Supervisors: Arūnas Verbyla and Brian Cullis THE UNIVERSITY OF ADELAIDE School of Agriculture and Wine BiometricsSA

Contents 1 Introduction 1 1.1 Background.................................... 1 1.2 Advances..................................... 1 1.3 Structure of the thesis.............................. 3 1.4 Principal contributions.............................. 4 2 Classical geostatistics 5 2.1 What is geostatistics?............................... 5 2.2 Aims of spatial data analysis........................... 6 2.3 Notation for geostatistical spatial data..................... 6 2.4 Approaches to spatial data analysis....................... 6 2.5 Stationarity.................................... 8 2.6 Isotropy...................................... 9 2.7 The spatial covariance function......................... 10 2.8 Separability.................................... 12 2.9 The variogram................................... 13 2.10 The empirical variogram............................. 18 2.11 Estimating the variogram............................ 19 2.12 Kriging, assuming a known variogram...................... 19 2.13 The nugget effect................................. 24 2.14 Kriging using covariance............................. 27 2.15 Classical and Gaussian geostatistics....................... 28 3 Model-based geostatistics 29 3.1 Gaussian spatial processes............................ 29 3.2 Linear models for spatial data.......................... 29 3.3 Gaussian linear mixed models.......................... 31 3.4 Residual maximum likelihood (REML)..................... 32 3.5 Prediction in the linear mixed models context................. 34 3.6 Kriging using linear mixed models........................ 36 3.7 Plug-in predictions................................ 38 3.8 Modelling a nugget effect............................. 39 i

ii CONTENTS 3.9 A general prediction result............................ 40 3.10 Application to model-based spatial prediction................. 41 3.11 Kriging using linear mixed models Summary................ 43 4 The Matérn class of spatial covariance models 45 4.1 Matérn correlation................................ 45 4.2 Alternative parameterisations.......................... 48 4.3 Derivatives..................................... 49 4.4 Numerical difficulties............................... 55 4.5 REML Implementation.............................. 65 4.6 Initial values.................................... 66 5 Geometric anisotropy and the Matérn Class 69 5.1 Geometric anisotropy............................... 69 5.2 Derivatives for geometric anisotropy....................... 71 5.3 Matérn correlation with geometric anisotropy................. 71 5.4 Minkowski metric................................. 75 5.5 Initial values.................................... 76 5.6 Separable AR(1) AR(1)............................. 77 5.7 A test for anisotropy............................... 78 5.8 ASReml implementation............................. 85 5.9 Extended Matérn class.............................. 86 6 Three examples 89 6.1 Soil salinity ECa data for rice growing..................... 89 6.1.1 Obtaining initial values.......................... 93 6.1.2 Results................................... 94 6.1.3 Implications for predictions....................... 98 6.1.4 Predictions for the study region..................... 99 6.2 Fine-scale soil ph................................. 102 6.3 Cashmore soil water data............................. 106 6.4 Concluding remarks................................ 118 7 Simulations 121 7.1 Motivation..................................... 121 7.2 Parameter combinations............................. 123 7.3 Sampling schemes................................. 124 7.4 Convergence criteria and measures of performance............... 125 7.5 Results....................................... 130 7.5.1 Isotropic simulations without nugget.................. 130 7.5.2 Isotropic simulations with nugget.................... 137

7.5.3 Anisotropic simulations without nugget................. 141 7.5.4 Anisotropic simulations with nugget................... 144 7.6 Approximate standard errors........................... 144 7.7 Comparing simulation results.......................... 149 7.8 Discussion and Conclusions........................... 155 8 Model misspecification and PEV 161 8.1 Motivation..................................... 161 8.2 Method...................................... 161 8.3 Results....................................... 164 9 Overview and Concluding Remarks 169 9.1 Background.................................... 169 9.2 Anisotropic Matérn covariance.......................... 169 9.3 Application to examples............................. 170 9.4 Simulation results................................. 171 9.5 Model misspecification and mean squared error of prediction......... 171 9.6 Further work................................... 172 Appendices 174 A Matrix Results 175 A.1 Partitioned matrix inverses............................ 175 A.2 Matrix differentiation............................... 178 B Results for modified Bessel functions 181 C Conditional Gaussian distribution 183 D Angular function results 185 E Exit codes from ASReml 187 F Standard errors based on expected information 191 G Reprint of paper 193 iii

iv

Abstract v This thesis concerns the development, estimation and investigation of a general anisotropic spatial correlation function, within model-based geostatistics, expressed as a Gaussian linear mixed model, and estimated using residual maximum likelihood (REML). The Matérn correlation function is attractive because of its parameter which controls smoothness of the spatial process, and which can be estimated from the data. This function is combined with geometric anisotropy, with an extension permitting different distance metrics, forming a flexible spatial covariance model which incorporates as special cases many infiniterange spatial covariance functions in common use. Derivatives of the residual log-likelihood with respect to the four correlation-model parameters are derived, and the REML algorithm coded in Splus for testing and refinement as a precursor to its implementation into the software ASReml, with additional generality of linear mixed models. Suggestions are given regarding initial values for the estimation. A residual likelihood ratio test for anisotropy is also developed and investigated. Application to three soil-based examples reveals that anisotropy does occur in practice, and that this technique is able to fit covariance models previously unavailable or inaccessible. Simulations of isotropic and anisotropic data with and without a nugget effect reveal the following principal points. Inclusion of some closely-spaced locations greatly improves estimation, particularly of the Matérn smoothness parameter, and of the nugget variance when present. The presence of geometric anisotropy does not adversely affect parameter estimation. Presence of a nugget effect introduces greater uncertainty into the parameter estimates, most dramatically for the smoothness parameter, and also increases the chance of non-convergence and decreases the power of the test for anisotropy. Estimation is more difficult with very unsmooth processes (Matérn smoothness parameter 0.1 or 0.25) nonconvergence is more likely and estimates are less precise and/or more biased. However it is still often possible to fit the full model including both anisotropy and nugget effect using REML with as few as 100 observations. Additional simulations involving model misspecification reveal that ignoring anisotropy when it is present can substantially increase the mean squared error of prediction, but overfitting by attempting to model anisotropy when it is absent is less damaging. Further, plug-in estimates of prediction error variance are reasonable estimates of the actual mean squared error of prediction, regardless of the model fitted, weakening the argument requiring Bayesian approaches to properly allow for uncertainty in the parameter estimates when estimating prediction error variance. The most valuable outcome of this research is the implementation of an anisotropic Matérn correlation function in ASReml, including the full generality of Gaussian linear

vi mixed models which permits additional fixed and random effects, making publicly available the facility to fit, via REML estimation, a much wider range of variance models than has previously been readily accessible. This greatly increases the probability and ease with which a well-fitting covariance model can be found for a spatial data set, thus contributing to improved geostatistical spatial analysis.

Declaration vii This work contains no material which has been accepted for the award of any other degree or diploma in any university or other tertiary institution and, to the best of my knowledge and belief, contains no material previously published or written by another person, except where due reference has been made. I give consent to this copy of my thesis, when deposited in the University Library, being made available for loan and photocopying, subject to the provisions of the Copyright Act 1968. The author acknowledges that copyright of the published work Haskard, K. A., Cullis, B. R. & Verbyla, A. P. (2007). Anisotropic Matérn correlation and spatial prediction using REML. Journal of Agricultural, Biological and Environmental Statistics 12, 147 160 contained within this thesis resides with the copyright holder of that work. SIGNED:.............................. DATE:..............................

viii

Acknowledgements ix My heartiest thanks go to my two supervisors, Ari Verbyla, of The University of Adelaide, Australia, and Brian Cullis, of the New South Wales Department of Primary Industries, Australia, for their considerable contribution to the topics and direction of this research, and their invaluable guidance, conversations, stretched patience, encouragement and support of various kinds through some difficult times. I especially appreciate Brian s extraordinary energy and enthusiasm, and his insight and vision. Without him this journey would never have begun. Ari has helped my statistical development informally and more formally over several years, and this also has been very valuable and is much appreciated. I have come a long way and it is credit to you both. Arthur Gilmour, of the New South Wales Department of Primary Industries, Australia, contributed substantially to the progression of this work by implementing the extended Matérn model developed in this thesis in the powerful and flexible computer program ASReml. Dave Butler s (Queensland Department of Primary Industries, Australia) implementation of ASReml, together with directional variograms, into the Splus and R function samm (now called asreml), was also helpful. I am indebted to the Grains Research and Development Corporation (GRDC), Australia, for providing a scholarship, and to my employer, the South Australian Research and Development Institute (SARDI) for granting me extended leave without pay and supplementary financial support while doing this PhD. I gratefully acknowledge Brian Dunn and Mark Conyers, both of the New South Wales Department of Primary Industries, Australia, and Murray Lark of Rothamsted Research, Harpenden, United Kingdom, for making their data available for investigation of the methods developed. I sincerely thank my partner Warwick for his enduring love, ongoing support and encouragement, for listening, and for tolerating my crazy hours, odd work practices, and periodic absences. Among my BiometricsSA colleagues, Scott Foster deserves special mention for his general good humour, high values, wit and wisdom, exemplary example, and many enjoyable and helpful conversations.

x