Multivariate modelling and efficient estimation of Gaussian random fields with application to roller data

Size: px
Start display at page:

Download "Multivariate modelling and efficient estimation of Gaussian random fields with application to roller data"

Transcription

1 Multivariate modelling and efficient estimation of Gaussian random fields with application to roller data Reinhard Furrer, UZH PASI, Búzios, NZZ.ch

2 Motivation Microarray data: construct alternative background correction 2

3 Motivation observations = fixed effects + spatial term + noise Chip 1 Chip 2 3

4 Motivation Microarray data Climate data synthesize projections 4

5 Motivation Microarray data Climate data Satellite data attribute spatial changes 5

6 Motivation (a) Observed change in vegetation activity ( ) y 10 S 20 S observations 0 (b) Change explained by climatic effects x T β (RT) 30 S 10 S 20 E 30 E 20 S trend 0 30 S 20 E 30 E (c) Structured change not explained by climatic effects h (GRF) 10 S 20 S spatial term 0 30 S 20 E 30 E (d) Residuals ε 10 S noise 0 20 S 30 S E 30 E 6

7 Motivation for roller data analysis Compaction for road construction: one vibrating drum (smooth drum or padfoot) rolling at 1m/s 20cm material per layer typical bed is 12 15m wide and m long sufficient compaction is manually tested after several layers of material (USA) 7

8 Motivation for roller data analysis Compaction for road construction: one vibrating drum (smooth drum or padfoot) rolling at 1m/s 20cm material per layer typical bed is 12 15m wide and m long sufficient compaction is manually tested after several layers of material (USA) Is an automatic quality assurance and intelligent compaction possible? 7

9 Motivation Current intelligent compaction: precise GPS positioning on-board visualization off-board processing Source: y (m) k s (MN m) x (m)

10 Motivation Relation between measurement and actual soil modulus is unknown. (Linear) relationship is determined with a second measurement device (lightweight deflection, nuclear density... 8

11 Spatial model Spatial, additive mixed effects model for roller measurement values: RMV = amplitude + roller type + driving direction trend(s) + spatial term(s) + error Y (s) = Xβ + α(s) + γ(s) + ε(s) s D R d, d 1 with Xβ: fixed effects and trend α(s): spline component (trend) γ(s): zero mean spatial Gaussian process ε(s): iid noise, orthogonal to γ(s) 9

12 Spatial model Spatial, additive mixed effects model for roller measurement values: RMV = amplitude + roller type + driving direction trend(s) + spatial term(s) + error Y (s) = Xβ + α(s) + γ(s) + ε(s) s D R d, d 1 with Xβ: α(s): γ(s): ε(s): fixed effects and trend coefficients β spline component (trend) basis function coefficients θ α ; smoothing parameter λ α zero mean spatial Gaussian process parameters θ γ describing the covariance function iid noise, orthogonal to γ(s) variance σ 2 10

13 Multivariate modeling: setting Spatial, additive mixed effects model: Y 1 (s) = X 1 β 1 + α 1 (s) + γ 1 (s) + ε 1 (s). Y p (s) = X p β p + α p (s) + γ p (s) + ε p (s) s D R d, d 1 11

14 Multivariate modeling: setting Spatial, additive mixed effects model: Y 1 (s) = X 1 β 1 + α 1 (s) + γ 1 (s) + ε 1 (s). Y p (s) = X p β p + α p (s) + γ p (s) + ε p (s) s D R d, d 1 Modeling the spatial processes themselves: Random field: GRF GMRF Dependency: cross-correlation model ➀ ➂ common process(es) ➁ ➃ 11

15 Random field modeling Dependency through a cross-correlation model ➀ : X N n (µ X, Σ X ) Y N n (µ Y, Σ Y ) ( ) X Y (( ) µx N 2n, µ Y Cov(X, Y) = Σ XY ( )) ΣX Σ XY Σ XY T (e.g., Gneiting, Kleiber, Schlather 2010, Apanasovich, Genton, Sun 2012,... ) Σ Y 12

16 Random field modeling Dependency through a cross-correlation model ➀ : X N n (µ X, Σ X ) Y N n (µ Y, Σ Y ) ( ) X Y (( ) µx N 2n, µ Y Cov(X, Y) = Σ XY ( )) ΣX Σ XY Σ XY T (e.g., Gneiting, Kleiber, Schlather 2010, Apanasovich, Genton, Sun 2012,... ) Σ Y Dependency through common process(es) ➁ : X N n (µ X, Σ X ) Y N n (µ Y, Σ Y ) ( X + Z Y + Z ) Z N n (0, Σ Z ) (( ) µx N 2n, µ Y ( ΣX + Σ Z Σ Z Σ Z Σ Y + Σ Z )) 12

17 Backfitting Recall: Y 1 (s) = X 1 β 1 + α 1 (s) + γ 1 (s) + ε 1 (s). Y p (s) = X p β p + α p (s) + γ p (s) + ε p (s) 13

18 Backfitting Recall: Y 1 (s) = X 1 β 1 + α 1 (s) + γ 1 (s) + ε 1 (s). Y p (s) = X p β p + α p (s) + γ p (s) + ε p (s) Extending the classical backfitting approach to dependent data: repeat until convergence repeat until convergence estimate fixed effects for all stochastic effects estimate parameters predict smooth field See Furrer, Sain (2009) Heersink, Furrer (2012 3) 13

19 Sparse covariance matrices Calculate Σ:

20 Sparse covariance matrices Calculate Σ:

21 Sparse covariance matrices Calculate Σ: Distances : Correlation( Distances ) : 14

22 Sparse covariance matrices Sparseness is guaranteed when the covariance function has a compact support a compact support is (artificially) imposed tapering Covariance Matern ν = 0.5 spherical Distance, lag h 15

23 Sparse covariance matrices Sparseness is guaranteed when the covariance function has a compact support a compact support is (artificially) imposed tapering Covariance Matern ν = 0.5 spherical Matern * spherical Distance, lag h 16

24 Sparse covariance matrices Sparseness is guaranteed when the covariance function has a compact support a compact support is (artificially) imposed tapering Covariance Matern ν = 0.5 spherical Matern * spherical Distance, lag h spam for 16

25 Multivariate tapering Concept of Gaussian equivalent measures does not exist Domain increasing framework Σ Σ T 0 as n 17

26 Multivariate tapering Concept of Gaussian equivalent measures does not exist Domain increasing framework Σ Σ T 0 as n Covariance Matern ν = 0.5 spherical Matern * spherical Distance, lag h 17

27 Multivariate tapering Concept of Gaussian equivalent measures does not exist Domain increasing framework Σ Σ T 0 as n Difference Spherical Distance, lag h Covariance Matern ν = 0.5 spherical Matern * spherical Distance, lag h 17

28 Multivariate tapering Concept of Gaussian equivalent measures does not exist Domain increasing framework Σ Σ T 0 as n Difference Wendland Distance, lag h Covariance Matern ν = 1.5 Wendland Matern * Wendland Distance, lag h 18

29 Multivariate tapering Concept of Gaussian equivalent measures does not exist Domain increasing framework Σ Σ T 0 as n Difference Wendland Distance, lag h Covariance Matern ν = 1.5 Wendland Matern * Wendland Distance, lag h 19

30 Multivariate tapering Concept of Gaussian equivalent measures does not exist Domain increasing framework Σ Σ T 0 as n Difference Wendland Distance, lag h Covariance Matern ν = 1.5 Wendland Matern * Wendland Distance, lag h 19

31 Multivariate tapering Concept of Gaussian equivalent measures does not exist Domain increasing framework Σ Σ T 0 as n Difference Wendland Distance, lag h Covariance Matern ν = 1.5 Wendland Matern * Wendland Distance, lag h 19

32 Multivariate tapering Concept of Gaussian equivalent measures does not exist Domain increasing framework Σ Σ T 0 as n Difference Wendland Distance, lag h Covariance Matern ν = 1.5 Wendland Matern * Wendland Distance, lag h 19

33 Multivariate tapering Concept of Gaussian equivalent measures does not exist Domain increasing framework Σ Σ T 0 as n Tapering is purely pragmatic T = Q T i Q = ɛi + (1 ɛ)j 19

34 All models are wrong, but... Iterative approaches + Flexible, numerically feasible Uncertainties Maximum likelihood + Uncertainties, asymptotics Numerical issues Bayesian hierarchical models + Flexible, uncertainties MCMC SPDE models + flexible, scalable 20

35 Example: Backfitting Minnesota testbed: Cell 28 Cell 27 Subgrade, subbase, base (top to bottom). 21

36 Example: Backfitting Minnesota testbed: Model: Y 1 (s) = X 1 β 1 + γ 1 (s) + ε 1 (s) Y 2 (s) = X 2 β 2 + c 1 γ 1 (s) + γ 2 (s) + ε 2 (s) Y 3 (s) = X 3 β 3 + c 2 (c 1 γ 1 (s) + γ 2 (s)) + γ 3 (s) + ε 3 (s) 22

37 be uniformly blue. Inset: nonstationarity Figure 2: Data from the test bed in Florida, USA. RMVs collected from driving in the x-direction (left) and from driving in the y-direction (right) are depicted

38 be uniformly blue. Inset: nonstationarity Figure 2: Data from the test bed in Florida, USA. RMVs collected from driving in the x-direction (left) and from driving in the y-direction (right) are depicted lag dist lag dist 23 1

39 be uniformly blue. Inset: nonstationarity Figure 2: Data from the test bed in Florida, USA. RMVs collected from driving in the x-direction (left) and from driving in the y-direction (right) are depicted lag dist lag dist 23 1

40 be uniformly blue. Inset: nonstationarity Figure 2: Data from the test bed in Florida, USA. RMVs collected from driving in the Geometric anisotropy x-direction (left) and from driving in the y-direction (right) are depicted lag dist directionally colored noise lag dist 23 1

41 Example: Backfitting Minnesota testbed: Model: Y 1 (s) = X 1 β 1 + γ 1 (s) + ε 1 (s) Y 2 (s) = X 2 β 2 + c 1 γ 1 (s) + γ 2 (s) + ε 2 (s) Y 3 (s) = X 3 β 3 + c 2 (c 1 γ 1 (s) + γ 2 (s)) + γ 3 (s) + ε 3 (s) (add measurement operator... ) 24

42 Example: Backfitting Minnesota testbed: Model: Y 1 (s) = X 1 β 1 + γ 1 (s) + ε 1 (s) Y 2 (s) = X 2 β 2 + c 1 γ 1 (s) + γ 2 (s) + ε 2 (s) Y 3 (s) = X 3 β 3 + c 2 (c 1 γ 1 (s) + γ 2 (s)) + γ 3 (s) + ε 3 (s) (add measurement operator... ) Cell 27: Fitted smooths: 24

43 Multiresolution analysis Simultaneous inference for IC and QA which features are really there? Using Holmström et al. (2010), extensions from original SiZER Ready to use software Decomposition into different scales: 0 = λ 1 < λ 2 < < λ L = smoothing parameters γ = L 1 i=1 (S λi S λi+1 )γ + S λl γ z 1 + z z L 25

44 Multiresolution analysis Simultaneous inference for IC and QA which features are really there? Using Holmström et al. (2010), extensions from original SiZER Ready to use software Decomposition into different scales: 0 = λ 1 < λ 2 < < λ L = smoothing parameters γ = L 1 i=1 (S λi S λi+1 )γ + S λl γ z 1 + z z L 25

45 Example: multiresolution analysis Minnesota testbed: Cell 28 Cell 27 Subgrade, subbase, base (top to bottom). 26

46 Example: Multiresolution analysis Cell 27: Fitted smooths: Multiresolution analysis: subgrade subbase base 27

47 Afterthoughts/outlook Flexible setting... toolbox(es) Multivariate spatial (spatio-temporal) non-stationary data Bayesian framework Non-Gaussian data 28

48 Collaboration with: Daniel Heersink, now Research Scientist at CSIRO, Canberra Mike Mooney, CSM Roland Anderegg, FHNW... URPP Systems Biology/Functional Genomics & URPP Global Change and Biodiversity , ,

49 References (some) de Jong, Schaepman, Furrer, de Bruin, Verburget (2013) Spatial relationship between climatologies and changes in global vegetation activity, Global Change Biology, 19(6), Facas, Furrer, Mooney (2010) Anisotropy in the Spatial Distribution of Roller- Measured Soil Stiffness. International Journal of Geomechanics, 10(4), Furrer, Sain (2009) Spatial Model Fitting for Large Datasets with Applications to Climate and Microarray Problems, Statistics and Computing, 19(2), Heersink, Furrer (2013) Sequential Spatial Analysis of Large Datasets with Applications to Modern Soil Compaction Roller Measurement Values, Spatial Statistics, 6, Heersink, Furrer, Mooney (2013) Intelligent Compaction and Quality Assurance of Roller Measurement Values utilizing Backfitting and Multiresolution Scale Space Analysis, arxiv: Heersink, Furrer, Mooney (2013) Spatial Backfitting of Roller Measurement Values from a Florida Test Bed, arxiv: Heersink, Furrer (2012) Moore Penrose Inverses of Quasi-Kronecker Structured Matrices, Linear Algebra and its Applications, 436(3), Holmström, Pasanen, Furrer, Sain (2011) Scale Space Multiresolution Analysis of Random Signals. Computational Statistics and Data Analysis, 55,

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2

More information

Multivariate Gaussian Random Fields with SPDEs

Multivariate Gaussian Random Fields with SPDEs Multivariate Gaussian Random Fields with SPDEs Xiangping Hu Daniel Simpson, Finn Lindgren and Håvard Rue Department of Mathematics, University of Oslo PASI, 214 Outline The Matérn covariance function and

More information

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University this presentation derived from that presented at the Pan-American Advanced

More information

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets Thomas Romary, Nicolas Desassis & Francky Fouedjio Mines ParisTech Centre de Géosciences, Equipe Géostatistique

More information

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets Matthias Katzfuß Advisor: Dr. Noel Cressie Department of Statistics The Ohio State University

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North

More information

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Abhirup Datta 1 Sudipto Banerjee 1 Andrew O. Finley 2 Alan E. Gelfand 3 1 University of Minnesota, Minneapolis,

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley 1 and Sudipto Banerjee 2 1 Department of Forestry & Department of Geography, Michigan

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley Department of Forestry & Department of Geography, Michigan State University, Lansing

More information

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016 Spatial Statistics Spatial Examples More Spatial Statistics with Image Analysis Johan Lindström 1 1 Mathematical Statistics Centre for Mathematical Sciences Lund University Lund October 6, 2016 Johan Lindström

More information

On the smallest eigenvalues of covariance matrices of multivariate spatial processes

On the smallest eigenvalues of covariance matrices of multivariate spatial processes On the smallest eigenvalues of covariance matrices of multivariate spatial processes François Bachoc, Reinhard Furrer Toulouse Mathematics Institute, University Paul Sabatier, France Institute of Mathematics

More information

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation A Framework for Daily Spatio-Temporal Stochastic Weather Simulation, Rick Katz, Balaji Rajagopalan Geophysical Statistics Project Institute for Mathematics Applied to Geosciences National Center for Atmospheric

More information

Nonstationary cross-covariance models for multivariate processes on a globe

Nonstationary cross-covariance models for multivariate processes on a globe Nonstationary cross-covariance models for multivariate processes on a globe Mikyoung Jun 1 April 15, 2011 Abstract: In geophysical and environmental problems, it is common to have multiple variables of

More information

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

On Gaussian Process Models for High-Dimensional Geostatistical Datasets On Gaussian Process Models for High-Dimensional Geostatistical Datasets Sudipto Banerjee Joint work with Abhirup Datta, Andrew O. Finley and Alan E. Gelfand University of California, Los Angeles, USA May

More information

Climate Change: the Uncertainty of Certainty

Climate Change: the Uncertainty of Certainty Climate Change: the Uncertainty of Certainty Reinhard Furrer, UZH JSS, Geneva Oct. 30, 2009 Collaboration with: Stephan Sain - NCAR Reto Knutti - ETHZ Claudia Tebaldi - Climate Central Ryan Ford, Doug

More information

Geostatistical Modeling for Large Data Sets: Low-rank methods

Geostatistical Modeling for Large Data Sets: Low-rank methods Geostatistical Modeling for Large Data Sets: Low-rank methods Whitney Huang, Kelly-Ann Dixon Hamil, and Zizhuang Wu Department of Statistics Purdue University February 22, 2016 Outline Motivation Low-rank

More information

Practical Bayesian Optimization of Machine Learning. Learning Algorithms

Practical Bayesian Optimization of Machine Learning. Learning Algorithms Practical Bayesian Optimization of Machine Learning Algorithms CS 294 University of California, Berkeley Tuesday, April 20, 2016 Motivation Machine Learning Algorithms (MLA s) have hyperparameters that

More information

Disease mapping with Gaussian processes

Disease mapping with Gaussian processes EUROHEIS2 Kuopio, Finland 17-18 August 2010 Aki Vehtari (former Helsinki University of Technology) Department of Biomedical Engineering and Computational Science (BECS) Acknowledgments Researchers - Jarno

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes TK4150 - Intro 1 Chapter 4 - Fundamentals of spatial processes Lecture notes Odd Kolbjørnsen and Geir Storvik January 30, 2017 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites

More information

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O. Finley 1, Sudipto Banerjee 2, and Bradley P. Carlin 2 1 Michigan State University, Departments

More information

Non-stationary Cross-Covariance Models for Multivariate Processes on a Globe

Non-stationary Cross-Covariance Models for Multivariate Processes on a Globe Scandinavian Journal of Statistics, Vol. 38: 726 747, 2011 doi: 10.1111/j.1467-9469.2011.00751.x Published by Blackwell Publishing Ltd. Non-stationary Cross-Covariance Models for Multivariate Processes

More information

Multi-resolution models for large data sets

Multi-resolution models for large data sets Multi-resolution models for large data sets Douglas Nychka, National Center for Atmospheric Research National Science Foundation NORDSTAT, Umeå, June, 2012 Credits Steve Sain, NCAR Tia LeRud, UC Davis

More information

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Cheng Soon Ong & Christian Walder. Canberra February June 2018 Cheng Soon Ong & Christian Walder Research Group and College of Engineering and Computer Science Canberra February June 2018 (Many figures from C. M. Bishop, "Pattern Recognition and ") 1of 254 Part V

More information

OPTIMAL ESTIMATION of DYNAMIC SYSTEMS

OPTIMAL ESTIMATION of DYNAMIC SYSTEMS CHAPMAN & HALL/CRC APPLIED MATHEMATICS -. AND NONLINEAR SCIENCE SERIES OPTIMAL ESTIMATION of DYNAMIC SYSTEMS John L Crassidis and John L. Junkins CHAPMAN & HALL/CRC A CRC Press Company Boca Raton London

More information

Extreme Value Analysis and Spatial Extremes

Extreme Value Analysis and Spatial Extremes Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models

More information

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University

More information

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature

More information

Wrapped Gaussian processes: a short review and some new results

Wrapped Gaussian processes: a short review and some new results Wrapped Gaussian processes: a short review and some new results Giovanna Jona Lasinio 1, Gianluca Mastrantonio 2 and Alan Gelfand 3 1-Università Sapienza di Roma 2- Università RomaTRE 3- Duke University

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is

More information

Likelihood-Based Methods

Likelihood-Based Methods Likelihood-Based Methods Handbook of Spatial Statistics, Chapter 4 Susheela Singh September 22, 2016 OVERVIEW INTRODUCTION MAXIMUM LIKELIHOOD ESTIMATION (ML) RESTRICTED MAXIMUM LIKELIHOOD ESTIMATION (REML)

More information

A short introduction to INLA and R-INLA

A short introduction to INLA and R-INLA A short introduction to INLA and R-INLA Integrated Nested Laplace Approximation Thomas Opitz, BioSP, INRA Avignon Workshop: Theory and practice of INLA and SPDE November 7, 2018 2/21 Plan for this talk

More information

Cross-covariance Functions for Tangent Vector Fields on the Sphere

Cross-covariance Functions for Tangent Vector Fields on the Sphere Cross-covariance Functions for Tangent Vector Fields on the Sphere Minjie Fan 1 Tomoko Matsuo 2 1 Department of Statistics University of California, Davis 2 Cooperative Institute for Research in Environmental

More information

Fitting Large-Scale Spatial Models with Applications to Microarray Data Analysis

Fitting Large-Scale Spatial Models with Applications to Microarray Data Analysis Fitting Large-Scale Spatial Models with Applications to Microarray Data Analysis Stephan R Sain Department of Mathematics University of Colorado at Denver Denver, Colorado ssain@mathcudenveredu Reinhard

More information

Data are collected along transect lines, with dense data along. Spatial modelling using GMRFs. 200m. Today? Spatial modelling using GMRFs

Data are collected along transect lines, with dense data along. Spatial modelling using GMRFs. 200m. Today? Spatial modelling using GMRFs Centre for Mathematical Sciences Lund University Engineering geology Lund University Results A non-stationary extension The model Estimation Gaussian Markov random fields Basics Approximating Mate rn covariances

More information

Gaussian Process Regression

Gaussian Process Regression Gaussian Process Regression 4F1 Pattern Recognition, 21 Carl Edward Rasmussen Department of Engineering, University of Cambridge November 11th - 16th, 21 Rasmussen (Engineering, Cambridge) Gaussian Process

More information

arxiv: v4 [stat.me] 14 Sep 2015

arxiv: v4 [stat.me] 14 Sep 2015 Does non-stationary spatial data always require non-stationary random fields? Geir-Arne Fuglstad 1, Daniel Simpson 1, Finn Lindgren 2, and Håvard Rue 1 1 Department of Mathematical Sciences, NTNU, Norway

More information

CSC2541 Lecture 2 Bayesian Occam s Razor and Gaussian Processes

CSC2541 Lecture 2 Bayesian Occam s Razor and Gaussian Processes CSC2541 Lecture 2 Bayesian Occam s Razor and Gaussian Processes Roger Grosse Roger Grosse CSC2541 Lecture 2 Bayesian Occam s Razor and Gaussian Processes 1 / 55 Adminis-Trivia Did everyone get my e-mail

More information

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency Chris Paciorek March 11, 2005 Department of Biostatistics Harvard School of

More information

The Bayesian approach to inverse problems

The Bayesian approach to inverse problems The Bayesian approach to inverse problems Youssef Marzouk Department of Aeronautics and Astronautics Center for Computational Engineering Massachusetts Institute of Technology ymarz@mit.edu, http://uqgroup.mit.edu

More information

How to build an automatic statistician

How to build an automatic statistician How to build an automatic statistician James Robert Lloyd 1, David Duvenaud 1, Roger Grosse 2, Joshua Tenenbaum 2, Zoubin Ghahramani 1 1: Department of Engineering, University of Cambridge, UK 2: Massachusetts

More information

Predictive spatio-temporal models for spatially sparse environmental data. Umeå University

Predictive spatio-temporal models for spatially sparse environmental data. Umeå University Seminar p.1/28 Predictive spatio-temporal models for spatially sparse environmental data Xavier de Luna and Marc G. Genton xavier.deluna@stat.umu.se and genton@stat.ncsu.edu http://www.stat.umu.se/egna/xdl/index.html

More information

Flexible Spatio-temporal smoothing with array methods

Flexible Spatio-temporal smoothing with array methods Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session IPS046) p.849 Flexible Spatio-temporal smoothing with array methods Dae-Jin Lee CSIRO, Mathematics, Informatics and

More information

Nearest Neighbor Gaussian Processes for Large Spatial Data

Nearest Neighbor Gaussian Processes for Large Spatial Data Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns

More information

Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain. October 10, 2012

Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain. October 10, 2012 A multi-resolution Gaussian process model for the analysis of large spatial data sets. Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain October 10, 2012 Abstract

More information

Journal of Multivariate Analysis. Nonstationary modeling for multivariate spatial processes

Journal of Multivariate Analysis. Nonstationary modeling for multivariate spatial processes Journal of Multivariate Analysis () 76 9 Contents lists available at SciVerse ScienceDirect Journal of Multivariate Analysis journal homepage: wwwelseviercom/locate/jmva Nonstationary modeling for multivariate

More information

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Environmentrics 00, 1 12 DOI: 10.1002/env.XXXX Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Regina Wu a and Cari G. Kaufman a Summary: Fitting a Bayesian model to spatial

More information

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN: Index Akaike information criterion (AIC) 105, 290 analysis of variance 35, 44, 127 132 angular transformation 22 anisotropy 59, 99 affine or geometric 59, 100 101 anisotropy ratio 101 exploring and displaying

More information

Spatial point processes in the modern world an

Spatial point processes in the modern world an Spatial point processes in the modern world an interdisciplinary dialogue Janine Illian University of St Andrews, UK and NTNU Trondheim, Norway Bristol, October 2015 context statistical software past to

More information

arxiv: v2 [stat.me] 9 Feb 2016

arxiv: v2 [stat.me] 9 Feb 2016 A Flexible Class of Non-separable Cross- Functions for Multivariate Space-Time Data Marc Bourotte a, Denis Allard a & Emilio Porcu b a Biostatistique et Processus Spatiaux (BioSP), INRA, Avignon, France.

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information

Stochastic Models, Estimation and Control Peter S. Maybeck Volumes 1, 2 & 3 Tables of Contents

Stochastic Models, Estimation and Control Peter S. Maybeck Volumes 1, 2 & 3 Tables of Contents Navtech Part #s Volume 1 #1277 Volume 2 #1278 Volume 3 #1279 3 Volume Set #1280 Stochastic Models, Estimation and Control Peter S. Maybeck Volumes 1, 2 & 3 Tables of Contents Volume 1 Preface Contents

More information

Spatial smoothing using Gaussian processes

Spatial smoothing using Gaussian processes Spatial smoothing using Gaussian processes Chris Paciorek paciorek@hsph.harvard.edu August 5, 2004 1 OUTLINE Spatial smoothing and Gaussian processes Covariance modelling Nonstationary covariance modelling

More information

A Multi-resolution Gaussian process model for the analysis of large spatial data sets.

A Multi-resolution Gaussian process model for the analysis of large spatial data sets. 1 2 3 4 A Multi-resolution Gaussian process model for the analysis of large spatial data sets. Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain August 13, 2013 5

More information

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions Spatial inference I will start with a simple model, using species diversity data Strong spatial dependence, Î = 0.79 what is the mean diversity? How precise is our estimate? Sampling discussion: The 64

More information

Nonparametric Bayesian Methods

Nonparametric Bayesian Methods Nonparametric Bayesian Methods Debdeep Pati Florida State University October 2, 2014 Large spatial datasets (Problem of big n) Large observational and computer-generated datasets: Often have spatial and

More information

Positive Definite Functions on Spheres

Positive Definite Functions on Spheres Positive Definite Functions on Spheres Tilmann Gneiting Institute for Applied Mathematics Heidelberg University, Germany t.gneiting@uni-heidelberg.de www.math.uni-heidelberg.de/spatial/tilmann/ International

More information

Bayesian Nonparametrics

Bayesian Nonparametrics Bayesian Nonparametrics Peter Orbanz Columbia University PARAMETERS AND PATTERNS Parameters P(X θ) = Probability[data pattern] 3 2 1 0 1 2 3 5 0 5 Inference idea data = underlying pattern + independent

More information

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This

More information

A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets

A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets Statistica Sinica (2013): Preprint 1 A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets Ganggang Xu 1, Faming Liang 1 and Marc G. Genton 2 1 Texas A&M University

More information

Sub-kilometer-scale space-time stochastic rainfall simulation

Sub-kilometer-scale space-time stochastic rainfall simulation Picture: Huw Alexander Ogilvie Sub-kilometer-scale space-time stochastic rainfall simulation Lionel Benoit (University of Lausanne) Gregoire Mariethoz (University of Lausanne) Denis Allard (INRA Avignon)

More information

Composite Hypotheses and Generalized Likelihood Ratio Tests

Composite Hypotheses and Generalized Likelihood Ratio Tests Composite Hypotheses and Generalized Likelihood Ratio Tests Rebecca Willett, 06 In many real world problems, it is difficult to precisely specify probability distributions. Our models for data may involve

More information

Computer model calibration with large non-stationary spatial outputs: application to the calibration of a climate model

Computer model calibration with large non-stationary spatial outputs: application to the calibration of a climate model Computer model calibration with large non-stationary spatial outputs: application to the calibration of a climate model Kai-Lan Chang and Serge Guillas University College London, Gower Street, London WC1E

More information

Multi-resolution models for large data sets

Multi-resolution models for large data sets Multi-resolution models for large data sets Douglas Nychka, National Center for Atmospheric Research National Science Foundation Iowa State March, 2013 Credits Steve Sain, Tamra Greasby, NCAR Tia LeRud,

More information

Fast Direct Methods for Gaussian Processes

Fast Direct Methods for Gaussian Processes Fast Direct Methods for Gaussian Processes Mike O Neil Departments of Mathematics New York University oneil@cims.nyu.edu December 12, 2015 1 Collaborators This is joint work with: Siva Ambikasaran Dan

More information

ECE521 week 3: 23/26 January 2017

ECE521 week 3: 23/26 January 2017 ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear

More information

Gaussian Process Optimization with Mutual Information

Gaussian Process Optimization with Mutual Information Gaussian Process Optimization with Mutual Information Emile Contal 1 Vianney Perchet 2 Nicolas Vayatis 1 1 CMLA Ecole Normale Suprieure de Cachan & CNRS, France 2 LPMA Université Paris Diderot & CNRS,

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

Regression with correlation for the Sales Data

Regression with correlation for the Sales Data Regression with correlation for the Sales Data Scatter with Loess Curve Time Series Plot Sales 30 35 40 45 Sales 30 35 40 45 0 10 20 30 40 50 Week 0 10 20 30 40 50 Week Sales Data What is our goal with

More information

Bayesian Hierarchical Modelling: Incorporating spatial information in water resources assessment and accounting

Bayesian Hierarchical Modelling: Incorporating spatial information in water resources assessment and accounting Bayesian Hierarchical Modelling: Incorporating spatial information in water resources assessment and accounting Grace Chiu & Eric Lehmann (CSIRO Mathematics, Informatics and Statistics) A water information

More information

The Matrix Reloaded: Computations for large spatial data sets

The Matrix Reloaded: Computations for large spatial data sets The Matrix Reloaded: Computations for large spatial data sets Doug Nychka National Center for Atmospheric Research The spatial model Solving linear systems Matrix multiplication Creating sparsity Sparsity,

More information

TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω

TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω ECO 513 Spring 2015 TAKEHOME FINAL EXAM (1) Suppose the univariate stochastic process y is ARMA(2,2) of the following form: y t = 1.6974y t 1.9604y t 2 + ε t 1.6628ε t 1 +.9216ε t 2, (1) where ε is i.i.d.

More information

Modelling Multivariate Spatial Data

Modelling Multivariate Spatial Data Modelling Multivariate Spatial Data Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. June 20th, 2014 1 Point-referenced spatial data often

More information

MCMC algorithms for fitting Bayesian models

MCMC algorithms for fitting Bayesian models MCMC algorithms for fitting Bayesian models p. 1/1 MCMC algorithms for fitting Bayesian models Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota MCMC algorithms for fitting Bayesian models

More information

Lecture 16: State Space Model and Kalman Filter Bus 41910, Time Series Analysis, Mr. R. Tsay

Lecture 16: State Space Model and Kalman Filter Bus 41910, Time Series Analysis, Mr. R. Tsay Lecture 6: State Space Model and Kalman Filter Bus 490, Time Series Analysis, Mr R Tsay A state space model consists of two equations: S t+ F S t + Ge t+, () Z t HS t + ɛ t (2) where S t is a state vector

More information

Gaussian Processes 1. Schedule

Gaussian Processes 1. Schedule 1 Schedule 17 Jan: Gaussian processes (Jo Eidsvik) 24 Jan: Hands-on project on Gaussian processes (Team effort, work in groups) 31 Jan: Latent Gaussian models and INLA (Jo Eidsvik) 7 Feb: Hands-on project

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

Fast approximations for the Expected Value of Partial Perfect Information using R-INLA

Fast approximations for the Expected Value of Partial Perfect Information using R-INLA Fast approximations for the Expected Value of Partial Perfect Information using R-INLA Anna Heath 1 1 Department of Statistical Science, University College London 22 May 2015 Outline 1 Health Economic

More information

Part 6: Multivariate Normal and Linear Models

Part 6: Multivariate Normal and Linear Models Part 6: Multivariate Normal and Linear Models 1 Multiple measurements Up until now all of our statistical models have been univariate models models for a single measurement on each member of a sample of

More information

A multi-resolution Gaussian process model for the analysis of large spatial data sets.

A multi-resolution Gaussian process model for the analysis of large spatial data sets. National Science Foundation A multi-resolution Gaussian process model for the analysis of large spatial data sets. Doug Nychka Soutir Bandyopadhyay Dorit Hammerling Finn Lindgren Stephen Sain NCAR/TN-504+STR

More information

Model-Free Knockoffs: High-Dimensional Variable Selection that Controls the False Discovery Rate

Model-Free Knockoffs: High-Dimensional Variable Selection that Controls the False Discovery Rate Model-Free Knockoffs: High-Dimensional Variable Selection that Controls the False Discovery Rate Lucas Janson, Stanford Department of Statistics WADAPT Workshop, NIPS, December 2016 Collaborators: Emmanuel

More information

Bayes: All uncertainty is described using probability.

Bayes: All uncertainty is described using probability. Bayes: All uncertainty is described using probability. Let w be the data and θ be any unknown quantities. Likelihood. The probability model π(w θ) has θ fixed and w varying. The likelihood L(θ; w) is π(w

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Spatial omain Hierarchical Modelling for Univariate Spatial ata Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A.

More information

SCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA. Sistemi di Elaborazione dell Informazione. Regressione. Ruggero Donida Labati

SCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA. Sistemi di Elaborazione dell Informazione. Regressione. Ruggero Donida Labati SCUOLA DI SPECIALIZZAZIONE IN FISICA MEDICA Sistemi di Elaborazione dell Informazione Regressione Ruggero Donida Labati Dipartimento di Informatica via Bramante 65, 26013 Crema (CR), Italy http://homes.di.unimi.it/donida

More information

Model Selection for Geostatistical Models

Model Selection for Geostatistical Models Model Selection for Geostatistical Models Richard A. Davis Colorado State University http://www.stat.colostate.edu/~rdavis/lectures Joint work with: Jennifer A. Hoeting, Colorado State University Andrew

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,

More information

Nonparametric Bayesian Methods (Gaussian Processes)

Nonparametric Bayesian Methods (Gaussian Processes) [70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent

More information

The Matrix Reloaded: Computations for large spatial data sets

The Matrix Reloaded: Computations for large spatial data sets The Matrix Reloaded: Computations for large spatial data sets The spatial model Solving linear systems Matrix multiplication Creating sparsity Doug Nychka National Center for Atmospheric Research Sparsity,

More information

Bayesian PalaeoClimate Reconstruction from proxies:

Bayesian PalaeoClimate Reconstruction from proxies: Bayesian PalaeoClimate Reconstruction from proxies: Framework Bayesian modelling of space-time processes General Circulation Models Space time stochastic process C = {C(x,t) = Multivariate climate at all

More information

Gaussian predictive process models for large spatial data sets.

Gaussian predictive process models for large spatial data sets. Gaussian predictive process models for large spatial data sets. Sudipto Banerjee, Alan E. Gelfand, Andrew O. Finley, and Huiyan Sang Presenters: Halley Brantley and Chris Krut September 28, 2015 Overview

More information

STATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS

STATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS STATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS Eric Gilleland Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research Supported

More information

X t = a t + r t, (7.1)

X t = a t + r t, (7.1) Chapter 7 State Space Models 71 Introduction State Space models, developed over the past 10 20 years, are alternative models for time series They include both the ARIMA models of Chapters 3 6 and the Classical

More information

Nonparametric Regression With Gaussian Processes

Nonparametric Regression With Gaussian Processes Nonparametric Regression With Gaussian Processes From Chap. 45, Information Theory, Inference and Learning Algorithms, D. J. C. McKay Presented by Micha Elsner Nonparametric Regression With Gaussian Processes

More information

Bayesian Estimation of Input Output Tables for Russia

Bayesian Estimation of Input Output Tables for Russia Bayesian Estimation of Input Output Tables for Russia Oleg Lugovoy (EDF, RANE) Andrey Polbin (RANE) Vladimir Potashnikov (RANE) WIOD Conference April 24, 2012 Groningen Outline Motivation Objectives Bayesian

More information

These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher Bishop

These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher Bishop Music and Machine Learning (IFT68 Winter 8) Prof. Douglas Eck, Université de Montréal These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher Bishop

More information

STA414/2104 Statistical Methods for Machine Learning II

STA414/2104 Statistical Methods for Machine Learning II STA414/2104 Statistical Methods for Machine Learning II Murat A. Erdogdu & David Duvenaud Department of Computer Science Department of Statistical Sciences Lecture 3 Slide credits: Russ Salakhutdinov Announcements

More information

Fast algorithms that utilise the sparsity of Q exist (c-package ( Q 1. GMRFlib) Seattle February 3, 2009

Fast algorithms that utilise the sparsity of Q exist (c-package ( Q 1. GMRFlib) Seattle February 3, 2009 GMRF:s Results References Some Theory for and Applications of Gaussian Markov Random Fields Johan Lindström 1 David Bolin 1 Finn Lindgren 1 Håvard Rue 2 1 Centre for Mathematical Sciences Lund University

More information

Models for models. Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research

Models for models. Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research Models for models Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research Outline Statistical models and tools Spatial fields (Wavelets) Climate regimes (Regression and clustering)

More information

Multivariate spatial models and the multikrig class

Multivariate spatial models and the multikrig class Multivariate spatial models and the multikrig class Stephan R Sain, IMAGe, NCAR ENAR Spring Meetings March 15, 2009 Outline Overview of multivariate spatial regression models Case study: pedotransfer functions

More information