Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency
|
|
- Martin Lawson
- 5 years ago
- Views:
Transcription
1 Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency Chris Paciorek March 11, 2005 Department of Biostatistics Harvard School of Public Health
2 Increased attention to spatial analysis in public health data availability: geocoding and GPS for assigning point locations to individuals and monitors GIS software: easy data management and manipulation graphical presentation spatially-varying covariate generation interest amongst researchers: strong applied interest in kriging and related smoothing methods opportunities for more sophisticated spatio-temporal modelling, particularly Bayesian hierarchical modelling 1
3 Petrochemical exposure in Kaohsiung, Taiwan leukemia brain cancer northing (km) northing (km) easting (km) easting (km) n = 495 n = 433 n 1 = 141 n 1 = 121 2
4 Possible approaches for health analysis Explicitly estimate pollutant exposure - difficult retrospectively Use distance to exposure source as covariate Use a moving window/multiple testing to detect clusters of cases default approach - software available Include space as a covariate to provide a map of risk H i Ber(p(x i, s i )) logit(p(x i, s i )) = x T i β + g θ (s i ) 3
5 Particulate matter exposure in the Nurses Health Study estimate individual exposure, EPA monitoring for large-scale spatio-temporal heterogeneity spatially-varying covariates for local heterogenity distance to roads, climate variables, local land use,... generated using GIS basic additive exposure model: log E i N(f(x i, s i ), η 2 ) f(x i, s i ) = p h p (x i ) + g θ (s i ) geocoding of individual residences every two years relate estimated exposure to health outcomes (chronic heart disease) 4
6 geocoding and GIS make this possible; spatial statistics provides a rigorous framework 5
7 Health outcomes by postcode in NSW, Australia methodological challenges areal (postcode) units vary drastically in size data misalignment relate areal data to a latent smooth process, g θ ( ) (Kelsall & Wakefield, Rathouz) computational challenges: 650 units, 5 years daily data, 2 sexes, 9 age groups 6
8 Outline Motivating examples Introduction to Gaussian processes (GPs) Fast Gaussian process modelling Flexible Gaussian process modelling Bayes and overfitting The future: flexibility + efficiency + hierarchical modelling 7
9 Kriging as a GP model Y i N(g(s i ), η 2 ) g( ) GP(µ, C( ; θ)) Bayesian model specifies prior distributions for θ (Bayesian kriging) Empirical Bayes/marginal likelihood (i.e., kriging) integrate g train = (g(s 1 ),..., g(s n )) out of model estimate θ maximizing marginal posterior maximizing marginal likelihood fitting variogram model for C( ; θ) point estimate for spatial process: E(g test Y, θ) based conditional normal calculations 8
10 GAUSSIAN PROCESS DISTRIBUTION Infinite-dimensional joint distribution for g(x), x X : Example: g( ) a spatial process, X = R 2 g( ) GP(µ( ), C(, )) Finite-dimensional marginals are normal Types of covariance functions, C(x i, x j ): stationary, isotropic stationary, anisotropic nonstationary X x j x j x i x i 9
11 GAUSSIAN PROCESS DISTRIBUTION Infinite-dimensional joint distribution for g(x), x X : Example: g( ) a spatial process, X = R 2 g( ) GP(µ( ), C(, )) Finite-dimensional marginals are normal Types of covariance functions, C(x i, x j ): stationary, isotropic stationary, anisotropic nonstationary X x j x i x j x i 10
12 GAUSSIAN PROCESS DISTRIBUTION Infinite-dimensional joint distribution for g(x), x X : Example: g( ) a spatial process, X = R 2 g( ) GP(µ( ), C(, )) Finite-dimensional marginals are normal Types of covariance functions, C(x i, x j ): stationary, isotropic stationary, anisotropic nonstationary X x j x j x i x i 11
13 Stationary Correlation Functions Correlation Correlation fxns ρ = 0.2 ρ = Distance Z(x) Sample fxns x Matérn form ( ) ν ( ) 1 R(τ) = 2 ντ 2 ντ Γ(ν)2 ν 1 ρ Kν ρ ; ν > 0, ρ > 0 Differentiability controlled by ν, asymptotic advantages (Stein) Familiar exponential (ν = 0.5) and squared exponential (Gaussian) (ν ) correlations as special and limiting cases 12
14 Computational challenges of GPs even marginal likelihood in normal error model is intensive: g( ) GP(µ( ), C θ (, )) Y N(µ, C θ + η 2 I) O(n 3 ) fitting: C θ + η 2 I and (C θ + η 2 I) 1 (Y µ1) non-gaussian spatial models particularly difficult spatial process can t be integrated out MCMC mixing is very slow because of high-level structure correlation amongst process values and between process values and process hyperparameters value ACF iteration lag/10 13
15 Petrochemical exposure in Kaohsiung, Taiwan leukemia brain cancer northing (km) northing (km) easting (km) easting (km) n = 495 n = 433 n 1 = 141 n 1 =
16 Modelling Framework H i Ber(p(x i, s i )) logit(p(x i, s i )) = x i T β + g θ (s i ) basic spatial model for g s θ = (g θ (s 1 ),..., g θ (s n )) GAM: g θ ( ) is a two-dimensional smooth term basis representation g s θ = Zu Gaussian process representation: g( ) GP(µ( ), C θ (, )) g s θ N(µ, C θ ) GLMM: g s θ = Zu correlated random effects, u N(0, Σ) 15
17 Approaches Bayesian spectral basis model fit by MCMC (Wikle, 2002) [B-SB] penalized likelihood based on mixed model (radial basis functions) with REML smoothing (Kammann and Wand, 2003; Ngo and Wand, 2004) [PL-PQL] penalized likelihood with GCV smoothing (Wood, 2001, 2003, 2004) [PL-GCV] Bayesian mixed model/radial basis functions fit by MCMC (Zhao and Wand 2004) [B-Geo] 16
18 Bayesian spectral basis function model computationally efficient basis function construction (Wikle 2002) g # = Zu and g s = σp g # piecewise constant gridded surface on k by k grid P maps observation locations to nearest grid point Z is the Fourier (spectral) basis and Zu is the inverse FFT Zu is approximately a Gaussian process (GP) when... u N(0, diag(π θ (ω))) for Fourier frequencies, ω spectral density, π θ ( ), of GP covariance function defines V(u) 17
19 Bayesian spectral basis functions ω 2 = 0 ω 2 = 1 ω 2 = 2 ω 2 = 3 ω 1 = 0 ω 1 = 1 ω 1 = 2 ω 1 = 3 18
20 Comparison with usual GP specification spectral basis uses FFT O ( (k 2 ) log(k 2 ) ) additional observations are essentially free for fixed grid fast computation and prediction of surface given coefficients a priori independent coefficients give fast computation of prior and help with mixing 19
21 Penalized likelihood using GLMM framework with REML [PL-PQL] g s = Zu, Z = Ψ nk Ω 1 2 kk, u N(0, σ2 u) - variance component provides complexity penalty Ω contains pairwise spatial covariances between k knot locations and Ψ between n data locations and k knot locations potential covariance functions: thin plate spline generalized covariance function, ( C(τ) = τ 2 log τ ) ν ( ) 1 Matérn correlation function, R(τ) = 2 ντ 2 ντ Γ(ν)2 ν 1 ρ Kν ρ, with ρ and ν fixed computationally efficient approximation of a Gaussian process representation for g s PQL approach - IWLS fitting of (β, u) with REML estimation of σ 2 u within the iterations using MM software 20
22 GLMM basis functions radial basis functions centered at the knots 4 of 64 functions displayed: TPS Matérn 21
23 Penalized likelihood using GCV [PL-GCV] thin plate spline basis for g( ) truncated eigendecomposition of basis matrix increases computational efficiency IWLS fitting of (β, u) with GCV estimation of penalty easy implementation using the R mgcv library gam() 22
24 Bayesian geoadditive model [B-Geo] Bayesian version of GLMM framework already described g s = Zu, Z = Ψ nk Ω 1 2 kk, u N(0, σ2 u) natural Bayesian complexity penalty through prior on u thin plate spline covariance or Matérn correlation basis construction of Ψ and Ω MCMC implementation - ensuring mixing is not simple Metropolis-Hastings for u using conditional posterior mean and variance based on linearized observations joint proposals for σ 2 u and u to ensure that u remains compatible with its variance component 23
25 Simulated datasets 3 case-control scenarios: n 0 = 1, 000; n 1 = 200; n test = 2500 on 50 by 50 grid 1 cohort scenario: n = 10, 000; n test = 2500 on 50 by 50 grid
26 Assessment on 50 simulated datasets flat isotropic MSE PL PQL PL GCV B Geo B SB MSE PL PQL PL GCV B Geo B SB anisotropic cohort MSE MSE PL PQL PL GCV B Geo B SB PL PQL PL GCV B SB null 25
27 Mixing and speed of Bayesian methods speed (1000 its) speed (1000 eff. its) log posterior trace σ trace ρ trace B-Geo 15 min. 104 hr B-SB 1.3 min. 6 hr
28 Taiwan revisited - assessment leukemia brain cancer Summed test deviance over 10-fold C-V sets leukemia brain cancer PL-GCV PL-PQL B-Geo B-SB null northing (km) northing (km) easting (km) easting (km) 27
29 Assessment on count simulations n = 225, n test = 2500 on 50 by 50 grid MSE MSE MSE PL PQL PL GCV B Geo B SB null PL PQL PL GCV B Geo B SB null PL PQL PL GCV B Geo B SB null 28
30 Penalization in the spectral approach GP representation zeroes out high-frequency coefficients as appropriate Spatial hyperparameter controls coefficient variances g( ) GP(µ( ), σ 2 R(, ; ρ, ν)) prior penalty sample functions log(variance) ρ = 0.1 ρ = 0.5 ρ = frequency variance ρ = 0.1 ρ = 0.5 ρ = frequency f(x) x 29
31 Heterogeneous penalties y true gam() spectral x spatially-varying penalties are one option (e.g., Lang & Brezger 2004; Crainiceanu et al. 2004) spatially-varying ρ in a GP context is another 30
32 Outline Motivating examples Introduction to Gaussian processes (GPs) Fast Gaussian process modelling Flexible Gaussian process modelling Why Bayes works for smoothing The future: flexibility + efficiency + hierarchical modelling 31
33 A nonstationary covariance Higdon, Swall, and Kern (1999) model: C NS (x i, x j ) = k xi (u)k xj (u)du R p guaranteed positive definite Gaussian kernels give closed form: k xi (u) exp ( (u x i ) T Σ 1 i (u x i ) ) ( R NS (x i, x j ) = c ij exp (x i x j ) T ( Σi + Σ j 2 ) 1 (x i x j )) g( ) GP(µ, σ 2 R NS (, ; Σ( ))) 32
34 Nonstationary GPs in 1-D Kernel standard deviation Some kernels f(x) Some sample functions f(x) x x x f(x) x f(x) x 33
35 Nonstationary GPs in 2-D Sample function Kernel structure Sample function image x 2 1 g(x[1], x[2]) x[2] x[1] x 2 x 1 x 1 34
36 Generalizing the kernel convolution approach Squared exponential form: exp ( ( τij ρ ) 2 ) c ij exp ( (x i x j ) T ( Σi + Σ j 2 ) 1 (x i x j )) infinitely-differentiable sample paths Distance measures : isotropy τij 2 = (x i x j ) T (x i x j ) anisotropy τij 2 = (x i x j ) T Σ 1 (x i x j ) ( ) nonstationarity Q ij = (x i x j ) T Σi +Σ 1 j 2 (xi x j ) Can we replace τ 2 ij with Q ij in other stationary correlation functions? 35
37 A class of nonstationary covariance functions Theorem 1: if R(τ) is positive definite for R P, P = 1, 2,..., then R NS (x i, x j ) = c ij R( Q ij ) is positive definite for R P, P = 1, 2,... Theorem 2: smoothness (differentiability) properties of the original stationary correlation retained Specific case of Matérn nonstationary covariance: 1 Γ(ν)2 ν 1 ( ) ν ( ) 2 ντ 2 ντ K ν ρ ρ 1 ( Γ(ν)2 ν 1 2 ) ν ( νq ij Kν 2 ) νq ij advantages: more flexible form, differentiability not constrained, possible asymptotic advantages 36
38 Exponential and Matérn sample functions (stationary) ν = 0.5 ν = 4 x x x 1 x 1 37
39 A basic Bayesian nonstationary spatial model Bayesian nonstationary kriging model Y i N(g(x i ), η 2 ), x i R 2 g( ) GP(µ, σ 2 R NS (, ; ν, Σ( )) Let R NS be the nonstationary Matérn correlation Kernels (Σ x ) constructed based on stationary GP priors define multiple kernel matrices, Σ x, x X smoothly-varying (element-wise) in domain positive definite Fit via MCMC, including parameters determining Σ( ) 38
40 Smoothly-varying kernel matrices Spectral decomposition for each Σ x = Γ T x Λ x Γ x in R 2, parameterize each kernel using unnormalized eigenvector coordinates (a x, b x ) and the second eigenvalue (log λ 2,x ) define stationary GP priors for Φ( ) {(a( ), b( ), log(λ 2 ( ))} efficiently parameterize each GP using basis function approximation (Zhao & Wand, 2004) a x, b x λ 2x x λ 1x 39
41 Colorado precipitation characterization
42 Estimating Colorado precipitation Posterior mean Posterior std dev a c b d
43 Why doesn t Bayes overfit? Fourier basis involves k 2 (=4096, e.g.) coefficients Nonstationary covariance involves very highly-parameterized covariance structure No direct penalty on complicated spatial functions marg. lik. (prior predictive) P(Y M2) data space P(Y M1) P (y M i ) y Y 2 x y Y 1 x y x Model 1 ρ = 2.5 Model 2 ρ =
44 What does a Bayesian approach give us? ability to create rich hierarchical models that reflect our understanding of the system in environmental health applications the ability to incorporate time, latent variables, misaligned data a natural penalty on overfitting a recipe (perhaps slow) for estimation proper characterization of the uncertainty challenge lies less on the modelling side than with computations, model comparison and evaluation, and reproducibility 43
45 Future methodological work collaborative work on spatio-temporal modelling computational approaches for applying existing methodological ideas to real health data GP computations and parameterization: flexibility + efficiency + hierarchical modelling computational tricks for the nonstationary covariance; e.g., knotbased approaches for faster computation use of a wavelet basis with irregular spatial data in a similar framework as the spectral basis combining deterministic and stochastic models, e.g. for air pollution useful, practical methods for designing spatial monitoring networks and determining power 44
Spatial smoothing using Gaussian processes
Spatial smoothing using Gaussian processes Chris Paciorek paciorek@hsph.harvard.edu August 5, 2004 1 OUTLINE Spatial smoothing and Gaussian processes Covariance modelling Nonstationary covariance modelling
More informationComputational techniques for spatial logistic regression with large datasets
Computational techniques for spatial logistic regression with large datasets Christopher J. Paciorek April 19, 2007 key words: Bayesian statistics, disease mapping, Fourier basis, generalized linear mixed
More informationComputational techniques for spatial logistic regression with large datasets
Computational techniques for spatial logistic regression with large datasets Christopher J. Paciorek and Louise Ryan 6th October 2005 key words: Bayesian statistics, Fourier basis, FFT, generalized linear
More informationSpatial Modelling Using a New Class of Nonstationary Covariance Functions
Spatial Modelling Using a New Class of Nonstationary Covariance Functions Christopher J. Paciorek and Mark J. Schervish 13th June 2005 Abstract We introduce a new class of nonstationary covariance functions
More informationMeasurement Error in Spatial Modeling of Environmental Exposures
Measurement Error in Spatial Modeling of Environmental Exposures Chris Paciorek, Alexandros Gryparis, and Brent Coull August 9, 2005 Department of Biostatistics Harvard School of Public Health www.biostat.harvard.edu/~paciorek
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley 1 and Sudipto Banerjee 2 1 Department of Forestry & Department of Geography, Michigan
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North
More informationNonstationary Covariance Functions for Gaussian Process Regression
Nonstationar Covariance Functions for Gaussian Process Regression Christopher J. Paciorek Department of Biostatistics Harvard School of Public Health Mark J. Schervish Department of Statistics Carnegie
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley Department of Forestry & Department of Geography, Michigan State University, Lansing
More informationDisease mapping with Gaussian processes
EUROHEIS2 Kuopio, Finland 17-18 August 2010 Aki Vehtari (former Helsinki University of Technology) Department of Biomedical Engineering and Computational Science (BECS) Acknowledgments Researchers - Jarno
More informationAnalysing geoadditive regression data: a mixed model approach
Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression
More informationTechnical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models
Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University
More informationNonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University
Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University this presentation derived from that presented at the Pan-American Advanced
More informationHierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets
Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Abhirup Datta 1 Sudipto Banerjee 1 Andrew O. Finley 2 Alan E. Gelfand 3 1 University of Minnesota, Minneapolis,
More informationSpatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter
Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter Chris Paciorek Department of Biostatistics Harvard School of Public Health application joint
More informationStatistical and epidemiological considerations in using remote sensing data for exposure estimation
Statistical and epidemiological considerations in using remote sensing data for exposure estimation Chris Paciorek Department of Biostatistics Harvard School of Public Health Collaborators: Yang Liu, Doug
More informationEcon 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines
Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Maximilian Kasy Department of Economics, Harvard University 1 / 37 Agenda 6 equivalent representations of the
More informationrandom fields on a fine grid
Spatial models for point and areal data using Markov random fields on a fine grid arxiv:1204.6087v1 [stat.me] 26 Apr 2012 Christopher J. Paciorek Department of Biostatistics, Harvard School of Public Health
More informationA STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES
A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES JOHN STEPHENSON 1, CHRIS HOLMES, KERRY GALLAGHER 1 and ALEXANDRE PINTORE 1 Dept. Earth Science and Engineering, Imperial College,
More informationModelling geoadditive survival data
Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model
More informationSTATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS
STATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS Eric Gilleland Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research Supported
More informationHierarchical Modelling for Univariate Spatial Data
Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department
More informationModels for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data
Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise
More informationHierarchical Modeling for Univariate Spatial Data
Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationBayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson
Bayesian variable selection via penalized credible regions Brian Reich, NC State Joint work with Howard Bondell and Ander Wilson Brian Reich, NCSU Penalized credible regions 1 Motivation big p, small n
More informationA Framework for Daily Spatio-Temporal Stochastic Weather Simulation
A Framework for Daily Spatio-Temporal Stochastic Weather Simulation, Rick Katz, Balaji Rajagopalan Geophysical Statistics Project Institute for Mathematics Applied to Geosciences National Center for Atmospheric
More informationBayesian smoothing of irregularly-spaced data using Fourier basis functions
Bayesian smoothing of irregularly-spaced data using Fourier basis functions Christopher J. Paciorek Department of Biostatistics, Harvard School of Public Health Abstract The spectral representation of
More informationNONSTATIONARY GAUSSIAN PROCESSES FOR REGRESSION AND SPATIAL MODELLING
CARNEGIE MELLON UNIVERSITY NONSTATIONARY GAUSSIAN PROCESSES FOR REGRESSION AND SPATIAL MODELLING A DISSERTATION SUBMITTED TO THE GRADUATE SCHOOL IN PARTIAL FULFILLMENT OF THE REQUIREMENTS for the degree
More informationIntroduction to Spatial Data and Models
Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,
More informationWrapped Gaussian processes: a short review and some new results
Wrapped Gaussian processes: a short review and some new results Giovanna Jona Lasinio 1, Gianluca Mastrantonio 2 and Alan Gelfand 3 1-Università Sapienza di Roma 2- Università RomaTRE 3- Duke University
More informationNearest Neighbor Gaussian Processes for Large Spatial Data
Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns
More informationspbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models
spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O. Finley 1, Sudipto Banerjee 2, and Bradley P. Carlin 2 1 Michigan State University, Departments
More informationA short introduction to INLA and R-INLA
A short introduction to INLA and R-INLA Integrated Nested Laplace Approximation Thomas Opitz, BioSP, INRA Avignon Workshop: Theory and practice of INLA and SPDE November 7, 2018 2/21 Plan for this talk
More informationA general mixed model approach for spatio-temporal regression data
A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression
More informationSTA 4273H: Sta-s-cal Machine Learning
STA 4273H: Sta-s-cal Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 2 In our
More informationIntroduction to Spatial Data and Models
Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry
More informationPredicting Long-term Exposures for Health Effect Studies
Predicting Long-term Exposures for Health Effect Studies Lianne Sheppard Adam A. Szpiro, Johan Lindström, Paul D. Sampson and the MESA Air team University of Washington CMAS Special Session, October 13,
More informationBayesian Hierarchical Models
Bayesian Hierarchical Models Gavin Shaddick, Millie Green, Matthew Thomas University of Bath 6 th - 9 th December 2016 1/ 34 APPLICATIONS OF BAYESIAN HIERARCHICAL MODELS 2/ 34 OUTLINE Spatial epidemiology
More informationSTA414/2104. Lecture 11: Gaussian Processes. Department of Statistics
STA414/2104 Lecture 11: Gaussian Processes Department of Statistics www.utstat.utoronto.ca Delivered by Mark Ebden with thanks to Russ Salakhutdinov Outline Gaussian Processes Exam review Course evaluations
More informationGaussian Process Regression
Gaussian Process Regression 4F1 Pattern Recognition, 21 Carl Edward Rasmussen Department of Engineering, University of Cambridge November 11th - 16th, 21 Rasmussen (Engineering, Cambridge) Gaussian Process
More informationParameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets
Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets Matthias Katzfuß Advisor: Dr. Noel Cressie Department of Statistics The Ohio State University
More informationSummary STK 4150/9150
STK4150 - Intro 1 Summary STK 4150/9150 Odd Kolbjørnsen May 22 2017 Scope You are expected to know and be able to use basic concepts introduced in the book. You knowledge is expected to be larger than
More informationBayesian spatial quantile regression
Brian J. Reich and Montserrat Fuentes North Carolina State University and David B. Dunson Duke University E-mail:reich@stat.ncsu.edu Tropospheric ozone Tropospheric ozone has been linked with several adverse
More informationBayesian covariate models in extreme value analysis
Bayesian covariate models in extreme value analysis David Randell, Philip Jonathan, Kathryn Turnbull, Mathew Jones EVA 2015 Ann Arbor Copyright 2015 Shell Global Solutions (UK) EVA 2015 Ann Arbor June
More informationDefault Priors and Effcient Posterior Computation in Bayesian
Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature
More informationWhat s for today. Continue to discuss about nonstationary models Moving windows Convolution model Weighted stationary model
What s for today Continue to discuss about nonstationary models Moving windows Convolution model Weighted stationary model c Mikyoung Jun (Texas A&M) Stat647 Lecture 11 October 2, 2012 1 / 23 Nonstationary
More informationOn Gaussian Process Models for High-Dimensional Geostatistical Datasets
On Gaussian Process Models for High-Dimensional Geostatistical Datasets Sudipto Banerjee Joint work with Abhirup Datta, Andrew O. Finley and Alan E. Gelfand University of California, Los Angeles, USA May
More informationGAUSSIAN PROCESS REGRESSION
GAUSSIAN PROCESS REGRESSION CSE 515T Spring 2015 1. BACKGROUND The kernel trick again... The Kernel Trick Consider again the linear regression model: y(x) = φ(x) w + ε, with prior p(w) = N (w; 0, Σ). The
More informationThe Bayesian approach to inverse problems
The Bayesian approach to inverse problems Youssef Marzouk Department of Aeronautics and Astronautics Center for Computational Engineering Massachusetts Institute of Technology ymarz@mit.edu, http://uqgroup.mit.edu
More informationGaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012
Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature
More informationBayesian non-parametric model to longitudinally predict churn
Bayesian non-parametric model to longitudinally predict churn Bruno Scarpa Università di Padova Conference of European Statistics Stakeholders Methodologists, Producers and Users of European Statistics
More informationSpatial Statistics with Image Analysis. Lecture L08. Computer exercise 3. Lecture 8. Johan Lindström. November 25, 2016
C3 Repetition Creating Q Spectral Non-grid Spatial Statistics with Image Analysis Lecture 8 Johan Lindström November 25, 216 Johan Lindström - johanl@maths.lth.se FMSN2/MASM25L8 1/39 Lecture L8 C3 Repetition
More informationDeblurring Jupiter (sampling in GLIP faster than regularized inversion) Colin Fox Richard A. Norton, J.
Deblurring Jupiter (sampling in GLIP faster than regularized inversion) Colin Fox fox@physics.otago.ac.nz Richard A. Norton, J. Andrés Christen Topics... Backstory (?) Sampling in linear-gaussian hierarchical
More informationThe Use of Spatial Exposure Predictions in Health Effects Models: An Application to PM Epidemiology
The Use of Spatial Exposure Predictions in Health Effects Models: An Application to PM Epidemiology Chris Paciorek and Brent Coull Department of Biostatistics Harvard School of Public Health wwwbiostatharvardedu/
More informationModel Selection for Gaussian Processes
Institute for Adaptive and Neural Computation School of Informatics,, UK December 26 Outline GP basics Model selection: covariance functions and parameterizations Criteria for model selection Marginal
More informationarxiv: v4 [stat.me] 14 Sep 2015
Does non-stationary spatial data always require non-stationary random fields? Geir-Arne Fuglstad 1, Daniel Simpson 1, Finn Lindgren 2, and Håvard Rue 1 1 Department of Mathematical Sciences, NTNU, Norway
More informationComparing Non-informative Priors for Estimation and. Prediction in Spatial Models
Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Vigre Semester Report by: Regina Wu Advisor: Cari Kaufman January 31, 2010 1 Introduction Gaussian random fields with specified
More informationInversion Base Height. Daggot Pressure Gradient Visibility (miles)
Stanford University June 2, 1998 Bayesian Backtting: 1 Bayesian Backtting Trevor Hastie Stanford University Rob Tibshirani University of Toronto Email: trevor@stat.stanford.edu Ftp: stat.stanford.edu:
More informationModels for models. Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research
Models for models Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research Outline Statistical models and tools Spatial fields (Wavelets) Climate regimes (Regression and clustering)
More informationThe Matrix Reloaded: Computations for large spatial data sets
The Matrix Reloaded: Computations for large spatial data sets The spatial model Solving linear systems Matrix multiplication Creating sparsity Doug Nychka National Center for Atmospheric Research Sparsity,
More informationState Space Representation of Gaussian Processes
State Space Representation of Gaussian Processes Simo Särkkä Department of Biomedical Engineering and Computational Science (BECS) Aalto University, Espoo, Finland June 12th, 2013 Simo Särkkä (Aalto University)
More informationAggregated cancer incidence data: spatial models
Aggregated cancer incidence data: spatial models 5 ième Forum du Cancéropôle Grand-est - November 2, 2011 Erik A. Sauleau Department of Biostatistics - Faculty of Medicine University of Strasbourg ea.sauleau@unistra.fr
More informationWeb Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.
Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Ruppert A. EMPIRICAL ESTIMATE OF THE KERNEL MIXTURE Here we
More informationIntroduction to Geostatistics
Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,
More informationDirect Learning: Linear Classification. Donglin Zeng, Department of Biostatistics, University of North Carolina
Direct Learning: Linear Classification Logistic regression models for classification problem We consider two class problem: Y {0, 1}. The Bayes rule for the classification is I(P(Y = 1 X = x) > 1/2) so
More informationKneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"
Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach" Sonderforschungsbereich 386, Paper 43 (25) Online unter: http://epub.ub.uni-muenchen.de/
More informationThe Matrix Reloaded: Computations for large spatial data sets
The Matrix Reloaded: Computations for large spatial data sets Doug Nychka National Center for Atmospheric Research The spatial model Solving linear systems Matrix multiplication Creating sparsity Sparsity,
More informationOn Bayesian Computation
On Bayesian Computation Michael I. Jordan with Elaine Angelino, Maxim Rabinovich, Martin Wainwright and Yun Yang Previous Work: Information Constraints on Inference Minimize the minimax risk under constraints
More informationGaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008
Gaussian processes Chuong B Do (updated by Honglak Lee) November 22, 2008 Many of the classical machine learning algorithms that we talked about during the first half of this course fit the following pattern:
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear
More informationStochastic Spectral Approaches to Bayesian Inference
Stochastic Spectral Approaches to Bayesian Inference Prof. Nathan L. Gibson Department of Mathematics Applied Mathematics and Computation Seminar March 4, 2011 Prof. Gibson (OSU) Spectral Approaches to
More informationHierarchical Modeling for Multivariate Spatial Data
Hierarchical Modeling for Multivariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department
More informationHandbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp
Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Marcela Alfaro Córdoba August 25, 2016 NCSU Department of Statistics Continuous Parameter
More informationNeutron inverse kinetics via Gaussian Processes
Neutron inverse kinetics via Gaussian Processes P. Picca Politecnico di Torino, Torino, Italy R. Furfaro University of Arizona, Tucson, Arizona Outline Introduction Review of inverse kinetics techniques
More informationHierarchical Modelling for Univariate Spatial Data
Spatial omain Hierarchical Modelling for Univariate Spatial ata Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A.
More informationSome general observations.
Modeling and analyzing data from computer experiments. Some general observations. 1. For simplicity, I assume that all factors (inputs) x1, x2,, xd are quantitative. 2. Because the code always produces
More informationCPSC 540: Machine Learning
CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is
More informationA Complete Spatial Downscaler
A Complete Spatial Downscaler Yen-Ning Huang, Brian J Reich, Montserrat Fuentes 1 Sankar Arumugam 2 1 Department of Statistics, NC State University 2 Department of Civil, Construction, and Environmental
More informationSub-kilometer-scale space-time stochastic rainfall simulation
Picture: Huw Alexander Ogilvie Sub-kilometer-scale space-time stochastic rainfall simulation Lionel Benoit (University of Lausanne) Gregoire Mariethoz (University of Lausanne) Denis Allard (INRA Avignon)
More informationSpatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016
Spatial Statistics Spatial Examples More Spatial Statistics with Image Analysis Johan Lindström 1 1 Mathematical Statistics Centre for Mathematical Sciences Lund University Lund October 6, 2016 Johan Lindström
More informationStudent-t Process as Alternative to Gaussian Processes Discussion
Student-t Process as Alternative to Gaussian Processes Discussion A. Shah, A. G. Wilson and Z. Gharamani Discussion by: R. Henao Duke University June 20, 2014 Contributions The paper is concerned about
More informationMulti-resolution models for large data sets
Multi-resolution models for large data sets Douglas Nychka, National Center for Atmospheric Research National Science Foundation NORDSTAT, Umeå, June, 2012 Credits Steve Sain, NCAR Tia LeRud, UC Davis
More informationFlexible Spatio-temporal smoothing with array methods
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session IPS046) p.849 Flexible Spatio-temporal smoothing with array methods Dae-Jin Lee CSIRO, Mathematics, Informatics and
More informationESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS
ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, N.C.,
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 7 Approximate
More informationIntroduction to Gaussian Processes
Introduction to Gaussian Processes Iain Murray murray@cs.toronto.edu CSC255, Introduction to Machine Learning, Fall 28 Dept. Computer Science, University of Toronto The problem Learn scalar function of
More informationReliability Monitoring Using Log Gaussian Process Regression
COPYRIGHT 013, M. Modarres Reliability Monitoring Using Log Gaussian Process Regression Martin Wayne Mohammad Modarres PSA 013 Center for Risk and Reliability University of Maryland Department of Mechanical
More informationPumps, Maps and Pea Soup: Spatio-temporal methods in environmental epidemiology
Pumps, Maps and Pea Soup: Spatio-temporal methods in environmental epidemiology Gavin Shaddick Department of Mathematical Sciences University of Bath 2012-13 van Eeden lecture Thanks Constance van Eeden
More informationBasics of Point-Referenced Data Models
Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Chapter 2: Basics of Point-Referenced Data Models p. 1/45 Basics of Point-Referenced Data Models Basic
More informationA spatio-temporal model for extreme precipitation simulated by a climate model
A spatio-temporal model for extreme precipitation simulated by a climate model Jonathan Jalbert Postdoctoral fellow at McGill University, Montréal Anne-Catherine Favre, Claude Bélisle and Jean-François
More informationApproximate Bayesian computation for spatial extremes via open-faced sandwich adjustment
Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment Ben Shaby SAMSI August 3, 2010 Ben Shaby (SAMSI) OFS adjustment August 3, 2010 1 / 29 Outline 1 Introduction 2 Spatial
More informationMultivariate spatial modeling
Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21 Multivariate spatial modeling Point-referenced
More informationStatistícal Methods for Spatial Data Analysis
Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis
More informationComputer Vision Group Prof. Daniel Cremers. 2. Regression (cont.)
Prof. Daniel Cremers 2. Regression (cont.) Regression with MLE (Rep.) Assume that y is affected by Gaussian noise : t = f(x, w)+ where Thus, we have p(t x, w, )=N (t; f(x, w), 2 ) 2 Maximum A-Posteriori
More informationInformation geometry for bivariate distribution control
Information geometry for bivariate distribution control C.T.J.Dodson + Hong Wang Mathematics + Control Systems Centre, University of Manchester Institute of Science and Technology Optimal control of stochastic
More informationClimate Change: the Uncertainty of Certainty
Climate Change: the Uncertainty of Certainty Reinhard Furrer, UZH JSS, Geneva Oct. 30, 2009 Collaboration with: Stephan Sain - NCAR Reto Knutti - ETHZ Claudia Tebaldi - Climate Central Ryan Ford, Doug
More informationBayesian linear regression
Bayesian linear regression Linear regression is the basis of most statistical modeling. The model is Y i = X T i β + ε i, where Y i is the continuous response X i = (X i1,..., X ip ) T is the corresponding
More informationIntegrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University
Integrated Likelihood Estimation in Semiparametric Regression Models Thomas A. Severini Department of Statistics Northwestern University Joint work with Heping He, University of York Introduction Let Y
More information