Low-rank methods and predictive processes for spatial models

Size: px
Start display at page:

Download "Low-rank methods and predictive processes for spatial models"

Transcription

1 Low-rank methods and predictive processes for spatial models Sam Bussman, Linchao Chen, John Lewis, Mark Risser with Sebastian Kurtek, Vince Vu, Ying Sun February 27, 2014

2 Outline Introduction and general notation Bayesian estimation via MCMC Classical estimation via fixed-rank kriging (FRK) Choice of expansion matrix H Predictive processes Evaluation criteria Improvements

3 Introduction: Full-Rank Geostatistical Setup Following Wikle (2010): {η(s)} - mean zero Gaussian Spatial process on domain D c η (s, r) = Cov(η(s), η(r)) Observations at n locations Data model Y = η + ɛ, ɛ Gau(0, Σ ɛ ) Y = (Y (s 1 ),..., Y (s n )) and η = (η(s 1 ),..., η(s n )) η Gau(0, Σ η ) Goal: predict the true process at locations {r 1,..., r m } denoted by η 0 = (η(r 1 ),..., η(r m ))

4 Introduction: Full-Rank Geostatistical Setup We want the predictive distribution [η 0 Y] = [η 0 η][η Y]dη Given the Gaussian assumptions, [η 0 Y] is conveniently Gaussian Mean and var-cov are functions of (Σ η + Σ ɛ ) 1 As we have established, inverting an n n matrix is hard for large n

5 Introduction: Reduced-Rank Hierarchical Parameterization Consider the alternative representation of the spatial model Y = η + ɛ, ɛ Gau(0, Σ ɛ ) η = Hα + ξ, ξ Gau(0, Σ ξ ) α Gau(0, Σ α ) α (α 1,..., α p ) with p << n H is the n p expansion matrix: maps the low dimensional α to the true spatial process η ξ accounts for the difference between η and Hα Model for latent process at unobserved locations is η 0 = H 0 α + ξ 0

6 Introduction: Reduced-Rank Hierarchical Parameterization Again: Gaussian in, Gaussian out [η 0 Y] has mean and var-cov depending on (HΣ α H + V ) 1 Making the usual assumption Σ ɛ = σ 2 I and setting V = Σ ξ + σ 2 I But, we still must invert an n n matrix Two work arounds: MCMC and fixed-ranked Kriging

7 MCMC Approach Consider the posterior [η, α y] induced by the hierarchical model η is Gaussian with mean and var-cov depending on I + Σ 1 ξ ) 1 ( 1 σ 2 ɛ To make this feasible we add the assumption that Σ ξ is diagonal α is also Gaussian. Only need to invert p p matrices Now [η 0 Y] = [η 0 α][α Y]dα where we have [η 0 α] = Gau(H 0 α, Σ ξ0 )

8 Fixed-rank kriging approach Fixed-rank kriging (FRK), from Cressie and Johannesson (2008), outlines how kriging (i.e., spatial best linear unbiased predictor or BLUP) can be applied in this reduced-rank parameterization of a spatial process.

9 Fixed-rank kriging approach Main idea: Primary computational cost in (general) kriging is computing (Σ η + σ 2 ɛ I n ) 1, a dense, n n matrix. For FRK, all we need do is calculate the inverse (HΣ α H + V) 1 = V 1 V 1 H(H V 1 H + Σ 1 α ) 1 H V 1, (using the Sherman-Morrison-Woodbury Formula) which only requires the simple inverse calculation of a diagonal matrix V = σ 2 ɛ I n + Σ ξ (assuming Σ ξ is diagonal) and the inverse of (dense) p p matrices (O(p 3 ) computations). See Cressie and Johannesson (2008) for the closed-form kriging equations.

10 Fixed-rank kriging approach The FRK spatial predictor and corresponding standard errors require the following calculations: 1 H (HΣ α H + V) 1 H, 2 H (HΣ α H + V) 1 a, and 3 (HΣ α H + V) 1 a, where a is a vector of length n. The computational cost for each of these calculations is 1 O(p 3 ), 2 O(np 2 ), and 3 O(np 2 ), so that the overall computational cost is O(np 2 ).

11 Fixed-rank kriging approach Two important components in this model: 1 Σ α, to be estimated from the data Cressie and Johannesson (2008) use a plug-in estimate ˆΣ α, calculated by minimizing a weighted Frobenius norm Classical geostatistical methods: variogram estimation (Zimmerman and Stein (2010)) Choose a parametric form for Σ α Σ α (θ); for low-dimensional θ use maximum likelihood or REML (Zimmerman (2010)) 2 H, determined by the analyst (fixed!) Orthogonal basis functions (so that H H = I), Non orthogonal basis functions (smoothing splines, wavelet, radial, multiresolutional)

12 Choice of expansion matrix H Reminder: η α Gau(Hα, Σ ξ ) α Gau(0, Σ α ) Want dimension of α to be small compared to n. Would love for Σ ξ to have simple form. If H is orthogonal then essentially we have no redundancy or duplicate info. Also, if Σ ξ = σ 2 ξ I then V = (σ2 ξ + σ2 ɛ )I = σ 2 v I so H V 1 H = σ 2 vh H = σ 2 vi which simplifies FRK and MCMC. Typically as p increases, the residual structure will have smaller and smaller scale spatial dependence so that Σ ξ = σ 2 ξ I is reasonable.

13 Choice of expansion matrix H Examples of Orthogonal Basis Functions: Fourier, orthogonal Polynomials, certain wavelets, eigenvectors of a specified covariance matrix How to choose? Not a lot of guidance on this. Driven perhaps by problem specific knowledge.

14 Choice of expansion matrix H Karhunen-Loeve Expansion: The continuous K-L expansion of a covariance function of a mean zero spatial process η(s) over a spatial domain D is c η (s, r) = E[η(s), η(r)] = λ k φ k (s)φ k (r) k=1 where φ k are the eigenfunctions and λ k are the eigenvalues of the Fredholm integral equation: c η (s, r)φ k (s)ds = λ k φ k (r) D Can expand the process using these basis functions: η(s) = k=1 α kφ k (s) (where α k is the projection of η). Truncating this to p dimensions minimizes the variance of the truncation error and thus is optimal. The continuous expansion is typically not used in practice due to discrete nature of data and issues with solving the above integral.

15 Choice of expansion matrix H When you have repeat observations over time (or you are willing to grid your data) then you can calculate an empirical covariance matrix for the spatial process. Then you can perform a PCA on your estimate of Σ η, i.e. ˆΣ η Φ = ΦΛ. The eigenvectors, Φ are known as the empirical orthogonal functions (EOFs). K-L PCA when each observation has equal area of influence. Now let H = Φ p and α = Φ pη where Φ p is now n p containing the first p eigenvectors.

16 Choice of expansion matrix H Nice: No assumption of stationarity required. Σ α is diagonal due to orthogonality If p is large enough to account for larger scale spatial variation then Σ ξ = σξ 2 I is likely reasonable. EOFs give maps of principal spatial structures. Not so nice: Not obvious how to handle prediction since ˆΣ η isn t available at unobserved locations. Have to interpolate eigenvectors somehow or play with ˆΣ η to make it into a valid covariance model with known covariance function. Directly estimating covariance of η is perhaps difficult since we can t observe it directly. If we have replication, can get an estimate of Σ Y and σ 2 ɛ and base our EOFs on ˆΣ Y ˆσ 2 ɛ

17 Choice of expansion matrix H Example: Modeling Migratory Patterns

18 Choice of expansion matrix H The model for the process was as follows: η t = Φα t + ξ t, ξ t Gau(0, Σ ξ ) α t = Mα t 1 + ζ t, ζ t Gau(0, Σ ζ ) M is the state transition matrix.

19 Choice of expansion matrix H Example: Modeling Migratory Patterns: The EOF maps Spatial Map of loc. 22 perc. of variation in first Evector= 0.38 Spatial Map of loc. 22 perc. of variation in second Evector= 0.07 Spatial Map of loc. 22 perc. of variation in third Evector= 0.04 yg xg Time Series of loc. 22 perc. of variation in first Evector= 0.38 A[1, ] yg xg Time Series of loc. 22 perc. of variation in second Evector= 0.07 A[2, ] yg xg Time Series of loc. 22 perc. of variation in third Evector= 0.04 A[3, ] Index Index Index

20 Choice of expansion matrix H Example: Simulation using first 5 EOFs Original values for 2002 Simulated Values yg yg xg xg

21 Predictive Process Model (Banerjee et al., 2008) Univariate Predictive Process Modeling Spatial regression model (parent model) Predictive process Y (s) = x T (s)β + ω(s) + ɛ(s) (1) ω(s) GP(0, C(s, s )), ɛ(s) IID N(0, τ 2 ) Given w = [ω(s i )]m i=1, spatial interpolant (kriging) over S 1 ω(s) = c T (s)c 1 w ω(s) GP(0, C(.)), C(s, s ) = c T (s)c 1 c(s ) Approximate ω(s) in (1) by ω(s). Inference requires inversion of an m m matrix, m << n (Sherman-Morrison-Woodbury Formula). 1 S - n observed locations, S - m selected knots; w MVN(0, C )

22 Predictive process modeling Low-rank approximation Previous notation: the spatial process is approximated by η Hα Under the PP notation: the spatial process is approximated by ω(s) c T (s)c 1 w So, α = w and H = c T (s)c 1.

23 Properties of Predictive Process Models ω(s) is an orthogonal projection of ω(s) onto the linear subspace generated by w. min f E[ ω(s) f (w ) w ] Exact interpolant at the knots Predictive process model has less variability than parent model Minimize reverse KL divergence of posterior p(w a Y ) 2 and its approximations min q KL{q(w a Y), p(w a Y)}, q(y w a ) = q(y w) p(w a Y) p(w a )p(y w) p(w a )q(y w ) = q(w a Y) 2 w a = (w, w ). w is the value over S. w is the value over S

24 Selection of Knots Spatial design Space filling knot selection based on geometric criteria (Nychka and Saltzman, 1998) Spatial balance of design locations is more efficient than simple random sampling. (Stevens Jr and Olsen, 2004) Diggle and Lophaven (2006) suggest augmenting the lattice with close pairs or infills (See Simulation I later) Evaluation of knots performance Comparison of covariance function from parent model with that of predictive process model Details: 200 locations are uniformly generated over a [0, 10] [0, 10] rectangle. Knots consist of a equally spaced grid. Matern covariance with σ 2 = 1, change range and smooth parameters

25 Selection of Knots Comparison of Covariance Function Fix range parameter φ = 2, set ν = 0.5, 1, 1.5, 5 Fix smooth parameter ν = 0.5, set φ = 2, 4, 6, 12 What matters is the size of the range relative to the spacing of the grid for the knots

26 Simulation I Goal: Evaluate the approximation; Choice of m, spatial design Details 3000 irregularly scattered (observed) locations over a domain; stationary anisotropic Matern covariance with smoothness (ν = 0.5), rotation angle (ψ = 45 ), rate of spatial decay (λ 1 = 300, λ 2 = 50) Run predictive process model with m = 144, 256, 529, spatial designs (regular grid; lattice plus close pair configuration; lattice plus infill configuration). For comparison, run parent model. Comparing run time and estimation/prediction results Results 3 parallel chains: 1000 iterations for burnin, 4000 iterations for inference h (m = 144), 1.5 h (m = 256), 4.25 h (m = 529), 18 h (parent model)

27 Simulation I 3 Fig 3(b) Posterior mean surface, lattice plus close pair, 256 knots Fig 3(c) Posterior mean surface, lattice plus infill, 256 knots 3 Table 2: lattice plus close pair design; Prediction - empirical coverage of 95% prediction intervals for a set of 100 hold-out locations

28 Predictive Processes Conclusion from Simulation I Improvement in estimation with increasing m, irrespectively of spatial design Although 144 knots are adequate for capturing β, higher knot densities are required for capturing anisotropic covariance parameters and nugget variance τ 2 Estimation is more sensitive to the number of knots than to the underlying design; Predictions are much more robust Simulation II(Section 5.1.2) locations over the same domain; non-stationary anisotropic Matern covariance (Paciorek and Schervish, 2006); parent model is infeasible computationally Comparison among number of knots, spatial designs gives similar results as Simulation I

29 Extensions Non-Gaussian first-stage models Y (s) Poisson or Bernoulli with E[Y (s)] = µ(s) η(s) = g(µ(s)) = X T (s)β + ω(s) + ɛ(s) Replace ω(s) by predictive process ω(s) Spatiotemporal versions Y (s, t) = x T (s, t)β + ω(s, t) + ɛ(s, t) Replace ω(s, t) by ω(s, t) = c(s, t) T C 1 w 4 Multivariate predictive process modelling (See section 3 (Banerjee et al., 2008) for the model and section 5.2 for real data example. ) 4 w - random variables at m knots over D [0, T ] with covariance matrix C ; c(s, t) - covariances of ω(s, t) with w

30 Performance of low-rank methods Low-rank methods give computational efficiency Obvious question: What do we lose by using the low-rank representation? (How much statistical efficiency is lost?)

31 Performance of low-rank methods Limitations on low rank approximations: Stein (2013) Evaluation criteria: Kullback-Leibler (KL) divergence For two probability measures P and Q, the KL divergence of Q from P, denoted KL(P, Q), equals ( )] dp KL(P, Q) = E P [log = E P [log(dp)] E P [log(dq)]. dq The KL divergence is simply the expected difference in loglikelihood when using model Q instead of P, when in fact P is the correct measure.

32 Performance of low-rank methods Under a mean-zero Gaussian process assumption on the spatial process η; i.e., then P = N n (0, Σ 0 ), Q = N n (0, Σ 1 ), 2KL(P, Q) = tr(σ 1 1 Σ 0) log Σ 1 + log Σ 0 n. The assumption is that P is the true (unknown) model for η and Q is the model we choose.

33 Performance of low-rank methods A low-rank form for Σ 1 is assumed; i.e., Σ 1 = σ 2 ɛ I n + R, where R is n n positive semidefinite with rank at most p << n (computational cost O(np 2 )). Results: Even for optimal R, the KL divergence will be large if the fine scale variations are not well-described by white noise (this would be the ξ term from before). When the nugget (σ 2 ɛ ) is small: the smoother the process, the worse the low-rank approximation does.

34 Performance of low-rank methods Alternative: assume Σ 1 is block diagonal (i.e., assume observations in different blocks are independent; not appropriate for most spatial data); with n/p blocks of size p this requires O ( 1 3 np2) computations. (Surprising) Result: When the error/nugget variance (σ 2 ɛ ) is sufficiently small and observations sufficiently dense, the low-rank approximation performs disastrously and the independent block model much better.

35 Performance of low-rank methods Numerical results Comparison of the low rank and independent block approximations, when the true correlation function is Matérn (stationary/isotropic), for n = 1200: 1 Fixed range, smoothness; vary rank/block size (p) and nugget effect. 2 Fixed smoothness, nugget effect; vary spatial range and rank/block size.

36 Performance of low-rank methods 1. Fixed range, smoothness; vary rank/block size and nugget effect: M.L. Stein / Spatial Statistics ( ) Fig. 1. KL divergences of low rank (curves) and independent block (+symbols) approximations to 1200 observation with spacing 1 and autocovariance function K(x) = e 12 x plus a nugget. Black (curves/+symbols) corresponds t 1200 of 0, medium gray to 0.01 and light gray to 0.1. Low rank approximations are best possible. L divergences of low rank (curves) and independent block (+symbols) approximations to 1200 observations on a line ing 1 and autocovariance function K(x) = e 12 x Fig. 2. Same as Fig. 1 except that the exponential autocovariance function is replaced by the Whittle autocovarianc plus a nugget. Black (curves/+symbols) corresponds to a nugget x K1(12 x ). ium gray to 0.01 and light gray to 0.1. Low rank approximations are best possible. Left: smoothness ν = 0.5 (Exponential); right: smoothness ν = 1 (Whittle) more favorable to the low rank approximation by considering positive nuggets and values smoothness parameter greater than 1. However, in my experience, values of much large are not common for environmental processes. Next consider varying the range of an exponential covariance function. One way to look at th KL divergences of low rank (solid lines) and independent block (+ symbols); of the range parameter is to fix the nugget and consider K(x) = e x for different values o black corresponds to a nuggetspectral of 0, density medium corresponding gray to tok(x) 0.01, = e x and is f (!) light = ( gray, so increasing range (dec 2 +! 2 ) to 0.1. ) corresponds to increasing variation at low frequencies but decreasing variation at suf high frequencies. For evenly spaced observations, the relationship between the eigenvalue covariance matrix and the spectrum is similar to the periodic case considered in Section 2, w Main idea: low-rank is better decay of thewith eigenvalues large being closely nugget, tied to the large decay ofsmoothness. the spectrum. Thus, when there is a effect, we should expect the low rank approximation to perform increasingly well for suf large ranges, especially for larger values of the rank, which is exactly what we see in the le of Fig. 3. Specifically, with the nugget fixed at 0.1 (the largest value in Fig. 1), the independe approximation dominates for short ranges for a wide range of block sizes, but as the range in the KL divergence monotonically increases for the independent block approximation but ev

37 Performance of low-rank methods 2. Fixed smoothness, nugget effect; vary spatial range and rank/block size: M.L. Stein / Spatial Statistics ( ) 11 KL divergences of low rank (black) and independent block (gray) approximations; nugget effect of 0.1. Line types indicate the block size/rank; the solid, dashed, dotted and dashdot curves correspond to B = 3, 12, 50 and 200 respectively. Fig. 3. KL divergences of low rank (black) and independent block (gray) approximations to 1200 observations on a line with 1 spacing and autocovariance function K(x) = e x plus a nugget of 0.1 (left panel) and K(x) = 12 1 e x plus a 1200 nugget of 0.1 (right panel). Black curves correspond to low rank approximations (best possible) and gray to independent block approximations. Line types indicate B, the block size, with the rank of the corresponding low rank approximation equal to B 1; the solid, dashed, dotted and dash dot curves correspond to B = 3, 12, 50 and 200 respectively. Main idea: in the presence of a nugget effect, low-rank performs very well as the range increases (i.e., low-rank approximations can capture long-range dependence well), especially for large rank. observations converges to a diagonal matrix plus a rank 1 matrix (the matrix with all elements equal to 1), so the KL divergence of the low rank approximation will tend to 0 as!1for any B 2. In contrast, the KL divergence for the independent block approximation will not tend to 0 even for

38 Improvements Low rank + taper: Sang and Huang (2012) Low rank performs poorly for short range dependence, tapering performs poorly for long range dependence: use both! Recall the decomposition η = Hα + ξ Use low rank methods to work with Σ α = Cov(α). Use a tapered covariance structure for Σ ξ = Cov(ξ). Then, Cov 1 (η) = (HΣ α H + Σ ξ ) 1 can be computed with less computational burden than full-rank model.

39 Improvements Low rank + GMRF: Nychka et al. (2013) The spatial process is represented as a sum of p independent processes η(s) = p k=1 η k(s), where each η k (s) is defined through a basis function expansion M k η k (s) = c kj φ kj (s). j=1 The {c kj } are stochastic (Gaussian) coefficients and the {φ kj ( )} are fixed radial (compactly supported) basis functions with differing ranges of dependence (multiresolutional). The basis functions are centered on gridded locations; thus the coefficients {c kj } can be estimated using GMRF techniques.

40 Bibliography I Banerjee, S., Gelfand, A. E., Finley, A. O., and Sang, H. (2008). Gaussian predictive process models for large spatial data sets. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 70(4): Cressie, N. and Johannesson, G. (2008). Fixed rank kriging for very large spatial data sets. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 70(1): Diggle, P. and Lophaven, S. (2006). Bayesian geostatistical design. Scandinavian Journal of Statistics, 33(1): Nychka, D., Bandyopadhyay, S., Hammerling, D., Lindgren, F., and Sain, S. (2013). A multi-resolution gaussian process model for the analysis of large spatial data sets. Unpublished draft, -(-):. Nychka, D. and Saltzman, N. (1998). Design of air-quality monitoring networks. In Case studies in environmental statistics, pages Springer. Paciorek, C. J. and Schervish, M. J. (2006). Spatial modelling using a new class of nonstationary covariance functions. Environmetrics, 17(5): Sang, H. and Huang, J. Z. (2012). A full scale approximation of covariance functions for large spatial data sets. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 74(1): Stein, M. L. (2013). Limitations on low rank approximations for covariance matrices of spatial data. Spatial Statistics, (0):. Stevens Jr, D. L. and Olsen, A. R. (2004). Spatially balanced sampling of natural resources. Journal of the American Statistical Association, 99(465): Wikle, C. K. (2010). Innovation and intellectual property rights. In Gelfand, A., Fuentes, M., Guttorp, P., and Diggle, P., editors, Handbook of Spatial Statistics, Chapman & Hall/CRC Handbooks of Modern Statistical Methods. Taylor & Francis. Zimmerman, D. (2010). Innovation and intellectual property rights. In Gelfand, A., Fuentes, M., Guttorp, P., and Diggle, P., editors, Handbook of Spatial Statistics, Chapman & Hall/CRC Handbooks of Modern Statistical Methods. Taylor & Francis. Zimmerman, D. and Stein, M. (2010). Innovation and intellectual property rights. In Gelfand, A., Fuentes, M., Guttorp, P., and Diggle, P., editors, Handbook of Spatial Statistics, Chapman & Hall/CRC Handbooks of Modern Statistical Methods. Taylor & Francis.

Geostatistical Modeling for Large Data Sets: Low-rank methods

Geostatistical Modeling for Large Data Sets: Low-rank methods Geostatistical Modeling for Large Data Sets: Low-rank methods Whitney Huang, Kelly-Ann Dixon Hamil, and Zizhuang Wu Department of Statistics Purdue University February 22, 2016 Outline Motivation Low-rank

More information

Gaussian predictive process models for large spatial data sets.

Gaussian predictive process models for large spatial data sets. Gaussian predictive process models for large spatial data sets. Sudipto Banerjee, Alan E. Gelfand, Andrew O. Finley, and Huiyan Sang Presenters: Halley Brantley and Chris Krut September 28, 2015 Overview

More information

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

On Gaussian Process Models for High-Dimensional Geostatistical Datasets On Gaussian Process Models for High-Dimensional Geostatistical Datasets Sudipto Banerjee Joint work with Abhirup Datta, Andrew O. Finley and Alan E. Gelfand University of California, Los Angeles, USA May

More information

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models arxiv:1811.03735v1 [math.st] 9 Nov 2018 Lu Zhang UCLA Department of Biostatistics Lu.Zhang@ucla.edu Sudipto Banerjee UCLA

More information

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Abhirup Datta 1 Sudipto Banerjee 1 Andrew O. Finley 2 Alan E. Gelfand 3 1 University of Minnesota, Minneapolis,

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley 1 and Sudipto Banerjee 2 1 Department of Forestry & Department of Geography, Michigan

More information

Hierarchical Low Rank Approximation of Likelihoods for Large Spatial Datasets

Hierarchical Low Rank Approximation of Likelihoods for Large Spatial Datasets Hierarchical Low Rank Approximation of Likelihoods for Large Spatial Datasets Huang Huang and Ying Sun CEMSE Division, King Abdullah University of Science and Technology, Thuwal, 23955, Saudi Arabia. July

More information

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields 1 Introduction Jo Eidsvik Department of Mathematical Sciences, NTNU, Norway. (joeid@math.ntnu.no) February

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North

More information

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets Thomas Romary, Nicolas Desassis & Francky Fouedjio Mines ParisTech Centre de Géosciences, Equipe Géostatistique

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley Department of Forestry & Department of Geography, Michigan State University, Lansing

More information

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University this presentation derived from that presented at the Pan-American Advanced

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

Spatial smoothing using Gaussian processes

Spatial smoothing using Gaussian processes Spatial smoothing using Gaussian processes Chris Paciorek paciorek@hsph.harvard.edu August 5, 2004 1 OUTLINE Spatial smoothing and Gaussian processes Covariance modelling Nonstationary covariance modelling

More information

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This

More information

A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation

A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation Supplementary materials for this article are available at 10.1007/s13253-016-0251-8. A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation Ryan J. Parker, Brian J. Reich,andJoEidsvik Spatial

More information

Nearest Neighbor Gaussian Processes for Large Spatial Data

Nearest Neighbor Gaussian Processes for Large Spatial Data Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns

More information

A multi-resolution Gaussian process model for the analysis of large spatial data sets.

A multi-resolution Gaussian process model for the analysis of large spatial data sets. National Science Foundation A multi-resolution Gaussian process model for the analysis of large spatial data sets. Doug Nychka Soutir Bandyopadhyay Dorit Hammerling Finn Lindgren Stephen Sain NCAR/TN-504+STR

More information

Kernel-based Approximation. Methods using MATLAB. Gregory Fasshauer. Interdisciplinary Mathematical Sciences. Michael McCourt.

Kernel-based Approximation. Methods using MATLAB. Gregory Fasshauer. Interdisciplinary Mathematical Sciences. Michael McCourt. SINGAPORE SHANGHAI Vol TAIPEI - Interdisciplinary Mathematical Sciences 19 Kernel-based Approximation Methods using MATLAB Gregory Fasshauer Illinois Institute of Technology, USA Michael McCourt University

More information

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time Space-time data Observations taken over space and over time Z(s, t): indexed by space, s, and time, t Here, consider geostatistical/time data Z(s, t) exists for all locations and all times May consider

More information

A full-scale approximation of covariance functions for large spatial data sets

A full-scale approximation of covariance functions for large spatial data sets A full-scale approximation of covariance functions for large spatial data sets Huiyan Sang Department of Statistics, Texas A&M University, College Station, USA. Jianhua Z. Huang Department of Statistics,

More information

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets Sudipto Banerjee University of California, Los Angeles, USA Based upon projects involving: Abhirup Datta (Johns Hopkins University)

More information

STATISTICAL INTERPOLATION METHODS RICHARD SMITH AND NOEL CRESSIE Statistical methods of interpolation are all based on assuming that the process

STATISTICAL INTERPOLATION METHODS RICHARD SMITH AND NOEL CRESSIE Statistical methods of interpolation are all based on assuming that the process 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 STATISTICAL INTERPOLATION METHODS

More information

An Additive Gaussian Process Approximation for Large Spatio-Temporal Data

An Additive Gaussian Process Approximation for Large Spatio-Temporal Data An Additive Gaussian Process Approximation for Large Spatio-Temporal Data arxiv:1801.00319v2 [stat.me] 31 Oct 2018 Pulong Ma Statistical and Applied Mathematical Sciences Institute and Duke University

More information

arxiv: v4 [stat.me] 14 Sep 2015

arxiv: v4 [stat.me] 14 Sep 2015 Does non-stationary spatial data always require non-stationary random fields? Geir-Arne Fuglstad 1, Daniel Simpson 1, Finn Lindgren 2, and Håvard Rue 1 1 Department of Mathematical Sciences, NTNU, Norway

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Spatial omain Hierarchical Modelling for Univariate Spatial ata Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A.

More information

Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain. October 10, 2012

Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain. October 10, 2012 A multi-resolution Gaussian process model for the analysis of large spatial data sets. Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain October 10, 2012 Abstract

More information

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes TTU, October 26, 2012 p. 1/3 Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes Hao Zhang Department of Statistics Department of Forestry and Natural Resources Purdue University

More information

A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets

A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets Statistica Sinica (2013): Preprint 1 A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets Ganggang Xu 1, Faming Liang 1 and Marc G. Genton 2 1 Texas A&M University

More information

CBMS Lecture 1. Alan E. Gelfand Duke University

CBMS Lecture 1. Alan E. Gelfand Duke University CBMS Lecture 1 Alan E. Gelfand Duke University Introduction to spatial data and models Researchers in diverse areas such as climatology, ecology, environmental exposure, public health, and real estate

More information

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2

More information

Overview of Spatial Statistics with Applications to fmri

Overview of Spatial Statistics with Applications to fmri with Applications to fmri School of Mathematics & Statistics Newcastle University April 8 th, 2016 Outline Why spatial statistics? Basic results Nonstationary models Inference for large data sets An example

More information

Theory and Computation for Gaussian Processes

Theory and Computation for Gaussian Processes University of Chicago IPAM, February 2015 Funders & Collaborators US Department of Energy, US National Science Foundation (STATMOS) Mihai Anitescu, Jie Chen, Ying Sun Gaussian processes A process Z on

More information

Nonparametric Bayesian Methods

Nonparametric Bayesian Methods Nonparametric Bayesian Methods Debdeep Pati Florida State University October 2, 2014 Large spatial datasets (Problem of big n) Large observational and computer-generated datasets: Often have spatial and

More information

Introduction to Geostatistics

Introduction to Geostatistics Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,

More information

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Elizabeth C. Mannshardt-Shamseldin Advisor: Richard L. Smith Duke University Department

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes TK4150 - Intro 1 Chapter 4 - Fundamentals of spatial processes Lecture notes Odd Kolbjørnsen and Geir Storvik January 30, 2017 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites

More information

Regionalization of Multiscale Spatial Processes Using a Criterion for Spatial Aggregation Error. Christopher K. Wikle

Regionalization of Multiscale Spatial Processes Using a Criterion for Spatial Aggregation Error. Christopher K. Wikle Regionalization of Multiscale Spatial Processes Using a Criterion for Spatial Aggregation Error Christopher K. Wikle University of Missouri TAMU Spatial Workshop, January 2015 Collaborators: Jonathan Bradley

More information

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Financial Econometrics / 49

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Financial Econometrics / 49 State-space Model Eduardo Rossi University of Pavia November 2013 Rossi State-space Model Financial Econometrics - 2013 1 / 49 Outline 1 Introduction 2 The Kalman filter 3 Forecast errors 4 State smoothing

More information

Approaches for Multiple Disease Mapping: MCAR and SANOVA

Approaches for Multiple Disease Mapping: MCAR and SANOVA Approaches for Multiple Disease Mapping: MCAR and SANOVA Dipankar Bandyopadhyay Division of Biostatistics, University of Minnesota SPH April 22, 2015 1 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

More information

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling François Bachoc former PhD advisor: Josselin Garnier former CEA advisor: Jean-Marc Martinez Department

More information

Point-Referenced Data Models

Point-Referenced Data Models Point-Referenced Data Models Jamie Monogan University of Georgia Spring 2013 Jamie Monogan (UGA) Point-Referenced Data Models Spring 2013 1 / 19 Objectives By the end of these meetings, participants should

More information

A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES

A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES JOHN STEPHENSON 1, CHRIS HOLMES, KERRY GALLAGHER 1 and ALEXANDRE PINTORE 1 Dept. Earth Science and Engineering, Imperial College,

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes Chapter 4 - Fundamentals of spatial processes Lecture notes Geir Storvik January 21, 2013 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites Mostly positive correlation Negative

More information

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency Chris Paciorek March 11, 2005 Department of Biostatistics Harvard School of

More information

Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques

Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques Institut für Numerische Mathematik und Optimierung Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques Oliver Ernst Computational Methods with Applications Harrachov, CR,

More information

A Divide-and-Conquer Bayesian Approach to Large-Scale Kriging

A Divide-and-Conquer Bayesian Approach to Large-Scale Kriging A Divide-and-Conquer Bayesian Approach to Large-Scale Kriging Cheng Li DSAP, National University of Singapore Joint work with Rajarshi Guhaniyogi (UC Santa Cruz), Terrance D. Savitsky (US Bureau of Labor

More information

On fixed effects estimation in spline-based semiparametric regression for spatial data

On fixed effects estimation in spline-based semiparametric regression for spatial data Libraries Conference on Applied Statistics in Agriculture 015-7th Annual Conference Proceedings On fixed effects estimation in spline-based semiparametric regression for spatial data Guilherme Ludwig University

More information

Implementing an anisotropic and spatially varying Matérn model covariance with smoothing filters

Implementing an anisotropic and spatially varying Matérn model covariance with smoothing filters CWP-815 Implementing an anisotropic and spatially varying Matérn model covariance with smoothing filters Dave Hale Center for Wave Phenomena, Colorado School of Mines, Golden CO 80401, USA a) b) c) Figure

More information

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53 State-space Model Eduardo Rossi University of Pavia November 2014 Rossi State-space Model Fin. Econometrics - 2014 1 / 53 Outline 1 Motivation 2 Introduction 3 The Kalman filter 4 Forecast errors 5 State

More information

A Multi-resolution Gaussian process model for the analysis of large spatial data sets.

A Multi-resolution Gaussian process model for the analysis of large spatial data sets. 1 2 3 4 A Multi-resolution Gaussian process model for the analysis of large spatial data sets. Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain August 13, 2013 5

More information

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Marcela Alfaro Córdoba August 25, 2016 NCSU Department of Statistics Continuous Parameter

More information

Hierarchical Modeling for Spatio-temporal Data

Hierarchical Modeling for Spatio-temporal Data Hierarchical Modeling for Spatio-temporal Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of

More information

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise

More information

Latent Gaussian Processes and Stochastic Partial Differential Equations

Latent Gaussian Processes and Stochastic Partial Differential Equations Latent Gaussian Processes and Stochastic Partial Differential Equations Johan Lindström 1 1 Centre for Mathematical Sciences Lund University Lund 2016-02-04 Johan Lindström - johanl@maths.lth.se Gaussian

More information

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, N.C.,

More information

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model UNIVERSITY OF TEXAS AT SAN ANTONIO Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model Liang Jing April 2010 1 1 ABSTRACT In this paper, common MCMC algorithms are introduced

More information

A BAYESIAN SPATIO-TEMPORAL GEOSTATISTICAL MODEL WITH AN AUXILIARY LATTICE FOR LARGE DATASETS

A BAYESIAN SPATIO-TEMPORAL GEOSTATISTICAL MODEL WITH AN AUXILIARY LATTICE FOR LARGE DATASETS Statistica Sinica 25 (2015), 61-79 doi:http://dx.doi.org/10.5705/ss.2013.085w A BAYESIAN SPATIO-TEMPORAL GEOSTATISTICAL MODEL WITH AN AUXILIARY LATTICE FOR LARGE DATASETS Ganggang Xu 1, Faming Liang 1

More information

Data are collected along transect lines, with dense data along. Spatial modelling using GMRFs. 200m. Today? Spatial modelling using GMRFs

Data are collected along transect lines, with dense data along. Spatial modelling using GMRFs. 200m. Today? Spatial modelling using GMRFs Centre for Mathematical Sciences Lund University Engineering geology Lund University Results A non-stationary extension The model Estimation Gaussian Markov random fields Basics Approximating Mate rn covariances

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,

More information

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets Matthias Katzfuß Advisor: Dr. Noel Cressie Department of Statistics The Ohio State University

More information

Bayesian Dynamic Linear Modelling for. Complex Computer Models

Bayesian Dynamic Linear Modelling for. Complex Computer Models Bayesian Dynamic Linear Modelling for Complex Computer Models Fei Liu, Liang Zhang, Mike West Abstract Computer models may have functional outputs. With no loss of generality, we assume that a single computer

More information

A Generalized Convolution Model for Multivariate Nonstationary Spatial Processes

A Generalized Convolution Model for Multivariate Nonstationary Spatial Processes A Generalized Convolution Model for Multivariate Nonstationary Spatial Processes Anandamayee Majumdar, Debashis Paul and Dianne Bautista Department of Mathematics and Statistics, Arizona State University,

More information

Summary STK 4150/9150

Summary STK 4150/9150 STK4150 - Intro 1 Summary STK 4150/9150 Odd Kolbjørnsen May 22 2017 Scope You are expected to know and be able to use basic concepts introduced in the book. You knowledge is expected to be larger than

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University An Introduction to Spatial Statistics Chunfeng Huang Department of Statistics, Indiana University Microwave Sounding Unit (MSU) Anomalies (Monthly): 1979-2006. Iron Ore (Cressie, 1986) Raw percent data

More information

REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data

REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data Sue Welham Rothamsted Research Harpenden UK AL5 2JQ December 1, 2008 1 We will start by reviewing the principles

More information

Multi-resolution models for large data sets

Multi-resolution models for large data sets Multi-resolution models for large data sets Douglas Nychka, National Center for Atmospheric Research National Science Foundation Iowa State March, 2013 Credits Steve Sain, Tamra Greasby, NCAR Tia LeRud,

More information

Fractal functional regression for classification of gene expression data by wavelets

Fractal functional regression for classification of gene expression data by wavelets Fractal functional regression for classification of gene expression data by wavelets Margarita María Rincón 1 and María Dolores Ruiz-Medina 2 1 University of Granada Campus Fuente Nueva 18071 Granada,

More information

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Environmentrics 00, 1 12 DOI: 10.1002/env.XXXX Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Regina Wu a and Cari G. Kaufman a Summary: Fitting a Bayesian model to spatial

More information

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O. Finley 1, Sudipto Banerjee 2, and Bradley P. Carlin 2 1 Michigan State University, Departments

More information

Stochastic Spectral Approaches to Bayesian Inference

Stochastic Spectral Approaches to Bayesian Inference Stochastic Spectral Approaches to Bayesian Inference Prof. Nathan L. Gibson Department of Mathematics Applied Mathematics and Computation Seminar March 4, 2011 Prof. Gibson (OSU) Spectral Approaches to

More information

Multi-resolution models for large data sets

Multi-resolution models for large data sets Multi-resolution models for large data sets Douglas Nychka, National Center for Atmospheric Research National Science Foundation NORDSTAT, Umeå, June, 2012 Credits Steve Sain, NCAR Tia LeRud, UC Davis

More information

Spatially-Varying Covariance Functions for Nonstationary Spatial Process Modeling

Spatially-Varying Covariance Functions for Nonstationary Spatial Process Modeling Spatially-Varying Covariance Functions for Nonstationary Spatial Process Modeling Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School

More information

Spatial Modeling and Prediction of County-Level Employment Growth Data

Spatial Modeling and Prediction of County-Level Employment Growth Data Spatial Modeling and Prediction of County-Level Employment Growth Data N. Ganesh Abstract For correlated sample survey estimates, a linear model with covariance matrix in which small areas are grouped

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information

Groundwater permeability

Groundwater permeability Groundwater permeability Easy to solve the forward problem: flow of groundwater given permeability of aquifer Inverse problem: determine permeability from flow (usually of tracers With some models enough

More information

The Bayesian approach to inverse problems

The Bayesian approach to inverse problems The Bayesian approach to inverse problems Youssef Marzouk Department of Aeronautics and Astronautics Center for Computational Engineering Massachusetts Institute of Technology ymarz@mit.edu, http://uqgroup.mit.edu

More information

Tutorial on Fixed Rank Kriging (FRK) of CO 2 data. M. Katzfuss, The Ohio State University N. Cressie, The Ohio State University

Tutorial on Fixed Rank Kriging (FRK) of CO 2 data. M. Katzfuss, The Ohio State University N. Cressie, The Ohio State University Tutorial on Fixed Rank Kriging (FRK) of CO 2 data M. Katzfuss, The Ohio State University N. Cressie, The Ohio State University Technical Report No. 858 July, 20 Department of Statistics The Ohio State

More information

Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study

Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study Gunter Spöck, Hannes Kazianka, Jürgen Pilz Department of Statistics, University of Klagenfurt, Austria hannes.kazianka@uni-klu.ac.at

More information

Linear Methods for Prediction

Linear Methods for Prediction Chapter 5 Linear Methods for Prediction 5.1 Introduction We now revisit the classification problem and focus on linear methods. Since our prediction Ĝ(x) will always take values in the discrete set G we

More information

Paper Review: NONSTATIONARY COVARIANCE MODELS FOR GLOBAL DATA

Paper Review: NONSTATIONARY COVARIANCE MODELS FOR GLOBAL DATA Paper Review: NONSTATIONARY COVARIANCE MODELS FOR GLOBAL DATA BY MIKYOUNG JUN AND MICHAEL L. STEIN Presented by Sungkyu Jung April, 2009 Outline 1 Introduction 2 Covariance Models 3 Application: Level

More information

Modified Linear Projection for Large Spatial Data Sets

Modified Linear Projection for Large Spatial Data Sets Modified Linear Projection for Large Spatial Data Sets Toshihiro Hirano July 15, 2014 arxiv:1402.5847v2 [stat.me] 13 Jul 2014 Abstract Recent developments in engineering techniques for spatial data collection

More information

Nonparametric Bayesian Methods (Gaussian Processes)

Nonparametric Bayesian Methods (Gaussian Processes) [70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent

More information

FUNCTIONAL DATA ANALYSIS. Contribution to the. International Handbook (Encyclopedia) of Statistical Sciences. July 28, Hans-Georg Müller 1

FUNCTIONAL DATA ANALYSIS. Contribution to the. International Handbook (Encyclopedia) of Statistical Sciences. July 28, Hans-Georg Müller 1 FUNCTIONAL DATA ANALYSIS Contribution to the International Handbook (Encyclopedia) of Statistical Sciences July 28, 2009 Hans-Georg Müller 1 Department of Statistics University of California, Davis One

More information

Linear Regression. Aarti Singh. Machine Learning / Sept 27, 2010

Linear Regression. Aarti Singh. Machine Learning / Sept 27, 2010 Linear Regression Aarti Singh Machine Learning 10-701/15-781 Sept 27, 2010 Discrete to Continuous Labels Classification Sports Science News Anemic cell Healthy cell Regression X = Document Y = Topic X

More information

Cross-sectional space-time modeling using ARNN(p, n) processes

Cross-sectional space-time modeling using ARNN(p, n) processes Cross-sectional space-time modeling using ARNN(p, n) processes W. Polasek K. Kakamu September, 006 Abstract We suggest a new class of cross-sectional space-time models based on local AR models and nearest

More information

STATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS

STATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS STATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS Eric Gilleland Douglas Nychka Geophysical Statistics Project National Center for Atmospheric Research Supported

More information

STAT 518 Intro Student Presentation

STAT 518 Intro Student Presentation STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible

More information

Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"

Kneib, Fahrmeir: Supplement to Structured additive regression for categorical space-time data: A mixed model approach Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach" Sonderforschungsbereich 386, Paper 43 (25) Online unter: http://epub.ub.uni-muenchen.de/

More information

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature

More information

Bayesian spatial hierarchical modeling for temperature extremes

Bayesian spatial hierarchical modeling for temperature extremes Bayesian spatial hierarchical modeling for temperature extremes Indriati Bisono Dr. Andrew Robinson Dr. Aloke Phatak Mathematics and Statistics Department The University of Melbourne Maths, Informatics

More information

GAUSSIAN PROCESS REGRESSION

GAUSSIAN PROCESS REGRESSION GAUSSIAN PROCESS REGRESSION CSE 515T Spring 2015 1. BACKGROUND The kernel trick again... The Kernel Trick Consider again the linear regression model: y(x) = φ(x) w + ε, with prior p(w) = N (w; 0, Σ). The

More information

A Generalized Convolution Model for Multivariate Nonstationary Spatial Processes

A Generalized Convolution Model for Multivariate Nonstationary Spatial Processes A Generalized Convolution Model for Multivariate Nonstationary Spatial Processes Anandamayee Majumdar, Debashis Paul and Dianne Bautista Department of Mathematics and Statistics, Arizona State University,

More information

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016 Spatial Statistics Spatial Examples More Spatial Statistics with Image Analysis Johan Lindström 1 1 Mathematical Statistics Centre for Mathematical Sciences Lund University Lund October 6, 2016 Johan Lindström

More information

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data.

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data. Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two

More information

Dynamically updated spatially varying parameterisations of hierarchical Bayesian models for spatially correlated data

Dynamically updated spatially varying parameterisations of hierarchical Bayesian models for spatially correlated data Dynamically updated spatially varying parameterisations of hierarchical Bayesian models for spatially correlated data Mark Bass and Sujit Sahu University of Southampton, UK June 4, 06 Abstract Fitting

More information

Analysis of AneuRisk65 data: warped logistic discrimination

Analysis of AneuRisk65 data: warped logistic discrimination Electronic Journal of Statistics Vol. () ISSN: 935-7524 DOI:./ Analysis of AneuRisk65 data: warped logistic discrimination Daniel Gervini Department of Mathematical Sciences University of Wisconsin Milwaukee

More information