A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation

Size: px
Start display at page:

Download "A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation"

Transcription

1 Supplementary materials for this article are available at /s A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation Ryan J. Parker, Brian J. Reich,andJoEidsvik Spatial data are increasing in size and complexity due to technological advances. For an analysis of a large and diverse spatial domain, simplifying assumptions such as stationarity are questionable and standard computational algorithms are inadequate. In this paper, we propose a computationally efficient method to estimate a nonstationary covariance function. We partition the spatial domain into a fine grid of subregions and assign each subregion its own set of spatial covariance parameters. This introduces a large number of parameters and to stabilize the procedure we impose a penalty to spatially smooth the estimates. By penalizing the absolute difference between parameters for adjacent subregions, the solution can be identical for adjacent subregions and thus the method identifies stationary subdomains. To apply the method to large datasets, we use a block composite likelihood which is natural in this setting because it also operates on a partition of the spatial domain. The method is applied to tropospheric ozone in the US, and we find that the spatial covariance on the west coast differs from the rest of the country. Key Words: Spatial statistics; Nonstationary covariance; Regularization; Penalized likelihood. 1. INTRODUCTION Spatial datasets continue to grow in size and complexity, with satellites, for example, being able to collect spatiotemporal data on a large scale. A fundamental task of a spatial analysis is estimating the spatial covariance function. A common simplifying assumption is stationarity, i.e., that the covariance function is the same for the entire spatial domain and the covariance of the process at two locations depends only on their relative locations. However, for data collected over a large and diverse geographic domain, this assumption is untenable as we might expect a different covariance in different subregions (e.g., in coastal and mountainous regions). Therefore, our interest is in modeling a nonstationary covariance function for large spatial datasets. Ryan J. Parker (B), SAS Institute, Cary, NC, USA ( ryan.parker@jmp.com). Brian J. Reich, North Carolina State University, Raleigh, NC, USA. Jo Eidsvik, Norwegian University of Science and Technology, Trondheim, Norway International Biometric Society Journal of Agricultural, Biological, and Environmental Statistics DOI: /s

2 R. J. Parker et al. As a motivating example, we conduct a spatial analysis of ground-level ozone in the Continental US. As shown in Fig. 2, ozone is monitored at hundreds of locations and our objective is to interpolate ozone throughout the entire domain. Ozone maps are useful for regulation, conducting epidemiological studies, and monitoring temporal changes. Emission sources, atmospheric chemistry, and climate vary substantially across this large domain, and thus the stationarity assumption is questionable. Because of their complex structure, a variety of techniques have been studied to model spatial data having a nonstationary covariance. One of these techniques involves deforming the spatial region so that the deformed space can be modeled with a stationary covariance model (Sampson and Guttorp 1992). Nychka and Saltzman (1998), among others, have proposed the use of empirical orthogonal functions (EOF) to model the nonstationary covariance function using eigenfunctions. Constructing the nonstationary process in terms of kernel convolutions was proposed by Higdon (1998). The kernels used in this technique allow for complex nonstationary structures, with Paciorek and Schervish (2006) using this approach to introduce a nonstationary covariance model that is a function of any stationary correlation function. Due to the simplicity of working with stationary covariance models, Fuentes (2002) proposes the use of a mixture of independent stationary processes, with mixture weights coming from a kernel function that operates on spatial location. This construction allows for the covariance function to be modeled as a weighted average of stationary covariance functions. A more recent approach from Bornn et al. (2012) proposes expanding the observation space into higher dimensions where the process is stationary, such as when only two dimensions of a three-dimensional environmental process are observed. Sampson (2010) provides a detailed review of nonstationary methods. Typically nonstationary models are fit using datasets that are large enough that traditional computing methods are either not tractable or replaceable with more efficient methods. One of these techniques uses a covariance taper to introduce sparsity into the covariance matrix so that more efficient sparse matrix methods can be used for estimation and prediction (Furrer et al. 2006; Kaufman et al. 2008). Fixed rank kriging (Cressie and Johannesson 2008) decomposes the covariance matrix in terms of basis functions that are more tractable to work with than the full covariance matrix. Banerjee et al. (2008) propose the use of predictive process models that allow for a low rank covariance matrix to be used for computing (see also Finley et al. 2009). Lindgren et al. (2011) show that there is a link between Gaussian processes and Gaussian Markov random fields (GMRF) that can be used to exploit fast matrix operations under the GMRF. The use of approximate likelihoods has helped speed up computations (Stein et al. 2004; Fuentes 2007). Composite likelihood methods have also shown to be successful for estimation and prediction, with the block composite likelihood (BCL, Eidsvik et al. 2014) in particular having computations that are linear in the number of observations. We propose a method that models the nonstationary covariance using a penalized likelihood (Sect. 3) as an alternative to costly Bayesian computing. Penalized likelihoods have been used previously to model a nonstationary covariance, although these techniques differ fromours.chang et al. (2010) formulate a constrained least squares approach for estimating the nonstationary covariance by using basis functions to represent the spatial process (Hsu et al. (2012) extend this for spatiotemporal data). Our technique is novel in that instead

3 A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation of using a semiparametric approach as Chang et al. (2010) have done, we work directly with covariance parameters from familiar stationary covariance models. Using the flexible nonstationary covariance of Paciorek and Schervish (2006) (see Sect. 2), we discretize the domain into subregions that each have a covariance parameter so that the covariance is allowed to vary over space. Our regularization comes through penalizing the difference between parameter values in neighboring subregions. When penalizing the absolute differences in these parameters, our method is capable of detecting stationary subdomains formed by neighboring subregions with the same covariance parameters. We believe this approach offers a unique and new contribution when modeling spatial data having a nonstationary covariance. We further anticipate increased applicability of such rich nonstationary models as the data size and complexity continue to grow. Our technique is demonstrated through a simulation study (Sect. 5) and an analysis of tropospheric ozone (Sect. 6). We conclude with a discussion and identify areas of future research in Sect NONSTATIONARY COVARIANCE MODEL Let y t (s) be the response at spatial location s for replicate t. We consider observing t = 1,...,T replications each at n spatial locations s i = (s i1, s i2 ) T, with y ti = y t (s i ) denoting the observation at the ith location in replication t. The observations are assumed to be the realization of a Gaussian process with mean function μ(s i ; β) and covariance function C(s i, s j ; θ), with independence between replications. This results in the joint density of y t = (y t1,...,y tn ) T having a multivariate normal distribution with mean vector μ(β) and covariance matrix (θ). Here, β and θ are the model parameters. The log-likelihood of these observations is l(β, θ; y) = 1 2 T t=1 A common stationary covariance model for C is where the Mahalanobis distance D ij is { log (θ) + [ y t μ(β) ] T (θ) 1 [ y t μ(β) ]}. (1) C(s i, s j ; θ) = τ 2 I (i = j) + σ 2 R S ( D ij ), (2) D ij = (s i s j ) T 1 (s i s j ). In this model, τ 2 is the nugget (nonspatial variance), σ 2 is the partial sill (spatial variance), and controls the range of spatial correlation. The stationary correlation function R S ( D ij ) that depends on the range can take a variety of forms, such as the familiar exponential correlation R S ( D ij ) = exp ( D ij ). Our interest is in the case when (θ) has a nonstationary covariance structure (see, for example, Sampson 2010). We choose to consider the flexible nonstationary covariance model of Paciorek and Schervish (2006). In the Paciorek and Schervish model, the nugget

4 R. J. Parker et al. τ 2, partial sill σ 2, and nonstationary kernel matrices are allowed to vary over space. The covariance function is C(s i, s j ; θ) = τ 2 (s i )I (i = j) + σ(s i )σ (s j )R NS (s i, s j ). (3) The nonstationary correlation function, R NS (s i, s j ) = (s i ) 1 4 (s j ) [ (si ) + (s j ) ] 2 1 R S ( D ij ), 2 is defined for any stationary correlation function R S ( ). In the nonstationary case, the Mahalanobis distance D ij is D ij = (s i s j ) T ( 1 2 [ (si ) + (s j ) ]) 1 (s i s j ). Hence stationarity is a special case of this correlation function when τ 2, σ 2, and are the same at all locations. The nonstationary kernel matrices (s) that model the range of spatial dependence can be constructed to model complex anisotropic correlation structures (such as in Higdon et al or Neto et al. 2014). We, however, will consider the simpler isotropic form (s) = ( ) φ 2 (s) 0 0 φ 2 (s) that assumes the range does not have an angular component. The parameter φ(s) then determines the range of spatial correlation in the vicinity of s. Because the parameters in this model vary over space, the model allows for the estimation of an anisotropic covariance even with diagonal (s). For example, when the range parameter φ(s) varies over space, the correlation is different depending on the direction of the neighboring location. We do not estimate the angular component, however, so our parameterization may not estimate the true covariance well when the data exhibit an explicit angular component. Fitting this model when the parameters vary over space is difficult. Paciorek and Schervish (2006) propose the use of a Gaussian process for each spatially varying parameter. This approach is costly due to the O(n 3 ) operations required to update parameters at each iteration in Bayesian inference. Banerjee et al. (2008) apply the predictive process model to (3), but they assume that there are a small number of known subregions that have fixed covariance parameters instead of each site having a covariance parameter. As described below, our method can be thought of as a mixture between these two approaches; on the one hand, we not only reduce the number of covariance parameters by dividing the domain into a discrete set of regions, but we also allow these parameters to vary smoothly over space without knowing these subregions a priori.

5 A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation 3. COVARIANCE REGULARIZATION To reduce the number of parameters in the nonstationary covariance (3), we create a grid of R subregions, R 1,...,R R, that partition the region of interest S (see Fig. 2b). We assume that each subregion r has a different set of covariance parameters τr 2, σ r 2, and φ r so that, e.g., τ 2 (s) = τr 2 if s is in subregion R r. With this model construction, we can estimate covariance parameters that vary over space while reducing the total number of parameters needed to represent the covariance. Because we want to consider a fine grid of subregions, we propose regularizing the covariance parameters to determine where we can reduce the total parameters needed to represent the nonstationary covariance. We penalize the difference of parameter values in neighboring subregions, and this penalty allows us to borrow information between the neighboring subregions. To perform covariance regularization, we first place the unconstrained covariance parameters in the 3 R matrix θ so that θ 1r = log (τr 2), θ 2r = log (σr 2), and θ 3r = log (φ r ). Now we maximize the penalized log-likelihood l P (β, θ; y, λ) = l(β, θ; y) 3 λ i r 1 r 2 θ ir1 θ ir2 q, (4) where r 1 r 2 denotes that subregions r 1 and r 2 are neighbors. This penalized model has a tuning parameter λ i for each covariance parameter type to control the similarity between parameters in neighboring subregions. When λ i = 0 then parameter i corresponds to the full nonstationary model without a penalty on the parameter. When λ i = then parameter i is forced to be the same in all regions, just as if we were using the stationary model. The choice of q determines the type of regularization. When q = 1 we penalize the L 1 norm of the differences in parameters similar to variable fusion (Land and Friedman 1997)or the fused lasso (Tibshirani et al. 2005; Tibshirani and Taylor 2011). This penalty allows for the solution θ ir1 = θ ir2 for some pairs of neighbors, forming a stationary subdomain. Hence this penalty is useful when identifying these subdomains is an objective of the analysis. On the other hand, when q = 2, we penalize the squared L 2 norm of the differences of parameter values. This penalty bears resemblance to a conditionally autoregressive (CAR, Besag et al. 1991) prior for θ i. This never gives estimates θ ir1 = θ ir2, but it provides for easier computation and a potentially smoother surface of covariance parameter estimates. 4. COMPUTING DETAILS Nonstationary models are typically fit to large datasets, and so evaluating the full likelihood in (1) is burdensome because of the O(n 3 ) operations required when working with the large covariance matrix (θ). Therefore we choose to use the block composite likelihood (BCL) of Eidsvik et al. (2014) to perform fast estimation for large n. We have chosen to use this technique because computation scales linearly with the number of observations n and is parallelizable. Also, the full likelihood is a special case of the BCL framework that can be used when n is small enough so that the O(n 3 ) operations are not prohibitive.

6 R. J. Parker et al. The BCL is formed by creating a grid of B blocks, B 1,...,B B, that partition S. Note that the grid of blocks B can be different from the grid of subregions R used in the covariance regularization (Fig. 2c). The log-composite likelihood is then composed of the sum of joint log-likelihoods for all neighboring blocks. That is, the BCL is l BCL (β, θ; y) = b 1 b 2 l [ β, θ; (y b1, y b2 ) T ], (5) where y b are the observations in block b. To use the BCL in our regularized model, we add the penalty term (4) and maximize l PBCL (β, θ; y, λ) = l BCL (β, θ; y) 3 λ i r 1 r 2 θ ir1 θ ir2 q. (6) To perform parameter estimation, Algorithm 1 of Eidsvik et al. (2014) has been modified to account for the penalty term in (6). Estimation requires we compute derivatives of (6) with respect to each parameter in θ, as the regularization parameter λ is tuned using a validation set (see Sect. 4.1). For L 2 regularization, the derivatives of the penalty on the squared differences are straightforward to compute. To estimate under L 1 regularization, however, we must modify the penalty to alleviate the need to take the derivative of the absolute value terms. This modification is done by introducing parameters α to replace the L 1 term with an L 2 term as used in Lin and Zhang (2006) and Storlie et al. (2011), among others. With these new parameters, we maximize l BCL (β, θ; y) 3 α 1 r 1 r 2 ir 1 r 2 (θ ir1 θ ir2 ) 2 3 λ 0i r 1 r 2 α ir1 r 2. (7) Now we will show that the maximizer of (7) is the maximizer of (6) when q = 1. First, we will show that solving (6) implies solving (7). Suppose that β and θ maximize (6)fora given λ. Letλ 0i = 1 4 λ2 i and α ir1 r 2 = λ 1/2 0i θ ir1 θ ir2.itfollowsthat l BCL ( β, θ; y) 3 = l BCL ( β, θ; y) 2 = l BCL ( β, θ; y) α 1 r 1 r 2 3 ir 1 r 2 ( θ ir1 θ ir2 ) 2 λ 1/2 0i 3 λ i r 1 r 2 θ ir1 θ ir2 r 1 r 2 θ ir1 θ ir2. 3 λ 0i r 1 r 2 α ir1 r 2 Hence (6) equals (7) at the solution of (6). To conclude, we will now show that solving (7) implies solving (6). The optimal α jkl s in (7) satisfy α 2 jkl (θ jk θ jl ) 2 λ 0 j = 0, and

7 A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation thus α jkl = (θ jk θ jl ) 2 λ 0 j. Substituting α ijk s into (7), we have l BCL (β, θ; y) 3 = l BCL (β, θ; y) 2 α 1 r 1 r 2 3 ir 1 r 2 (θ ir1 θ ir2 ) 2 λ 1/2 0i 3 λ 0i r 1 r 2 θ ir1 θ ir2. r 1 r 2 α ir1 r 2 By letting λ i = 2λ 1/2 0i,wehave(7) equivalent to (6) when q = 1. Hence maximizing (7) for β and θ leads to the equivalent solution in (6). Therefore, we have shown that estimation in (6) and (7) is equivalent CHOOSING TUNING PARAMETERS λ, R, AND B In this model we must make choices about three tuning parameters: (1) penalty λ, (2) subregion grid R 1,...,R R, and (3) grid of blocks B 1,...,B B for the BCL. For a fixed subregion grid and grid of BCL blocks, we choose the penalty λ using the maximum loglikelihood of a holdout set y h, l(y h ; β, θ, y f ), where y f are the observations used to compute parameter estimates β and θ (Bien and Tibshirani 2011). This maximum log-likelihood is computed over a single random holdout set or with cross-validation. Candidate penalty parameters are proposed using the algorithm below. To choose λ, we use an algorithm in the spirit of gradient directed regularization (Friedman and Popescu 2003). We must first choose a descending grid for each λ i, such as g = (, 250, 100, 25, 5, 0) T, which we use in our simulation study in Sect. 5. Thisgrid allows us to fit models with the same parameter in each region (g j = ) all the way down to models without any penalty (g j = 0). To start we set each λ i = g 1 corresponding to the stationary model. Next, we fit three different models by one-at-a-time setting λ j = g 2 and leaving the other λ i = g 1. If we find improvement (a larger log-likelihood) with one of these three models, then we choose the fit with the largest improvement and repeat this process of fitting three different models with the elements of g. Once we no longer find improvement then the algorithm terminates. The subregion grid can also be chosen with the above holdout set ideas. One way to tune the subregion grid would be to first start with one subregion corresponding to stationarity and continuing to add subregions until no more improvement is found in the log-likelihood on the holdout set. Alternatively, features of the observation space may direct the researcher to compare competing subregions to find the best for their data. Ultimately this can be time consuming and so we propose using a fine grid of subregions so that the penalization can approximate the subdomains of stationarity without costly tuning over many competing subregion grids. Finally, we propose setting the grid of BCL blocks based on the recommendations of Eidsvik et al. (2014). Because these blocks are used as a way to speed up computing time, they propose choosing the smallest number of observations in each block while maintaining statistical efficiency. The size of the blocks may depend on the features of the correlation

8 R. J. Parker et al. structure, such as a slower decay in spatial correlation (longer spatial range) potentially needing larger block sizes. We have found that 50 observations per block work well in our simulation study and real data example. Also, under more complex anisotropic structures, block sizes may need to be rectangular in shape to account for the directional differences in the correlation structure. 5. SIMULATION STUDY In this section, we design and analyze the results of a simulation study to better understand the performance of the regularized nonstationary model DESIGN In this study, over N = 100 simulation replications, we generate data from a nonstationary covariance with the exponential correlation function. These samples are generated using a2 2 square grid of four subregions in the unit square [0, 1] [0, 1], with each subregion having a nugget, partial sill, and range parameter. We consider three different covariance structures so that one of the three parameters varies from subregion to subregion, while the other two parameters are the same in each subregion. For example, we will allow the nugget to vary from subregion to subregion but have the partial sill and range parameter equal in each subregion. This allows us to gain insight into how sensitive this approach is to estimation and prediction performance when varying a single covariance parameter. Data are generated over T = 10 independent replications on an n = = 3, 025 grid in the unit square. The mean at each location is μ ti = β 0 + β 1 x ti with β 0 = 0 and β 1 = 1. The covariates x ti are generated independent and identically distributed from a standard normal distribution. The three nonstationary covariance structures are listed in Table 1. We compare the performance of the actual covariance (Oracle) to that of estimates obtained from (1) assuming stationarity, (2) L 1 regularization (q = 1), and (3) L 2 regularization (q = 2). For the regularized fits, we will compare an R = 4 4 = 16 subregion grid to an R = 5 5 = 25 subregion grid. These two choices for R will allow us to compare the case when the chosen grid of subregions fits perfectly into the true covariance (R = 16) to the case when the subregions overlap the true covariance (R = 25). The models are Table 1. Nonstationary covariance structures used to simulate data with the nugget (left), partial sill (center), and range (right) varying by region. Design τ 2 σ 2 φ Varying nugget Varying partial sill Varying range

9 A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation fit using n f = 2, 525 of the observation locations, with n h = 500 randomly selected for a validation set. For the nonstationary models, 500 of the remaining n f observations are randomly chosen and used as a holdout set for choosing the penalty λ. After holding out the validation set, we construct B = 51 blocks for the BCL so that we have approximately 50 observations in each block (these blocks are constructed using k-means clustering on the observation locations). When fitting the regularized models we will allow all parameters in each subregion to vary since we would not expect to know beforehand which were fixed to be the same in each subregion. This further allows us to evaluate how well our penalized model handles this situation EVALUATING PERFORMANCE We evaluate the performance of the methods in our simulation study as follows. The overall model fit for each method is evaluated by computing the log-likelihood of the data in the validation set. To compare the estimation of the overall covariance matrix, we compute the Frobenius norm, (θ) ( ˆθ) F, of the difference between the actual and estimated covariance matrix for each method. The root mean squared error (RMSE) of the regression coefficient estimates of β j, 1 N N ( ˆβ (i) j β j ) 2, is used to evaluate how well the methods estimate the effect of model covariates. The integrated RMSE of each covariance parameter vector, for example, 1 N N 1 R R r=1 ( ) 2, ˆτ r 2 (i) τr 2 for the nugget, allows us to identify how well the methods estimate each covariance parameter type. Finally, we use the prediction MSE and coverage of 90 % prediction intervals for data in the validation set to evaluate the prediction power of each method RESULTS The results for data generated with a varying nugget are shown in Table 2, varying partial sill in Table 3, and varying range in Table 4. Where appropriate, we present relative differences (indicated by (Rel.) ) that are relative to either the oracle or stationary model. These allow us to easily compare the proportional differences between results for each method. Using paired t tests, we find that the differences in the log-likelihood and Frobenius norm are statistically significant when comparing the nonstationary and stationary models, giving us confidence that we are better estimating the full covariance in these cases. The

10 R. J. Parker et al. Table 2. Simulation study results when the nugget varies over space. Oracle Stationary Nonstationary L 1 L 2 L 1 L 2 Log-likelihood difference (θ) ( θ) F (Rel.) 41.7 (1) 17.5 (0.42) 16.6 (0.40) 23.9 (0.57) 28.2 (0.68) Integrated RMSE τ 2 (Rel.) 0.80 (1) 0.07 (0.08) 0.08 (0.09) 0.15 (0.19) 0.10 (0.13) Integrated RMSE σ 2 (Rel.) 0.02 (1) 0.03 (1.74) 0.03 (1.59) 0.05 (2.73) 0.03 (1.56) Integrated RMSE φ (Rel.) (1) (3.45) (1.73) (2.02) (2.41) RMSE β 0 (Rel.) 0.03 (1) 0.04 (1.10) 0.04 (1.09) 0.04 (1.09) 0.04 (1.10) 0.04 (1.10) RMSE β 1 (Rel.) (1) (1.20) (1.02) (1.02) (1.03) (1.03) Prediction MSE (Rel.) (1) (1.004) (1) 1.33 (1) (1.001) (1.001) Prediction coverage (0.90) The relative factors shown in parenthesis highlight the proportional differences between the methods and the Oracle. The Oracle model uses the true values of the covariance parameters, so for metrics evaluating covariance parameters the differences are relative to the stationary model. differences in prediction MSE between the nonstationary and stationary models are also statistically significant in all cases. When the nugget varies (Table 2), we see that the nonstationary models outperform the stationary model for overall model fit (log-likelihood) and estimating the nonstationary covariance (Frobenius norm). The nonstationary models also do better estimating the slope, β 1, with the stationary model being 1.20 times larger than the oracle estimate, whereas the estimates from the nonstationary fits are only 1.02 to 1.03 times larger. The integrated RMSE of τ 2 clearly shows that the flexibility of the nonstationary models is needed to estimate this varying parameter, with the error from the nonstationary models being less than 20 % of the size of the error from the stationary model depending on the structure of the subregion grid and type of penalty. For the partial sill and range, the estimates from the nonstationary model are not as precise as the stationary model. The errors are not large from a practical standpoint, and this is reinforced by the overall estimate of the covariance matrix being much better for the nonstationary models. Each method does a good job predicting, and they all have the expected coverage for the 90 % prediction intervals. In this case, the nonstationary models with either the L 1 or L 2 penalty perform well. When the partial sill varies (Table 3), the nonstationary models continue to perform well. The L 1 and L 2 regularization outperform the stationary model in terms of the log-likelihood and covariance matrix estimate, and as we expect the estimate of the partial sill is better for all of the nonstationary models. When using the overlapping grid of R = 5 5, the nonstationary models have some difficulty estimating the range parameter when compared relative to the stationary model. Estimating the overall covariance is still better under these conditions though, indicating these individual parameter estimates do not adversely impact the full covariance estimate. For the R = 4 4 grid, we do not experience the same difficulty, with the estimated nugget outperforming the stationary model. The estimate of the range is slightly better for the stationary model compared to this subregion grid, but again this is

11 A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation Table 3. Simulation study results when the partial sill varies over space. Oracle Stationary Nonstationary L 1 L 2 L 1 L 2 Log-likelihood difference (θ) ( θ) F (Rel.) (1) 37.8 (0.16) 50.7 (0.21) (0.71) (0.59) Integrated RMSE τ 2 (Rel.) 0.02 (1) 0.02 (0.71) 0.02 (0.80) 0.03 (1.27) 0.02 (0.94) Integrated RMSE σ 2 (Rel.) 0.55 (1) 0.04 (0.08) 0.05 (0.09) 0.18 (0.32) 0.13 (0.23) Integrated RMSE φ (Rel.) (1) (1.06) (1.46) (14.1) (6.83) RMSE β 0 (Rel.) 0.04 (1) 0.07 (1.94) 0.04 (1.13) 0.04 (1.11) 0.05 (1.43) 0.05 (1.49) RMSE β 1 (Rel.) (1) (1.28) (1.04) (1.04) (1.07) (1.08) Prediction MSE (Rel.) (1) 0.96 (1.003) (1.00) (1.00) 0.96 (1.003) 0.96 (1.003) Prediction coverage (0.90) The relative factors shown in parenthesis highlight the proportional differences between the methods and the Oracle. The Oracle model uses the true values of the covariance parameters, so for metrics evaluating covariance parameters the differences are relative to the stationary model. Table 4. Simulation study results when the range varies over space. Oracle Stationary Nonstationary L 1 L 2 L 1 L 2 Log-likelihood difference (θ) ( θ) F (Rel.) (1) 11.8 (0.11) 17.3 (0.16) 52.7 (0.50) 60.3 (0.57) Integrated RMSE τ 2 (Rel.) 0.16 (1) 0.01 (0.05) 0.02 (0.28) 0.01 (0.09) 0.11 (0.69) Integrated RMSE σ 2 (Rel.) 0.09 (1) 0.01 (0.13) 0.02 (0.28) 0.02 (0.26) 0.07 (0.80) Integrated RMSE φ (Rel.) (1) (0.12) (0.15) (0.26) 0.01 (0.32) RMSE β 0 (Rel.) 0.02 (1) 0.05 (3.01) 0.02 (1.19) 0.02 (1.22) 0.03 (1.53) 0.03 (1.63) RMSE β 1 (Rel.) (1) (1.22) (1.02) (1.02) (1.04) (1.05) Prediction MSE (Rel.) (1) (1.019) (1.00) (1.00) (1.01) (1.009) Prediction coverage (0.90) The relative factors shown in parenthesis highlight the proportional differences between the methods and the Oracle. The Oracle model uses the true values of the covariance parameters, so for metrics evaluating covariance parameters the differences are relative to the stationary model. not of practical importance given the size of the errors and the much better estimate of the overall covariance. Prediction performance continues to be very similar across all methods. When the range varies (Table 4), the nonstationary models perform the best compared to when the other parameters vary. Each of the covariance parameters is estimated very well in this case when compared to the stationary model regardless of the subregion grid or penalty used. In the varying nugget and varying partial sill cases, one of the nonvarying parameters would be estimated better with the stationary model, but now we have that both the L 1 and L 2 regularization are better estimating each of these parameters. The prediction MSE continues to be similar in all cases, but this case also has the largest difference in prediction MSE between the stationary model and the nonstationary models, highlighting the importance of modeling the range properly for prediction.

12 R. J. Parker et al (a) Varying Nugget (b) Varying Partial Sill (c) Varying Range Figure 1. The relative prediction MSE (stationary / L 2 prediction MSE) by location, grouped into an grid of locations, between the stationary model and the nonstationary model with L 2 regularization. These highlight the differences between prediction MSE by location, with southwest quadrant having larger differences relative to the rest of the space. All cases show this effect, with the largest effect by varying the nugget, then varying the range, and finally a larger but smaller effect for varying the partial sill. Finally, we explore the prediction performance by location. In Fig. 1, we plot the relative prediction MSE between the stationary model and the nonstationary model with L 2 regularization. These show that the prediction performance varies by location in our study. The southwest quadrant in particular shows larger prediction MSE in the stationary model relative to the nonstationary fit, with the parameter values in this region being smaller than the others around it. Although the prediction error is similar on the whole, we do find that specific locations can have improved prediction performance by using the nonstationary model over the stationary model. These results show that the nonstationary model estimates the true covariance much better than the stationary model, as we would expect, especially for the parameters that vary over space. For this blocked covariance structure, the nonstationary model also outperforms the stationary model when the grid of subregions does not perfectly fit into the true covariance. Nonstationary models (including the oracle model) do not notably improve prediction MSE compared to the stationary model in any of the three cases we considered. As others have found, such as Reich et al. (2011, Table 1), nonstationary models do not tend to dramatically improve prediction performance (further results are given in the supplemental materials). We not only find similar results overall, but also find that there are locations where prediction performance is improved using the nonstationary model. Also, we find that the MSE for estimating mean parameters β is smaller for the nonstationary models. Therefore when the objective is parameter estimation or understanding how the covariance changes across space, then the nonstationary method performs well, but if the objective is only spatial prediction, then the stationary model appears to be sufficient with the exception of the locational differences we found. In terms of computation, the time needed to estimate the parameters in the nonstationary model is higher than the stationary model. In this study we are able to estimate the stationary model in roughly 2 min. The nonstationary model, however, requires somewhere between 35 and 60 min for L 1 regularization and min for L 2 regularization. So computing time should also be a factor when deciding between models.

13 A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation 6. ANALYZING THE SPATIAL COVARIANCE OF TROPOSPHERIC OZONE To demonstrate how the regularized nonstationary model performs on real data, we examine monitored ozone data for July 2005 along with model output for ozone from CMAQ version (Appel et al. 2013). CMAQ is a deterministic model that combines meteorology and emissions data with atmospheric chemistry equations to estimate daily ozone levels. The ozone is measured hourly, and we analyze the daily maximum 8-h average measured in parts-per-billion (ppb). We restrict our analysis to the n = 709 sites that have data for all T = 31 days of the month. In this analysis, all latitude and longitude coordinates have been projected to kilometers (km) using the Lambert Conformal Conic (LCC) projection with parameters defined by CMAQ. More details about this dataset are provided in Reich et al. (2014). In Fig. 2, we plot the observation locations (a), subregion grid of parameters for the regularized nonstationary model (b), and the BCL grid to speed up computing (c). The subregion grid (Fig. 2b) has R = 100 subregions. These subregions were created with k- means clustering on the observation locations, with the shape of each subregion formed by computing the Delaunay triangulation at the location of the 100 centroids from k-means. We do this because of the irregularly spaced observation locations so that there are not too many observation locations within any subregion. The BCL grid (Fig. 2c) was created in a similar way, except that the number of blocks was chosen to be B = 14 so that approximately 50 observation locations fall within each block. For these data, we fit models with two different mean structures. The first uses the CMAQ covariate available at each observation location so that the mean on day t at location s is μ t (s) = β 0 + β 1 CMAQ t (s). (8) This will allow us to estimate the residual nonstationary covariance after accounting for the CMAQ values, which may prove useful to modelers working to improve CMAQ. Alternatively, we will consider a polynomial on the location s so that we can remove any large scale trends before modeling the nonstationary covariance. That is, we assume the mean is μ t (s) = β 0 + β 1 s 1 + β 2 s 2 + β 3 s β 4s β 5s 1 s 2, (9) which is constant for each day. This mean structure is of interest because it does not remove the effect of CMAQ, allowing us to estimate the marginal nonstationary covariance structure to gain insight into the effects of long-range transport or common point sources. Each of these mean structures is fit with a stationary covariance, L 1 regularization, and L 2 regularization, giving us six models for comparison. We use the exponential covariance model, as Reich et al. (2014) demonstrate the estimated smoothness parameter for the Matèrn covariance is approximately 0.5, equivalent to the simpler exponential model. To evaluate model performance, we create a validation set by holding out all 31 days of data for 70 sites, which is approximately 10 % of our 709 observation locations. After

14 R. J. Parker et al. (a) Observation locations (b) Subregion grid (c) BCL grid Figure 2. Location of observations s 1,...,s n (a), the subregion grid having a covariance parameter in each subregion R 1,...,R R (b), and the BCL grid B 1,...,B B (c).

15 A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation Table 5. A comparison between the stationary (S), L 1 nonstationary, and L 2 nonstationary covariance models. Mean Cov LL All Excluding CA Only CA Pred MSE Pred SD Pred MSE Pred SD Pred MSE Pred SD CMAQ S L L Poly S L L This table compares log-likelihood of the validation set (LL) along with prediction error (Pred MSE) and prediction standard deviations (Pred SD) over sites in the validation set for the entire United States, excluding California, and only in California. Note the prediction MSE and SD are computed on a centered and scaled version of the original data so that the observations have mean zero and variance one. fitting each model to the remaining data, we compute the log-likelihood (LL), prediction mean squared error (Pred MSE), and prediction standard deviation (Pred SD) for the data in the validation set. These are computed for all of the observation locations along with just locations in California. The results are presented in Table 5. These results show that the nonstationary models are preferred to the stationary model, as the log-likelihood is larger for the L 1 and L 2 regularized models when compared to the stationary model. The overall prediction MSE is not significantly different between the stationary and nonstationary models, although this is different when looking at the west coast. In California specifically, we find that the L 2 model does as much as 15 % better for the model with CMAQ and 11 % better for the polynomial mean models. As the parameter estimates suggest in Fig. 3, this region of the country is where we find more nonstationary features, and these results show that we can indeed perform better for these more complex structures. These plots of the parameter estimates also highlight an attractive feature of the L 1 regularization: the ability to fuse parameters in neighboring subregions to detect subdomains of stationarity. The nugget (a) and range (c) parameters are estimated to be the same over the entire country for the L 1 penalty, with the partial sill (b) estimates being fused for large regions, especially in the east. Instead of a unique parameter in each of the 100 subregions, we have a much smaller set to work with due to this parameter fusion. Plots of the predictions using the polynomial mean (9) for each covariance model along with differences relative to the stationary model are shown in Fig. 4. Although the predictions are very similar between the models for the entire United States (a), the prediction standard deviations (b) vary between the stationary and regularized models. The southwest region in particular is noteworthy, as the prediction variance is smaller under the stationary model. Predictions for only California are shown in (c) along with associated prediction standard deviations (d). These highlight that the nonstationary models do a better job of detecting lower ozone levels in the southeastern region of the state. One other key difference is near the borders of New Mexico, Texas, and Mexico. We do not have many observations in this region, and the prediction uncertainty under the nonstationary model is high relative to the stationary model (Fig. 4b).

16 R. J. Parker et al. CMAQ: L1 Polynomial: L (a) Nugget CMAQ: L1 Polynomial: L (b) Partial sill CMAQ: L Polynomial: L1 (c) Range Figure 3. Estimated parameter values for the nugget (a), partial sill (b), and range (c)forthel 1 and L 2 nonstationary covariance models along with the CMAQ and polynomial mean functions. 7. DISCUSSION In this paper, we have developed a computationally efficient method for estimating a nonstationary spatial covariance function. Using a penalty to smooth the covariance parameters over space provides stability and identifies stationary subdomains in the L 1 case. For both simulated and real data, we find this method improves estimation of the covariance function compared to the stationary model. Future work includes allowing for anisotropy and Matèrn covariance. These extensions would increase the computational burden, but would not require fundamental changes. Computation may be improved using parallel computing. Since the BCL sums over pairs of blocks, it is embarrassingly parallelizable. Eidsvik et al. (2014) achieve 100-fold speedups via parallelization in some stationary cases and similar gains should be possible in our nonstationary setting. We have assumed in this paper that the data consist of several independent realizations from the spatial process. The method can also be applied to a single realization. Also, we conjecture that our method will produce reasonable estimates (with questionable uncertainty measures) even if the realizations are dependent over time with separable spatiotemporal covariance so that each realization has the same spatial covariance. In the case of strong temporal dependence, a possibility is to analyze the differences between

17 Author's personal copy A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation Stationary L1 S L1 Difference S Difference (a) Stationary L US Predictions S/L1 Ratio S/ Ratio (b) US Prediction SD Stationary L1 S L1 Difference S Difference (c) CA Predictions Stationary L S/L1 Ratio S/ Ratio (d) CA Prediction SD Figure 4. Predictions and associated prediction standard deviations (in terms of ppb) for the United States (a, b) and California (c, d) using the stationary covariance along with the L 1 and L 2 regularized nonstationary covariance models. Also included are the differences in predictions (a and c) as well as the prediction SD ratios (b and d). adjacent time points which may be approximately independent. However, we concede that our approach is inappropriate for estimating nonstationary and nonseparable spatiotemporal covariance functions, and we leave this as future work. [Received June Accepted April 2016.] REFERENCES Appel, K., Pouliot, G., Simon, H., Sarwar, G., Pye, H., Napelenok, S., Akhtar, F., and Roselle, S. (2013), Evaluation of dust and trace metal estimates from the Community Multiscale Air Quality (CMAQ) model version 5.0, Geoscientific Model Development, 6, Banerjee, S., Gelfand, A. E., Finley, A. O., and Sang, H. (2008), Gaussian predictive process models for large spatial data sets, Journal of the Royal Statistical Society: Series B, 70, Besag, J., York, J., and Mollié, A. (1991), Bayesian image restoration, with two applications in spatial statistics, Annals of the Institute of Statistical Mathematics, 43, Bien, J. and Tibshirani, R. J. (2011), Sparse estimation of a covariance matrix, Biometrika, 98,

18 R. J. Parker et al. Bornn, L., Shaddick, G., and Zidek, J. V. (2012), Modeling nonstationary processes through dimension expansion, Journal of the American Statistical Association, 107, Chang, Y.-M., Hsu, N.-J., and Huang, H.-C. (2010), Semiparametric estimation and selection for nonstationary spatial covariance functions, Journal of Computational and Graphical Statistics, 19, Cressie, N. and Johannesson, G. (2008), Fixed rank kriging for very large spatial data sets, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 70, Eidsvik, J., Shaby, B. A., Reich, B. J., Wheeler, M., and Niemi, J. (2014), Estimation and prediction in spatial models with block composite likelihoods, Journal of Computational and Graphical Statistics, 23, Finley, A. O., Sang, H., Banerjee, S., and Gelfand, A. E. (2009), Improving the performance of predictive process modeling for large datasets, Computational Statistics & Data Analysis, 53, Friedman, J. and Popescu, B. E. (2003), Gradient directed regularization for linear regression and classification, Tech. rep., Statistics Department, Stanford University. Fuentes, M. (2002), Spectral methods for nonstationary spatial processes, Biometrika, 89, (2007), Approximate likelihood for large irregularly spaced spatial data, Journal of the American Statistical Association, 102, Furrer, R., Genton, M. G., and Nychka, D. (2006), Covariance tapering for interpolation of large spatial datasets, Journal of Computational and Graphical Statistics, 15. Higdon, D. (1998), A process-convolution approach to modelling temperatures in the North Atlantic Ocean, Environmental and Ecological Statistics, 5, Higdon, D., Swall, J., and Kern, J. (1999), Non-stationary spatial modeling, in Bayesian Statistics 6, pp Hsu, N.-J., Chang, Y.-M., and Huang, H.-C. (2012), A group lasso approach for non-stationary spatial-temporal covariance estimation, Environmetrics, 23, Kaufman, C. G., Schervish, M. J., and Nychka, D. W. (2008), Covariance tapering for likelihood-based estimation in large spatial data sets, Journal of the American Statistical Association, 103, Land, S. R. and Friedman, J. H. (1997), Variable fusion: A new adaptive signal regression method, Tech. rep., Department of Statistics, Carnegie Mellon University, Pittsburgh. Lin, Y. and Zhang, H. H. (2006), Component selection and smoothing in smoothing spline analysis of variance models, Annals of Statistics, 34, Lindgren, F., Rue, H., and Lindström, J. (2011), An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 73, Neto, J. H. V., Schmidt, A. M., and Guttorp, P. (2014), Accounting for spatially varying directional effects in spatial covariance structures, Journal of the Royal Statistical Society: Series C (Applied Statistics), 63, Nychka, D. and Saltzman, N. (1998), Design of air-quality monitoring networks, in Case studies in environmental statistics, Springer, pp Paciorek, C. J. and Schervish, M. J. (2006), Spatial modelling using a new class of nonstationary covariance functions, Environmetrics, 17, Reich, B. J., Chang, H. H., and Foley, K. M. (2014), A spectral method for spatial downscaling, Biometrics, 70, Reich, B. J., Eidsvik, J., Guindani, M., Nail, A. J., and Schmidt, A. M. (2011), A class of covariate-dependent spatiotemporal covariance functions, The annals of applied statistics, 5, Sampson, P. D. (2010), Constructions for Nonstationary Spatial Processes, in Handbook of Spatial Statistics, eds. Gelfand, A. E., Diggle, P. J., Fuentes, M., and Guttorp, P., CRC Press, chap. 9. Sampson, P. D. and Guttorp, P. (1992), Nonparametric estimation of nonstationary spatial covariance structure, Journal of the American Statistical Association, 87, Stein, M. L., Chi, Z., and Welty, L. J. (2004), Approximating likelihoods for large spatial data sets, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 66, Storlie, C. B., Bondell, H. D., Reich, B. J., and Zhang, H. H. (2011), Surface estimation, variable selection, and the nonparametric oracle property, Statistica Sinica, 21, 679.

19 A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation Tibshirani, R., Saunders, M., Rosset, S., Zhu, J., and Knight, K. (2005), Sparsity and smoothness via the fused lasso, Journal of the Royal Statistical Society: Series B, 67, Tibshirani, R. J. and Taylor, J. (2011), The solution path of the generalized lasso, Annals of Statistics, 39,

Spatial smoothing using Gaussian processes

Spatial smoothing using Gaussian processes Spatial smoothing using Gaussian processes Chris Paciorek paciorek@hsph.harvard.edu August 5, 2004 1 OUTLINE Spatial smoothing and Gaussian processes Covariance modelling Nonstationary covariance modelling

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North

More information

arxiv: v4 [stat.me] 14 Sep 2015

arxiv: v4 [stat.me] 14 Sep 2015 Does non-stationary spatial data always require non-stationary random fields? Geir-Arne Fuglstad 1, Daniel Simpson 1, Finn Lindgren 2, and Håvard Rue 1 1 Department of Mathematical Sciences, NTNU, Norway

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley 1 and Sudipto Banerjee 2 1 Department of Forestry & Department of Geography, Michigan

More information

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University this presentation derived from that presented at the Pan-American Advanced

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley Department of Forestry & Department of Geography, Michigan State University, Lansing

More information

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets Thomas Romary, Nicolas Desassis & Francky Fouedjio Mines ParisTech Centre de Géosciences, Equipe Géostatistique

More information

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter Chris Paciorek Department of Biostatistics Harvard School of Public Health application joint

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,

More information

Latent Gaussian Processes and Stochastic Partial Differential Equations

Latent Gaussian Processes and Stochastic Partial Differential Equations Latent Gaussian Processes and Stochastic Partial Differential Equations Johan Lindström 1 1 Centre for Mathematical Sciences Lund University Lund 2016-02-04 Johan Lindström - johanl@maths.lth.se Gaussian

More information

Bayesian spatial quantile regression

Bayesian spatial quantile regression Brian J. Reich and Montserrat Fuentes North Carolina State University and David B. Dunson Duke University E-mail:reich@stat.ncsu.edu Tropospheric ozone Tropospheric ozone has been linked with several adverse

More information

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

On Gaussian Process Models for High-Dimensional Geostatistical Datasets On Gaussian Process Models for High-Dimensional Geostatistical Datasets Sudipto Banerjee Joint work with Abhirup Datta, Andrew O. Finley and Alan E. Gelfand University of California, Los Angeles, USA May

More information

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets Sudipto Banerjee University of California, Los Angeles, USA Based upon projects involving: Abhirup Datta (Johns Hopkins University)

More information

Geostatistical Modeling for Large Data Sets: Low-rank methods

Geostatistical Modeling for Large Data Sets: Low-rank methods Geostatistical Modeling for Large Data Sets: Low-rank methods Whitney Huang, Kelly-Ann Dixon Hamil, and Zizhuang Wu Department of Statistics Purdue University February 22, 2016 Outline Motivation Low-rank

More information

A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES

A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES JOHN STEPHENSON 1, CHRIS HOLMES, KERRY GALLAGHER 1 and ALEXANDRE PINTORE 1 Dept. Earth Science and Engineering, Imperial College,

More information

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields 1 Introduction Jo Eidsvik Department of Mathematical Sciences, NTNU, Norway. (joeid@math.ntnu.no) February

More information

What s for today. Continue to discuss about nonstationary models Moving windows Convolution model Weighted stationary model

What s for today. Continue to discuss about nonstationary models Moving windows Convolution model Weighted stationary model What s for today Continue to discuss about nonstationary models Moving windows Convolution model Weighted stationary model c Mikyoung Jun (Texas A&M) Stat647 Lecture 11 October 2, 2012 1 / 23 Nonstationary

More information

Generalized Elastic Net Regression

Generalized Elastic Net Regression Abstract Generalized Elastic Net Regression Geoffroy MOURET Jean-Jules BRAULT Vahid PARTOVINIA This work presents a variation of the elastic net penalization method. We propose applying a combined l 1

More information

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University

More information

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Environmentrics 00, 1 12 DOI: 10.1002/env.XXXX Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Regina Wu a and Cari G. Kaufman a Summary: Fitting a Bayesian model to spatial

More information

Low-rank methods and predictive processes for spatial models

Low-rank methods and predictive processes for spatial models Low-rank methods and predictive processes for spatial models Sam Bussman, Linchao Chen, John Lewis, Mark Risser with Sebastian Kurtek, Vince Vu, Ying Sun February 27, 2014 Outline Introduction and general

More information

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation A Framework for Daily Spatio-Temporal Stochastic Weather Simulation, Rick Katz, Balaji Rajagopalan Geophysical Statistics Project Institute for Mathematics Applied to Geosciences National Center for Atmospheric

More information

Introduction to Geostatistics

Introduction to Geostatistics Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,

More information

Gaussian Process Regression Model in Spatial Logistic Regression

Gaussian Process Regression Model in Spatial Logistic Regression Journal of Physics: Conference Series PAPER OPEN ACCESS Gaussian Process Regression Model in Spatial Logistic Regression To cite this article: A Sofro and A Oktaviarina 018 J. Phys.: Conf. Ser. 947 01005

More information

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Abhirup Datta 1 Sudipto Banerjee 1 Andrew O. Finley 2 Alan E. Gelfand 3 1 University of Minnesota, Minneapolis,

More information

Statistical Models for Monitoring and Regulating Ground-level Ozone. Abstract

Statistical Models for Monitoring and Regulating Ground-level Ozone. Abstract Statistical Models for Monitoring and Regulating Ground-level Ozone Eric Gilleland 1 and Douglas Nychka 2 Abstract The application of statistical techniques to environmental problems often involves a tradeoff

More information

The Matrix Reloaded: Computations for large spatial data sets

The Matrix Reloaded: Computations for large spatial data sets The Matrix Reloaded: Computations for large spatial data sets Doug Nychka National Center for Atmospheric Research The spatial model Solving linear systems Matrix multiplication Creating sparsity Sparsity,

More information

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Jon Wakefield Departments of Statistics and Biostatistics University of Washington 1 / 37 Lecture Content Motivation

More information

The Matrix Reloaded: Computations for large spatial data sets

The Matrix Reloaded: Computations for large spatial data sets The Matrix Reloaded: Computations for large spatial data sets The spatial model Solving linear systems Matrix multiplication Creating sparsity Doug Nychka National Center for Atmospheric Research Sparsity,

More information

Nonstationary cross-covariance models for multivariate processes on a globe

Nonstationary cross-covariance models for multivariate processes on a globe Nonstationary cross-covariance models for multivariate processes on a globe Mikyoung Jun 1 April 15, 2011 Abstract: In geophysical and environmental problems, it is common to have multiple variables of

More information

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling François Bachoc former PhD advisor: Josselin Garnier former CEA advisor: Jean-Marc Martinez Department

More information

Fusing point and areal level space-time data. data with application to wet deposition

Fusing point and areal level space-time data. data with application to wet deposition Fusing point and areal level space-time data with application to wet deposition Alan Gelfand Duke University Joint work with Sujit Sahu and David Holland Chemical Deposition Combustion of fossil fuel produces

More information

Reduction of Model Complexity and the Treatment of Discrete Inputs in Computer Model Emulation

Reduction of Model Complexity and the Treatment of Discrete Inputs in Computer Model Emulation Reduction of Model Complexity and the Treatment of Discrete Inputs in Computer Model Emulation Curtis B. Storlie a a Los Alamos National Laboratory E-mail:storlie@lanl.gov Outline Reduction of Emulator

More information

Some Topics in Convolution-Based Spatial Modeling

Some Topics in Convolution-Based Spatial Modeling Some Topics in Convolution-Based Spatial Modeling Catherine A. Calder 1 and Noel Cressie Department of Statistics The Ohio State University 1958 Neil Ave. Columbus, OH 43210 USA Appears in the Proceedings

More information

Flexible Spatio-temporal smoothing with array methods

Flexible Spatio-temporal smoothing with array methods Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session IPS046) p.849 Flexible Spatio-temporal smoothing with array methods Dae-Jin Lee CSIRO, Mathematics, Informatics and

More information

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP The IsoMAP uses the multiple linear regression and geostatistical methods to analyze isotope data Suppose the response variable

More information

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency Chris Paciorek March 11, 2005 Department of Biostatistics Harvard School of

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information

Hierarchical Low Rank Approximation of Likelihoods for Large Spatial Datasets

Hierarchical Low Rank Approximation of Likelihoods for Large Spatial Datasets Hierarchical Low Rank Approximation of Likelihoods for Large Spatial Datasets Huang Huang and Ying Sun CEMSE Division, King Abdullah University of Science and Technology, Thuwal, 23955, Saudi Arabia. July

More information

Multi-resolution models for large data sets

Multi-resolution models for large data sets Multi-resolution models for large data sets Douglas Nychka, National Center for Atmospheric Research National Science Foundation NORDSTAT, Umeå, June, 2012 Credits Steve Sain, NCAR Tia LeRud, UC Davis

More information

Estimation of non-stationary spatial covariance structure

Estimation of non-stationary spatial covariance structure Estimation of non-stationary spatial covariance structure DAVID J NOTT Department of Statistics, University of New South Wales, Sydney 2052, Australia djn@mathsunsweduau WILLIAM T M DUNSMUIR Division of

More information

A COVARIANCE PARAMETER ESTIMATION METHOD FOR POLAR-ORBITING SATELLITE DATA

A COVARIANCE PARAMETER ESTIMATION METHOD FOR POLAR-ORBITING SATELLITE DATA Statistica Sinica 25 (2015), 41-59 doi:http://dx.doi.org/10.5705/ss.2013.227w A COVARIANCE PARAMETER ESTIMATION METHOD FOR POLAR-ORBITING SATELLITE DATA Michael T. Horrell and Michael L. Stein University

More information

Overview of Spatial Statistics with Applications to fmri

Overview of Spatial Statistics with Applications to fmri with Applications to fmri School of Mathematics & Statistics Newcastle University April 8 th, 2016 Outline Why spatial statistics? Basic results Nonstationary models Inference for large data sets An example

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

Non-stationary Cross-Covariance Models for Multivariate Processes on a Globe

Non-stationary Cross-Covariance Models for Multivariate Processes on a Globe Scandinavian Journal of Statistics, Vol. 38: 726 747, 2011 doi: 10.1111/j.1467-9469.2011.00751.x Published by Blackwell Publishing Ltd. Non-stationary Cross-Covariance Models for Multivariate Processes

More information

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise

More information

Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty

Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty Journal of Data Science 9(2011), 549-564 Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty Masaru Kanba and Kanta Naito Shimane University Abstract: This paper discusses the

More information

P -spline ANOVA-type interaction models for spatio-temporal smoothing

P -spline ANOVA-type interaction models for spatio-temporal smoothing P -spline ANOVA-type interaction models for spatio-temporal smoothing Dae-Jin Lee 1 and María Durbán 1 1 Department of Statistics, Universidad Carlos III de Madrid, SPAIN. e-mail: dae-jin.lee@uc3m.es and

More information

Bruno Sansó. Department of Applied Mathematics and Statistics University of California Santa Cruz bruno

Bruno Sansó. Department of Applied Mathematics and Statistics University of California Santa Cruz   bruno Bruno Sansó Department of Applied Mathematics and Statistics University of California Santa Cruz http://www.ams.ucsc.edu/ bruno Climate Models Climate Models use the equations of motion to simulate changes

More information

Gaussian Processes 1. Schedule

Gaussian Processes 1. Schedule 1 Schedule 17 Jan: Gaussian processes (Jo Eidsvik) 24 Jan: Hands-on project on Gaussian processes (Team effort, work in groups) 31 Jan: Latent Gaussian models and INLA (Jo Eidsvik) 7 Feb: Hands-on project

More information

Data are collected along transect lines, with dense data along. Spatial modelling using GMRFs. 200m. Today? Spatial modelling using GMRFs

Data are collected along transect lines, with dense data along. Spatial modelling using GMRFs. 200m. Today? Spatial modelling using GMRFs Centre for Mathematical Sciences Lund University Engineering geology Lund University Results A non-stationary extension The model Estimation Gaussian Markov random fields Basics Approximating Mate rn covariances

More information

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes TTU, October 26, 2012 p. 1/3 Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes Hao Zhang Department of Statistics Department of Forestry and Natural Resources Purdue University

More information

Integrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University

Integrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University Integrated Likelihood Estimation in Semiparametric Regression Models Thomas A. Severini Department of Statistics Northwestern University Joint work with Heping He, University of York Introduction Let Y

More information

A multi-resolution Gaussian process model for the analysis of large spatial data sets.

A multi-resolution Gaussian process model for the analysis of large spatial data sets. National Science Foundation A multi-resolution Gaussian process model for the analysis of large spatial data sets. Doug Nychka Soutir Bandyopadhyay Dorit Hammerling Finn Lindgren Stephen Sain NCAR/TN-504+STR

More information

Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"

Kneib, Fahrmeir: Supplement to Structured additive regression for categorical space-time data: A mixed model approach Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach" Sonderforschungsbereich 386, Paper 43 (25) Online unter: http://epub.ub.uni-muenchen.de/

More information

Spatial Modelling Using a New Class of Nonstationary Covariance Functions

Spatial Modelling Using a New Class of Nonstationary Covariance Functions Spatial Modelling Using a New Class of Nonstationary Covariance Functions Christopher J. Paciorek and Mark J. Schervish 13th June 2005 Abstract We introduce a new class of nonstationary covariance functions

More information

On block bootstrapping areal data Introduction

On block bootstrapping areal data Introduction On block bootstrapping areal data Nicholas Nagle Department of Geography University of Colorado UCB 260 Boulder, CO 80309-0260 Telephone: 303-492-4794 Email: nicholas.nagle@colorado.edu Introduction Inference

More information

A short introduction to INLA and R-INLA

A short introduction to INLA and R-INLA A short introduction to INLA and R-INLA Integrated Nested Laplace Approximation Thomas Opitz, BioSP, INRA Avignon Workshop: Theory and practice of INLA and SPDE November 7, 2018 2/21 Plan for this talk

More information

A Multi-resolution Gaussian process model for the analysis of large spatial data sets.

A Multi-resolution Gaussian process model for the analysis of large spatial data sets. 1 2 3 4 A Multi-resolution Gaussian process model for the analysis of large spatial data sets. Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain August 13, 2013 5

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, N.C.,

More information

Analysis Methods for Supersaturated Design: Some Comparisons

Analysis Methods for Supersaturated Design: Some Comparisons Journal of Data Science 1(2003), 249-260 Analysis Methods for Supersaturated Design: Some Comparisons Runze Li 1 and Dennis K. J. Lin 2 The Pennsylvania State University Abstract: Supersaturated designs

More information

An Additive Gaussian Process Approximation for Large Spatio-Temporal Data

An Additive Gaussian Process Approximation for Large Spatio-Temporal Data An Additive Gaussian Process Approximation for Large Spatio-Temporal Data arxiv:1801.00319v2 [stat.me] 31 Oct 2018 Pulong Ma Statistical and Applied Mathematical Sciences Institute and Duke University

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

VARIANCE ESTIMATION AND KRIGING PREDICTION FOR A CLASS OF NON-STATIONARY SPATIAL MODELS

VARIANCE ESTIMATION AND KRIGING PREDICTION FOR A CLASS OF NON-STATIONARY SPATIAL MODELS Statistica Sinica 25 (2015), 135-149 doi:http://dx.doi.org/10.5705/ss.2013.205w VARIANCE ESTIMATION AND KRIGING PREDICTION FOR A CLASS OF NON-STATIONARY SPATIAL MODELS Shu Yang and Zhengyuan Zhu Iowa State

More information

Modelling geoadditive survival data

Modelling geoadditive survival data Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model

More information

Luke B Smith and Brian J Reich North Carolina State University May 21, 2013

Luke B Smith and Brian J Reich North Carolina State University May 21, 2013 BSquare: An R package for Bayesian simultaneous quantile regression Luke B Smith and Brian J Reich North Carolina State University May 21, 2013 BSquare in an R package to conduct Bayesian quantile regression

More information

Nearest Neighbor Gaussian Processes for Large Spatial Data

Nearest Neighbor Gaussian Processes for Large Spatial Data Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns

More information

A Spatio-Temporal Downscaler for Output From Numerical Models

A Spatio-Temporal Downscaler for Output From Numerical Models Supplementary materials for this article are available at 10.1007/s13253-009-0004-z. A Spatio-Temporal Downscaler for Output From Numerical Models Veronica J. BERROCAL,AlanE.GELFAND, and David M. HOLLAND

More information

Multi-resolution models for large data sets

Multi-resolution models for large data sets Multi-resolution models for large data sets Douglas Nychka, National Center for Atmospheric Research National Science Foundation Iowa State March, 2013 Credits Steve Sain, Tamra Greasby, NCAR Tia LeRud,

More information

Tomoko Matsuo s collaborators

Tomoko Matsuo s collaborators Tomoko Matsuo s collaborators Thermosphere-Ionosphere GCM Modeling A. Richmond (NCAR-HAO) T. Fuller-Rowell and M. Codrescu (NOAA) Polar Ionosphere Data Assimilation A. Richmond, G. Lu, and B. Emery (NCAR-HAO)

More information

Statistica Sinica Preprint No: SS wR2

Statistica Sinica Preprint No: SS wR2 Statistica Sinica Preprint No: SS-13-227wR2 Title A covariance parameter estimation method for polar-orbiting satellite data Manuscript ID SS-13-227wR2 URL http://www.stat.sinica.edu.tw/statistica/ DOI

More information

Bayesian hierarchical modelling for data assimilation of past observations and numerical model forecasts

Bayesian hierarchical modelling for data assimilation of past observations and numerical model forecasts Bayesian hierarchical modelling for data assimilation of past observations and numerical model forecasts Stan Yip Exeter Climate Systems, University of Exeter c.y.yip@ex.ac.uk Joint work with Sujit Sahu

More information

A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression

A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression A Blockwise Descent Algorithm for Group-penalized Multiresponse and Multinomial Regression Noah Simon Jerome Friedman Trevor Hastie November 5, 013 Abstract In this paper we purpose a blockwise descent

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets

A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets Statistica Sinica (2013): Preprint 1 A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets Ganggang Xu 1, Faming Liang 1 and Marc G. Genton 2 1 Texas A&M University

More information

Likelihood-Based Methods

Likelihood-Based Methods Likelihood-Based Methods Handbook of Spatial Statistics, Chapter 4 Susheela Singh September 22, 2016 OVERVIEW INTRODUCTION MAXIMUM LIKELIHOOD ESTIMATION (ML) RESTRICTED MAXIMUM LIKELIHOOD ESTIMATION (REML)

More information

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models arxiv:1811.03735v1 [math.st] 9 Nov 2018 Lu Zhang UCLA Department of Biostatistics Lu.Zhang@ucla.edu Sudipto Banerjee UCLA

More information

Comparing Non-informative Priors for Estimation and. Prediction in Spatial Models

Comparing Non-informative Priors for Estimation and. Prediction in Spatial Models Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Vigre Semester Report by: Regina Wu Advisor: Cari Kaufman January 31, 2010 1 Introduction Gaussian random fields with specified

More information

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Marcela Alfaro Córdoba August 25, 2016 NCSU Department of Statistics Continuous Parameter

More information

Approaches for Multiple Disease Mapping: MCAR and SANOVA

Approaches for Multiple Disease Mapping: MCAR and SANOVA Approaches for Multiple Disease Mapping: MCAR and SANOVA Dipankar Bandyopadhyay Division of Biostatistics, University of Minnesota SPH April 22, 2015 1 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

More information

Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain. October 10, 2012

Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain. October 10, 2012 A multi-resolution Gaussian process model for the analysis of large spatial data sets. Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain October 10, 2012 Abstract

More information

A Gaussian state-space model for wind fields in the North-East Atlantic

A Gaussian state-space model for wind fields in the North-East Atlantic A Gaussian state-space model for wind fields in the North-East Atlantic Julie BESSAC - Université de Rennes 1 with Pierre AILLIOT and Valï 1 rie MONBET 2 Juillet 2013 Plan Motivations 1 Motivations 2 Context

More information

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING DATE AND TIME: August 30, 2018, 14.00 19.00 RESPONSIBLE TEACHER: Niklas Wahlström NUMBER OF PROBLEMS: 5 AIDING MATERIAL: Calculator, mathematical

More information

Implementing an anisotropic and spatially varying Matérn model covariance with smoothing filters

Implementing an anisotropic and spatially varying Matérn model covariance with smoothing filters CWP-815 Implementing an anisotropic and spatially varying Matérn model covariance with smoothing filters Dave Hale Center for Wave Phenomena, Colorado School of Mines, Golden CO 80401, USA a) b) c) Figure

More information

BAGUS: Bayesian Regularization for Graphical Models with Unequal Shrinkage

BAGUS: Bayesian Regularization for Graphical Models with Unequal Shrinkage BAGUS: Bayesian Regularization for Graphical Models with Unequal Shrinkage Lingrui Gan, Naveen N. Narisetty, Feng Liang Department of Statistics University of Illinois at Urbana-Champaign Problem Statement

More information

Spatio-temporal modelling of daily air temperature in Catalonia

Spatio-temporal modelling of daily air temperature in Catalonia Spatio-temporal modelling of daily air temperature in Catalonia M. Saez 1,, M.A. Barceló 1,, A. Tobias 3, D. Varga 1,4 and R. Ocaña-Riola 5 1 Research Group on Statistics, Applied Economics and Health

More information

STATISTICAL INTERPOLATION METHODS RICHARD SMITH AND NOEL CRESSIE Statistical methods of interpolation are all based on assuming that the process

STATISTICAL INTERPOLATION METHODS RICHARD SMITH AND NOEL CRESSIE Statistical methods of interpolation are all based on assuming that the process 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 STATISTICAL INTERPOLATION METHODS

More information

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS Srinivasan R and Venkatesan P Dept. of Statistics, National Institute for Research Tuberculosis, (Indian Council of Medical Research),

More information

The Bayesian approach to inverse problems

The Bayesian approach to inverse problems The Bayesian approach to inverse problems Youssef Marzouk Department of Aeronautics and Astronautics Center for Computational Engineering Massachusetts Institute of Technology ymarz@mit.edu, http://uqgroup.mit.edu

More information

STA414/2104 Statistical Methods for Machine Learning II

STA414/2104 Statistical Methods for Machine Learning II STA414/2104 Statistical Methods for Machine Learning II Murat A. Erdogdu & David Duvenaud Department of Computer Science Department of Statistical Sciences Lecture 3 Slide credits: Russ Salakhutdinov Announcements

More information

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2

More information

Hierarchical Modelling for Multivariate Spatial Data

Hierarchical Modelling for Multivariate Spatial Data Hierarchical Modelling for Multivariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Point-referenced spatial data often come as

More information

Single Index Quantile Regression for Heteroscedastic Data

Single Index Quantile Regression for Heteroscedastic Data Single Index Quantile Regression for Heteroscedastic Data E. Christou M. G. Akritas Department of Statistics The Pennsylvania State University SMAC, November 6, 2015 E. Christou, M. G. Akritas (PSU) SIQR

More information

Forward Selection and Estimation in High Dimensional Single Index Models

Forward Selection and Estimation in High Dimensional Single Index Models Forward Selection and Estimation in High Dimensional Single Index Models Shikai Luo and Subhashis Ghosal North Carolina State University August 29, 2016 Abstract We propose a new variable selection and

More information

WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION. Abstract

WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION. Abstract Journal of Data Science,17(1). P. 145-160,2019 DOI:10.6339/JDS.201901_17(1).0007 WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION Wei Xiong *, Maozai Tian 2 1 School of Statistics, University of

More information

Permutation-invariant regularization of large covariance matrices. Liza Levina

Permutation-invariant regularization of large covariance matrices. Liza Levina Liza Levina Permutation-invariant covariance regularization 1/42 Permutation-invariant regularization of large covariance matrices Liza Levina Department of Statistics University of Michigan Joint work

More information

Paper Review: NONSTATIONARY COVARIANCE MODELS FOR GLOBAL DATA

Paper Review: NONSTATIONARY COVARIANCE MODELS FOR GLOBAL DATA Paper Review: NONSTATIONARY COVARIANCE MODELS FOR GLOBAL DATA BY MIKYOUNG JUN AND MICHAEL L. STEIN Presented by Sungkyu Jung April, 2009 Outline 1 Introduction 2 Covariance Models 3 Application: Level

More information