A comparative study of optimization methods and conventional methods for sampling design in fishery-independent surveys

Size: px
Start display at page:

Download "A comparative study of optimization methods and conventional methods for sampling design in fishery-independent surveys"

Transcription

1 A comparative study of optimization methods and conventional methods for sampling design in fishery-independent surveys Yong Liu, Yong Chen, and Jiahua Cheng 1873 Liu, Y., Chen, Y., and Cheng, J A comparative study of optimization methods and conventional methods for sampling design in fisheryindependent surveys. ICES Journal of Marine Science, 66: We have introduced and evaluated a procedure, the constrained spatial simulated annealing method, for developing an optimal sampling design for fishery-independent surveys. We used two criterion functions, minimization of the mean of the shortest distance (MMSD) and uniform distribution of point pairs for variogram estimation (WM), and three arrangements of the two criteria, all WM, all MMSD, and a combination of MMSD (2/3 of samples) and WM (1/3), to construct three optimized sampling designs (denoted as Designs I, II, and III, respectively). These three designs were compared in a simulation study with systematic sampling (Design IV) and stratified random sampling designs (Design V), commonly used in fishery-independent surveys. Three levels of sample size (small, medium, and large) were considered in the simulation study developed using a geostatistical approach. The results showed that for parameter estimation of the spatial covariance function, Design III was better than the other designs at relatively small sample size and Design II performed better than the other designs at relatively large sample size. For estimating fish stock abundance, the performance of the designs considered in this study can be ranked as follows: Design II. Design IV. Design III. Design V. Design I. It is clearly important to evaluate and improve sampling design based on historical survey data. Such a study allows us to identify an optimal sampling design to balance the quality of the data collected and the costs of the sampling programme, leading to the development and optimization of a sustainable and fishery-independent monitoring programme. Keywords: fishery-independent survey, geostatistics, optimization method, sampling design, simulation. Received 8 December 2008; accepted 15 April 2009; advance access publication 14 June Y. Liu and J. Cheng: Key and Open Laboratory of Marine and Estuarine Fisheries Certificated by the Ministry of Agriculture, East China Sea Fisheries Institute, Chinese Academy of Fishery Sciences, Shanghai , China. Y. Chen: School of Marine Sciences, University of Maine, Orono, ME 04469, USA. Correspondence to Y. Liu: tel: þ ; fax: þ ; liuyong7707@yahoo.com.cn. Introduction Fishery-independent surveys are critical to fish stock assessment and management, because they provide reliable information about fish stocks at a defined temporal and spatial scale (Jardim and Ribeiro, 2007). The cost of such a survey is often so high that it is important and necessary to optimize the sampling design and maximize the use of data collected for the stock or stocks being investigated (Simmonds and Fryer, 1996). Sampling design affects the methods of data collection and the quality, quantity, and spatial and temporal scales of the data collected. It also greatly influences the choice of methods that can be used for data analyses (Cochran, 1977; Rivoirard et al., 2000; Stein and Ettema, 2003). The sampling design used in fishery-independent surveys can be divided into two groups: (i) the design-based approach used in classical survey and sampling programmes, and (ii) the modelbased approach following primarily geostatistics (Petitgas, 2001; Jardim and Ribeiro, 2007). These two approaches differ greatly in their assumptions about the properties of target populations and sampling locations. The design-based approach assumes that the values of variables to be measured at sampling locations are fixed, and the random selection of sampling locations provides necessary randomization, whereas the model-based approach assumes that the randomization occurring in natural processes is sufficient, and that sampling locations can be fixed (Hansen et al., 1983; Thompson, 1992; Brus and de Gruijter, 1997). Random sampling is a classical survey method, but may have its limitation in fishery-independent surveys because of some unique characteristics of fish populations. For example, fish are likely to aggregate spatially, which does not satisfy the assumption of random distribution of individuals in a survey area for random sampling. Possible spatial correlations between samples often existing in the fishery-independent survey of fish populations may bias variance estimators (Harbitz and Aschan, 2003). Another fact is that the survey design may not be implemented exactly at sea for logistical reasons such as poor weather and untrawlable substrata (Petitgas, 2001). Therefore, a random sampling design (i.e. only spatially random, not temporally random) cannot always be followed strictly. If succeeding stations are too close, sample processing and handling may also be complicated (Harbitz et al., 1998). Recent studies have increasingly focused on spatial statistics for estimating fish stock abundance (Petitgas, 1993; Rivoirard et al., 2000; Grabowski et al., 2005). In theory, when there are spatial correlations between observations and spatial structure can be appropriately considered in modelling, abundance estimates obtained with a systematic sampling design tend to be more precise than # 2009 International Council for the Exploration of the Sea. Published by Oxford Journals. All rights reserved. For Permissions, please journals.permissions@oxfordjournals.org

2 1874 Y. Liu et al. others derived based on random sampling (Cochran, 1977; Rivoirard et al., 2000). The data from non-random survey designs can be analysed using spatial statistics, and the distribution of fisheries resources can be inferred (Petitgas, 2001). A geostatistical design approach involves selecting and optimizing (e.g. minimizing or maximizing) a suitable objective function (Diggle and Lophaven, 2006), developed based on the selected design criterion (e.g. the mean or maximum value of variance in the estimation; van Groenigen et al., 1999). Most approaches assume that all parameters defining the covariance structure of data are known, and the objective is to identify the best design that can minimize the variance in the estimation (McBratney and Webster, 1981; Domburg et al., 1997; van Groenigen et al., 1999; Marchant and Lark, 2007). Such approaches may not be applicable to fisheries because of limited knowledge of the covariance structure of fisheries data, which is essential for these approaches. However, there are other approaches that do not need the information about data covariance structure (Russo, 1984; Warrick and Myers, 1987; Abt et al., 1999; Bogaert and Russo, 1999; Muller and Zimmerman, 1999; Lark, 2002). One such approach is to focus on the goodness of semi-variogram estimation, critical for geostatistical analysis. A conventional procedure for the estimation of semi-variogram model parameters is to group all distances between every pair of points (if considering anisotropy, the direction should be consistent within a group), from which the average distance in each distance group can be derived and used for estimating the parameters. Design criteria focus on the identification of an approach to deal with pairwise samples within each group (e.g. minimizing the variance of the distances; Russo, 1984) to achieve a proper distribution of pairwise samples among all distance groups (Warrick and Myers, 1987). Another type of design is purely geometric, such as favouring a regular spatial distribution of sampling locations within the constraints imposed by a particular application (Royle and Nychka, 1998) or maximizing the spatial spread of sampling locations (van Groenigen and Stein, 1998). The sampling design criteria discussed above have been used in research areas as diverse as mining engineering, soil and crop science, hydrology, and ecology. As far as we know, they have not been used before in fisheries. One of the objectives of this study is to compare the performance of different sampling designs. Suitable measures need to be identified for the comparisons. Ideally, a performance measure should include an evaluation of accuracy, which is the degree of closeness of a measured or calculated quantity to its actual (true) value, and precision, which is the extent to which further measurements or calculations show the same or similar results (Taylor, 1997). In this study, a statistic, the relative standard error (RSE), was used to summarize and compare both the accuracy and precision of parameter estimation for the covariance function (Chen, 1996; Andrew and Chen, 1997). We also used an index, the relative bias (RB), to compare the accuracy of stocksize estimation (Paloheimo and Chen, 1996). Our study aimed at proposing geostatistics-based design criteria for sampling design, comparing them with the conventional sampling designs used in fishery-independent surveys, and identifying an optimal approach for fishery-independent survey design. We developed a constrained spatial simulated annealing (CSSA) method (van Groenigen et al., 1999) for optimizing sampling schemes. We used the following three criteria: (i) minimization of the mean of the shortest distance (MMSD; van Groenigen and Stein, 1998), (ii) uniform distribution of pairwise points for variogram estimation (WM; Warrick and Myers, 1987), and (iii) a combination of MMSD (2/3 of samples) and WM (1/3) to construct three optimized sampling schemes. The above three optimized sampling schemes were compared with the two conventional sampling designs, stratified random sampling and systematic sampling schemes, for estimating sampling parameters and fish abundance in a simulation study. The simulation study includes simulating the fish distribution using parameters estimated from a real fishery-independent survey, which were considered as true values in the simulation, then sampling the true distribution with the five sampling schemes listed above. Using the estimated parameters for the five sampling designs, we evaluated and compared the results in terms of estimated fish abundance and identified an optimal sampling design. The study provides new insights into survey designs for fishery-independent surveys that are critical to fish stock assessment and management. Material and methods Study area and data collection A fishery-independent bottom trawl survey in the East China Sea has been conducted for many years. The survey area covers the area between 27 and 348N and from 120 to 1278E. The survey design follows a systematic sampling design with sampling stations evenly distributed in the survey area with a 30 0 interval between each pair of stations. Based on previous studies (Liu et al., 2004, 2006), two Setipinna taty stocks were identified, a large one in the north and a small one in the south. The northern stock, which is distributed from 27 to 308N and from 120 to 1278E, was used for the simulation study. The original spatial autocorrelation parameters for the stock were calculated from the survey data collected during the period in the survey area. Sampling design Systematic sampling design Our design was consistent with that for the actual survey from which the simulation data were collected. Based on this design, sampling stations were evenly distributed with an equal interval between each pair throughout the survey area. We removed some opportunistic and irregular stations that appeared in the actual surveys and added some stations to fill the survey area defined in the simulation more evenly (Figure 1). The systematic sampling design is referred to as Design IV in this study. Stratified random sampling design This design is widely used in fishery-independent surveys and is supported by classic statistics theory (Cochran, 1977; Manly et al., 2002; Miller et al., 2007). In fishery-independent bottom trawl surveys, strata are often defined based on environmental variables shown to be important in influencing fish distribution, such as depth and geographic area. Sampling efforts were allocated among strata, generally proportional to the stratum area (Chen et al., 2006). The sampling stations in each stratum were assigned randomly (Cochran, 1977). In this simulation study, the survey area was divided into four strata according to depth:,40, 40 60, 60 80, and.80 m (Figure 1). The number of sampling stations in each stratum was determined by both the stratum area and the distance to the coast because an analysis of historical data suggested that both variables were important in determining the spatial distribution of S. taty (Liu et al., 2004, 2006). The stratified random survey design is referred to as Design V in this study.

3 Optimization methods and conventional sampling design in fishery-independent surveys 1875 Figure 1. Illustration of the spatial distribution of sampling stations from the five sampling designs (columns of panels) for the three different levels (rows of panels) of sample sizes (i.e. 40, 82, and 160). Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3) þ WM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design. Optimized sampling design by constraint spatial simulated annealing In terms of the objective functions for optimization, an optimal design can normally be obtained through two steps: defining a criterion, then finding the set of sampling locations that can minimize the objective function defined in the criterion (Diggle and Lophaven, 2006). Here, we used two criteria, the criterion defined by Warrick and Myers (1987; referred to as the WM criterion) and the MMSD criterion (van Groenigen and Stein, 1998). The WM criterion (Warrick and Myers, 1987) aims to obtain an even distribution of the number of coupled points among all classes/groups of distances to achieve the precise variogram estimation. It does not need any assumption about spatial autocorrelation structure, but depends on the distances between sampling points: f WM ðsþ ¼ Xnc i¼1 ½aw i ð f i f i Þ 2 þ bm i Š; ð1þ where S is a set of sampling points, n c the number of distance classes, f the number of coupled stations in a given distance class, f * the expected number of coupled stations in a given distance class, a, b, and w the user-defined weighting coefficients, and M the other possible accountable objections in distance classes. In this study, for simplicity, we considered just a simple situation for which b was 0 and a and w were both set equal to 1. For a particular number of samples (n) and number of distance classes (n c ), the expected number of coupled stations for a uniform distribution is f i nðn 1Þ ¼ : ð2þ 2n c The MMSD criterion (van Groenigen and Stein, 1998) aims for an even distribution of sampling points over the entire survey area. The even distribution can be obtained when the expectation of the distances between a point chosen arbitrarily within the survey area and its nearest sampling point is minimal. With N points of a finely meshed grid as reference points, the criterion is to minimize the average of the distance of all grid points to their nearest sampling points: f MMSD ðsþ ¼ 1 N X N i dðx i ; SÞ; where x is the grid point and d(x, S) is the distance between grid point x and the closest sampling point in sampling set S. The number of grid points should be much larger than the number of sample points to achieve a reliable value of f MMSD (Simbahan and Dobermann, 2006). ð3þ

4 1876 Y. Liu et al. Next, it is necessary to find an effective way to optimize the criterion function. Simulated annealing (SA) is a generic probabilistic approach for finding an approximation to the global optimum of a given objective function f(s) (Kirkpatrick et al., 1983; Simbahan and Dobermann, 2006). The key aspect of this method is to determine whether the new solution f(s iþ1 ), obtained from a random perturbation of one of the variables in S i, can take the place of the previous f(s i ). The core of SA is the Metropolis selection criterion (Metropolis et al., 1953), written as P c ðs i! S iþ1 Þ¼1; if fðs iþ1 ÞfðS i Þ P c ðs i! S iþ1 Þ¼exp fðsiþ1þ fðsiþ c ; if fðs iþ1 Þ. fðs i Þ ; ð4þ where c denotes a positive control parameter, decreased with the optimization progress. If S iþ1 is accepted, it is taken as the starting point for the next solution S iþ2, and the process continues. The positive control parameter c is also referred to as the annealing temperature. There are many cooling schedules to control the value of parameter c. In this study, this parameter is calculated by the following equation: c kþ1 ¼ ac k ; k ¼ 1; 2;...; ð5þ where a is a user-determined parameter (,1 to keep c decreasing), typically having a value slightly,1 (e.g ), and k the number of optimization processes that have been performed. If the SA optimization method is adopted for spatial sampling, it is referred to as spatial SA (SSA; van Groenigen and Stein, 1998). The SSA method was further improved to become a CSSA method. Sampling points were treated as continuous variables rather than chosen from a discrete grid (van Groenigen and Stein, 1998). For the SSA method, the distance from one randomly chosen sampling point to another is between 0 and h max, which is the length of the sampling region. In the modified CSSA method, h max is not always the same and can decrease with progress. In this way, the efficiency of finding a better sampling scheme can be improved greatly. Three sampling designs were then developed based on CSSA. The first was optimized based on the WM criterion (Figure 1; referred to as Design I). The second sampling design was optimized based on the MMSD criterion (Figure 1; Design II). The third sampling design was optimized based on both the WM and MMSD criteria, with a sequential MMSD þ WM optimization strategy (Figure 1; Design III). Two-thirds of the sampling points were optimized according to the MMSD criterion. After the convergence of the MMSD algorithm, the remaining one-third of the points were optimized according to the WM criterion, with the MMSD-optimized sampling points treated as prior information (Simbahan and Dobermann, 2006). For the convenience of comparison among the five designs, the WM and MMSD criterion values were also calculated for Designs IV and V without the procedure of the optimizations. Geostatistical model framework and simulation study A Gaussian stochastic process is widely used to quantify the distribution of geostatistical data. However, the original observations usually cannot satisfy the Gaussian assumption. Therefore, we used a log-gaussian geostatistical model, requiring log-transformation of the original data, a typical Box Cox Gaussian transformation (Box and Cox, 1964). A Gaussian spatial process, {S(x): x[r 2 }, is a stochastic process with the property that for any collection of locations x 1,..., x n with each x i [R 2, the joint distribution of S ¼ fs(x 1 ),..., S(x n )g is a multivariate Gaussian distribution. With mean vector m S, calculated as m(x) ¼ E[S(x)], and covariance matrix G as G ij ¼ g(x i, x j ) ¼ CovfS(x i ), S(x j )g, we have SMVN(m S, G) (Diggle and Ribeiro, 2007). A stationary Gaussian process was applied in this study, and it is defined by two statistics: the mean of every location x, which is a constant m(x) ¼ m, and the covariance depending only on the distance between x and x 0, g(x, x 0 ) ¼ g(h), where h is the Euclidean distance between locations x and x 0. The variance in the stationary process is a constant s 2 ¼ g(0), and the correlation function is defined as r(h) ¼ g(h)/s 2, which we describe with an exponential function. The exponential function can be written as r(h) ¼ exp(2h/f), where f is the correlation range parameter with the property r(h) 0.05 when h ¼ 3f. In practice, there is always another type of independent variance in observed data, which is usually referred to as the nugget effect t 2. In geostatistical terminology, t 2 þ s 2 is the total sill, s 2 the partial sill, t 2 the nugget effect, and 3f the practical range (Isaaks and Srivastava, 1989). Geostatistical parameter estimation The geostatistical multivariate Gaussian model can be written (Diggle and Ribeiro, 2007) as Y NðDb; s 2 RðfÞþt 2 IÞ; where D is an n p matrix of covariates, b the corresponding vector of regression parameters, and R the correlation function, which depends on correlation range parameter f. The product of the two parameters D and b describes a linear specification for the spatial trend m(x). The log-likelihood function is Lðb; t 2 ; s 2 ; fþ ¼ 0:5fn logð2pþþlogfjðs 2 RðfÞþt 2 IÞjg þðy DbÞ T ðs 2 RðfÞþt 2 IÞ 1 ðy DbÞg : Model parameters can be estimated through maximization of the likelihood function. Unconditional simulation A key aspect of simulation is to simulate a realization. Y can be represented by Y ¼ m þ S þ tt, where m ¼ E[Y], T ¼ (T 1,..., T n )is a set of mutually independent random variables following a standard Gaussian distribution N(0,1), and S ¼ (S 1,..., S n ) follow a zero-mean multivariate Gaussian distribution, SMVN(0, P ). The realization of S can be obtained through linear transformation: S ¼ AZ, where A is such a matrix that AA 0 ¼ P, and Z ¼ (Z 1,..., Z n ) follow a standard normal distribution N(0,1). Matrix A can be derived using two methods: Cholesky factorization and singular value decomposition. However, when n becomes large, these two methods become less efficient at simulation. It is necessary to use other methods, such as the ingenious algorithm described by Wood and Chan (1994), circulated embedding in conjunction with fast Fourier transform methods, or a Markov Chain Monte Carlo method (Diggle and Ribeiro, 2007). ð6þ ð7þ

5 Optimization methods and conventional sampling design in fishery-independent surveys 1877 Figure 2. The flowchart of the simulation study. Conditional simulation Conditional simulation is the simulation of a spatial process S(x) at location x i * : i ¼ 1,..., N, conditional on observed values S(x i )at location x i : i ¼ 1,..., n. The conditional distribution of the values of the process S(x) at any set of locations, e.g. S* ¼ fs(x i * ),..., S(x N * )g, given the data Y, follows a multivariate Gaussian distribution with a variance matrix independent of Y (Diggle and Ribeiro, 2007). Simulation procedure The simulation started with the sampling design. As described above, five sampling designs were considered, each with three different numbers of sampling stations: 40, 82, and 160. They are denoted by G N D : D ¼ 1,..., 5, corresponding to Design I,..., Design V, respectively, and N ¼ 1, 2, and 3, corresponding to sample sizes of 40, 82, and 160, respectively. These three sample sizes were used to test the impacts of sample size on the simulation and are referred to as the small, medium, and large sample sizes in this study. The parameters of spatial distribution characteristics of each year were estimated from the survey data of S. taty and denoted as u y : y ¼ 2000,..., In all, 200 unconditional simulation runs were conducted for each set of parameters, with each simulation run having 3685 regulated grid-point realizations within the study area, and were denoted as Y y S : S ¼ 1,..., 200. The next step was to draw a sample from the above simulation realizations according to the sampling designs, and the parameters were estimated for each sample (denoted as eu y SD ). The last step was to simulate 100 runs for the estimated spatial structure parameters with 952 regulated grid-point realizations within the study area and to average the results of 100 simulations to yield the final simulation result, denoted as ~Y y SD. The simulation process is illustrated in Figure 2. Analysis of results The simulation results derived for the five designs were compared using two measures. The first measure, RSE, evaluates the accuracy and precision of estimated parameters for covariance functions by comparing the true values estimated from 8 years of original data with values estimated from the data simulated by sampling from unconditionally simulated data, with different sampling schemes (200 unconditional simulations for each year). The RSE is calculated as qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi P n estimated i¼1ðvi V true Þ 2 =n RSE ¼ V true ; ð8þ where V i estimated is the value of parameters estimated from the ith unconditional simulation (for a given sampling design in a given year), V true the parameter value derived from the original data (for a given year), and n the number of simulation runs. The RSEs of 8 years were pooled to calculate the mean and s.d. for comparing the accuracy and precision of parameter estimates among the five designs for each of the three sample sizes considered. The second measure, RB, evaluates the accuracy of stock abundance estimates by comparing the assumed true value (stock sizes estimated from the unconditional simulation data) with the stock sizes estimated by averaging the 100 estimates of stock size derived from 100 sets of data, i.e. the conditional simulations based on the sampled data from unconditional simulation. We used the RB to compare the accuracy of estimated stock size because the unconditional simulation resulted in different total stock sizes, so the absolute bias between true and estimated values could not be

6 1878 Y. Liu et al. Table 1. Exponential covariance function parameters estimated for Setipinna taty for the survey years Year b s 2 f t The parameters are defined in Equations (6) and (7). compared. The RB is calculated as V estimated V true RB ¼ V true ; ð9þ where V estimated is the average of 100 conditional estimates of stock size for a given sampling design corresponding to an assumed true distribution and V true the assumed true value of stock size derived from the unconditional simulation for a given year. For each year, each unconditional simulation yielded a value of RB, and 200 values were derived in total. Their means were calculated for each year, and the mean and s.d. of the values of 8 years were calculated to compare the difference in accuracy of stockabundance estimates among the five sampling designs under the three different sample sizes considered. R software (R Development Core Team, 2008) was used in this study, with add-on packages geor (Ribeiro and Diggle, 2001) and RandomFields (Schlather, 2001). Results Exponential covariance function parameters for the survey Using 8 years of survey data collected by the East China Sea Fisheries Research Institute, we estimated parameters quantifying the characteristics of spatial distribution for S. taty in the north part of the East China Sea, using Equations (6) and (7) (Table 1). These parameters were used to simulate the fish distribution in the simulation study. Five sampling designs For Design I (WN design), the coefficient of variation for the number of paired sampling stations in different distance classes used in variogram estimation was small, ranging from 0.02 to 0.35%, suggesting a fairly uniform distribution of paired sampling stations in the survey area. However, the WM design was not optimal for sampling because sample stations were clustered (Figure 1), and the evenness of the distribution of sampling stations was the poorest of all the designs, as demonstrated by the largest MMSD criterion values for Design I (Table 2). Design II (MMSD design) led to good and even coverage of the survey area (Figure 1) and had the least average distance of all grid points to their nearest sampling point (f MMSD ) among all the sampling designs for each sample size (Table 2). However, the number of paired sampling stations was small at short-lag intervals. For example, the number of paired sampling stations in the shortest lag class (0 1) identified based on the MMSD sampling design was the least of all the sampling designs for each sample Table 2. Values of the objective function and frequency distribution of paired points in different classes for the five sampling designs. Parameter Design I Design II Design III Design IV Design V Size f MMSD f WM f* Lag Distance Number of paired points in each lag distance class Mean s.d CV (%) Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3) þ WM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design.

7 Optimization methods and conventional sampling design in fishery-independent surveys 1879 Table 3. Mean and s.d. of the RSE for the three parameters for covariance function (b, s 2, f) estimated in the simulation for the eight survey years. Design Sample size Estimated parameter I II III IV V 40 b Mean s.d s 2 Mean s.d f Mean s.d b Mean s.d s 2 Mean s.d f Mean s.d b Mean s.d s 2 Mean s.d f Mean s.d The smallest average RSE for each parameter among the five designs is emboldened, and the largest is italicized and underlined. Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3)þWM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design. size (Table 2). If the sample size was small, the number of paired sampling stations at short-lag intervals would be too few to estimate the parameters of the variogram precisely. Values of f MMSD for Design III (MMSDþWM design) were higher than those for Design II and were the least of all other designs except for Design IV (Table 2). Moreover, the f WM value was also larger than that for Design I and was the least of all other designs (Table 2). Both criterion values in the split dataset were inferior to their whole dataset counterparts, an expected trade-off because both spatial coverage and modelling of the variogram were included. The f MMSD value of the systematic design (Design IV) was the second smallest, just slightly higher than that for the MMSD design (Design II). However, the stratified random design (Design V) had the second largest value of f MMSD, just smaller than that for the WM design (Design I; Table 2). The f MMSD values for the five designs for all sample sizes had the following order of magnitude, i.e. f MMSD (Design II), f MMSD (Design IV), f MMSD (Design III), f MMSD (Design V), f MMSD (Design I). Comparing the RSEs of estimated covariance function parameters The smallest values of RSE (implying the highest accuracy and precision) were derived from Designs II, III, and IV. The frequency of the smallest values appearing in Designs II, III, and IV was 5 (out of 9), 3 (out of 9), and 1 (out of 9), respectively (Table 3). The largest value appeared in Designs I, II, and IV, and the corresponding frequency was 6 (out of 9), 2 (out of 9), and 1 (out of 9), respectively. Design I seemed to perform poorly at estimating the parameters, with the highest frequency of the largest RSE value (i.e. lowest accuracy and precision in 6 out of 9 times), and never the smallest RSE value (the highest accuracy and precision). Design II seemed to be good at estimating parameters for the large sample size, with three estimates having the highest accuracy and precision, but poor for small sample size, with two estimates having the lowest accuracy and precision. Design III seemed to be good for relatively small sample sizes, with two estimates having the highest accuracy and precision for the small sample size and one for the medium sample size. This design yielded no estimate with the lowest accuracy and precision. Design IV had the highest and lowest accuracy and precision for one estimate each in the medium sample size, and Design V had the estimates with neither the highest nor the lowest accuracy and precision in the simulation. For the small sample size of 40, Design III was good for parameter estimation: two highest accuracy and precision for parameters s 2 and f, respectively, and the second highest accuracy and precision for parameter b just below the highest of Design II. For the medium sample size of 82, Designs II, III, and IV had the highest accuracy and precision, respectively, for s 2, f, and b, but Design IV had low accuracy and precision for f. The estimation accuracy and precision for Designs II and III tended to be similar and better than the other designs. For the large sample size of 160, the best estimation of all parameters was by Design II, and the poorest by Design I. The mean RSE of parameter b tended to decrease with sample size for all designs, except for Design IV. The mean RSE values of parameters s 2 and f also decreased with sample size for all designs except Design I. Comparing the RB of estimated stock sizes Design I yielded the estimates with the largest RB value under all three sample sizes considered, implying that it had the lowest accuracy (Table 4). The estimates for Design II were best when sample sizes were 40 and 160, and second best when sample size was 82. The estimation for Design IV was best when sample size was 82. For the accuracy of stock size estimates, Design II was best, followed by Designs IV, III, V, and I.

8 1880 Y. Liu et al. Table 4. Mean and s.d. of RB of the estimated stock size in the simulation. Sampling design Sample size I II III IV V 40 Mean s.d Mean s.d Mean s.d The smallest average RB for each parameter among five designs is emboldened, and the largest is italicized and underlined. Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3)þWM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design. Similar to the RSE, the mean RB value of stock size estimates tended to decrease with sample size for each design. Discussion The classical survey design for estimating population abundance is simple random sampling with a good statistical property. For fishery-independent sampling, because of the many unique characteristics of fish stocks (e.g. fish aggregation and migration, and logistic constraints of fisheries surveys such as unsuitable trawling substrata and poor weather), simple random sampling may not be an optimal sampling design for fisheries. Stratified random sampling is often used in fishery-independent surveys. Its design may, however, include some subjectivity such as the choices of variables used to determine strata, the number of strata, and the criteria used to allocate sampling effort among strata. Arbitrary selections also occur in the systematic sampling design. For example, the location of the survey starting point, the interval span of two points, and the location of the endpoint may all be determined arbitrarily. The optimized sampling design (i.e. Designs I, II, and III) proposed in this study tends to be more objective, and the results are more stable. As the original parameters were estimated from the data collected in the survey based on systematic sampling design, one would ask whether this might influence the comparisons, resulting in biases favouring the systematic sampling design. The answer is definitely negative. Here, we only used the parameters to produce the true distribution within the study area for simulation, and it had no relationship with the data derived from systematic sampling because the process of producing the distribution was based on the unconditional simulation method, which is a randomized procedure. Therefore, the parameters estimated from the data collected in systematic sampling should have had no impact on our results. Design III, the combined MMSDþWM-optimized sampling design, estimated the parameters for the covariance function better than other designs when sample size was small. However, Design II, the MMSD-optimized sampling design, performed better than other designs when sample size was large, but poorest when sample size was small. According to Warrick and Myers (1987), the use of the WM criterion in sampling design could enhance the reliability of estimation for the variogram. However, when the WM-optimized sampling design was used, the sampling locations chosen tended to aggregate at two sites, which was similar to the results found in other studies (Warrick and Myers, 1987; Simbahan and Dobermann, 2006). Sampling stations defined based on this design tend to be limited in small areas, and the sampling results may not be a good representation of the fish population in the survey area. Hence, the accuracy and precision of parameter estimation for the WM-optimized sampling scheme is low and became worse with increasing sample size, compared with other designs, because of the absence of an apparent increase in area coverage with increased sample size. This is different from other sampling designs for which an increase in sampling density tends to lead to improved accuracy and precision. The combined MMSDþWM-optimized sampling scheme (i.e. Design III), however, performs well because it combines the advantages of both spatial coverage and variogram estimation. Design III only performed better than other designs when sample size was small; when sample size was large, the performance of Design II was the best. On the other hand, Design II gave a poor performance in estimating parameters for a small sample size. For a small sample size, the short paired distances were too few to yield an accurate estimate for the short distance classes of the variogram, resulting in poor estimates. With an increase in sample size, the number of short paired distances increases, improving the parameter estimation precision. In addition, the sampling site distribution of Design II is more evenly distributed than that of other designs (see f MMSD values in Table 2). This may be the reason for Design II performing better than other designs when sample size was large. With a set of well-estimated parameters, we can derive a good prediction of the spatial distribution and population abundance of a fish stock, which is often the central task of a fishery-independent survey programme and provides crucial information for fisheries management. Our study shows that the performance of five sampling designs in estimating stock size has the following ranking: Design II. Design IV. Design III. Design V. Design I, coinciding with the order of MMSD criterion values for sampling designs. This implies a tendency for increasing accuracy with evenness of sampling station spatial distribution. This tendency is in general agreement with that found in other studies. Cochran (1977) reported that, based on a few examples, systematic surveys were more accurate than stratified random surveys. Ripley (1981) considered the error variance of the sample mean for a stationary isotropic random process and showed that systematic sampling performed better than stratified random sampling and that both stratified random and systematic surveys performed better than a simple random survey. Simmonds and Fryer (1996) compared eight sampling strategies from a series of simulations and found that more-precise estimates of the surface mean could be obtained using stratified random or systematic sampling than simple random sampling. Jardim and Ribeiro (2007) compared five designs constructed by an informal method with the actually used stratified random design and found that, except for the high density design used as a benchmark, the design combining regular (i.e. systematic sampling design) and stratified random design performed best, which resulted in more evenly distributed sampling stations in the survey area than the other designs. One exception to the above tendency was at a medium sample size. For a medium sample size, Design IV was better than Design II, with the other designs following the same order in ranking of their performance.

9 Optimization methods and conventional sampling design in fishery-independent surveys 1881 Table 5. Rates of changes in accuracy [described by ACR calculated in Equation (10)] for the estimation of parameters of covariance function and stock size with increased sample sizes for the five sampling designs. Sampling design Estimated object Changes in sample size I II III IV V Parameter b E E E E E E E E E E204 Parameter s E E E E E E E E E E204 Parameter f E E E E E E E E E E203 Abundance E E E E E E E E E E203 Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3) þ WM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design. Comparing the accuracy of stock-size estimates with the accuracy and precision of parameter estimates, it seems that the accuracy of stock-size estimation is determined by the accuracy and precision of parameter b estimation, rather than by the accuracy and precision of other parameters. For example, for a small sample size, although Design II had the lowest accuracy and precision of the other two parameters, it still yielded the greatest accuracy in terms of stock-size estimation, with only its best estimate of parameter b. For the medium sample size, Design IV yielded the highest accuracy and precision of parameter b and the highest accuracy of stock size, although it had the poorest estimate for another parameter. Muller (2001) pointed out that those ideal designs for estimating covariance parameters of the stochastic process were not the same as for the best estimate of the value of the stochastic process in a given location or in estimating overall abundance. Our findings partially agree with Muller s conclusion, in that the accuracy of abundance estimation does not depend on all covariance parameters, but is mainly determined by the key covariance parameter b. This result is not difficult to understand, because parameter b is the mean component of the Gaussian random field model (see Material and methods section above), or the spatial trend which is directly related to abundance estimates. An interesting question is whether we can set parameter b as the objective in the estimation instead of stock abundance. We can then just focus on the covariance parameter estimation. This, however, needs further study. Accuracy and precision of parameter estimation and accuracy of abundance estimation both increased with sample size, which is consistent with the findings of many other studies (Simbahan and Dobermann, 2006; Jardim and Ribeiro, 2007). Jardim and Ribeiro (2007) found that in the presence of spatial correlation, an increase in sample size may not provide a proportional increase in the quantity of information because of the partial redundancy of information resulting from the spatial correlation. We calculated an accuracy changing rate (ACR), which is the change in the performance measure statistics (i.e. RSE and RB) for the parameters or abundance estimation divided by the corresponding change in sample size: ACR ¼ðM size1 M size2 Þ=ðSize2 Size1Þ; ð10þ where Size1 and Size2 are two different sample sizes and M size1 and M size2 are corresponding statistics. The results are listed in Table 5. Most of the changes in the RSE and RB when sample sizes increased from 82 to 160 were smaller than when sample size increased from 40 to 82. This is consistent with the results of Jardim and Ribeiro (2007). Some interesting questions can be asked, such as whether there is a sample size at which an additional increase in the number of sampling stations cannot significantly improve the estimation accuracy, and how one can identify a cost-effective sample size that balances the costs of sampling and the accuracy of estimations. A simulation study similar to this can be used to address such questions. This study took a pelagic fish species as an example. Whether the results would be different for a fish species with different annual variations and/or life history can be addressed readily using the simulation framework developed here. The study was based on 8 years of survey data. In practice, however, one may not need that many years of data, and a few recent years of data may be sufficient because information from the most recent years is more relevant to the current characteristics of the targeted fish populations. However, if there is no historical information, such a study is impossible. Based on historical survey data, such a study allows one to improve current survey design, balancing the quality of the data collected and the costs of the sampling programme, leading to the development of a sustainable and fishery-independent monitoring programme. Acknowledgements Thanks are due to all scientific staff and crew for their assistance in collecting data during the surveys. The work was conducted under the auspices of the National Science and Technology Support Plan (2007BAD43B01), the Chinese Ministry of Agriculture Assessment of Marine Fisheries Resources Programme, and the Ministry of Science and Technology Public Project (2006/2007). Support from the East China Sea Fisheries Research Institute and the University of Maine is appreciated greatly. References Abt, M., Welch, W. J., and Sacks, J Design and analysis for modeling and predicting spatial contamination. Mathematical Geology, 31: Andrew, N., and Chen, Y Optimal sampling for estimating the size structure and mean size of individuals caught in an abalone fishery. Fishery Bulletin US, 95:

10 1882 Y. Liu et al. Bogaert, P., and Russo, D Optimal spatial sampling design for the estimation of the variogram based on a least squares approach. Water Resources Research, 35: Box, G. E., and Cox, D. R An analysis of transformations. Journal of the Royal Statistical Society, Series B (Methodological), 26: Brus, D. J., and de Gruijter, J Random sampling or geostatistical modelling? Choosing between design-based and model-based sampling strategies for soil (with discussion). Geoderma, 80: Chen, Y A Monte Carlo study on impacts of the size of subsample catch on estimation of fish stock parameters. Fisheries Research, 26: Chen, Y., Sherman, S., Wilson, C., Sowles, J., and Kanaiwa, M A comparison of two fishery-independent survey programs used to define the population structure of American lobster (Homarus americanus) in the Gulf of Maine. Fishery Bulletin US, 104: Cochran, W. G Sampling Techniques, 3rd edn. John Wiley and Sons, New York. 900 pp. Diggle, P. J., and Lophaven, S. O Bayesian geostatistical design. Scandinavian Journal of Statistics, 33: Diggle, P. J., and Ribeiro, P. J Model-based Geostatistics. Springer, New York. 230 pp. Domburg, P., de Gruijter, J. J., and van Beek, P Designing efficient soil survey schemes with a knowledge-based system using dynamic programming. Geoderma, 75: Grabowski, R., Windholz, T., and Chen, T Estimating exploitable stock biomass for the Maine green sea urchin (Strongylocentrotus droebachiensis) fishery using a spatial statistics approach. Fishery Bulletin US, 103: Hansen, M. H., Madow, W. G., and Tepping, B. J An evaluation of model-dependent and probability-sampling inferences in sample surveys. Journal of the American Statistical Association, 78: Harbitz, A., and Aschan, M A two-dimensional geostatistic method to simulate the precision of abundance estimates. Canadian Journal of Fisheries and Aquatic Sciences, 60: Harbitz, A., Aschan, M., and Sunnanå, K Optimal effort allocation in stratified, large area trawl surveys, with application to shrimp surveys in the Barents Sea. Fisheries Research, 37: Isaaks, E. H., and Srivastava, R. M An Introduction to Applied Geostatistics. Oxford University Press, New York. 561 pp. Jardim, E., and Ribeiro, J. P Geostatistical assessment of sampling designs for Portuguese bottom trawl surveys. Fisheries Research, 85: Kirkpatrick, S., Gelatt, C. D., and Vecchi, M. P Optimization by simulated annealing. Science, 220: Lark, R. M Optimized spatial sampling of soil for estimation of the variogram by maximum likelihood. Geoderma, 105: Liu, Y., Cheng, J., and Chen, X Studies on the seasonal distribution of Setipinna taty in the East China Sea. Marine Fisheries Research, 27: 1 6. Liu, Y., Cheng, J., and Li, S A study on the distribution of Setipinna taty in the East China Sea. Marine Fisheries, 26: Manly, B. F., Akroyd, J. A., and Walshe, K. A Two-phase stratified random surveys on multiple populations at multiple locations. New Zealand Journal of Marine and Freshwater Research, 36: Marchant, B., and Lark, R Optimized sample schemes for geostatistical surveys. Mathematical Geology, 39: McBratney, A. B., and Webster, R The design of optimal sampling schemes for local estimation and mapping of regionalized variables. 2. Program and examples. Computers and Geosciences, 7: Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., and Teller, E Equation of state calculations by fast computing machines. Journal of Chemical Physics, 21: Miller, T. J., Skalski, J. R., and Ianelli, J. N Optimizing a stratified sampling design when faced with multiple objectives. ICES Journal of Marine Science, 64: Muller, W. G Collecting Spatial Data: Optimum Design of Experiments for Random Fields, 2nd edn. Contributions to Statistics. Physica, Heidelberg. 196 pp. Muller, W. G., and Zimmerman, D. L Optimal designs for variogram estimation. Environmetrics, 10: Paloheimo, J. E., and Chen, Y Estimating fishing mortality and cohort sizes. Canadian Journal of Fisheries and Aquatic Sciences, 53: Petitgas, P Geostatistics for fish stock assessments: a review and an acoustic application. ICES Journal of Marine Science, 50: Petitgas, P Geostatistics in fisheries survey design and stock assessment: models, variances and applications. Fish and Fisheries, 2: R Development Core Team R: a Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN , Ribeiro, P. J., and Diggle, P. J geor: a package for geostatistical analysis. R News, 1: Ripley, B. D Spatial Statistics. John Wiley and Sons, New York. 252 pp. Rivoirard, J., Simmonds, E. J., Foote, K. G., Fernandes, P., and Bez, N Geostatistics for Estimating Fish Abundance. Blackwell Science, Oxford. 206 pp. Royle, J. A., and Nychka, D An algorithm for the construction of spatial coverage designs with implementation in SPLUS. Computers and Geosciences, 24: Russo, D Design of an optimal sampling network for estimating the variogram. Soil Science Society of America Journal, 48: Schlather, M Simulation and analysis of random fields. R News, 1: Simbahan, G. C., and Dobermann, A Sampling optimization based on secondary information and its utilization in soil carbon mapping. Geoderma, 133: Simmonds, E. J., and Fryer, R. J Which are better, random or systematic acoustic surveys? A simulation using North Sea herring as an example. ICES Journal of Marine Science, 53: Stein, A., and Ettema, C An overview of spatial sampling procedures and experimental design of spatial studies for ecosystem comparisons. Agriculture, Ecosystems and Environment, 94: Taylor, J. R An Introduction to Error Analysis: the Study of Uncertainties in Physical Measurements, 2nd edn. University Science Books, Sausalito, CA, USA. 327 pp. Thompson, S. K Sampling. John Wiley and Sons, New York. 343 pp. van Groenigen, J., Siderius, W., and Stein, A Constrained optimisation of soil sampling for minimisation of the kriging variance. Geoderma, 87: van Groenigen, J. W., and Stein, A Constrained optimization of spatial sampling using continuous simulated annealing. Journal of Environmental Quality, 27: Warrick, A. W., and Myers, D. E Optimization of sampling locations for variogram calculations. Water Resources Research, 23: Wood, A. T., and Chan, G Simulation of stationary Gaussian processes in [0,1] d. Journal of Computational and Graphical Statistics, 3: doi: /icesjms/fsp157

Bayesian Geostatistical Design

Bayesian Geostatistical Design Johns Hopkins University, Dept. of Biostatistics Working Papers 6-5-24 Bayesian Geostatistical Design Peter J. Diggle Medical Statistics Unit, Lancaster University, UK & Department of Biostatistics, Johns

More information

Spatial Sampling. Dr. Eric M. Delmelle. Initial submission: February , revised version May for Handbook of Regional Science

Spatial Sampling. Dr. Eric M. Delmelle. Initial submission: February , revised version May for Handbook of Regional Science Spatial Sampling Dr. Eric M. Delmelle Initial submission: February 18 2012, revised version May 7 2012 for Handbook of Regional Science Department of Geography and Earth Sciences, University of North Carolina

More information

A MultiGaussian Approach to Assess Block Grade Uncertainty

A MultiGaussian Approach to Assess Block Grade Uncertainty A MultiGaussian Approach to Assess Block Grade Uncertainty Julián M. Ortiz 1, Oy Leuangthong 2, and Clayton V. Deutsch 2 1 Department of Mining Engineering, University of Chile 2 Department of Civil &

More information

Spatial Data Mining. Regression and Classification Techniques

Spatial Data Mining. Regression and Classification Techniques Spatial Data Mining Regression and Classification Techniques 1 Spatial Regression and Classisfication Discrete class labels (left) vs. continues quantities (right) measured at locations (2D for geographic

More information

arxiv: v1 [stat.me] 24 May 2010

arxiv: v1 [stat.me] 24 May 2010 The role of the nugget term in the Gaussian process method Andrey Pepelyshev arxiv:1005.4385v1 [stat.me] 24 May 2010 Abstract The maximum likelihood estimate of the correlation parameter of a Gaussian

More information

Building Blocks for Direct Sequential Simulation on Unstructured Grids

Building Blocks for Direct Sequential Simulation on Unstructured Grids Building Blocks for Direct Sequential Simulation on Unstructured Grids Abstract M. J. Pyrcz (mpyrcz@ualberta.ca) and C. V. Deutsch (cdeutsch@ualberta.ca) University of Alberta, Edmonton, Alberta, CANADA

More information

Transiogram: A spatial relationship measure for categorical data

Transiogram: A spatial relationship measure for categorical data International Journal of Geographical Information Science Vol. 20, No. 6, July 2006, 693 699 Technical Note Transiogram: A spatial relationship measure for categorical data WEIDONG LI* Department of Geography,

More information

Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland

Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland EnviroInfo 2004 (Geneva) Sh@ring EnviroInfo 2004 Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland Mikhail Kanevski 1, Michel Maignan 1

More information

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH SURESH TRIPATHI Geostatistical Society of India Assumptions and Geostatistical Variogram

More information

On dealing with spatially correlated residuals in remote sensing and GIS

On dealing with spatially correlated residuals in remote sensing and GIS On dealing with spatially correlated residuals in remote sensing and GIS Nicholas A. S. Hamm 1, Peter M. Atkinson and Edward J. Milton 3 School of Geography University of Southampton Southampton SO17 3AT

More information

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS Srinivasan R and Venkatesan P Dept. of Statistics, National Institute for Research Tuberculosis, (Indian Council of Medical Research),

More information

STATISTICAL GEOCOMPUTING: SPATIAL OUTLIER DETECTION IN PRECISION AGRICULTURE

STATISTICAL GEOCOMPUTING: SPATIAL OUTLIER DETECTION IN PRECISION AGRICULTURE STATISTICAL GEOCOMPUTING: SPATIAL OUTLIER DETECTION IN PRECISION AGRICULTURE PETER CHU SU Department of Geography and Environmental Management, University of Waterloo, Waterloo, Ontario, Canada ABSTRACT

More information

Monte Carlo Simulation. CWR 6536 Stochastic Subsurface Hydrology

Monte Carlo Simulation. CWR 6536 Stochastic Subsurface Hydrology Monte Carlo Simulation CWR 6536 Stochastic Subsurface Hydrology Steps in Monte Carlo Simulation Create input sample space with known distribution, e.g. ensemble of all possible combinations of v, D, q,

More information

SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA

SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA D. Pokrajac Center for Information Science and Technology Temple University Philadelphia, Pennsylvania A. Lazarevic Computer

More information

Soil Moisture Modeling using Geostatistical Techniques at the O Neal Ecological Reserve, Idaho

Soil Moisture Modeling using Geostatistical Techniques at the O Neal Ecological Reserve, Idaho Final Report: Forecasting Rangeland Condition with GIS in Southeastern Idaho Soil Moisture Modeling using Geostatistical Techniques at the O Neal Ecological Reserve, Idaho Jacob T. Tibbitts, Idaho State

More information

Bayesian Nonparametric Regression for Diabetes Deaths

Bayesian Nonparametric Regression for Diabetes Deaths Bayesian Nonparametric Regression for Diabetes Deaths Brian M. Hartman PhD Student, 2010 Texas A&M University College Station, TX, USA David B. Dahl Assistant Professor Texas A&M University College Station,

More information

Beta-Binomial Kriging: An Improved Model for Spatial Rates

Beta-Binomial Kriging: An Improved Model for Spatial Rates Available online at www.sciencedirect.com ScienceDirect Procedia Environmental Sciences 27 (2015 ) 30 37 Spatial Statistics 2015: Emerging Patterns - Part 2 Beta-Binomial Kriging: An Improved Model for

More information

Synthesis and Integrated Modeling of Long-term Data Sets to Support Fisheries and Hypoxia Management in the Northern Gulf of Mexico

Synthesis and Integrated Modeling of Long-term Data Sets to Support Fisheries and Hypoxia Management in the Northern Gulf of Mexico Synthesis and Integrated Modeling of Long-term Data Sets to Support Fisheries and Hypoxia Management in the Northern Gulf of Mexico Dan Obenour (Scientific/Hypoxia PI) Kevin Craig (Applications/Fisheries

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate

More information

Estimation of direction of increase of gold mineralisation using pair-copulas

Estimation of direction of increase of gold mineralisation using pair-copulas 22nd International Congress on Modelling and Simulation, Hobart, Tasmania, Australia, 3 to 8 December 2017 mssanz.org.au/modsim2017 Estimation of direction of increase of gold mineralisation using pair-copulas

More information

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph Spatial analysis Huge topic! Key references Diggle (point patterns); Cressie (everything); Diggle and Ribeiro (geostatistics); Dormann et al (GLMMs for species presence/abundance); Haining; (Pinheiro and

More information

Large Scale Modeling by Bayesian Updating Techniques

Large Scale Modeling by Bayesian Updating Techniques Large Scale Modeling by Bayesian Updating Techniques Weishan Ren Centre for Computational Geostatistics Department of Civil and Environmental Engineering University of Alberta Large scale models are useful

More information

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN: Index Akaike information criterion (AIC) 105, 290 analysis of variance 35, 44, 127 132 angular transformation 22 anisotropy 59, 99 affine or geometric 59, 100 101 anisotropy ratio 101 exploring and displaying

More information

4th HR-HU and 15th HU geomathematical congress Geomathematics as Geoscience Reliability enhancement of groundwater estimations

4th HR-HU and 15th HU geomathematical congress Geomathematics as Geoscience Reliability enhancement of groundwater estimations Reliability enhancement of groundwater estimations Zoltán Zsolt Fehér 1,2, János Rakonczai 1, 1 Institute of Geoscience, University of Szeged, H-6722 Szeged, Hungary, 2 e-mail: zzfeher@geo.u-szeged.hu

More information

Advances in Locally Varying Anisotropy With MDS

Advances in Locally Varying Anisotropy With MDS Paper 102, CCG Annual Report 11, 2009 ( 2009) Advances in Locally Varying Anisotropy With MDS J.B. Boisvert and C. V. Deutsch Often, geology displays non-linear features such as veins, channels or folds/faults

More information

Serial No. N4567 NAFO SCR Doc. 01/178. SCIENTIFIC COUNCIL MEETING November 2001

Serial No. N4567 NAFO SCR Doc. 01/178. SCIENTIFIC COUNCIL MEETING November 2001 NOT TO BE CITED WITHOUT PRIOR REFERENCE TO THE AUTHOR(S) Northwest Atlantic Fisheries Organization Serial No. N4567 NAFO SCR Doc. 01/178 SCIENTIFIC COUNCIL MEETING November 01 Geographical Distribution

More information

Bryan F.J. Manly and Andrew Merrill Western EcoSystems Technology Inc. Laramie and Cheyenne, Wyoming. Contents. 1. Introduction...

Bryan F.J. Manly and Andrew Merrill Western EcoSystems Technology Inc. Laramie and Cheyenne, Wyoming. Contents. 1. Introduction... Comments on Statistical Aspects of the U.S. Fish and Wildlife Service's Modeling Framework for the Proposed Revision of Critical Habitat for the Northern Spotted Owl. Bryan F.J. Manly and Andrew Merrill

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information

INTEGRATING DIVERSE CALIBRATION PRODUCTS TO IMPROVE SEISMIC LOCATION

INTEGRATING DIVERSE CALIBRATION PRODUCTS TO IMPROVE SEISMIC LOCATION INTEGRATING DIVERSE CALIBRATION PRODUCTS TO IMPROVE SEISMIC LOCATION ABSTRACT Craig A. Schultz, Steven C. Myers, Jennifer L. Swenson, Megan P. Flanagan, Michael E. Pasyanos, and Joydeep Bhattacharyya Lawrence

More information

Inferring biological dynamics Iterated filtering (IF)

Inferring biological dynamics Iterated filtering (IF) Inferring biological dynamics 101 3. Iterated filtering (IF) IF originated in 2006 [6]. For plug-and-play likelihood-based inference on POMP models, there are not many alternatives. Directly estimating

More information

ICES training Course on Design and Analysis of Statistically Sound Catch Sampling Programmes

ICES training Course on Design and Analysis of Statistically Sound Catch Sampling Programmes ICES training Course on Design and Analysis of Statistically Sound Catch Sampling Programmes Sara-Jane Moore www.marine.ie General Statistics - backed up by case studies General Introduction to sampling

More information

Relevance Vector Machines for Earthquake Response Spectra

Relevance Vector Machines for Earthquake Response Spectra 2012 2011 American American Transactions Transactions on on Engineering Engineering & Applied Applied Sciences Sciences. American Transactions on Engineering & Applied Sciences http://tuengr.com/ateas

More information

Geostatistics: Kriging

Geostatistics: Kriging Geostatistics: Kriging 8.10.2015 Konetekniikka 1, Otakaari 4, 150 10-12 Rangsima Sunila, D.Sc. Background What is Geostatitics Concepts Variogram: experimental, theoretical Anisotropy, Isotropy Lag, Sill,

More information

Optimizing Thresholds in Truncated Pluri-Gaussian Simulation

Optimizing Thresholds in Truncated Pluri-Gaussian Simulation Optimizing Thresholds in Truncated Pluri-Gaussian Simulation Samaneh Sadeghi and Jeff B. Boisvert Truncated pluri-gaussian simulation (TPGS) is an extension of truncated Gaussian simulation. This method

More information

Toward an automatic real-time mapping system for radiation hazards

Toward an automatic real-time mapping system for radiation hazards Toward an automatic real-time mapping system for radiation hazards Paul H. Hiemstra 1, Edzer J. Pebesma 2, Chris J.W. Twenhöfel 3, Gerard B.M. Heuvelink 4 1 Faculty of Geosciences / University of Utrecht

More information

Quantifying uncertainty of geological 3D layer models, constructed with a-priori

Quantifying uncertainty of geological 3D layer models, constructed with a-priori Quantifying uncertainty of geological 3D layer models, constructed with a-priori geological expertise Jan Gunnink, Denise Maljers 2 and Jan Hummelman 2, TNO Built Environment and Geosciences Geological

More information

Spatial Sampling. Eric Delmelle INTRODUCTION

Spatial Sampling. Eric Delmelle INTRODUCTION 10 Spatial Sampling Eric Delmelle 10.1. INTRODUCTION When trying to make inferences about a phenomenon, we are forced to collect a limited number of samples instead of trying to acquire information at

More information

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University this presentation derived from that presented at the Pan-American Advanced

More information

Automatic Determination of Uncertainty versus Data Density

Automatic Determination of Uncertainty versus Data Density Automatic Determination of Uncertainty versus Data Density Brandon Wilde and Clayton V. Deutsch It is useful to know how various measures of uncertainty respond to changes in data density. Calculating

More information

A Covariance Conversion Approach of Gamma Random Field Simulation

A Covariance Conversion Approach of Gamma Random Field Simulation Proceedings of the 8th International Symposium on Spatial Accuracy Assessment in Natural Resources and Environmental Sciences Shanghai, P. R. China, June 5-7, 008, pp. 4-45 A Covariance Conversion Approach

More information

Applications of Randomized Methods for Decomposing and Simulating from Large Covariance Matrices

Applications of Randomized Methods for Decomposing and Simulating from Large Covariance Matrices Applications of Randomized Methods for Decomposing and Simulating from Large Covariance Matrices Vahid Dehdari and Clayton V. Deutsch Geostatistical modeling involves many variables and many locations.

More information

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, N.C.,

More information

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio Applied Spatial Data Analysis with R 4:1 Springer Contents Preface VII 1 Hello World: Introducing Spatial Data 1 1.1 Applied Spatial Data Analysis

More information

The Proportional Effect of Spatial Variables

The Proportional Effect of Spatial Variables The Proportional Effect of Spatial Variables J. G. Manchuk, O. Leuangthong and C. V. Deutsch Centre for Computational Geostatistics, Department of Civil and Environmental Engineering University of Alberta

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes TK4150 - Intro 1 Chapter 4 - Fundamentals of spatial processes Lecture notes Odd Kolbjørnsen and Geir Storvik January 30, 2017 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,

More information

COLLOCATED CO-SIMULATION USING PROBABILITY AGGREGATION

COLLOCATED CO-SIMULATION USING PROBABILITY AGGREGATION COLLOCATED CO-SIMULATION USING PROBABILITY AGGREGATION G. MARIETHOZ, PH. RENARD, R. FROIDEVAUX 2. CHYN, University of Neuchâtel, rue Emile Argand, CH - 2009 Neuchâtel, Switzerland 2 FSS Consultants, 9,

More information

Hybrid Optimal Design of the Eco-Hydrological Wireless Sensor Network in the Middle Reach of the Heihe River Basin, China

Hybrid Optimal Design of the Eco-Hydrological Wireless Sensor Network in the Middle Reach of the Heihe River Basin, China Sensors 2014, 14, 19095-19114; doi:10.3390/s141019095 Article OPEN ACCESS sensors ISSN 1424-8220 www.mdpi.com/journal/sensors Hybrid Optimal Design of the Eco-Hydrological Wireless Sensor Network in the

More information

High-dimensional regression modeling

High-dimensional regression modeling High-dimensional regression modeling David Causeur Department of Statistics and Computer Science Agrocampus Ouest IRMAR CNRS UMR 6625 http://www.agrocampus-ouest.fr/math/causeur/ Course objectives Making

More information

Influence of incorporating geometric anisotropy on the construction of thematic maps of simulated data and chemical attributes of soil

Influence of incorporating geometric anisotropy on the construction of thematic maps of simulated data and chemical attributes of soil RESEARCH Influence of incorporating geometric anisotropy on the construction of thematic maps of simulated data and chemical attributes of soil Luciana Pagliosa Carvalho Guedes 1*, Miguel Angel Uribe-Opazo

More information

Performance Analysis of Some Machine Learning Algorithms for Regression Under Varying Spatial Autocorrelation

Performance Analysis of Some Machine Learning Algorithms for Regression Under Varying Spatial Autocorrelation Performance Analysis of Some Machine Learning Algorithms for Regression Under Varying Spatial Autocorrelation Sebastian F. Santibanez Urban4M - Humboldt University of Berlin / Department of Geography 135

More information

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Environmentrics 00, 1 12 DOI: 10.1002/env.XXXX Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Regina Wu a and Cari G. Kaufman a Summary: Fitting a Bayesian model to spatial

More information

Introduction. Chapter 1

Introduction. Chapter 1 Chapter 1 Introduction In this book we will be concerned with supervised learning, which is the problem of learning input-output mappings from empirical data (the training dataset). Depending on the characteristics

More information

Bayesian Transgaussian Kriging

Bayesian Transgaussian Kriging 1 Bayesian Transgaussian Kriging Hannes Müller Institut für Statistik University of Klagenfurt 9020 Austria Keywords: Kriging, Bayesian statistics AMS: 62H11,60G90 Abstract In geostatistics a widely used

More information

Pooling multiple imputations when the sample happens to be the population.

Pooling multiple imputations when the sample happens to be the population. Pooling multiple imputations when the sample happens to be the population. Gerko Vink 1,2, and Stef van Buuren 1,3 arxiv:1409.8542v1 [math.st] 30 Sep 2014 1 Department of Methodology and Statistics, Utrecht

More information

Correcting Variogram Reproduction of P-Field Simulation

Correcting Variogram Reproduction of P-Field Simulation Correcting Variogram Reproduction of P-Field Simulation Julián M. Ortiz (jmo1@ualberta.ca) Department of Civil & Environmental Engineering University of Alberta Abstract Probability field simulation is

More information

A Short Note on the Proportional Effect and Direct Sequential Simulation

A Short Note on the Proportional Effect and Direct Sequential Simulation A Short Note on the Proportional Effect and Direct Sequential Simulation Abstract B. Oz (boz@ualberta.ca) and C. V. Deutsch (cdeutsch@ualberta.ca) University of Alberta, Edmonton, Alberta, CANADA Direct

More information

Development of Stochastic Artificial Neural Networks for Hydrological Prediction

Development of Stochastic Artificial Neural Networks for Hydrological Prediction Development of Stochastic Artificial Neural Networks for Hydrological Prediction G. B. Kingston, M. F. Lambert and H. R. Maier Centre for Applied Modelling in Water Engineering, School of Civil and Environmental

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

Geostatistics for Seismic Data Integration in Earth Models

Geostatistics for Seismic Data Integration in Earth Models 2003 Distinguished Instructor Short Course Distinguished Instructor Series, No. 6 sponsored by the Society of Exploration Geophysicists European Association of Geoscientists & Engineers SUB Gottingen 7

More information

Spatiotemporal Analysis of Environmental Radiation in Korea

Spatiotemporal Analysis of Environmental Radiation in Korea WM 0 Conference, February 25 - March, 200, Tucson, AZ Spatiotemporal Analysis of Environmental Radiation in Korea J.Y. Kim, B.C. Lee FNC Technology Co., Ltd. Main Bldg. 56, Seoul National University Research

More information

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields 1 Introduction Jo Eidsvik Department of Mathematical Sciences, NTNU, Norway. (joeid@math.ntnu.no) February

More information

Optimizing the design of acoustic surveys of Peruvian anchoveta

Optimizing the design of acoustic surveys of Peruvian anchoveta ICES Journal of Marine Science Advance Access published April 30, 2009 Page 1 of 8 Optimizing the design of acoustic surveys of Peruvian anchoveta E. John Simmonds, Mariano Gutiérrez, Andres Chipollini,

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

Appendix: Modeling Approach

Appendix: Modeling Approach AFFECTIVE PRIMACY IN INTRAORGANIZATIONAL TASK NETWORKS Appendix: Modeling Approach There is now a significant and developing literature on Bayesian methods in social network analysis. See, for instance,

More information

Entropy of Gaussian Random Functions and Consequences in Geostatistics

Entropy of Gaussian Random Functions and Consequences in Geostatistics Entropy of Gaussian Random Functions and Consequences in Geostatistics Paula Larrondo (larrondo@ualberta.ca) Department of Civil & Environmental Engineering University of Alberta Abstract Sequential Gaussian

More information

Conditional Distribution Fitting of High Dimensional Stationary Data

Conditional Distribution Fitting of High Dimensional Stationary Data Conditional Distribution Fitting of High Dimensional Stationary Data Miguel Cuba and Oy Leuangthong The second order stationary assumption implies the spatial variability defined by the variogram is constant

More information

Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities

Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Peter M. Aronow and Cyrus Samii Forthcoming at Survey Methodology Abstract We consider conservative variance

More information

Statistical Models for Monitoring and Regulating Ground-level Ozone. Abstract

Statistical Models for Monitoring and Regulating Ground-level Ozone. Abstract Statistical Models for Monitoring and Regulating Ground-level Ozone Eric Gilleland 1 and Douglas Nychka 2 Abstract The application of statistical techniques to environmental problems often involves a tradeoff

More information

2.6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics. References - geostatistics. References geostatistics (cntd.

2.6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics. References - geostatistics. References geostatistics (cntd. .6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics Spline interpolation was originally developed or image processing. In GIS, it is mainly used in visualization o spatial

More information

An Alternative SSASPM Stock Assessment of Gulf of Mexico Vermilion Snapper that Incorporates the Recent Decline in Shrimp Effort.

An Alternative SSASPM Stock Assessment of Gulf of Mexico Vermilion Snapper that Incorporates the Recent Decline in Shrimp Effort. An Alternative SSASPM Stock Assessment of Gulf of Mexico Vermilion Snapper that Incorporates the Recent Decline in Shrimp Effort. Brian Linton, Shannon L. Cass-Calay and Clay E. Porch NOAA Fisheries Southeast

More information

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Jon Wakefield Departments of Statistics and Biostatistics University of Washington 1 / 37 Lecture Content Motivation

More information

COLLOCATED CO-SIMULATION USING PROBABILITY AGGREGATION

COLLOCATED CO-SIMULATION USING PROBABILITY AGGREGATION COLLOCATED CO-SIMULATION USING PROBABILITY AGGREGATION G. MARIETHOZ, PH. RENARD, R. FROIDEVAUX 2. CHYN, University of Neuchâtel, rue Emile Argand, CH - 2009 Neuchâtel, Switzerland 2 FSS Consultants, 9,

More information

Advanced Introduction to Machine Learning

Advanced Introduction to Machine Learning 10-715 Advanced Introduction to Machine Learning Homework 3 Due Nov 12, 10.30 am Rules 1. Homework is due on the due date at 10.30 am. Please hand over your homework at the beginning of class. Please see

More information

Extreme Value Analysis and Spatial Extremes

Extreme Value Analysis and Spatial Extremes Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models

More information

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2

More information

New Zealand Fisheries Assessment Report 2017/26. June J. Roberts A. Dunn. ISSN (online) ISBN (online)

New Zealand Fisheries Assessment Report 2017/26. June J. Roberts A. Dunn. ISSN (online) ISBN (online) Investigation of alternative model structures for the estimation of natural mortality in the Campbell Island Rise southern blue whiting (Micromesistius australis) stock assessment (SBW 6I) New Zealand

More information

A Statistical Input Pruning Method for Artificial Neural Networks Used in Environmental Modelling

A Statistical Input Pruning Method for Artificial Neural Networks Used in Environmental Modelling A Statistical Input Pruning Method for Artificial Neural Networks Used in Environmental Modelling G. B. Kingston, H. R. Maier and M. F. Lambert Centre for Applied Modelling in Water Engineering, School

More information

POPULAR CARTOGRAPHIC AREAL INTERPOLATION METHODS VIEWED FROM A GEOSTATISTICAL PERSPECTIVE

POPULAR CARTOGRAPHIC AREAL INTERPOLATION METHODS VIEWED FROM A GEOSTATISTICAL PERSPECTIVE CO-282 POPULAR CARTOGRAPHIC AREAL INTERPOLATION METHODS VIEWED FROM A GEOSTATISTICAL PERSPECTIVE KYRIAKIDIS P. University of California Santa Barbara, MYTILENE, GREECE ABSTRACT Cartographic areal interpolation

More information

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time Space-time data Observations taken over space and over time Z(s, t): indexed by space, s, and time, t Here, consider geostatistical/time data Z(s, t) exists for all locations and all times May consider

More information

7 Geostatistics. Figure 7.1 Focus of geostatistics

7 Geostatistics. Figure 7.1 Focus of geostatistics 7 Geostatistics 7.1 Introduction Geostatistics is the part of statistics that is concerned with geo-referenced data, i.e. data that are linked to spatial coordinates. To describe the spatial variation

More information

5. Simulated Annealing 5.1 Basic Concepts. Fall 2010 Instructor: Dr. Masoud Yaghini

5. Simulated Annealing 5.1 Basic Concepts. Fall 2010 Instructor: Dr. Masoud Yaghini 5. Simulated Annealing 5.1 Basic Concepts Fall 2010 Instructor: Dr. Masoud Yaghini Outline Introduction Real Annealing and Simulated Annealing Metropolis Algorithm Template of SA A Simple Example References

More information

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING Vishwanath Mantha Department for Electrical and Computer Engineering Mississippi State University, Mississippi State, MS 39762 mantha@isip.msstate.edu ABSTRACT

More information

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Abhirup Datta 1 Sudipto Banerjee 1 Andrew O. Finley 2 Alan E. Gelfand 3 1 University of Minnesota, Minneapolis,

More information

Improving Spatial Data Interoperability

Improving Spatial Data Interoperability Improving Spatial Data Interoperability A Framework for Geostatistical Support-To To-Support Interpolation Michael F. Goodchild, Phaedon C. Kyriakidis, Philipp Schneider, Matt Rice, Qingfeng Guan, Jordan

More information

Comparison of rainfall distribution method

Comparison of rainfall distribution method Team 6 Comparison of rainfall distribution method In this section different methods of rainfall distribution are compared. METEO-France is the French meteorological agency, a public administrative institution

More information

Geostatistics for Gaussian processes

Geostatistics for Gaussian processes Introduction Geostatistical Model Covariance structure Cokriging Conclusion Geostatistics for Gaussian processes Hans Wackernagel Geostatistics group MINES ParisTech http://hans.wackernagel.free.fr Kernels

More information

The ProbForecastGOP Package

The ProbForecastGOP Package The ProbForecastGOP Package April 24, 2006 Title Probabilistic Weather Field Forecast using the GOP method Version 1.3 Author Yulia Gel, Adrian E. Raftery, Tilmann Gneiting, Veronica J. Berrocal Description

More information

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics The Nature of Geographic Data Types of spatial data Continuous spatial data: geostatistics Samples may be taken at intervals, but the spatial process is continuous e.g. soil quality Discrete data Irregular:

More information

Coregionalization by Linear Combination of Nonorthogonal Components 1

Coregionalization by Linear Combination of Nonorthogonal Components 1 Mathematical Geology, Vol 34, No 4, May 2002 ( C 2002) Coregionalization by Linear Combination of Nonorthogonal Components 1 J A Vargas-Guzmán, 2,3 A W Warrick, 3 and D E Myers 4 This paper applies the

More information

Credibility of climate predictions revisited

Credibility of climate predictions revisited European Geosciences Union General Assembly 29 Vienna, Austria, 19 24 April 29 Session CL54/NP4.5 Climate time series analysis: Novel tools and their application Credibility of climate predictions revisited

More information

The importance of spatial sampling methods for their analysis results selected issues

The importance of spatial sampling methods for their analysis results selected issues The importance of spatial sampling methods for their analysis results selected issues Jadwiga Suchecka 1 Abstract A space or an area is characterised by the specific data and the uncertainties inherent

More information

Influence of parameter estimation uncertainty in Kriging: Part 2 Test and case study applications

Influence of parameter estimation uncertainty in Kriging: Part 2 Test and case study applications Hydrology and Earth System Influence Sciences, of 5(), parameter 5 3 estimation (1) uncertainty EGS in Kriging: Part Test and case study applications Influence of parameter estimation uncertainty in Kriging:

More information

Stepwise Conditional Transformation for Simulation of Multiple Variables 1

Stepwise Conditional Transformation for Simulation of Multiple Variables 1 Mathematical Geology, Vol. 35, No. 2, February 2003 ( C 2003) Stepwise Conditional Transformation for Simulation of Multiple Variables 1 Oy Leuangthong 2 and Clayton V. Deutsch 2 Most geostatistical studies

More information

Acceptable Ergodic Fluctuations and Simulation of Skewed Distributions

Acceptable Ergodic Fluctuations and Simulation of Skewed Distributions Acceptable Ergodic Fluctuations and Simulation of Skewed Distributions Oy Leuangthong, Jason McLennan and Clayton V. Deutsch Centre for Computational Geostatistics Department of Civil & Environmental Engineering

More information

USING GEOSTATISTICS TO DESCRIBE COMPLEX A PRIORI INFORMATION FOR INVERSE PROBLEMS THOMAS M. HANSEN 1,2, KLAUS MOSEGAARD 2 and KNUD S.

USING GEOSTATISTICS TO DESCRIBE COMPLEX A PRIORI INFORMATION FOR INVERSE PROBLEMS THOMAS M. HANSEN 1,2, KLAUS MOSEGAARD 2 and KNUD S. USING GEOSTATISTICS TO DESCRIBE COMPLEX A PRIORI INFORMATION FOR INVERSE PROBLEMS THOMAS M. HANSEN 1,2, KLAUS MOSEGAARD 2 and KNUD S. CORDUA 1 1 Institute of Geography & Geology, University of Copenhagen,

More information

Sampling in Space and Time. Natural experiment? Analytical Surveys

Sampling in Space and Time. Natural experiment? Analytical Surveys Sampling in Space and Time Overview of Sampling Approaches Sampling versus Experimental Design Experiments deliberately perturb a portion of population to determine effect objective is to compare the mean

More information

On Sampling Errors in Empirical Orthogonal Functions

On Sampling Errors in Empirical Orthogonal Functions 3704 J O U R N A L O F C L I M A T E VOLUME 18 On Sampling Errors in Empirical Orthogonal Functions ROBERTA QUADRELLI, CHRISTOPHER S. BRETHERTON, AND JOHN M. WALLACE University of Washington, Seattle,

More information

Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA

Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA SPATIAL STATISTICS Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA Keywords: Geostatistics, Isotropy, Kriging, Lattice Data, Spatial point patterns, Stationarity

More information

Kernel adaptive Sequential Monte Carlo

Kernel adaptive Sequential Monte Carlo Kernel adaptive Sequential Monte Carlo Ingmar Schuster (Paris Dauphine) Heiko Strathmann (University College London) Brooks Paige (Oxford) Dino Sejdinovic (Oxford) December 7, 2015 1 / 36 Section 1 Outline

More information