A comparative study of optimization methods and conventional methods for sampling design in fishery-independent surveys

Size: px

Start display at page:

Download "A comparative study of optimization methods and conventional methods for sampling design in fishery-independent surveys"

Lizbeth Knight
5 years ago
Views:

1 A comparative study of optimization methods and conventional methods for sampling design in fishery-independent surveys Yong Liu, Yong Chen, and Jiahua Cheng 1873 Liu, Y., Chen, Y., and Cheng, J A comparative study of optimization methods and conventional methods for sampling design in fisheryindependent surveys. ICES Journal of Marine Science, 66: We have introduced and evaluated a procedure, the constrained spatial simulated annealing method, for developing an optimal sampling design for fishery-independent surveys. We used two criterion functions, minimization of the mean of the shortest distance (MMSD) and uniform distribution of point pairs for variogram estimation (WM), and three arrangements of the two criteria, all WM, all MMSD, and a combination of MMSD (2/3 of samples) and WM (1/3), to construct three optimized sampling designs (denoted as Designs I, II, and III, respectively). These three designs were compared in a simulation study with systematic sampling (Design IV) and stratified random sampling designs (Design V), commonly used in fishery-independent surveys. Three levels of sample size (small, medium, and large) were considered in the simulation study developed using a geostatistical approach. The results showed that for parameter estimation of the spatial covariance function, Design III was better than the other designs at relatively small sample size and Design II performed better than the other designs at relatively large sample size. For estimating fish stock abundance, the performance of the designs considered in this study can be ranked as follows: Design II. Design IV. Design III. Design V. Design I. It is clearly important to evaluate and improve sampling design based on historical survey data. Such a study allows us to identify an optimal sampling design to balance the quality of the data collected and the costs of the sampling programme, leading to the development and optimization of a sustainable and fishery-independent monitoring programme. Keywords: fishery-independent survey, geostatistics, optimization method, sampling design, simulation. Received 8 December 2008; accepted 15 April 2009; advance access publication 14 June Y. Liu and J. Cheng: Key and Open Laboratory of Marine and Estuarine Fisheries Certificated by the Ministry of Agriculture, East China Sea Fisheries Institute, Chinese Academy of Fishery Sciences, Shanghai , China. Y. Chen: School of Marine Sciences, University of Maine, Orono, ME 04469, USA. Correspondence to Y. Liu: tel: þ ; fax: þ ; liuyong7707@yahoo.com.cn. Introduction Fishery-independent surveys are critical to fish stock assessment and management, because they provide reliable information about fish stocks at a defined temporal and spatial scale (Jardim and Ribeiro, 2007). The cost of such a survey is often so high that it is important and necessary to optimize the sampling design and maximize the use of data collected for the stock or stocks being investigated (Simmonds and Fryer, 1996). Sampling design affects the methods of data collection and the quality, quantity, and spatial and temporal scales of the data collected. It also greatly influences the choice of methods that can be used for data analyses (Cochran, 1977; Rivoirard et al., 2000; Stein and Ettema, 2003). The sampling design used in fishery-independent surveys can be divided into two groups: (i) the design-based approach used in classical survey and sampling programmes, and (ii) the modelbased approach following primarily geostatistics (Petitgas, 2001; Jardim and Ribeiro, 2007). These two approaches differ greatly in their assumptions about the properties of target populations and sampling locations. The design-based approach assumes that the values of variables to be measured at sampling locations are fixed, and the random selection of sampling locations provides necessary randomization, whereas the model-based approach assumes that the randomization occurring in natural processes is sufficient, and that sampling locations can be fixed (Hansen et al., 1983; Thompson, 1992; Brus and de Gruijter, 1997). Random sampling is a classical survey method, but may have its limitation in fishery-independent surveys because of some unique characteristics of fish populations. For example, fish are likely to aggregate spatially, which does not satisfy the assumption of random distribution of individuals in a survey area for random sampling. Possible spatial correlations between samples often existing in the fishery-independent survey of fish populations may bias variance estimators (Harbitz and Aschan, 2003). Another fact is that the survey design may not be implemented exactly at sea for logistical reasons such as poor weather and untrawlable substrata (Petitgas, 2001). Therefore, a random sampling design (i.e. only spatially random, not temporally random) cannot always be followed strictly. If succeeding stations are too close, sample processing and handling may also be complicated (Harbitz et al., 1998). Recent studies have increasingly focused on spatial statistics for estimating fish stock abundance (Petitgas, 1993; Rivoirard et al., 2000; Grabowski et al., 2005). In theory, when there are spatial correlations between observations and spatial structure can be appropriately considered in modelling, abundance estimates obtained with a systematic sampling design tend to be more precise than # 2009 International Council for the Exploration of the Sea. Published by Oxford Journals. All rights reserved. For Permissions, please journals.permissions@oxfordjournals.org

2 1874 Y. Liu et al. others derived based on random sampling (Cochran, 1977; Rivoirard et al., 2000). The data from non-random survey designs can be analysed using spatial statistics, and the distribution of fisheries resources can be inferred (Petitgas, 2001). A geostatistical design approach involves selecting and optimizing (e.g. minimizing or maximizing) a suitable objective function (Diggle and Lophaven, 2006), developed based on the selected design criterion (e.g. the mean or maximum value of variance in the estimation; van Groenigen et al., 1999). Most approaches assume that all parameters defining the covariance structure of data are known, and the objective is to identify the best design that can minimize the variance in the estimation (McBratney and Webster, 1981; Domburg et al., 1997; van Groenigen et al., 1999; Marchant and Lark, 2007). Such approaches may not be applicable to fisheries because of limited knowledge of the covariance structure of fisheries data, which is essential for these approaches. However, there are other approaches that do not need the information about data covariance structure (Russo, 1984; Warrick and Myers, 1987; Abt et al., 1999; Bogaert and Russo, 1999; Muller and Zimmerman, 1999; Lark, 2002). One such approach is to focus on the goodness of semi-variogram estimation, critical for geostatistical analysis. A conventional procedure for the estimation of semi-variogram model parameters is to group all distances between every pair of points (if considering anisotropy, the direction should be consistent within a group), from which the average distance in each distance group can be derived and used for estimating the parameters. Design criteria focus on the identification of an approach to deal with pairwise samples within each group (e.g. minimizing the variance of the distances; Russo, 1984) to achieve a proper distribution of pairwise samples among all distance groups (Warrick and Myers, 1987). Another type of design is purely geometric, such as favouring a regular spatial distribution of sampling locations within the constraints imposed by a particular application (Royle and Nychka, 1998) or maximizing the spatial spread of sampling locations (van Groenigen and Stein, 1998). The sampling design criteria discussed above have been used in research areas as diverse as mining engineering, soil and crop science, hydrology, and ecology. As far as we know, they have not been used before in fisheries. One of the objectives of this study is to compare the performance of different sampling designs. Suitable measures need to be identified for the comparisons. Ideally, a performance measure should include an evaluation of accuracy, which is the degree of closeness of a measured or calculated quantity to its actual (true) value, and precision, which is the extent to which further measurements or calculations show the same or similar results (Taylor, 1997). In this study, a statistic, the relative standard error (RSE), was used to summarize and compare both the accuracy and precision of parameter estimation for the covariance function (Chen, 1996; Andrew and Chen, 1997). We also used an index, the relative bias (RB), to compare the accuracy of stocksize estimation (Paloheimo and Chen, 1996). Our study aimed at proposing geostatistics-based design criteria for sampling design, comparing them with the conventional sampling designs used in fishery-independent surveys, and identifying an optimal approach for fishery-independent survey design. We developed a constrained spatial simulated annealing (CSSA) method (van Groenigen et al., 1999) for optimizing sampling schemes. We used the following three criteria: (i) minimization of the mean of the shortest distance (MMSD; van Groenigen and Stein, 1998), (ii) uniform distribution of pairwise points for variogram estimation (WM; Warrick and Myers, 1987), and (iii) a combination of MMSD (2/3 of samples) and WM (1/3) to construct three optimized sampling schemes. The above three optimized sampling schemes were compared with the two conventional sampling designs, stratified random sampling and systematic sampling schemes, for estimating sampling parameters and fish abundance in a simulation study. The simulation study includes simulating the fish distribution using parameters estimated from a real fishery-independent survey, which were considered as true values in the simulation, then sampling the true distribution with the five sampling schemes listed above. Using the estimated parameters for the five sampling designs, we evaluated and compared the results in terms of estimated fish abundance and identified an optimal sampling design. The study provides new insights into survey designs for fishery-independent surveys that are critical to fish stock assessment and management. Material and methods Study area and data collection A fishery-independent bottom trawl survey in the East China Sea has been conducted for many years. The survey area covers the area between 27 and 348N and from 120 to 1278E. The survey design follows a systematic sampling design with sampling stations evenly distributed in the survey area with a 30 0 interval between each pair of stations. Based on previous studies (Liu et al., 2004, 2006), two Setipinna taty stocks were identified, a large one in the north and a small one in the south. The northern stock, which is distributed from 27 to 308N and from 120 to 1278E, was used for the simulation study. The original spatial autocorrelation parameters for the stock were calculated from the survey data collected during the period in the survey area. Sampling design Systematic sampling design Our design was consistent with that for the actual survey from which the simulation data were collected. Based on this design, sampling stations were evenly distributed with an equal interval between each pair throughout the survey area. We removed some opportunistic and irregular stations that appeared in the actual surveys and added some stations to fill the survey area defined in the simulation more evenly (Figure 1). The systematic sampling design is referred to as Design IV in this study. Stratified random sampling design This design is widely used in fishery-independent surveys and is supported by classic statistics theory (Cochran, 1977; Manly et al., 2002; Miller et al., 2007). In fishery-independent bottom trawl surveys, strata are often defined based on environmental variables shown to be important in influencing fish distribution, such as depth and geographic area. Sampling efforts were allocated among strata, generally proportional to the stratum area (Chen et al., 2006). The sampling stations in each stratum were assigned randomly (Cochran, 1977). In this simulation study, the survey area was divided into four strata according to depth:,40, 40 60, 60 80, and.80 m (Figure 1). The number of sampling stations in each stratum was determined by both the stratum area and the distance to the coast because an analysis of historical data suggested that both variables were important in determining the spatial distribution of S. taty (Liu et al., 2004, 2006). The stratified random survey design is referred to as Design V in this study.

3 Optimization methods and conventional sampling design in fishery-independent surveys 1875 Figure 1. Illustration of the spatial distribution of sampling stations from the five sampling designs (columns of panels) for the three different levels (rows of panels) of sample sizes (i.e. 40, 82, and 160). Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3) þ WM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design. Optimized sampling design by constraint spatial simulated annealing In terms of the objective functions for optimization, an optimal design can normally be obtained through two steps: defining a criterion, then finding the set of sampling locations that can minimize the objective function defined in the criterion (Diggle and Lophaven, 2006). Here, we used two criteria, the criterion defined by Warrick and Myers (1987; referred to as the WM criterion) and the MMSD criterion (van Groenigen and Stein, 1998). The WM criterion (Warrick and Myers, 1987) aims to obtain an even distribution of the number of coupled points among all classes/groups of distances to achieve the precise variogram estimation. It does not need any assumption about spatial autocorrelation structure, but depends on the distances between sampling points: f WM ðsþ ¼ Xnc i¼1 ½aw i ð f i f i Þ 2 þ bm i Š; ð1þ where S is a set of sampling points, n c the number of distance classes, f the number of coupled stations in a given distance class, f * the expected number of coupled stations in a given distance class, a, b, and w the user-defined weighting coefficients, and M the other possible accountable objections in distance classes. In this study, for simplicity, we considered just a simple situation for which b was 0 and a and w were both set equal to 1. For a particular number of samples (n) and number of distance classes (n c ), the expected number of coupled stations for a uniform distribution is f i nðn 1Þ ¼ : ð2þ 2n c The MMSD criterion (van Groenigen and Stein, 1998) aims for an even distribution of sampling points over the entire survey area. The even distribution can be obtained when the expectation of the distances between a point chosen arbitrarily within the survey area and its nearest sampling point is minimal. With N points of a finely meshed grid as reference points, the criterion is to minimize the average of the distance of all grid points to their nearest sampling points: f MMSD ðsþ ¼ 1 N X N i dðx i ; SÞ; where x is the grid point and d(x, S) is the distance between grid point x and the closest sampling point in sampling set S. The number of grid points should be much larger than the number of sample points to achieve a reliable value of f MMSD (Simbahan and Dobermann, 2006). ð3þ

4 1876 Y. Liu et al. Next, it is necessary to find an effective way to optimize the criterion function. Simulated annealing (SA) is a generic probabilistic approach for finding an approximation to the global optimum of a given objective function f(s) (Kirkpatrick et al., 1983; Simbahan and Dobermann, 2006). The key aspect of this method is to determine whether the new solution f(s iþ1 ), obtained from a random perturbation of one of the variables in S i, can take the place of the previous f(s i ). The core of SA is the Metropolis selection criterion (Metropolis et al., 1953), written as P c ðs i! S iþ1 Þ¼1; if fðs iþ1 ÞfðS i Þ P c ðs i! S iþ1 Þ¼exp fðsiþ1þ fðsiþ c ; if fðs iþ1 Þ. fðs i Þ ; ð4þ where c denotes a positive control parameter, decreased with the optimization progress. If S iþ1 is accepted, it is taken as the starting point for the next solution S iþ2, and the process continues. The positive control parameter c is also referred to as the annealing temperature. There are many cooling schedules to control the value of parameter c. In this study, this parameter is calculated by the following equation: c kþ1 ¼ ac k ; k ¼ 1; 2;...; ð5þ where a is a user-determined parameter (,1 to keep c decreasing), typically having a value slightly,1 (e.g ), and k the number of optimization processes that have been performed. If the SA optimization method is adopted for spatial sampling, it is referred to as spatial SA (SSA; van Groenigen and Stein, 1998). The SSA method was further improved to become a CSSA method. Sampling points were treated as continuous variables rather than chosen from a discrete grid (van Groenigen and Stein, 1998). For the SSA method, the distance from one randomly chosen sampling point to another is between 0 and h max, which is the length of the sampling region. In the modified CSSA method, h max is not always the same and can decrease with progress. In this way, the efficiency of finding a better sampling scheme can be improved greatly. Three sampling designs were then developed based on CSSA. The first was optimized based on the WM criterion (Figure 1; referred to as Design I). The second sampling design was optimized based on the MMSD criterion (Figure 1; Design II). The third sampling design was optimized based on both the WM and MMSD criteria, with a sequential MMSD þ WM optimization strategy (Figure 1; Design III). Two-thirds of the sampling points were optimized according to the MMSD criterion. After the convergence of the MMSD algorithm, the remaining one-third of the points were optimized according to the WM criterion, with the MMSD-optimized sampling points treated as prior information (Simbahan and Dobermann, 2006). For the convenience of comparison among the five designs, the WM and MMSD criterion values were also calculated for Designs IV and V without the procedure of the optimizations. Geostatistical model framework and simulation study A Gaussian stochastic process is widely used to quantify the distribution of geostatistical data. However, the original observations usually cannot satisfy the Gaussian assumption. Therefore, we used a log-gaussian geostatistical model, requiring log-transformation of the original data, a typical Box Cox Gaussian transformation (Box and Cox, 1964). A Gaussian spatial process, {S(x): x[r 2 }, is a stochastic process with the property that for any collection of locations x 1,..., x n with each x i [R 2, the joint distribution of S ¼ fs(x 1 ),..., S(x n )g is a multivariate Gaussian distribution. With mean vector m S, calculated as m(x) ¼ E[S(x)], and covariance matrix G as G ij ¼ g(x i, x j ) ¼ CovfS(x i ), S(x j )g, we have SMVN(m S, G) (Diggle and Ribeiro, 2007). A stationary Gaussian process was applied in this study, and it is defined by two statistics: the mean of every location x, which is a constant m(x) ¼ m, and the covariance depending only on the distance between x and x 0, g(x, x 0 ) ¼ g(h), where h is the Euclidean distance between locations x and x 0. The variance in the stationary process is a constant s 2 ¼ g(0), and the correlation function is defined as r(h) ¼ g(h)/s 2, which we describe with an exponential function. The exponential function can be written as r(h) ¼ exp(2h/f), where f is the correlation range parameter with the property r(h) 0.05 when h ¼ 3f. In practice, there is always another type of independent variance in observed data, which is usually referred to as the nugget effect t 2. In geostatistical terminology, t 2 þ s 2 is the total sill, s 2 the partial sill, t 2 the nugget effect, and 3f the practical range (Isaaks and Srivastava, 1989). Geostatistical parameter estimation The geostatistical multivariate Gaussian model can be written (Diggle and Ribeiro, 2007) as Y NðDb; s 2 RðfÞþt 2 IÞ; where D is an n p matrix of covariates, b the corresponding vector of regression parameters, and R the correlation function, which depends on correlation range parameter f. The product of the two parameters D and b describes a linear specification for the spatial trend m(x). The log-likelihood function is Lðb; t 2 ; s 2 ; fþ ¼ 0:5fn logð2pþþlogfjðs 2 RðfÞþt 2 IÞjg þðy DbÞ T ðs 2 RðfÞþt 2 IÞ 1 ðy DbÞg : Model parameters can be estimated through maximization of the likelihood function. Unconditional simulation A key aspect of simulation is to simulate a realization. Y can be represented by Y ¼ m þ S þ tt, where m ¼ E[Y], T ¼ (T 1,..., T n )is a set of mutually independent random variables following a standard Gaussian distribution N(0,1), and S ¼ (S 1,..., S n ) follow a zero-mean multivariate Gaussian distribution, SMVN(0, P ). The realization of S can be obtained through linear transformation: S ¼ AZ, where A is such a matrix that AA 0 ¼ P, and Z ¼ (Z 1,..., Z n ) follow a standard normal distribution N(0,1). Matrix A can be derived using two methods: Cholesky factorization and singular value decomposition. However, when n becomes large, these two methods become less efficient at simulation. It is necessary to use other methods, such as the ingenious algorithm described by Wood and Chan (1994), circulated embedding in conjunction with fast Fourier transform methods, or a Markov Chain Monte Carlo method (Diggle and Ribeiro, 2007). ð6þ ð7þ

5 Optimization methods and conventional sampling design in fishery-independent surveys 1877 Figure 2. The flowchart of the simulation study. Conditional simulation Conditional simulation is the simulation of a spatial process S(x) at location x i * : i ¼ 1,..., N, conditional on observed values S(x i )at location x i : i ¼ 1,..., n. The conditional distribution of the values of the process S(x) at any set of locations, e.g. S* ¼ fs(x i * ),..., S(x N * )g, given the data Y, follows a multivariate Gaussian distribution with a variance matrix independent of Y (Diggle and Ribeiro, 2007). Simulation procedure The simulation started with the sampling design. As described above, five sampling designs were considered, each with three different numbers of sampling stations: 40, 82, and 160. They are denoted by G N D : D ¼ 1,..., 5, corresponding to Design I,..., Design V, respectively, and N ¼ 1, 2, and 3, corresponding to sample sizes of 40, 82, and 160, respectively. These three sample sizes were used to test the impacts of sample size on the simulation and are referred to as the small, medium, and large sample sizes in this study. The parameters of spatial distribution characteristics of each year were estimated from the survey data of S. taty and denoted as u y : y ¼ 2000,..., In all, 200 unconditional simulation runs were conducted for each set of parameters, with each simulation run having 3685 regulated grid-point realizations within the study area, and were denoted as Y y S : S ¼ 1,..., 200. The next step was to draw a sample from the above simulation realizations according to the sampling designs, and the parameters were estimated for each sample (denoted as eu y SD ). The last step was to simulate 100 runs for the estimated spatial structure parameters with 952 regulated grid-point realizations within the study area and to average the results of 100 simulations to yield the final simulation result, denoted as ~Y y SD. The simulation process is illustrated in Figure 2. Analysis of results The simulation results derived for the five designs were compared using two measures. The first measure, RSE, evaluates the accuracy and precision of estimated parameters for covariance functions by comparing the true values estimated from 8 years of original data with values estimated from the data simulated by sampling from unconditionally simulated data, with different sampling schemes (200 unconditional simulations for each year). The RSE is calculated as qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi P n estimated i¼1ðvi V true Þ 2 =n RSE ¼ V true ; ð8þ where V i estimated is the value of parameters estimated from the ith unconditional simulation (for a given sampling design in a given year), V true the parameter value derived from the original data (for a given year), and n the number of simulation runs. The RSEs of 8 years were pooled to calculate the mean and s.d. for comparing the accuracy and precision of parameter estimates among the five designs for each of the three sample sizes considered. The second measure, RB, evaluates the accuracy of stock abundance estimates by comparing the assumed true value (stock sizes estimated from the unconditional simulation data) with the stock sizes estimated by averaging the 100 estimates of stock size derived from 100 sets of data, i.e. the conditional simulations based on the sampled data from unconditional simulation. We used the RB to compare the accuracy of estimated stock size because the unconditional simulation resulted in different total stock sizes, so the absolute bias between true and estimated values could not be

6 1878 Y. Liu et al. Table 1. Exponential covariance function parameters estimated for Setipinna taty for the survey years Year b s 2 f t The parameters are defined in Equations (6) and (7). compared. The RB is calculated as V estimated V true RB ¼ V true ; ð9þ where V estimated is the average of 100 conditional estimates of stock size for a given sampling design corresponding to an assumed true distribution and V true the assumed true value of stock size derived from the unconditional simulation for a given year. For each year, each unconditional simulation yielded a value of RB, and 200 values were derived in total. Their means were calculated for each year, and the mean and s.d. of the values of 8 years were calculated to compare the difference in accuracy of stockabundance estimates among the five sampling designs under the three different sample sizes considered. R software (R Development Core Team, 2008) was used in this study, with add-on packages geor (Ribeiro and Diggle, 2001) and RandomFields (Schlather, 2001). Results Exponential covariance function parameters for the survey Using 8 years of survey data collected by the East China Sea Fisheries Research Institute, we estimated parameters quantifying the characteristics of spatial distribution for S. taty in the north part of the East China Sea, using Equations (6) and (7) (Table 1). These parameters were used to simulate the fish distribution in the simulation study. Five sampling designs For Design I (WN design), the coefficient of variation for the number of paired sampling stations in different distance classes used in variogram estimation was small, ranging from 0.02 to 0.35%, suggesting a fairly uniform distribution of paired sampling stations in the survey area. However, the WM design was not optimal for sampling because sample stations were clustered (Figure 1), and the evenness of the distribution of sampling stations was the poorest of all the designs, as demonstrated by the largest MMSD criterion values for Design I (Table 2). Design II (MMSD design) led to good and even coverage of the survey area (Figure 1) and had the least average distance of all grid points to their nearest sampling point (f MMSD ) among all the sampling designs for each sample size (Table 2). However, the number of paired sampling stations was small at short-lag intervals. For example, the number of paired sampling stations in the shortest lag class (0 1) identified based on the MMSD sampling design was the least of all the sampling designs for each sample Table 2. Values of the objective function and frequency distribution of paired points in different classes for the five sampling designs. Parameter Design I Design II Design III Design IV Design V Size f MMSD f WM f* Lag Distance Number of paired points in each lag distance class Mean s.d CV (%) Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3) þ WM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design.

7 Optimization methods and conventional sampling design in fishery-independent surveys 1879 Table 3. Mean and s.d. of the RSE for the three parameters for covariance function (b, s 2, f) estimated in the simulation for the eight survey years. Design Sample size Estimated parameter I II III IV V 40 b Mean s.d s 2 Mean s.d f Mean s.d b Mean s.d s 2 Mean s.d f Mean s.d b Mean s.d s 2 Mean s.d f Mean s.d The smallest average RSE for each parameter among the five designs is emboldened, and the largest is italicized and underlined. Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3)þWM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design. size (Table 2). If the sample size was small, the number of paired sampling stations at short-lag intervals would be too few to estimate the parameters of the variogram precisely. Values of f MMSD for Design III (MMSDþWM design) were higher than those for Design II and were the least of all other designs except for Design IV (Table 2). Moreover, the f WM value was also larger than that for Design I and was the least of all other designs (Table 2). Both criterion values in the split dataset were inferior to their whole dataset counterparts, an expected trade-off because both spatial coverage and modelling of the variogram were included. The f MMSD value of the systematic design (Design IV) was the second smallest, just slightly higher than that for the MMSD design (Design II). However, the stratified random design (Design V) had the second largest value of f MMSD, just smaller than that for the WM design (Design I; Table 2). The f MMSD values for the five designs for all sample sizes had the following order of magnitude, i.e. f MMSD (Design II), f MMSD (Design IV), f MMSD (Design III), f MMSD (Design V), f MMSD (Design I). Comparing the RSEs of estimated covariance function parameters The smallest values of RSE (implying the highest accuracy and precision) were derived from Designs II, III, and IV. The frequency of the smallest values appearing in Designs II, III, and IV was 5 (out of 9), 3 (out of 9), and 1 (out of 9), respectively (Table 3). The largest value appeared in Designs I, II, and IV, and the corresponding frequency was 6 (out of 9), 2 (out of 9), and 1 (out of 9), respectively. Design I seemed to perform poorly at estimating the parameters, with the highest frequency of the largest RSE value (i.e. lowest accuracy and precision in 6 out of 9 times), and never the smallest RSE value (the highest accuracy and precision). Design II seemed to be good at estimating parameters for the large sample size, with three estimates having the highest accuracy and precision, but poor for small sample size, with two estimates having the lowest accuracy and precision. Design III seemed to be good for relatively small sample sizes, with two estimates having the highest accuracy and precision for the small sample size and one for the medium sample size. This design yielded no estimate with the lowest accuracy and precision. Design IV had the highest and lowest accuracy and precision for one estimate each in the medium sample size, and Design V had the estimates with neither the highest nor the lowest accuracy and precision in the simulation. For the small sample size of 40, Design III was good for parameter estimation: two highest accuracy and precision for parameters s 2 and f, respectively, and the second highest accuracy and precision for parameter b just below the highest of Design II. For the medium sample size of 82, Designs II, III, and IV had the highest accuracy and precision, respectively, for s 2, f, and b, but Design IV had low accuracy and precision for f. The estimation accuracy and precision for Designs II and III tended to be similar and better than the other designs. For the large sample size of 160, the best estimation of all parameters was by Design II, and the poorest by Design I. The mean RSE of parameter b tended to decrease with sample size for all designs, except for Design IV. The mean RSE values of parameters s 2 and f also decreased with sample size for all designs except Design I. Comparing the RB of estimated stock sizes Design I yielded the estimates with the largest RB value under all three sample sizes considered, implying that it had the lowest accuracy (Table 4). The estimates for Design II were best when sample sizes were 40 and 160, and second best when sample size was 82. The estimation for Design IV was best when sample size was 82. For the accuracy of stock size estimates, Design II was best, followed by Designs IV, III, V, and I.

8 1880 Y. Liu et al. Table 4. Mean and s.d. of RB of the estimated stock size in the simulation. Sampling design Sample size I II III IV V 40 Mean s.d Mean s.d Mean s.d The smallest average RB for each parameter among five designs is emboldened, and the largest is italicized and underlined. Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3)þWM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design. Similar to the RSE, the mean RB value of stock size estimates tended to decrease with sample size for each design. Discussion The classical survey design for estimating population abundance is simple random sampling with a good statistical property. For fishery-independent sampling, because of the many unique characteristics of fish stocks (e.g. fish aggregation and migration, and logistic constraints of fisheries surveys such as unsuitable trawling substrata and poor weather), simple random sampling may not be an optimal sampling design for fisheries. Stratified random sampling is often used in fishery-independent surveys. Its design may, however, include some subjectivity such as the choices of variables used to determine strata, the number of strata, and the criteria used to allocate sampling effort among strata. Arbitrary selections also occur in the systematic sampling design. For example, the location of the survey starting point, the interval span of two points, and the location of the endpoint may all be determined arbitrarily. The optimized sampling design (i.e. Designs I, II, and III) proposed in this study tends to be more objective, and the results are more stable. As the original parameters were estimated from the data collected in the survey based on systematic sampling design, one would ask whether this might influence the comparisons, resulting in biases favouring the systematic sampling design. The answer is definitely negative. Here, we only used the parameters to produce the true distribution within the study area for simulation, and it had no relationship with the data derived from systematic sampling because the process of producing the distribution was based on the unconditional simulation method, which is a randomized procedure. Therefore, the parameters estimated from the data collected in systematic sampling should have had no impact on our results. Design III, the combined MMSDþWM-optimized sampling design, estimated the parameters for the covariance function better than other designs when sample size was small. However, Design II, the MMSD-optimized sampling design, performed better than other designs when sample size was large, but poorest when sample size was small. According to Warrick and Myers (1987), the use of the WM criterion in sampling design could enhance the reliability of estimation for the variogram. However, when the WM-optimized sampling design was used, the sampling locations chosen tended to aggregate at two sites, which was similar to the results found in other studies (Warrick and Myers, 1987; Simbahan and Dobermann, 2006). Sampling stations defined based on this design tend to be limited in small areas, and the sampling results may not be a good representation of the fish population in the survey area. Hence, the accuracy and precision of parameter estimation for the WM-optimized sampling scheme is low and became worse with increasing sample size, compared with other designs, because of the absence of an apparent increase in area coverage with increased sample size. This is different from other sampling designs for which an increase in sampling density tends to lead to improved accuracy and precision. The combined MMSDþWM-optimized sampling scheme (i.e. Design III), however, performs well because it combines the advantages of both spatial coverage and variogram estimation. Design III only performed better than other designs when sample size was small; when sample size was large, the performance of Design II was the best. On the other hand, Design II gave a poor performance in estimating parameters for a small sample size. For a small sample size, the short paired distances were too few to yield an accurate estimate for the short distance classes of the variogram, resulting in poor estimates. With an increase in sample size, the number of short paired distances increases, improving the parameter estimation precision. In addition, the sampling site distribution of Design II is more evenly distributed than that of other designs (see f MMSD values in Table 2). This may be the reason for Design II performing better than other designs when sample size was large. With a set of well-estimated parameters, we can derive a good prediction of the spatial distribution and population abundance of a fish stock, which is often the central task of a fishery-independent survey programme and provides crucial information for fisheries management. Our study shows that the performance of five sampling designs in estimating stock size has the following ranking: Design II. Design IV. Design III. Design V. Design I, coinciding with the order of MMSD criterion values for sampling designs. This implies a tendency for increasing accuracy with evenness of sampling station spatial distribution. This tendency is in general agreement with that found in other studies. Cochran (1977) reported that, based on a few examples, systematic surveys were more accurate than stratified random surveys. Ripley (1981) considered the error variance of the sample mean for a stationary isotropic random process and showed that systematic sampling performed better than stratified random sampling and that both stratified random and systematic surveys performed better than a simple random survey. Simmonds and Fryer (1996) compared eight sampling strategies from a series of simulations and found that more-precise estimates of the surface mean could be obtained using stratified random or systematic sampling than simple random sampling. Jardim and Ribeiro (2007) compared five designs constructed by an informal method with the actually used stratified random design and found that, except for the high density design used as a benchmark, the design combining regular (i.e. systematic sampling design) and stratified random design performed best, which resulted in more evenly distributed sampling stations in the survey area than the other designs. One exception to the above tendency was at a medium sample size. For a medium sample size, Design IV was better than Design II, with the other designs following the same order in ranking of their performance.

9 Optimization methods and conventional sampling design in fishery-independent surveys 1881 Table 5. Rates of changes in accuracy [described by ACR calculated in Equation (10)] for the estimation of parameters of covariance function and stock size with increased sample sizes for the five sampling designs. Sampling design Estimated object Changes in sample size I II III IV V Parameter b E E E E E E E E E E204 Parameter s E E E E E E E E E E204 Parameter f E E E E E E E E E E203 Abundance E E E E E E E E E E203 Design I, WM-optimized sampling design; Design II, MMSD-optimized sampling design; Design III, sequential MMSD (2/3) þ WM (1/3)-optimized sampling design; Design IV, systematic sampling design; and Design V, stratified random sampling design. Comparing the accuracy of stock-size estimates with the accuracy and precision of parameter estimates, it seems that the accuracy of stock-size estimation is determined by the accuracy and precision of parameter b estimation, rather than by the accuracy and precision of other parameters. For example, for a small sample size, although Design II had the lowest accuracy and precision of the other two parameters, it still yielded the greatest accuracy in terms of stock-size estimation, with only its best estimate of parameter b. For the medium sample size, Design IV yielded the highest accuracy and precision of parameter b and the highest accuracy of stock size, although it had the poorest estimate for another parameter. Muller (2001) pointed out that those ideal designs for estimating covariance parameters of the stochastic process were not the same as for the best estimate of the value of the stochastic process in a given location or in estimating overall abundance. Our findings partially agree with Muller s conclusion, in that the accuracy of abundance estimation does not depend on all covariance parameters, but is mainly determined by the key covariance parameter b. This result is not difficult to understand, because parameter b is the mean component of the Gaussian random field model (see Material and methods section above), or the spatial trend which is directly related to abundance estimates. An interesting question is whether we can set parameter b as the objective in the estimation instead of stock abundance. We can then just focus on the covariance parameter estimation. This, however, needs further study. Accuracy and precision of parameter estimation and accuracy of abundance estimation both increased with sample size, which is consistent with the findings of many other studies (Simbahan and Dobermann, 2006; Jardim and Ribeiro, 2007). Jardim and Ribeiro (2007) found that in the presence of spatial correlation, an increase in sample size may not provide a proportional increase in the quantity of information because of the partial redundancy of information resulting from the spatial correlation. We calculated an accuracy changing rate (ACR), which is the change in the performance measure statistics (i.e. RSE and RB) for the parameters or abundance estimation divided by the corresponding change in sample size: ACR ¼ðM size1 M size2 Þ=ðSize2 Size1Þ; ð10þ where Size1 and Size2 are two different sample sizes and M size1 and M size2 are corresponding statistics. The results are listed in Table 5. Most of the changes in the RSE and RB when sample sizes increased from 82 to 160 were smaller than when sample size increased from 40 to 82. This is consistent with the results of Jardim and Ribeiro (2007). Some interesting questions can be asked, such as whether there is a sample size at which an additional increase in the number of sampling stations cannot significantly improve the estimation accuracy, and how one can identify a cost-effective sample size that balances the costs of sampling and the accuracy of estimations. A simulation study similar to this can be used to address such questions. This study took a pelagic fish species as an example. Whether the results would be different for a fish species with different annual variations and/or life history can be addressed readily using the simulation framework developed here. The study was based on 8 years of survey data. In practice, however, one may not need that many years of data, and a few recent years of data may be sufficient because information from the most recent years is more relevant to the current characteristics of the targeted fish populations. However, if there is no historical information, such a study is impossible. Based on historical survey data, such a study allows one to improve current survey design, balancing the quality of the data collected and the costs of the sampling programme, leading to the development of a sustainable and fishery-independent monitoring programme. Acknowledgements Thanks are due to all scientific staff and crew for their assistance in collecting data during the surveys. The work was conducted under the auspices of the National Science and Technology Support Plan (2007BAD43B01), the Chinese Ministry of Agriculture Assessment of Marine Fisheries Resources Programme, and the Ministry of Science and Technology Public Project (2006/2007). Support from the East China Sea Fisheries Research Institute and the University of Maine is appreciated greatly. References Abt, M., Welch, W. J., and Sacks, J Design and analysis for modeling and predicting spatial contamination. Mathematical Geology, 31: Andrew, N., and Chen, Y Optimal sampling for estimating the size structure and mean size of individuals caught in an abalone fishery. Fishery Bulletin US, 95:

10 1882 Y. Liu et al. Bogaert, P., and Russo, D Optimal spatial sampling design for the estimation of the variogram based on a least squares approach. Water Resources Research, 35: Box, G. E., and Cox, D. R An analysis of transformations. Journal of the Royal Statistical Society, Series B (Methodological), 26: Brus, D. J., and de Gruijter, J Random sampling or geostatistical modelling? Choosing between design-based and model-based sampling strategies for soil (with discussion). Geoderma, 80: Chen, Y A Monte Carlo study on impacts of the size of subsample catch on estimation of fish stock parameters. Fisheries Research, 26: Chen, Y., Sherman, S., Wilson, C., Sowles, J., and Kanaiwa, M A comparison of two fishery-independent survey programs used to define the population structure of American lobster (Homarus americanus) in the Gulf of Maine. Fishery Bulletin US, 104: Cochran, W. G Sampling Techniques, 3rd edn. John Wiley and Sons, New York. 900 pp. Diggle, P. J., and Lophaven, S. O Bayesian geostatistical design. Scandinavian Journal of Statistics, 33: Diggle, P. J., and Ribeiro, P. J Model-based Geostatistics. Springer, New York. 230 pp. Domburg, P., de Gruijter, J. J., and van Beek, P Designing efficient soil survey schemes with a knowledge-based system using dynamic programming. Geoderma, 75: Grabowski, R., Windholz, T., and Chen, T Estimating exploitable stock biomass for the Maine green sea urchin (Strongylocentrotus droebachiensis) fishery using a spatial statistics approach. Fishery Bulletin US, 103: Hansen, M. H., Madow, W. G., and Tepping, B. J An evaluation of model-dependent and probability-sampling inferences in sample surveys. Journal of the American Statistical Association, 78: Harbitz, A., and Aschan, M A two-dimensional geostatistic method to simulate the precision of abundance estimates. Canadian Journal of Fisheries and Aquatic Sciences, 60: Harbitz, A., Aschan, M., and Sunnanå, K Optimal effort allocation in stratified, large area trawl surveys, with application to shrimp surveys in the Barents Sea. Fisheries Research, 37: Isaaks, E. H., and Srivastava, R. M An Introduction to Applied Geostatistics. Oxford University Press, New York. 561 pp. Jardim, E., and Ribeiro, J. P Geostatistical assessment of sampling designs for Portuguese bottom trawl surveys. Fisheries Research, 85: Kirkpatrick, S., Gelatt, C. D., and Vecchi, M. P Optimization by simulated annealing. Science, 220: Lark, R. M Optimized spatial sampling of soil for estimation of the variogram by maximum likelihood. Geoderma, 105: Liu, Y., Cheng, J., and Chen, X Studies on the seasonal distribution of Setipinna taty in the East China Sea. Marine Fisheries Research, 27: 1 6. Liu, Y., Cheng, J., and Li, S A study on the distribution of Setipinna taty in the East China Sea. Marine Fisheries, 26: Manly, B. F., Akroyd, J. A., and Walshe, K. A Two-phase stratified random surveys on multiple populations at multiple locations. New Zealand Journal of Marine and Freshwater Research, 36: Marchant, B., and Lark, R Optimized sample schemes for geostatistical surveys. Mathematical Geology, 39: McBratney, A. B., and Webster, R The design of optimal sampling schemes for local estimation and mapping of regionalized variables. 2. Program and examples. Computers and Geosciences, 7: Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., and Teller, E Equation of state calculations by fast computing machines. Journal of Chemical Physics, 21: Miller, T. J., Skalski, J. R., and Ianelli, J. N Optimizing a stratified sampling design when faced with multiple objectives. ICES Journal of Marine Science, 64: Muller, W. G Collecting Spatial Data: Optimum Design of Experiments for Random Fields, 2nd edn. Contributions to Statistics. Physica, Heidelberg. 196 pp. Muller, W. G., and Zimmerman, D. L Optimal designs for variogram estimation. Environmetrics, 10: Paloheimo, J. E., and Chen, Y Estimating fishing mortality and cohort sizes. Canadian Journal of Fisheries and Aquatic Sciences, 53: Petitgas, P Geostatistics for fish stock assessments: a review and an acoustic application. ICES Journal of Marine Science, 50: Petitgas, P Geostatistics in fisheries survey design and stock assessment: models, variances and applications. Fish and Fisheries, 2: R Development Core Team R: a Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN , Ribeiro, P. J., and Diggle, P. J geor: a package for geostatistical analysis. R News, 1: Ripley, B. D Spatial Statistics. John Wiley and Sons, New York. 252 pp. Rivoirard, J., Simmonds, E. J., Foote, K. G., Fernandes, P., and Bez, N Geostatistics for Estimating Fish Abundance. Blackwell Science, Oxford. 206 pp. Royle, J. A., and Nychka, D An algorithm for the construction of spatial coverage designs with implementation in SPLUS. Computers and Geosciences, 24: Russo, D Design of an optimal sampling network for estimating the variogram. Soil Science Society of America Journal, 48: Schlather, M Simulation and analysis of random fields. R News, 1: Simbahan, G. C., and Dobermann, A Sampling optimization based on secondary information and its utilization in soil carbon mapping. Geoderma, 133: Simmonds, E. J., and Fryer, R. J Which are better, random or systematic acoustic surveys? A simulation using North Sea herring as an example. ICES Journal of Marine Science, 53: Stein, A., and Ettema, C An overview of spatial sampling procedures and experimental design of spatial studies for ecosystem comparisons. Agriculture, Ecosystems and Environment, 94: Taylor, J. R An Introduction to Error Analysis: the Study of Uncertainties in Physical Measurements, 2nd edn. University Science Books, Sausalito, CA, USA. 327 pp. Thompson, S. K Sampling. John Wiley and Sons, New York. 343 pp. van Groenigen, J., Siderius, W., and Stein, A Constrained optimisation of soil sampling for minimisation of the kriging variance. Geoderma, 87: van Groenigen, J. W., and Stein, A Constrained optimization of spatial sampling using continuous simulated annealing. Journal of Environmental Quality, 27: Warrick, A. W., and Myers, D. E Optimization of sampling locations for variogram calculations. Water Resources Research, 23: Wood, A. T., and Chan, G Simulation of stationary Gaussian processes in [0,1] d. Journal of Computational and Graphical Statistics, 3: doi: /icesjms/fsp157

Bayesian Geostatistical Design

Johns Hopkins University, Dept. of Biostatistics Working Papers 6-5-24 Bayesian Geostatistical Design Peter J. Diggle Medical Statistics Unit, Lancaster University, UK & Department of Biostatistics, Johns