Four aspects of a sampling strategy necessary to make accurate and precise inferences about populations are:

Similar documents
Approach to Field Research Data Generation and Field Logistics Part 1. Road Map 8/26/2016

Introduction to capture-markrecapture

Webinar Session 1. Introduction to Modern Methods for Analyzing Capture- Recapture Data: Closed Populations 1

Capture-Recapture Analyses of the Frog Leiopelma pakeka on Motuara Island

Alex Zerbini. National Marine Mammal Laboratory Alaska Fisheries Science Center, NOAA Fisheries

Assessing Fishery Condition: Population Estimation

Akaike Information Criterion to Select the Parametric Detection Function for Kernel Estimator Using Line Transect Data

Separating Components of the Detection Process With Combined Methods: An Example With Northern Bobwhite

Cormack-Jolly-Seber Models

BIOL 217 ESTIMATING ABUNDANCE Page 1 of 10

LINE TRANSECT SAMPLING FROM A CURVING PATH

To hear the seminar, dial (605) , access code

Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology

Incorporating Boosted Regression Trees into Ecological Latent Variable Models

Large scale wildlife monitoring studies: statistical methods for design and analysis

ATLANTIC-WIDE RESEARCH PROGRAMME ON BLUEFIN TUNA (ICCAT GBYP PHASE ) ELABORATION OF DATA FROM THE AERIAL SURVEYS ON SPAWNING AGGREGATIONS

Mark-Recapture. Mark-Recapture. Useful in estimating: Lincoln-Petersen Estimate. Lincoln-Petersen Estimate. Lincoln-Petersen Estimate

Non-uniform coverage estimators for distance sampling

Lecture 7 Models for open populations: Tag recovery and CJS models, Goodness-of-fit

Letters to the Editor

Darryl I. MacKenzie 1

Estimating rates of local extinction and colonization in colonial species and an extension to the metapopulation and community levels

Design and Analysis of Line Transect Surveys for Primates

A Comparison of Methods for Estimating Northern Bobwhite Covey Detection Probabilities

Design and Analysis of Line Transect Surveys for Primates

Improving Estimates of Abundance by Aggregating Sparse Capture-Recapture Data

ANALYSIS OF WILDLIFE ABUNDANCE ESTIMATION METHODS USING REAL AND SIMULATED DATA

Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology

Double Kernel Method Using Line Transect Sampling

Multiple regression and inference in ecology and conservation biology: further comments on identifying important predictor variables

7.2 EXPLORING ECOLOGICAL RELATIONSHIPS IN SURVIVAL AND ESTIMATING RATES OF POPULATION CHANGE USING PROGRAM MARK

Possible Questions from Students Test #1

A Wildlife Simulation Package (WiSP)

Biometrics Unit and Surveys. North Metro Area Office C West Broadway Forest Lake, Minnesota (651)

Stopover Models. Rachel McCrea. BES/DICE Workshop, Canterbury Collaborative work with

EFFICIENT ESTIMATION OF ABUNDANCE FOR PATCHILY DISTRIBUTED POPULATIONS VIA TWO-PHASE, ADAPTIVE SAMPLING

ESTIMATING DETECTION PROBABILITY PARAMETERS FOR PLETHODON SALAMANDERS USING THE ROBUST CAPTURE RECAPTURE DESIGN

Sampling. Where we re heading: Last time. What is the sample? Next week: Lecture Monday. **Lab Tuesday leaving at 11:00 instead of 1:00** Tomorrow:

THE JOLLY-SEBER MODEL AND MAXIMUM LIKELIHOOD ESTIMATION REVISITED. Walter K. Kremers. Biometrics Unit, Cornell, Ithaca, New York ABSTRACT

Introduction to Occupancy Models. Jan 8, 2016 AEC 501 Nathan J. Hostetter

Sampling Strategies for the Assessment of Ecological Diversity

A NOTE ON TESTING THE SHOULDER CONDITION IN LINE TRANSECT SAMPLING

Why do we Care About Forest Sampling?

Models for Estimating Abundance from Repeated Counts of an Open Metapopulation

The Scenario. From today s lecture (and readings!) you should be able to : Steps in Conducting an Assessment using Inventory and Monitoring

Modeling Fish Assemblages in Stream Networks Representation of Stream Network Introduction habitat attributes Criteria for Success

CHAPTER 20. Density estimation... Jake Ivan, Colorado Parks and Wildlife

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

RESERVE DESIGN INTRODUCTION. Objectives. In collaboration with Wendy K. Gram. Set up a spreadsheet model of a nature reserve with two different

Lecture 5: Sampling Methods

Accuracy or Precision: Implications of Sample Design and Methodology on Abundance Estimation

Integrating mark-resight, count, and photograph data to more effectively monitor non-breeding American oystercatcher populations

John Erb, Minnesota Department of Natural Resources, Forest Wildlife Research Group

Detection Probability and Sources of Variation in White- Tailed Deer Spotlight Surveys

Predictive Thresholds for Plague in Kazakhstan

Disentangling spatial structure in ecological communities. Dan McGlinn & Allen Hurlbert.

LECTURE 1: Introduction and Brief History of Population Ecology

Metapopulation modeling: Stochastic Patch Occupancy Model (SPOM) by Atte Moilanen

Results of the 2012 Range-wide Survey of Lesser Prairie-chickens (Tympanuchus pallidicinctus)

The Ecology and Acoustic Behavior of Minke Whales in the Hawaiian and Pacific Islands

Finding patterns in nocturnal seabird flight-call behaviour

Bridging the two cultures: Latent variable statistical modeling with boosted regression trees

Technical Report HCSU-035

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score

Sampling Populations limited in the scope enumerate

FW Laboratory Exercise. Program MARK: Joint Live Recapture and Dead Recovery Data and Pradel Model

Multiple covariate distance sampling (MCDS)

Curriculum Vitae: DARRYL I. MACKENZIE. Proteus Wildlife Research Consultants PO Box 5193 Dunedin, New Zealand

IMPROVING INFERENCES IN POPULATION STUDIES OF RARE SPECIES THAT ARE DETECTED IMPERFECTLY

Estimating the unobservable moose - converting index to population size using a Bayesian Hierarchical state space model

Palila Abundance estimates and trend

Mark-recapture with identification errors

Site Occupancy Models with Heterogeneous Detection Probabilities

Lecture 09 - Patch Occupancy and Patch Dynamics

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

Abstract. time-of-detection methods. We provide a general model of detection where the total

MULTISITE MARK-RECAPTURE FOR CETACEANS: POPULATION ESTIMATES WITH BAYESIAN MODEL AVERAGING

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

Analysis of 2005 and 2006 Wolverine DNA Mark-Recapture Sampling at Daring Lake, Ekati, Diavik, and Kennady Lake, Northwest Territories

REVISION: POPULATION ECOLOGY 01 OCTOBER 2014

Managing Uncertainty in Habitat Suitability Models. Jim Graham and Jake Nelson Oregon State University

Closed population capture-recapture models

Unmodeled observation error induces bias when inferring patterns and dynamics of species occurrence via aural detections

Generalized Linear Models (GLZ)

CHAPTER 15. The robust design Decomposing the probability of subsequent encounter

Modeling co-occurrence of northern spotted and barred owls: Accounting for detection probability differences

Transitivity a FORTRAN program for the analysis of bivariate competitive interactions Version 1.1

The effect of line-transect placement in a coastal distance sampling survey

AP Statistics Cumulative AP Exam Study Guide

Animal pests: distance sampling for Bennett s wallaby

Estimating abundance of unmarked animal populations: accounting for imperfect detection and other sources of zero inflation

Marine Mammal and Turtle Division Southwest Fisheries Science Center

Bryan F.J. Manly and Andrew Merrill Western EcoSystems Technology Inc. Laramie and Cheyenne, Wyoming. Contents. 1. Introduction...

Population Ecology. Study of populations in relation to the environment. Increase population size= endangered species

Day 8: Sampling. Daniel J. Mallinson. School of Public Affairs Penn State Harrisburg PADM-HADM 503

Ecological applications of hidden Markov models and related doubly stochastic processes

KULLBACK-LEIBLER INFORMATION THEORY A BASIS FOR MODEL SELECTION AND INFERENCE

Ecology Regulation, Fluctuations and Metapopulations

Double-Observer Line Transect Surveys with Markov-Modulated Poisson Process Models for Animal Availability

Transcription:

Why Sample? Often researchers are interested in answering questions about a particular population. They might be interested in the density, species richness, or specific life history parameters such as survival or fecundity. They often want to know how the population is responding to particular management actions. It is usually not feasible to make a complete census of a population because of time and budget constraints. Therefore, a sample of the population is used to make inferences about the whole population. The goal of this type of sampling is to collect data that are representative of the entire population of interest. Four aspects of a sampling strategy necessary to make accurate and precise inferences about populations are: 1) Well-defined, specific objectives that allow you to answer your question(s) of interest. 2) A probabilistic sampling framework to account for spatial variability. 3) A sampling method that allows you to estimate detection probabilities (i.e. the proportion of individuals present, but not detected during your survey). 4) Sufficient sample sizes to make statistically valid inferences about your population. Well-defined Objectives: It is important that your sampling design address the questions of interest. The first step is to explicitly state your objectives and hypotheses, and then tailor your sampling framework around those specific questions. For instance: it would not make sense to ask questions about broad habitat associations of birds but only sample a small number of locations. A Probabilistic Sampling Framework: It is virtually impossible to collect information on every individual in a population. With this in mind, you need to design a sampling framework that is representative of the entire population. Historically, many researchers simply used their experience to select representative sampling sites, or they simply selected sites that were logistically convenient, for example along roads or trails. We now understand that this approach results in biased estimates and inappropriate inferences about populations. A large branch of biostatistics has developed to address these types of problems using probabilistic sampling designs. Several representative approaches include: Simple random sampling all sampling units are independent of one another and have equal probability of being selected. Systematic sampling sampling units allocated uniformly over area of interest or according to some other systematic method.

Stratified random sampling sampling units are first stratified by homogenous features (e.g., habitat) and then sample units are randomly selected within each strata. A probabilistic sampling design will reduce sampling bias caused by spatial variation. For example, if our parameter of interest is population size (N), a probabilistic sampling design will allow us to adjust our estimate of N ( Nˆ ) by the amount of area sampled. If M = total area of interest, and m = the area actually sampled in area of interest Then, m/m = the proportion of the total area sampled Thus: If C = our count of individuals in the area sampled Then our population estimate Nˆ = C/(m/M) Detectability: As we saw in lab last week, it is very important when you are sampling populations to account for individuals present, but not detected. We do this by estimating detection probabilities for the parameters of interest. We will illustrate this with three common methods of estimating detection probabilities with avian count data: 1) Distance Sampling estimates detection probability as a function of distance from the observer. The method is applicable to both transect and point sampling. 2) Independent Observer Method uses mark-recapture theory to estimate detection probability for observer conducting point counts. 3) Patch Occupancy Models estimate the probability that a patch is occupied by a given species. The method is applicable to a variety of sampling methods.. Sample Size Requirements: All of the methods that we use are statistical in nature, so it is important to ensure that your sample sizes are sufficient to make statistically valid inferences about your population. Sample size requirements will vary depending on your objectives, hypotheses, and sampling methods (see references), but generally speaking, a sample of at least 30 is required for testing most hypotheses. This requirement is often difficult to meet in ecological studies, thus it is very important to consider sample size requirements when you are formulating the objectives of your study. When it is not feasible to obtain sufficient independent samples there are several options to consider: 1) Use representative areas or habitats that are larger, but have the same characteristics as the habitat or area of interest. 2) Relax statistical assumptions (e.g. independence amongst sampling units), but acknowledge how this compromise might have weakened your conclusions.

3) Change your study plan to address a more feasible question. Methods Of Estimating Detection Probability: Distance Sampling: Distance sampling is a unified approach that incorporates sampling design, data collection and statistical theory to estimate density and detection probability as a function of the distance between observers and the animals they are counting (Buckland et al. 2001). Distance sampling is used for line transects, point transects (point counts), cluster counts or cue counts (e.g. marine mammals breaching the water). The probability of detection is modeled by the function g(r) where r is the radial distance of the subject of interest from the observer, such that as distance from the observer increases the detection probability decreases (Fig. 1). 1 g(r) Detection Probability 0 Distance (r) Figure 1. General detection function, g(r), where detection probability decreases as distance increases. Statistical inference based on distance sampling relies on three basic assumptions: 1. Objects directly on the line or point are always detected (they are detected with probability 1, or g(0) = 1). 2. Objects are detected at their initial location, prior to any movement in response to the observer. 3. Distances are measured accurately. In addition to these three explicit assumptions, there are several quasi-assumptions that relate study design to the effectiveness of distance sampling: 1) the survey should follow a probability based sampling plan such as random sampling or systematic random sampling,

2) the point locations should represent the area of interest and therefore be located in homogenous, uniform habitat away from edges and other sampling points, 3) the objects of interest are spatially distributed in the study area according to a natural process. Program Distance (Thomas et al. 1997) was developed to analyze data collected from distance sampling. Program Distance estimates density as: Dˆ = n/kπ r 2 x pˆ, where Dˆ is the density estimate, n is the number of objects counted, π r 2 is the area of the point, k is the number of points and pˆ is the estimated detection probability. The detection probability is modeled by using one of four possible functions, called "key functions." These functions (uniform, half-normal, negative exponential, and hazardrate) are simply different models of how detection probability changes with distance from an observer (different shapes of the curve in Figure 1). The uniform model represents a constant detection probability of 1 from the center of the point to the effective detection radius. The half-normal is simply half of the bell-shaped curve from the standard normal distribution (Fig. 1). The negative exponential and hazard-rate models are similar to the half-normal except that each has a different slope and shoulder length. The shoulder refers to the portion of the curve where the detection probability is constant with increasing distance from the point. Beyond the shoulder, detection probability decreases according to the characteristics of the detection function. Key function selection is based on a visual examination of a histogram of the data. After a key function is selected, a series expansion is used to add flexibility to the model by increasing the number of estimated parameters. The number of parameters needed for the series expansion varies for each dataset. Three series expansions are available; the cosine, simple polynomial, and the hermite polynomial. All three expansions have different properties, and each modify the key function in different ways to obtain a better fit of the model to the data. A large amount of literature exists describing the properties of the key functions in combination with different series expansions; see Buckland et al. (2001) for further information. Model selection in Program Distance is based on an information-theoretic approach (Burnham and Anderson 2002) using Akaike s Information Criterion, AIC. A model is selected from an a priori set of candidate models to optimize the balance between precision and bias (see Burnham and Anderson 2002). Double Independent Observer Method: Mark-recapture methods provide another tool for estimating detection probabilities (Williams et al. 2001). Traditionally, animals are caught, marked with a unique mark, and recaptured at a later date. Resampling the population one or more times generates a capture history for each animal that can be used to estimates overall detection (i.e. capture) probabilities. General theory of mark-recapture: If n1 = Total # of animals observed and marked in the first sample

n2 = Total # animals observed in the second sample m2 = Total # of marked animals in the second sample (new captures) N = population size Then m2/n2 is approximately equal to n1/n (sample) = (population) And the estimate of the population size (N) is (n1*n2)/m2 This is called a Lincoln-Peterson closed population estimate and it relies on three basic assumptions: 1) the population is closed (no births, deaths, immigration, emigration) 2) all the animals in the population have an equal probability of capture 3) no animals lose their marks Mark-recapture concepts can be applied to avian point counts to provide estimates of population size and detection probability (p). Birds are not physically marked as they are in traditional mark-recapture studies. They are considered "marked" when they are mapped by one observer, and "recaptured" when they are also mapped by another observer. One approach is called the independent double-observer method. It requires two observers to identify and map the locations of all the birds they hear within a fixed radius (usually 100 200 meters) during a 3 5 minute count. It is important that each observer is independent and not influenced by the other observer. At the end of the count the two observers compare their mapped locations of birds detected during the count, and they match up individual birds that both observers identified (equivalent to m2 in the previous example). The method assumes that observers correctly match individual birds that they both detected. For each species: n1 = The number of individuals seen by the first observer n2 = The number of individuals seen by the second observer m2 = The number of individuals seen by both observers The estimated population size (Chapman s modification) within the fixed radius is; Nˆ = [(n1+1)*(n2+1)] -1 m2+1 and the probability of detection for each observer is pˆ 1 = m2/n2 and pˆ 2 = m2/n1

Notice that the detection probability is already incorporated into the estimate of N so it is not necessary to adjust the estimate by the detection probability. It is important to also present a measure of variation represented by the Variance of N. V ˆ( N ˆ ) = (n1+1)*(n2+1)*(n1-m2)*(n2-m2) (m2+1) 2 * (m2+2) We can use this information to compare detection probabilities for different species among observers and to estimate population size. If multiple points are sampled, the population estimates (N) for each point are added together to estimate the total population size for the study area. The variance estimates are also added together to obtain the total variance in the estimate of N. Remember to multiply the total estimate of N by the sampling proportion to obtain an estimate of the total population in your study area.. To obtain a density estimate divide by the total area, D = N/A. For example: if four 200 meter fixed radius point counts are conducted producing population estimates of 20, 19, 21, and 22, the total population estimate is 82 and the estimated density is 82 divided by the total area of the study area (4 pi *200 2 = 502654.82 meters ). In this case density would be 82/502654.82 meters = 1.631 x 10-4 birds per square meter (1.631 x 10-8 birds per hectare).. The assumptions of this method are: 1) The population is closed. 2) There are no matching errors. 3) The observers are independent. It is also possible to use this method for species richness where: n1= The number of species identified by observer 1 n2= The number of species identified by observer 2 m2= The number of species identified by both observers You can then use the same estimates for N and detection probability except that N will represent a measure of species richness and not population size. Patch Occupancy: Patch occupancy models are useful when you are interested in the proportion of the sample area that is occupied by a particular species or the temporal dynamics of species occurrence (Mackenzie et al. 2006). Occupancy modeling relies on markrecapture theory by accounting for a species present in a patch, but not detected (detection probability) Patch occupancy modeling is simpler than abundance modeling because it is based on simple presence/absence data. Because the data contain less information, inferences from patch occupancy modeling are usually more limited than those from models based on abundance estimates. We will not go into the details of

occupancy modeling. See Mackenzie 2006 and Program Presence @ http://www.mbrpwrc.usgs.gov/software.html for more information. Important References: Buckland, S. T., D. R. Anderson, K. P. Burnham, J. L. Laake, D. L. Borchers and L. Thomas. 2001. Distance Sampling: Estimating Abundance of Biological Populations. Oxford New York. Burnham, K. P., and D. R. Anderson. 2002. Model Selection and Inference: A Practical Information-Theoretic Approach. Springer-Verlag. New York. MacKenzie, D.L., Nichols, J.D., Royle, J.A., Pollock, K.H., Bailey L.L., and J.E. Hines. 2006. Occupancy Estimation and Modeling. Academic Press. San Diego, CA. Morrison, M.L., Block, W.M., Strickland, M.D., and W.L. Kendall. 2001. Wildlife Study Design. Springer-Verlag. New York. Pollock, K. H., J. D. Nichols, T. R. Simons, G. L. Farnsworth, L. L. Bailey, and J. R. Sauer. 2002. Large scale wildlife monitoring studies: statistical methods for design and analysis. Environmetrics 13: 105-119. Williams, B. K., Nichols, J. D., and M. J. Conroy. 2002. Analysis and Management of Animal Populations. Academic Press. San Diego, CA. Yoccoz, N. G., J. D. Nichols, and T. Boulinier. 2001. Monitoring of biological diversity in space and time. Trends in Ecology and Evolution. 16: 446-453. Important websites with software: There has been a considerable amount of recent work on software to perform population analysis, particularly in terms of estimation of abundance, and both survival and recruitment rates (using both capture-recapture and recovery models). The following is a list of websites that contain available programs and related information. http://www.mbr-pwrc.usgs.gov/software.html http://www.phidot.org/software/ http://www.warnercnr.colostate.edu/~gwhite/software.html