Cormack-Jolly-Seber Models

Similar documents
Introduction to capture-markrecapture

FW Laboratory Exercise. Program MARK with Mark-Recapture Data

Lecture 7 Models for open populations: Tag recovery and CJS models, Goodness-of-fit

Mark-Recapture. Mark-Recapture. Useful in estimating: Lincoln-Petersen Estimate. Lincoln-Petersen Estimate. Lincoln-Petersen Estimate

Capture-Recapture Analyses of the Frog Leiopelma pakeka on Motuara Island

FW Laboratory Exercise. Program MARK: Joint Live Recapture and Dead Recovery Data and Pradel Model

Stopover Models. Rachel McCrea. BES/DICE Workshop, Canterbury Collaborative work with

Jolly-Seber models in MARK

CHAPTER 12. Jolly-Seber models in MARK Protocol. Carl James Schwarz, Simon Fraser University A. Neil Arnason, University of Manitoba

Webinar Session 1. Introduction to Modern Methods for Analyzing Capture- Recapture Data: Closed Populations 1

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

Joint live encounter & dead recovery data

Closed population capture-recapture models

THE JOLLY-SEBER MODEL AND MAXIMUM LIKELIHOOD ESTIMATION REVISITED. Walter K. Kremers. Biometrics Unit, Cornell, Ithaca, New York ABSTRACT

7.2 EXPLORING ECOLOGICAL RELATIONSHIPS IN SURVIVAL AND ESTIMATING RATES OF POPULATION CHANGE USING PROGRAM MARK

Four aspects of a sampling strategy necessary to make accurate and precise inferences about populations are:

Improving the Computational Efficiency in Bayesian Fitting of Cormack-Jolly-Seber Models with Individual, Continuous, Time-Varying Covariates

Barn swallow post-fledging survival: Using Stan to fit a hierarchical ecological model. Fränzi Korner-Nievergelt Beat Naef-Daenzer Martin U.

GOODNESS OF FIT TESTS FOR OPEN CAPTURE-RECAPTURE MODELS

CHAPTER 15. The robust design Decomposing the probability of subsequent encounter

Input from capture mark recapture methods to the understanding of population biology

Estimating rates of local extinction and colonization in colonial species and an extension to the metapopulation and community levels

CHAPTER 5. Goodness of fit testing...

Continuous, Individual, Time-Dependent Covariates in the Cormack-Jolly-Seber Model

Continuous Covariates in Mark-Recapture-Recovery Analysis: A Comparison. of Methods

Approach to Field Research Data Generation and Field Logistics Part 1. Road Map 8/26/2016

A class of latent marginal models for capture-recapture data with continuous covariates

EMBEDDING POPULATION DYNAMICS IN MARK-RECAPTURE MODELS. Jonathan R. B. Bishop

Estimating demographic parameters for capture recapture data in the presence of multiple mark types

BASIC RANDOM EFFECTS MODELS FOR CAPTURE-RECAPTURE DATA

State-space modelling of data on marked individuals

Using POPAN-5 to Analyse Banding Data.

An Extension of the Cormack Jolly Seber Model for Continuous Covariates with Application to Microtus pennsylvanicus

BIOL 217 ESTIMATING ABUNDANCE Page 1 of 10

Advances since Program MARK Parameter Estimation from Marked Animals. PLive captures PLive resightings PDead recoveries

Chapter 12: Inference about One Population

ESTIMATING POPULATION SIZE FROM DNA-BASED CLOSED CAPTURE RECAPTURE DATA INCORPORATING GENOTYPING ERROR

The Scenario. From today s lecture (and readings!) you should be able to : Steps in Conducting an Assessment using Inventory and Monitoring

Review Article Putting Markov Chains Back into Markov Chain Monte Carlo

Continuous, Individual, Time-Dependent Covariates in the Cormack-Jolly-Seber Model

Introduction to Maximum Likelihood Estimation

A population is a group of individuals of the same species, living in a shared space at a specific point in time.

A new method for analysing discrete life history data with missing covariate values

FOR amphibians with complex life histories,

Journal of Avian Biology

Time-Invariant Predictors in Longitudinal Models

EXERCISE 14: SINGLE-SEASON, SPECIES-INTERACTIONS OCCUPANCY MODELS. In collaboration with Rebecca J. Pfeiffer and Jeremy M. Clark

JULIEN MARTIN, JAMES D. NICHOLS*, WILEY M. KITCHENS and JAMES E. HINES*

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf

Integrating mark-resight, count, and photograph data to more effectively monitor non-breeding American oystercatcher populations

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

Chapter 11. Regression with a Binary Dependent Variable

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Lior Wolf

that bias in traditional population size estimators can be substantial because misidentification is ignored, and our new estimators perform well in

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,

Journal of Statistical Software

ECOLOGICAL STATISTICS: Analysis of Capture-Recapture Data

A tip of the iceberg? The probability of catching cartels

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means

Requirements for Prospective Teachers General Science. 4.1a Explain energy flow and nutrient cycling through ecosystems (e.g., food chain, food web)

My data doesn t look like that..

Stat 534 Philip Dixon Fall References: Estimating Population Parameters: Closed and Open populations

SPRING 2007 EXAM C SOLUTIONS

12 Generalized linear models

Lecture 4 - Survival Models

Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology

A Primer on Statistical Inference using Maximum Likelihood

arxiv: v3 [stat.me] 29 Nov 2013

Finite mixture models in secr 3.0 Murray Efford

Maximum likelihood estimation of mark-recapture-recovery models in the presence of continuous covariates

Method of Moments. Topic Introduction. Method of moments estimation is based solely on the law of large numbers, which we repeat here:

KULLBACK-LEIBLER INFORMATION THEORY A BASIS FOR MODEL SELECTION AND INFERENCE

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

CHAPTER 20. Density estimation... Jake Ivan, Colorado Parks and Wildlife

PHP 2510 Probability Part 3. Calculate probability by counting (cont ) Conditional probability. Law of total probability. Bayes Theorem.

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200 Spring 2018 University of California, Berkeley

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Time-Invariant Predictors in Longitudinal Models

β j = coefficient of x j in the model; β = ( β1, β2,

Knowledge Discovery and Data Mining

A CONDITIONALLY-UNBIASED ESTIMATOR OF POPULATION SIZE BASED ON PLANT-CAPTURE IN CONTINUOUS TIME. I.B.J. Goudie and J. Ashbridge

Model Estimation Example

Large scale wildlife monitoring studies: statistical methods for design and analysis

Simon J. Bonner Department of Statistics University of Kentucky

SAS Syntax and Output for Data Manipulation:

Comparing estimates of population change from occupancy and mark recapture models for a territorial species

Mark-recapture with identification errors

Parameter Estimation and Fitting to Data

Exploring causal pathways in demographic parameter variation: path analysis of mark recapture data

Parameter Redundancy with Applications in Statistical Ecology

Longitudinal Invariance CFA (using MLR) Example in Mplus v. 7.4 (N = 151; 6 items over 3 occasions)

Multinomial Logistic Regression Models

Estimating and Testing the US Model 8.1 Introduction

MAS3301 / MAS8311 Biostatistics Part II: Survival

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

Using POPAN-5 to analyse banding data

A nonparametric test for path dependence in discrete panel data

Lecture 7 Time-dependent Covariates in Cox Regression

Making sense of Econometrics: Basics

Transcription:

Cormack-Jolly-Seber Models Estimating Apparent Survival from Mark-Resight Data & Open-Population Models Ch. 17 of WNC, especially sections 17.1 & 17.2 For these models, animals are captured on k occasions (say k years) and given a unique mark during a relatively short tagging period (say 1 week) each year. Time periods could also be weeks, months, or multiple year intervals. After occasion 1, both marked and unmarked animals are caught; tag numbers of the marked animals are recorded and unmarked animals are marked. Animals are released back into the population; accidental deaths (losses on capture) are allowed. Mark Mark & Mark & Mark & Mark & Mark & Resight Resight Resight Resight Resight Resight Page 1 of 7

The Jolly-Seber model (after Jolly 1965 and Seber 1965) is fairly general and serves as a starting point for open C-R modeling (see White et al (1982, chapter 8). This model allows year-specific estimates of apparent survival (φ), capture probability (p), population size (N), and the number of new individuals entering the population (B). While population size can be estimated, it is often very difficult to avoid substantial bias in the estimation of this parameter set because of individual heterogeneity and other issues. We will discuss the actual Jolly-Seber model later in the course but begin our work with discussion of the Cormack-Jolly-Seber model, which is a restricted model that allows only year-specific estimates of φ and p. Although less general, it has proven to be the more useful model for several reasons. One important biological issue is that only apparent survival can be estimated with CJS modeling; that is 1 φ represents both animals that died and animals that merely left the population (emigration). In general, φ < S. This can be a significant matter and often misunderstood. In open C-R studies, all sampling is done by researchers on a relatively small area. That is, animals are marked on a study area each year for at least 3 occasions, e.g., years. Recaptures are only made for those animals that come back to the study area where capturing is being conducted. Thus, an animal that comes back to the same general area may not be recaptured if it is a mile or two away from the capture site. This issue can be very problematic with species that move considerable distances during the course of a year. Still, there are often cases where there is biological interest in φ and the fact that some animals merely left" is not problematic to the interpretation of the data. Note: data can be collected by recapturing or resighting animals. The approach we ll use conditions on the initial capture of an animal and models its subsequent capture history as functions of parameters associated with sampling (p) and real population change (φ). To get started and in keeping with section 17.1 of WNC, we ll first consider single-age models and the standard Cormack-Jolly-Seber model (CJS) in which φ and p are time-specific. Data Structure for study with 3 occasions (LLL format = 1 if seen alive on occasion, 0 otherwise): Capture History Number of animals with history 100 89 110 41 101 16 111 19 010 75 011 37 001 82 Note: for 3 occasions, there are 7 observable histories (000 is not seen). Model Structure: Known Constants: Page 2 of 7

R i - The number of animals released in year i. These releases are typically made up of releases of newly captured and marked animals AND re-releases of re-captured animals that were marked on earlier occasions. Parameters: φ i is the probability that a marked animal in the study population at sampling period i survives until period i+1 and remains in the population (does not permanently emigrate). Thus, apparent survival in year i relates to the interval between resighting (or capture) periods i and i+1. p i is the probability that a marked animal in the study population at sampling period i is captured or observed during period i. χ i is the probability that an animal alive and in the study population at sampling period i is not caught or observed again at any sampling period after period i. For a study with T sampling periods, χ T = 1, and values for periods with i<t can be obtained recursively as: i (1 i ) i (1 pi 1) i 1 Assumptions (we will later learn how to change some of these): 1. Every marked animal present in the population at sampling period i has the same probability p i of being captured or resighted. 2. Every marked animal present in the population at sampling period i has the same probability φ i of survival until sampling period i+1. 3. Marks are neither lost nor overlooked and are recorded correctly. 4. Sampling periods are instantaneous (in reality they are very short periods) and recaptured animals are released immediately. 5. All emigration from the sampled area is permanent. 6. The fate of each animal with respect to capture and survival probability is independent of the fate of any other animal. Model Structure Illustrations: Pr(111 release at period 1) = 1 p2 2 p3 Pr(110 release at period 1) = 1 p2 x2, where x2 (1 2) 2 (1 p3) Pr(101 release at period 1) = 1 (1 p2) 2 p3 Pr(100 release at period 1) = χ i, where 1 (1 1) 1 (1 p2)(1 2 p3) Pr(011 release at period 2) = 2 p3 Page 3 of 7

Competing models: When individual covariates are not considered, the models that can be considered for 1 group of animals within an identifiable class of animals are: 1. ( t), p( t) 2. ( t), p(.) 3. (.), pt ( ) 4. (.), p(.) Intuition on Estimation: Capture History Number of animals with history 100 176 110 11 101 239 111 74 Model ( t), p( t) : pˆ 2 0.236421 and 1ˆ 0.719054 How do the histories and their frequencies provide information on these 2 parameters? First, consider the histories 101 and 111. Animals with these histories are known to have survived from year 1 to year 2 (and also to have survived to year 3). There are 313 such animals (239 with 101 history plus 74 with 111 history) and 74 of those 313 were detected in year 2. And, 74/313=0.236421. Thus, because we have information from the third occasion, we can separately estimate the survival and recapture rates φ 1 and p 2 respectively. We can see how the probability statements show this: N111 1 p22 p3 p2 N (1 p ) p 1 p 101 1 2 2 3 2 = 74 74 ; 74 74p = 239p; 74 = 313p; p = = 0.236421 239 313 What about 1ˆ? Well, 1p2is the probability that an animal survives from year 1 to year 2 AND is detected in year 2 (it s referred to as a return rate in some literature and isn t all that useful by itself). But, given an estimate of p 2, we could then obtain an estimate of 1. Of the 500 individuals released on occasion 1, how many returned on occasion 2 (survived and captured)? To get this number add 11 and 74 (11 110 s and 74 111 s) to obtain 85. Thus, 1p2= (74+11)/500=0.1700. And, 0.1700/0.236421 = 0.719054. Page 4 of 7

Quoting from Chapter 4 of C&W (Addendum counting parameters): But, it is important to note that we can t separately estimate all the parameters. Consider for instance φ 2 and p 3. Can we separate them? No! In fact, the product of these two parameters is completely analogous to a return rate between occasions 2 and 3. If we wanted to separate these 2 parameters, we d need a fourth occasion, and so on. Thus, in such a model where both survival and recapture rate are time-dependent, the terminal parameters are not individually identifiable - all we can do is estimate the product of the 2. Lebreton et al. (1992) refer to this product term as β 3. Thus, we can re-write our table, and the probability statements, as: It turns out that we can estimate φ 1, p 2, and (φ 2 p 3 ) in the ( t), p( t) model. If we constrain parameters (as we do in the other 3 models), then we can avoid that terminal product. For example, the ( t), p(.) model allows us to estimate φ 1 and φ 2 because we now assume that p is constant across occasions. Our actual modeling is done using maximum likelihood estimation and a multinomial distribution where each of the encounter histories is a possible outcome. Multinomial Distribution: N! x1 x2 x P( X { x, x,..., x }) p p... p c 1 2 c 1 2 ( x1!)( x2!)...( xc!) c Example from page 35 of WNC:4 5! f ( x1, x2 N 5, p1.6, p2.38, p3 (1.6.38)) (0.6) (0.38) (0.02) ( x!)( x!)( x!) 1 2 3 x1 x2 5x1 x2 dmultinom(x=c(5,0,0),prob=c(.6,.38,.02)) # = 0.07776 = probability of 5 completed passes, # 0 incomplete passes, and 0 intercepted passes Page 5 of 7

dmultinom(x=c(4,1,0),prob=c(.6,.38,.02)) # = 0.24624 Maximum likelihood for the CJS model using the encounter histories From page 421 of WNC: R! 1 x111 x110 Pr({ x } R ) ( p p ) [ p (1 p )] 1 1 2 2 3 1 2 2 3 x! [ (1 p ) p ] ( x ) x 1 2 2 3 1 x 101 100 As for known-fate modeling, maximum likelihood finds that combination of the parameters that maximizes the likelihood of observing the set of encounter histories according to the frequencies at which they were observed in the study. MLE s of the parameters, their variances, and their covariances are available of the parameters, their variances, and their covariances are available. Different models can be evaluated that constrain the parameters in various ways. And, those models can include sub-models that allow the various parameters to be functions of covariates. AICc can be used to evaluate the competing models of interest for the problem at hand. Animals that aren t captured on the 1 st occasion are useful and the likelihood can be adjusted to handle them just fine. And, animals can be removed if desired without problem as discussed in Ch. 17 of WNC. Page 6 of 7

Input format for MARK (live recaptures or LLLL format): For Program MARK, the EH for each animal can be entered on 1 row, e.g., 1101 1; 1110 1; 1001 1; etc. OR, you can use summary formatting to indicate EH s for 112 animals. This is obviously much more efficient than writing out 112 lines; 12 repeats of the 1st line, 20 repeats of the 2nd line, etc.. 1111 12; 1011 20; 1000 80; You can also indicate multiple groups this way. 1111 12 15; 1011 20 32; 1000 80 101; And, you can include individual covariates. So, this type of modeling is quite useful IF the biology of the animal makes φ a biologically meaningful parameter for the study in question. Background reading: Ch. 17 WNC Ch. 3 & 4 CW Lebreton, J. D., K. P. Burnham, J. Clobert, and D. R. Anderson. 1992. Modeling Survival and Testing Biological Hypotheses Using Marked Animals - A Unified Approach with Case- Studies. Ecological Monographs 62:67-118. - not required for class but excellent background for those using these models in their work. Page 7 of 7