THE JOLLY-SEBER MODEL AND MAXIMUM LIKELIHOOD ESTIMATION REVISITED. Walter K. Kremers. Biometrics Unit, Cornell, Ithaca, New York ABSTRACT

Similar documents
Mark-Recapture. Mark-Recapture. Useful in estimating: Lincoln-Petersen Estimate. Lincoln-Petersen Estimate. Lincoln-Petersen Estimate

Cormack-Jolly-Seber Models

Webinar Session 1. Introduction to Modern Methods for Analyzing Capture- Recapture Data: Closed Populations 1

GOODNESS OF FIT TESTS FOR OPEN CAPTURE-RECAPTURE MODELS

Introduction to capture-markrecapture

Lecture 7 Models for open populations: Tag recovery and CJS models, Goodness-of-fit

Stopover Models. Rachel McCrea. BES/DICE Workshop, Canterbury Collaborative work with

A CONDITIONALLY-UNBIASED ESTIMATOR OF POPULATION SIZE BASED ON PLANT-CAPTURE IN CONTINUOUS TIME. I.B.J. Goudie and J. Ashbridge

Four aspects of a sampling strategy necessary to make accurate and precise inferences about populations are:

Capture-Recapture Analyses of the Frog Leiopelma pakeka on Motuara Island

UNIVERSITY OF NOTTINGHAM. Discussion Papers in Economics CONSISTENT FIRM CHOICE AND THE THEORY OF SUPPLY

7.2 EXPLORING ECOLOGICAL RELATIONSHIPS IN SURVIVAL AND ESTIMATING RATES OF POPULATION CHANGE USING PROGRAM MARK

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

Improving the Computational Efficiency in Bayesian Fitting of Cormack-Jolly-Seber Models with Individual, Continuous, Time-Varying Covariates

Package SPECIES. R topics documented: April 23, Type Package. Title Statistical package for species richness estimation. Version 1.

Jolly-Seber models in MARK

FW Laboratory Exercise. Program MARK: Joint Live Recapture and Dead Recovery Data and Pradel Model

CHAPTER 15. The robust design Decomposing the probability of subsequent encounter

Estimating demographic parameters for capture recapture data in the presence of multiple mark types

Review Article Putting Markov Chains Back into Markov Chain Monte Carlo

An Extension of the Cormack Jolly Seber Model for Continuous Covariates with Application to Microtus pennsylvanicus

REGRESSION WITH CORRELATED ERRORS C.A. GLASBEY

CHAPTER 12. Jolly-Seber models in MARK Protocol. Carl James Schwarz, Simon Fraser University A. Neil Arnason, University of Manitoba

BIOL 217 ESTIMATING ABUNDANCE Page 1 of 10

A Simulation Experiment on the Application of the Jackknife with Jolly's Method for the Analysis of Capture-Recapture Data

Modern Methods of Data Analysis - WS 07/08

Log-linear multidimensional Rasch model for capture-recapture

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

A simulation study for comparing testing statistics in response-adaptive randomization

Cluster Validity. Oct. 28, Cluster Validity 10/14/ Erin Wirch & Wenbo Wang. Outline. Hypothesis Testing. Relative Criteria.

COUNTING NUMERICAL SEMIGROUPS BY GENUS AND SOME CASES OF A QUESTION OF WILF

Censoring mechanisms

MAXIMUM LIKELIHOOD ESTIMATION IN A MULTINOMIAL MIXTURE MODEL. Charles E. McCulloch Cornell University, Ithaca, N. Y.

Definitions and Proofs

Continuous, Individual, Time-Dependent Covariates in the Cormack-Jolly-Seber Model

FW Laboratory Exercise. Program MARK with Mark-Recapture Data

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

The Petersen Lincoln Estimator and its Extension to Estimate the Size of a Shared Population

Mark-recapture with identification errors

EMBEDDING POPULATION DYNAMICS IN MARK-RECAPTURE MODELS. Jonathan R. B. Bishop

Quantitative Empirical Methods Exam

Statistics: Learning models from data

Sequences of Real Numbers

Estimation of the Exponential Distribution based on Multiply Progressive Type II Censored Sample

Barn swallow post-fledging survival: Using Stan to fit a hierarchical ecological model. Fränzi Korner-Nievergelt Beat Naef-Daenzer Martin U.

Without Loss of Generality

Existence of Optimal Strategies in Markov Games with Incomplete Information

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Continuous Covariates in Mark-Recapture-Recovery Analysis: A Comparison. of Methods

1. Introduction to commutative rings and fields

Using POPAN-5 to Analyse Banding Data.

A General Overview of Parametric Estimation and Inference Techniques.

07-PS4-2. Develop and use a model to describe that waves are reflected, absorbed, or transmitted through various materials.

Continuous, Individual, Time-Dependent Covariates in the Cormack-Jolly-Seber Model

Closed population capture-recapture models

Capture-Recapture and Removal Methods for Sampling Closed Populations

In Chapters 3 and 4 we introduced linear programming

Lecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from

Statistical Inference with Monotone Incomplete Multivariate Normal Data

More on Estimation. Maximum Likelihood Estimation.

A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models

Miscellanea Kernel density estimation and marginalization consistency

Sb1) = (4, Az,.-., An),

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

Maximum-Likelihood Estimation: Basic Ideas

We want to show P (n) is true for all integers

ACTEX CAS EXAM 3 STUDY GUIDE FOR MATHEMATICAL STATISTICS

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n =

A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation

2017 VCE Specialist Mathematics 2 examination report

Large scale wildlife monitoring studies: statistical methods for design and analysis

Using POPAN-5 to analyse banding data

DS-GA 1002 Lecture notes 11 Fall Bayesian statistics

Statistics. Lecture 2 August 7, 2000 Frank Porter Caltech. The Fundamentals; Point Estimation. Maximum Likelihood, Least Squares and All That

Ahlswede Khachatrian Theorems: Weighted, Infinite, and Hamming

Capture-Recapture for Human Populations

CRISP: Capture-Recapture Interactive Simulation Package

ECONOMET RICS P RELIM EXAM August 19, 2014 Department of Economics, Michigan State University

Math Lab- Algebra II Pacing Guide Quarter 3. Unit 1: Rational and Irrational Numbers, Exponents and Roots

IMO 2007, Problem 1. Tom Verhoeff. 4 August 2007

Approach to Field Research Data Generation and Field Logistics Part 1. Road Map 8/26/2016

STAT331. Cox s Proportional Hazards Model

1 Hypothesis Testing and Model Selection

NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or

COMPETING RISKS WEIBULL MODEL: PARAMETER ESTIMATES AND THEIR ACCURACY

Quantum Density Matrix and Entropic Uncertainty*

On Likelihoodism and Intelligent Design

A NICE PROOF OF FARKAS LEMMA

Method of Moments. Topic Introduction. Method of moments estimation is based solely on the law of large numbers, which we repeat here:

SMT 2013 Power Round Solutions February 2, 2013

Lecture Notes on Heyting Arithmetic

DEPARTMENT OF STATISTICS AND OPERATIONS RESEARCH OPERATIONS RESEARCH DETERMINISTIC QUALIFYING EXAMINATION. Part I: Short Questions

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45

DERIVATION OF THE PROBABILITY MASS FUNCTION OF A PURE BIRTH PROCESS FROM A DEATH PROCESS.

N = total number of individuals = 8 r = number who had internships ( Class 1 ) = 5 N r = number who had student teaching ( Class 2 ) = 3.

Explicit evaluation of the transmission factor T 1. Part I: For small dead-time ratios. by Jorg W. MUller

A class of latent marginal models for capture-recapture data with continuous covariates

Eco517 Fall 2014 C. Sims FINAL EXAM

Assessing Fishery Condition: Population Estimation

AN APPLICATION OF GOODNESS OF FIT TO A MULTINOMIAL MODEL WITH A. Douglas S. Robson Biometrics Unit, Cornell University, Ithaca, NY 14853

Transcription:

THE JOLLY-SEBER MODEL AND MAXIMUM LIKELIHOOD ESTIMATION REVISITED BU-847-M by Walter K. Kremers August 1984 Biometrics Unit, Cornell, Ithaca, New York ABSTRACT Seber (1965, B~ometr~ka 52: 249-259) describes a model for a tagrecapture study allowing birth, death, and migration, and derives the likelihood for the model. He then derives supposed maximum likelihood estimates (MLE's) for the parameters of the model. Here we show that in general his estimates are not MLE's. Conditions are given for his estimates to be MLE's and we argue for the use of his estimates over the true MLE's.

1. INTRODUCTION The ability to estimate population parameters has advanced greatly since the classic papers of Jolly (1965) and Seber (1965). Not only did these papers propose a useful stochastic model for biologists in the estimation of population parameters but they also introduced a general methodology for model building for estimation of population parameters. The estimates given by Seber and by many works to follow, such as those of Buckland (1980) and Pollock (1981), are based upon the well-founded theory of maximum likelihood estimates (MLE's). One difficulty of the estimates in the above works though is that they are not MLE's. We show this explicitly for Seber's estimates and similar arguements apply to the estimates of Buckland and Pollock. The discussion of Seber's model and estimation applies in general to tag-recapture statistics when the objective is to estimate population size, with capture and survival probabilities unknown. We also discuss why we might not want to use MLE's for population size when capture and survival probabilities are unknown. We offer justification for the estimates proposed by Seber, and describe a modification with which Seber's estimates of survival and capture probabilities can be derived through the likelihood approach. 2. THE JOLLY-SEBER MODEL AND ESTIMATION The Jolly-Saber model allows for a total of s samples or capture periods. For sample i, i l,,s, each animal of the sample population is assumed to be captured independently of all other animals with a probability of Pi after which the animal is immediately identified and returned to the population. Each animal alive immediately after sample i is assumed to survive until the next sample with the probability 'i independently of all other animals. With notation the same as Seber's let,

-2- ai = number of animals captured in sample i, a(i = number of different animals captured before sample i, a>i = number of different animals captured after sample i, a<i i = a<i + ai - a>i i a>i + ai - a<i+1 a)i-1 m 1 population size at the time of sample 1, mi - mi-l net increase in the number of unmarked animals between samples i-1 and i, for i 2,.,s, qi 1-pi ai q,iqi+l ~i... q,ipi+l from which we find the likelihood, L L[{mi},p 1,{ai,ai}], is proportional to ns {[(mi-a<i)!/(mi-a<i+1)!] pia<i+1-a<i qimi-a<i+1} ns-1{ ai-a)f i D a(i+1 i+l bi+l} X Xi Pi <Xi In general let 9 denote the MLE of the parameter e. Seber equates the difference equations of the form L[mi]- L[m 1-1] with zero and concludes that, However all that may be safely concluded is that m 1 is less than the quantity to the right of the equals sign and m.+l is greater than the l. quantity to the right. Using the difference equation improperly Seber claims that the partial derivatives with respect to ai and ai respectively are bi+l + ~i {(aj-a)j j)&j &i(xi+l-1)/xj} O for i=1,.,s-1 and, (1) a<i+1 i+l- r 1 {(aj-a>j j)aj.. ai-lbi/xj} O i=l,.,s-1. (2)

-3- However most properly 61/&~i z {mi- a<i+1 - <a<i+1 -a<i>ai-1/bi-1}/{(&i-l+bi-l)&i-1/ai-1} + bi+l + Li {(aj-a)j j)&j'''&i(xi+l-1)/xj} 61/oBi {-mi +a<i+l + (a<i+l -a<i)&i-1/bi-1}/(&i-l+bi-1) + a<i+1 i+l- ri {(aj-a>j j)&j'''&i-lii/xj} Since the first term in 6L/6~i is nonpositive, if we use Seber's solutions to the maximum-likelihood equations, we find that 61/6~i ~ 0. Similarly since the first term of 61/oBi is nonnegative, 61/&Bi ~ 0 when evaluated with his solutions to the maximum-likelihood equations. From this we conclude that the estimates given by Seber for the ~i and a 1 are not M1E's and hence neither are his estimates of the mi. Not only are Seber's estimates not MLE's in the technical sense, but neither are his estimates of them. assured of being within one of the true M1E's. 1 claim of Seber is that m 1 and p 1 are not separately estimable. The likelihood is however maximized in a trivial way if m 1 =a 1 and p 1 =1. Another In a similar way closed form M1E's for m 8, ~ 8 _ 1, and a 9 _ 1 are (a<s + as), 0 and a>s-1 s-1/a 8 _1 respectively. 3. SEBER'S ESTIMATES AS M1E's We now consider models for which Seber's estimates are M1E's. If in sample i, each nontagged animal is captured with the probability pi' and each tagged animal is captured with probability pi' with pi and pi unrelated, the likelihood is proportional to

-4- ns {[(m.-a(i)!/(m.-a<i+l)l] pia<i+l-a<i (1-p.)mi-a<i+l~ 1 1 1 Us-1{ ai-a)i i o a(i+l i+l bi+lt x xi pi "i r For this model 6L/6«i is given by (1), but for i 1,,s-2, and 8L/6Pi is given by (2) and Seber's estimates of ai and Pi become MLE's. Note that for this model, in a trivial way, mima<i+l and pi l, and hence this model is of little value when estimating the mi. Seber's estimates are also conditional MLE's when the likelihood is conditioned on the number of animals released into the population as tagged animals. That is, Seber's estimates can be derived from a likelihood approach if we consider the estimation of survival rates estimated solely from the capture of tagged animals 4. MAXIMUM LIKELIHOOD ESTIMATION OF POPULATION SIZE Seber's estimates of mi are often found to have a negative bias (Gilbert,1973). The partial derivatives of the likelihood of Seber's model, evaluated at his estimates, satisfy the inequalities 6L/6ai S 0 and 6L/6Pi ~ 0. Hence the MLE's of the mi are less than his estimates and may incur an even greater bias. The cause of this negative bias of the MLE's when compared to Seber's estimates is the same as that which forces m 1 a 1 and pi l. The estimation of the pi and +i' or of the ai and pi' is not independent on the mi or the mi' and the first product of the likelihood cannot be ignored when finding MLE's for the ai and pi. The first product of the likelihood is maximized, with respect to mi and pi' by mi a<i+l and pi l. Further, given any starting value for pi and the maximizing mi' the value of this product can be increased by taking a smaller value for mi and a larger value for pi (unless a<i+l-a<i 0). Hence the influence of the first product in the estimation of pi is to pull pi toward one and to

-sdecrease mi. This is observed in the most extreme way for m 1,p 1 and m 8,ps as described above and continues to be undesirable in the estimation of m, 1 for i 2,,s-2. For the purpose of discussion we will identify "N" as the population size. In Seber's model an estimate of the population size may be derived from mi along with the oi and ai. MLE's of N are often justified by ~n asymptotic argument. However when estimating N, with p and + unknown, the usual asymptotics do not apply. Should we consider the estimation of N as the number of animals sampled becomes stochastically large, (the number of animals caught is a random variable,) if N is not allowed to vary then p must be allowed to go to one. However in practice p may be far from one. Hence to bound p from above by a constant less than one, and to allow number of animals sampled to stochastically increase, is to vary N. Again the parameters being estimated are required to vary in the asymptotic argument, and the usual theory does not apply when we wish to estimate 5. PROPOSED METHODOLOGY Even though Seber's estimates are not MLE's they have a strong intuitive and rational appeal as discussed by Cormack (1972). Also the estimates of the ai and ai are the MLE's when estimation is based on the recapture of previously tagged animals, the proper subpopulation from which we should make inferences concerning survival and catchability. If the total number of animals which enter the population of concern is interpreted as the sample size, then we find that Seber's estimates of the ai and ai possess desirable asymptotic properties of MLE's, as the mi-mi_ 1 increase without bound. Let ffii denote Seber's estimates of mi.

-6- Asymptotics such as those found in Carothers (1973) imply that the relative biases of the ffii approach zero as the mi-mi-l are allowed to increase without bound (when the assumptions of the model are met). This, along with the asymptotic variances found in Seber, implies the [ E(mi-m.) 2 11121 m. 1 1 approach zero as the mi-mi-l are allowed to increase without bound, suggesting a type of weak consistency. 6. CONCLUSION The failure in Seber's derivation of the MLE's stems from the use of a difference equation as a differential equation, in particular the equating of rni - a<i+l - (a<i+l - a<i)&i_1/ai-l with zero in the evaluation of 6L/6ai and 6L/6ai' rather than considering the proper inequalities. This problem does not arise when population size is the only unknown parameter. The population size is then estimated by the integer part of the solution to the difference equation, thus estimating population size by a true member of the parameter space. However little concern is usually voiced if population size is estimated by the solution to the difference equation itself as this estimate is within one of the MLE. When the likelihood involves unknown parameters in addition to those determining population size, however, one can no longer expect to be within one of the MLE's when the difference equation is set to zero. Even though the estimates proposed by Seber are not MLE's they do appear to be the proper estimates to use. His estimates of survival and capture probabilities are those we obtain if we consider MLE's for the subpopulation of tagged animals. Further the MLE's of the mi are biased negatively to a larger extent than Seber's estimates are often found to be through asymptotic expansion and simulation.

-7-7. REFERENCES Buckland, S.T. (1980). A modified analysis of the Jolly-Seber capturerecapture model. BLome~rLcs 36: 419-435. Carothers, A.D. (1973). The effects of unequal catchability on Jolly-Seber estimates. BLome~rLcs 29: 79-100. Cormack, R.M. (1972). The logic of capture-recapture estimates. BLomecrlcs 28: 337-343. Gilbert,R.O. (1973). Approximations of the bias in the Jolly-Seber capturerecapture model. Blome~rLcs 29: 501-526. Jolly, G.M. (1965). Explicit estimates from capture-recapture data with both death and immigration--stochastic model. BLome~rika 52: 225-247. Pollock, K.H. (1981). Capture-recapture models allowing for age-dependent survival and capture rates. Biomecrics 37: 521-529. Seber, G.A.F. (1965). A note on the multiple recapture census. BLomecrika 52: 249-259.