Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology

Size: px
Start display at page:

Download "Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology"

Transcription

1 Occupancy models Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology Advances in Species distribution modelling in ecological studies and conservation Pavia and Gran Paradiso, Italy Sept. 2011

2 Outline Session 1: Introduction to occupancy modelling S1.1 Introduction S1.2 Statistical background S1.3 Single-season occupancy models Session 2: Occupancy modelling in practice S2.1 Practical: single-season S2.2 Study design Session 3: Occupancy modelling developments S3.1 Multiple-season occupancy model S3.2 Practical: multi-season S3.3 Further models 2

3 SESSION 1 Introduction to occupancy modelling S1.1 - Introduction

4 The study of wildlife populations Why study wildlife populations? Science (ecology): how our world works Conservation: threatened species Management: harvesting sustainably Hypothesis testing & discrimination are my observations consistent with a hypothesis? which of this competing hypothesis is more likely? Prediction e.g. species distribution 4

5 The study of wildlife populations Different state variables: Abundance Occupancy Species richness Which state variable to use will depend on things like... hypotheses to test management objectives biology of the species under consideration resources available 5

6 Occupancy as a state variable Occupancy: Probability of a site being occupied by a species Depends on scale (definition of site ) Of interest in various areas of ecology: Species geographic range Habitat relationships Metapopulation dynamics Large-scale monitoring Species interactions ψ 1 =2/4=0.5 ψ 2 =2/16=

7 Occupancy as a state variable Often well suited for surveying large areas Abundance typically more difficult to estimate Few cases where direct counts are possible Some statistical techniques available (e.g. closedpopulation Mark-Recapture; Distance Sampling) In general: time-consuming / resource intensive / expensive / expert-skills-based (etc); and strong statistical assumptions 7

8 Occupancy as a state variable Occupancy may be used as a valid state variable in itself... although sometimes used as surrogate of abundance Warning example: occupancy as a surrogate for abundance? Ideal species Annoying species: 8

9 Occupancy as habitat preference If we can relate occupancy ( probability of a site being occupied by the species ) to covariates that describe the habitat model of habitat preferences habitat suitability model, species distribution model, etc Can construct maps of probability of occupancy or distribution 9

10 Sampling issues Spatial variation In general, you cannot sample the whole area of interest Sampling should be done according to project objectives and so that can make inference about the locations not surveyed Imperfect detection Animals, or even the whole species, can go undetected at sites where they are present 10

11 Sampling: imperfect detection Alaotran gentle lemur Hapalemur alaotrensis Critically Endangered Only lives in the Alaotra marsh, Madagascar 11

12 Sampling: imperfect detection 12

13 Sampling: imperfect detection 13

14 Sampling: imperfect detection 14

15 Sampling: imperfect detection Sumatran tiger Panthera tigris sumatrae 15

16 Sampling: imperfect detection 16

17 Sampling: imperfect detection Effects of imperfect detection: underestimate occupancy (estimated occupancy) = (real occupancy) x (detectability) occupied sites out of 36 survey sites proportion occupied sites = 17/36 = 0.47 Seen in 6 out of 17 occupied sites naive estimate= 6/36 =

18 probability Sampling: imperfect detection Effects of imperfect detection: obscure trends in occupancy true occupancy detectability observed occupancy year 18

19 apparent logit (occupancy) Sampling: imperfect detection Effects of imperfect detection: obscure relationships of occupancy with covariates habitat true relationship p<1 p ~ +habitat p ~ -habitat (MacKenzie et al. 2006) 19

20 Sampling: imperfect detection Note that the main problem is the variation in detectability (from site to site, from survey to survey) If detection probability is constant across geographical space and time, it may have less impact in our inference Sometimes we may be able to mitigate these variations with careful survey design But, in any case, we ll improve our estimates and inference if we model imperfect detection explicitly 20

21 Sampling: imperfect detection The issue of detectability is not a new thing Taken into account in old frameworks like Mark-Recapture and Distance Sampling ID

22 Sampling: imperfect detection Various methods for accounting for it when estimating occupancy MacKenzie et al (2002) provide the most general approach for dealing with this problem 22

23 SESSION 1 Introduction to occupancy modelling S1.2 - Statistical background

24 Basic concepts REALITY MODEL d h L w Parameters: d, h, L, w 24

25 Basic concepts "all models are wrong, but some are useful" (Box, 1976) 25

26 Basic concepts Parameter: characteristic of the system under study Estimator: a rule for calculating an estimate of the parameter from observed data Estimate: the output of the estimator for our data set DATA ,0,0,1,1,0, ESTIMATOR of θ ESTIMATE θ = ,

27 Basic concepts Bias precision - accuracy Precise Biased Imprecise Biased Imprecise Unbiased Precise Unbiased accurate! 27

28 Model description (e.g. seed germination) Suppose we plant 6 seeds, record whether they germinate successfully. How probable is to obtain this outcome? Assuming independence and given θ=probability of success (Bernoulli trials) Pr data θ = θ 1 θ 1 θ θθθ = θ 4 1 θ 2 e.g. if θ=0.7 Pr(data θ)=

29 Methods of inference Estimates can be obtained from the probabilistic description of the model using different methods of inference: Maximum likelihood ( classic or frequentist framework) Bayesian (Marc s modules) Note: the inference, NOT the model, is frequentist/bayesian Central to these two method is the likelihood function. 29

30 likelihood Likelihood function L(θ data)=pr(data θ) (for discrete data) A function of the model parameters (i.e. θ is the variable) How likely different parameter values are, given the data Simple change in point of view, no mathematical change! L(θ data) = θ 4 1 θ θ 30

31 likelihood Likelihood function The actual likelihood value is not important It s the relative values that matter! L(θ data) = θ 4 1 θ θ 31

32 likelihood Maximum likelihood estimation Find parameter value that maximize L MLE The widest the peak the more uncertainty The curvature of the function reflects the estimator variance Can derive variances inverting the information matrix (matrix of second partial derivatives of the log-likelihood function) L(θ data) = θ 4 1 θ θ 32

33 likelihood likelihood Maximum likelihood estimation n t = 6 n s = n t = 25 n s = E θ L θ data = θ n s 1 θ n t n s 3.E-07 2.E-07 1.E-07 n t = #trials n s =#successes 0.E θ 33

34 loglikelihood Likelihood function Usually log-likelihood: L θ data = θ 4 1 θ 2 log L θ data = 4 log θ + 2log 1 θ θ 34

35 Maximum likelihood estimation Sometimes an explicit expression exists to find the maximum deriving the function and solving e.g. dl dθ = n s θn t dl dθ = 0 θ = n s n t But usually we need to find the maximum numerically Theoretically we could evaluate the function in all points but normally not feasible as our function may have many parameters We use optimization algorithms (different methods exist) 35

36 Adding covariates Often interest in allowing parameters to be a function of covariates water oxygen temperature light When dealing with parameters (like probabilities) that are restricted to a range (e.g. 0-1) we require a link function to transform linear relationships 36

37 θ The logit link function The logit link function constrains values to 0-1 Not the only option but a widely used one (logistic regression) logit θ i = ln θ i 1 θ i = β 0 + β 1 c i1 + β 2 c i logit(θ) 37

38 The logit link function The relationship of the variable with the covariates is: logit θ i = ln θ i 1 θ i = β 0 + β 1 c i1 + β 2 c i2 + θ i = eβ 0+β 1 c i1 +β 2 c i e β 0+β 1 c i1 +β 2 c i2 + = e (β 0+β 1 c i1 +β 2 c i2 + ) 38

39 response θ The logit link function logit θ i = ln θ i 1 θ i = β 0 + β 1 x i The intercept β 0 controls the position of the transition b0=3, b1=1 b0=0, b1=1 b0=-3, b1= covariate x 39

40 response θ The logit link function logit θ i = ln θ i 1 θ i = β 0 + β 1 x i The slope coefficient β 1 control the slope ( rapidness ) of the transition b0=0, b1=0.4 b0=0, b1=1 b0=0, b1= covariate x 40

41 response θ The logit link function logit θ i = ln θ i 1 θ i = β 0 + β 1 x i The sign of β 1 tells if we go from high to low or vice versa (i.e. if the covariate has a positive or negative effect on θ) b0=1, b1=-1 b0=1, b1= covariate x 41

42 response θ The logit link function logit θ i = ln θ i 1 θ i = β 0 + β 1 x i Note that a large β 1 creates an abrupt transition......while β 1 =0 means there is no relationship with the covariate (no transition) b0=0, b1=1 b0=0, b1=20 b0=0, b1= covariate x 42

43 The logit link function Now the optimization is carried out on the β parameters Coming back to our seed example: Let s assume 1 covariate with value x i for pot i Pr data θ = L θ data = θ 1 1 θ 2 1 θ 3 θ 4 θ 5 θ 6 = e β 0 β 1 x e β 0 β 1 x e β 0 β 1 x e β 0 β 1 x e β 0 β 1 x e β 0 β 1 x 6 = L β 0, β 1 data 43

44 Delta method Can be used to derive SE s for transformed parameters e.g. SE for θ from variance-covariance of β s (previous slide) It s based on approximations and large-sample assumptions Involves derivatives and matrix multiplications Y = f(β) var Y Y β Σ β Y β T For a nice explanation, see the appendix 2 in the MARK book 44

45 Model comparison d h L w MODEL 1 Params.: d, h, L, w REALITY? Z z h h MODEL 2 Params.: z, h z z x MODEL 3 Params.: z, x 45

46 Model selection What is the support for model X, relative to others in the set, given the data? Model selection indicates what inferences the data supports, not what the full reality might be Conditional on sample size With more data further effects could probably be found 46

47 Model selection Hypothesis-testing approaches Assess if there is evidence to reject the null hypothesis e.g. likelihood-ratio test Information-theoretic approaches (Anderson, 2008) Rank models based on parsimony Trade-off between underfitting and overfitting e.g. Akaike information criterion (AIC) 47

48 Model selection: likelihood-ratio tests Test to compare the fit of two nested models i.e. null model (M 0 ) is a special case of the alternative model (M A ) e.g. θ(.) vs θ(temp) Test statistic D, based on the likelihood ratio D = 2log likelihood_m 0 likelihood_m A = 2 loglik_m 0 + 2loglik_M A Under null hypothesis, asymptotically, d : difference in # of parameters D ~ χ d 2 Can compute a p-value, or compare to a critical value, to 2 decide whether to reject M 0 (can reject if ) D > χ d:α 48

49 Model selection: likelihood-ratio tests E.g. θ(.) vs θ(temp) We fit θ(.) and obtain a maximum log-likelihood of We fit θ(temp) and obtain a maximum log-likelihood of D = = = 2.2 (test-statistic) d = 2 1 = 1 (difference in # of parameters) Compare D with a χ 2 with d = 1 degree of freedom 2 2 χ 1:0.05 = 3.84 D < χ d:α Do not have evidence to reject the null model θ(.) 49

50 Model selection: AIC Akaike Information Criterion AIC = 2L + 2K L is the maximum log-likelihood K is the number of parameters in the model (penalty against overfitting) The lower the AIC within the set, the better the model e.g. θ(.): L = -78.4, K=1 AIC = -2 (-78.4)+2 1 =158.8 θ(temp): L = -77.3, K=2 AIC = -2 (-77.3)+2 2 =158.6 (Burnham & Anderson 2002) 50

51 Model selection: AIC It is the relative AIC value that matters Usually talk about ΔAIC = AIC AIC bestmodel Rule-of-thumb: ΔAIC 0-2 units substantial support 4-7 considerably less support >10 no support (Burnham & Anderson 2002) 51

52 Model selection: AIC Be careful with pretending variables e.g occupancy study of a bird species in Italy Model A: ψ(elev + habitat) Model B adds an irrelevant covariate: ψ(elev + habitat + rainfall in China) The new covariate does not improve the fit (i.e. same likelihood) 1 extra covariate (i.e. 2 AIC units of penalty) Model B is 2 AIC units from Model A AIC = 2L + 2K Rule-of-thumb model B is a good model But the new variable is only pretending to be important This should not be taken as evidence that the new parameter is relevant! 52

53 Model selection: AIC Small sample adjustments: AIC may perform poorly if too many parameters for the sample size 2K K + 1 AICc = AIC + n K 1 n over is the effective sample size Debate over what n is in occupancy models Adjustment for overdispersion: QAIC = 2L c + 2K 53

54 Model averaging and model selection uncertainty Sometimes no single model is clearly best Can do multiple-model inference using all models in the set Model weights: ad-hoc measures of model support w j = exp ΔAIC j /2 m exp ΔAIC i /2 Averaged estimates and their standard errors θ A = m i=1 i=1 w i θ i m SE θ A = w i Var θ i M i + θ i θ A 2 i=1 Model uncertainty (Burnham & Anderson 2002) Between-model uncertainty 54

55 Model averaging and model selection uncertainty Considerations: Usually safer to model average the real parameters (e.g. probabilities) rather than the betas (i.e. regression coefficients) Need to make sure that the parameters averaged have the same interpretation Regression coefficients can be sensitive to other covariates in the model if they are correlated Typically, all models in the set used in averaging. If one or more models are removed, then the model weights must be renormalized such that they sum to 1. 55

56 Goodness-of-fit Model comparison identifies the best model in the set... but is it a good model at all? GoF tests assess how well a model fits a set of observations Look for evidence of lack of fit For small samples, tests may have low power to detect lack of fit Ideally an assessment of GoF should always be carried out Area that needs more development 56

57 SESSION 1 Introduction to occupancy modelling S1.3 Single-season occupancy model

58 Key ideas Aim: estimate species occupancy Issue: species imperfect detection Protocol: collect data so that we can model the detection process and therefore obtain unbiased occupancy estimates MacKenzie et al Tyre et al

59 Sampling protocol s sampling units surveyed (out of S; for now assume s<<s) Data collected: detection/non-detection ( presence/absence ) Replicate surveys are carried out in each sampling unit System closed to occupancy changes during sampling season

60 Replication Types of replication Repeated visits at different points in time Simultaneous independent observers (or detection methods) Spatial replication within the site 60

61 Detection history e.g. resulting data set h k Site Site Site Site Site Site s

62 True or false absence? Reality Field observations 62

63 True or false absence? ? ?? ?? 1000 Reality Field observations 63

64 Probabilistic description (likelihood) ψ = occupancy: probability a site is occupied 1 ψ) = probability a site is empty p = detectability: probability species is detected at a site in a survey, given presence 1 p) = probability of not detecting the species, given presence Model: based on closure assumption and independence of replicate surveys 64

65 Probabilistic description (likelihood) ψ = occupancy, Pr(site occupied) p = detectability, Pr(species detected present) e.g. h i =1001 the species is present and was detected in two of the surveys (and missed in the other two) Pr h i = 1001 ψ, p = ψp 1 p 1 p p 65

66 Probabilistic description (likelihood) ψ = occupancy, Pr(site occupied) p = detectability, Pr(species detected present) e.g. h i =0000 species is present and was not detected in any survey OR species is not present at the site Pr h i = 0000 ψ, p = ψ 1 p 1 p 1 p 1 p + (1 ψ) 66

67 Probabilistic description (likelihood) Likelihood is the product of the probabilistic statements for all sites L ψ, p h = Pr h ψ, p = Pr i ψ, p Maximized to obtain maximum-likelihood estimates (MLEs) S i=1 67

68 Probabilistic description (likelihood) E.g. system with ψ = 0.5, p = 0.3 Sampling s=200, k=3 Data: detected at 79 sites 147 detections Naive-ψ = 0.39 Estimates (SE): ψ-hat = 0.46 (0.044) p-hat = 0.32 (0.028) 68

69 Survey-specific detection probability Modelled using a parameter for each survey occasion e.g. h i =1001 the species is present and was detected in the 1 st and 4 th survey (and was missed in the 2 nd and 3 rd ) Pr h i = 1001 ψ, p 1, p 2, p 3, p 4 = ψp 1 1 p 2 1 p 3 p 4 69

70 Missing observations Some survey visits may be missed e.g. due to weather conditions or other logistical difficulties The model can readily cope with missing data e.g. h i =10 0 species is present and was detected in the 1 st survey and not detected in the 2 nd and 4 th surveys (we cannot say anything about the 3 rd survey, since it did not take place in this site) Pr h i = 10 0 ψ, p = ψp 1 1 p 2 1 p 4 70

71 Introducing covariates: site-specific Occupancy and detection probability can be a function of site characteristics e.g. Habitat type, patch size, human disturbance,... E.g. logit link function: logit ψ i = α 0 + α 1 A i + α 2 B i + logit p ij = β 0 + β 1 Q i + β 2 R i + Extension of logistic regression to account for imperfect detection (species distribution model) 71

72 Introducing covariates: survey-specific Detection probability can also be a function of survey-specific characteristics e.g. Observer, weather... logit p ij = β 0 + β 1 Q i + β 2 R i + β 3 S ij + β 4 T ij + Remember: in this model we assume no changes in occupancy during the survey season occupancy cannot be a function of survey-specific covariates 72

73 Introducing covariates Covariates can be: Continuous, e.g. elevation (m) Categorical, e.g. habitat type Standardizing continuous covariates into a meaningful scale can be useful and may avoid numerical problems e.g. z-transform: x x SD x 73

74 Introducing covariates Categorical represented with dummy variables (binary) If m categories need m-1 dummy variables (e.g. 4 habitat types: 3 dummy variables indicating habitats 1, 2 & 3; habitat 4 as reference) Can also use a variable per category Habitat A B C Habitat A B C D logit ψ i = α 0 + α 1 A i + α 2 B i + α 3 C i logit ψ i = α 1 A i + α 2 B i + α 3 C i + α 4 D i Habitat 4 is taken as the reference here 74

75 apparent logit (occupancy) Introducing covariates Remember, not accounting for imperfect detection may obscure relationships of occupancy with covariates: habitat true relationship p<1 p ~ +habitat p ~ -habitat (MacKenzie et al. 2006) 75

76 Covariates and imperfect detection: an example MacKenzie (2006) explores the effect of disregarding detectability in the analysis of resource use by pronghorns (Antilocapra americana) 256 locations in Wyoming surveyed during 2 consecutive winters ( and ) 4 covariates: Sg : sagebrush density (bushes/ha) Sl : slope DW : distance to water (km) A : aspect J. Leupold 76

77 Covariates and imperfect detection: an example Analysis 1: simple logistic regression Implicit assumptions: Negligible probability of false detection......or detectability constant ( results are then relative) Summed AIC model weights: Distance to water: 86% Slope: 52% Sagebrush density: 35% Aspect: 16% (MacKenzie 2006) 77

78 Covariates and imperfect detection: an example Analysis 2: single-season occupancy model Each winter used as a replicate k=2 Use rather than occupancy (pronghorns may not be available for detection at the site at the time of the survey) General model for detectability: p(sg+sl+dw+a) Summed AIC model weights: Slope: 55% Sagebrush density: 41% Distance to water: 29% Aspect: 6% (MacKenzie 2006) 78

79 Covariates and imperfect detection: an example Analysis 2b: single-season occupancy model exploration of detectability Model selection on detectability while keeping general model for occupancy: ψ(sg+sl+dw+a) Summed AIC model weights (p): Distance to water: 86% Aspect: 37% Slope: 29% Sagebrush density: 27% Model with constant detectability p(.): low support (ΔAIC =2.8) (MacKenzie 2006) 79

80 Conditional occupancy Probability that a site is occupied, given there were no detections Pr occupied & nondetected Pr(occupied nondetected)= Pr nondetected = k ψ i (1 p ij ) k j =1 j =1 ψ i (1 p ij ) + 1 ψ i E.g. ψ=0.3, p=0.7 and k=3: =

81 Goodness of fit MacKenzie & Bailey (2004) propose a GOF test based on the observed vs expected number of sites with each of the possible detection histories Observed (O h ): from the data set Expected (E h ): based on the parameter estimates obtained in the analysis Test statistic (Pearson s chi-square: Χ 2 ) Χ 2 = O E 2 E Parametric bootstrapping (i.e. simulating histories) to determine whether the observed test statistic is unusually large 81

82 Goodness of fit e.g. simple case (no covariates, no missing data) s=40, k=3, ψ-hat=0.6, p-hat=0.3 E 000 = s ψ 1 p ψ History E h O h (O h -E h ) 2 /E h E 001 = s ψ 1 p 2 p Distribution bootstrap χ 2 s χ

83 Goodness of fit Null hypothesis: there is no lack of fit p-val < 0.05 evidence of lack of fit p-val > 0.05 no evidence of lack of fit ( evidence of fit!) Overdispersion parameter c-hat 2 2 c = Χ obs /Χ B In general recommended to assess the global model first and get the c-hat. 83

84 Goodness of fit Test performance: Power to detect lack-of-fit caused by an incorrect structure for detection probability Failure to detect poor model fit caused by occupancy probabilities In general low power for the sample sizes expected in many applications! 84

85 Key model assumptions 1. System closed to changes in occupancy 2. Independent detections 3. No false positives 4. No unmodelled heterogeneity Pr 1001 ψ, p = ψ i p i1 1 p i2 1 p i3 p i4 85

86 Assumptions: closure If changes at random: occupancy estimator remains unbiased ψ interpreted as use p is smaller as it involves two components Pr species detected at site i during survey j = Pr(species uses site i) = 1 if closure Pr(species present at site i during survey j uses site i) Pr(species detected during survey j present at site i during survey j) 86

87 Assumptions: closure If changes at random: occupancy estimator remains unbiased ψ interpreted as use p is smaller as it involves two components Pr species detected at site i during survey j = Pr(species uses site i) ψ Pr(species present at site i during survey j uses site i) = 1 if closure p Pr(species detected during survey j present at site i during survey j) 87

88 Assumptions: closure If emigration/immigration ψ relates to the probability species present at the site at the start/end of the season Need to allow p to change along the season Other non-random changes can cause bias (harder to interpret what parameter estimates mean) An issue to be considered during study design is the season defined appropriately? is the time between surveys suitable? 88

89 Assumptions: independence If outcome of one survey depends on the outcome of other survey lack of independence Can be induced by different mechanisms: e.g. Species easier to detect at a site where it has already been detected ( trap response ) e.g. Surveys carried out close in time so that the species is more likely to be detected if it was detected in the previous survey To tackle it: good design, modelling 89

90 Assumptions: independence Survey-specific covariate to account for trap response indicates surveys that happen after 1 st detection at the site Site History s

91 Assumptions: independence Model to account for lack of closure and dependence between consecutive replicates (Hines et al. 2010) Motivating example: sign surveys along transects (replicate = transect segment)

92 Assumptions: independence Model to account for lack of closure and dependence between consecutive replicates (Hines et al. 2010) Motivating example: sign surveys along transects (replicate = transect segment) Two new parameters into the model: θ = probability that the species is present at a replicate visit given it was not present in the previous replicate θ = probability that the species is present at a replicate visit given it was present in the previous replicate 92

93 Assumptions: independence Hidden Markov model Detection process at occupied sites is 1- θ θ 1-θ Present at a replicate Absent at a replicate Detected at a replicate p θ If θ = θ independence 93

94 Assumptions: independence e.g. Pr( i = 0101) = π 1 1 p θ pθ 1 p θ p + π 1 1 p θ p 1 θ θp + 1 π 1 θpθ 1 p θ p + 1 π 1 θp 1 θ θp ψ Hidden states: PPPP PPAP APPP APAP π 1 = probability of starting in present state (function of the other parameters) 1- θ θ 1-θ P A 1 p θ 94

95 Assumptions: no false positives If no false positives unambiguous state that allows the estimation of other parameters (like false negatives) If false positives, occupancy could be severely overestimated In practice, usually less of an issue that false absences Can be accounted for in the modelling: If no data on known false detections, could model it with finite mixture (Royle & Link, 2006) but this approach has problems If auxiliary data on false detection rate (e.g. expert prior information or genetic tests), can model the misidentification process with a separate likelihood component (McClintock et al. 2010) Explicit model of false positives if data from multiple detection methods, when one method has no uncertainty (Miller et al. 2011) 95

96 Assumptions: no unmodelled heterogeneity Model assumes that ψ and p are constant or a function of covariates If heterogeneity remains in occupancy probability: Parameter values still valid as average values across the sites surveyed If heterogeneity remains in detection probability: May induces negative bias in occupancy estimator (underestimation) 96

97 Assumptions: incorporating heterogeneity in p Models that incorporate heterogeneity in detectability Finite mixtures Continuous mixtures (random effects) Abundance-induced heterogeneity (Royle-Nichols) See Royle (2006) in Biometrics 97

98 Assumption: incorporating heterogeneity in p Finite mixtures Assume that each site belongs to one of G groups, each with a different detection probability group 1 p 1 group 2 p 2... group G p G Group membership is not known e.g. 2 groups with π 1 = probability of belonging to group 1 Pr( i = 101) = ψ π 1 p 1 1 p 1 p π 1 p 2 1 p 2 p 2 98

99 Assumption: incorporating heterogeneity in p Continuous mixtures Assume that p i is a random value drawn from a continuous distribution (e.g. beta distribution, logit-normal...) Estimate the parameters of the distribution In some cases a closed expression exists (e.g. beta-binomial) In general, easier to implement in the Bayesian framework 99

100 Assumption: incorporating heterogeneity in p Abundance-induced heterogeneity (Royle-Nichols model) Differences in site abundance can induce heterogeneity in detection probability Site 1 Site 2 Lower detectability of the species Higher detectability of the species 100

101 Assumption: incorporating heterogeneity in p Abundance-induced heterogeneity (Royle-Nichols model) Differences in site abundance can induce heterogeneity in detection probability Royle & Nichols (2003) propose a way to model this Link heterogeneity in detection probability and heterogeneity in abundance by where p ij = species detection probability, r j p ij = 1 1 r j N i = individuals detection probability N i = number of individuals in the site 101

102 Assumption: incorporating heterogeneity in p Abundance-induced heterogeneity (Royle-Nichols model) The number of individuals at the site N i is unknown Modelled as a random variable with a given distribution Occupancy can be derived as Pr(N i >0 =1 Pr N i =0) e.g. Poisson distribution Pr N = x = λx e λ x! ψ = 1 e λ lambda=2 lambda=

103 Assumption: incorporating heterogeneity in p Abundance-induced heterogeneity (Royle-Nichols model) The number of individuals at the site N i is unknown Modelled as a random variable with a given distribution Occupancy can be derived as Pr(N i >0 =1 Pr N i =0) Discrete mixture over site abundance Pr i = 101 λ, r = p i1 1 p i2 p i3 Pr(N i ) N i = p i1 1 p i2 p i3 e λ λ N i N i N i! with p ij = 1 1 r j Ni p ij = 1 1 r j N i 103

104 Assumption: incorporating heterogeneity in p Abundance-induced heterogeneity (Royle-Nichols model) Model assumptions each individual detected independently of others in the site all individuals equally detectable number of individuals in each site does not change during the survey season ( closure ) abundance main source of heterogeneity in p 104

105 Assumption: incorporating heterogeneity in p Abundance-induced heterogeneity (Royle-Nichols model) Need to take care with the interpretation of abundance and the model assumptions Heterogeneity in p from other sources can be picked up as abundance Perhaps more useful as a means to account for heterogeneity that as a method to estimate abundance 105

106 Assumption: incorporating heterogeneity in p Abundance-induced heterogeneity (Royle-Nichols model) Example (Royle & Nichols 2003) North American Breeding Bird Survey route S=50 stops K=11 (over 30 days) Poisson assumption on abundance at each stop (no covariates constant λ) Possible changes in breeding activity (individual) detectability may vary over time: logit r = β 0 + β 1 day + β 2 day 2 106

107 Assumption: incorporating heterogeneity in p Abundance-induced heterogeneity (Royle-Nichols model) Example (Royle & Nichols 2003) Hermit thrush logit r = β 0 + β 1 day + β 2 day 2 Model M1: only intercept (constant detectability) Model M2: β 0 and β 1 (linear change in detectability) Model M3: all three parameters (quadratic change in detectability) Model M0: MacKenzie et al. (2002) (no abundance-induced heterogeneity) Wood thrush D. Gordon Robertson Steve Maslowski 107

108 Finite population So far we have assumed s<<s ( infinite population ) We estimate the probability of occupancy, an underlying characteristic of the population The proportion of occupied sites is a realisation of this process very large population: S our survey area: s << S 108

109 Finite population The distinction between these two concepts is important when dealing with finite population (s S) we survey all available habitat (our survey area s covers practically all S) 109

110 Finite population The distinction between these two concepts is important when dealing with finite population (s S) SE s will be too large if not accounted for By default we are including the uncertainty derived from sampling from a infinite population ( binomial experiment ) The proportion of occupied sites can be calculated as s S d + ψ i c i=s d +1 S + ψ i i=s+1 S SE can be derived using the delta method (MacKenzie et al. 2006) 110

Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology

Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology Occupancy models Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology Advances in Species distribution modelling in ecological studies and conservation Pavia and Gran

More information

Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology

Occupancy models. Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology Occupancy models Gurutzeta Guillera-Arroita University of Kent, UK National Centre for Statistical Ecology Advances in Species distribution modelling in ecological studies and conservation Pavia and Gran

More information

Represent processes and observations that span multiple levels (aka multi level models) R 2

Represent processes and observations that span multiple levels (aka multi level models) R 2 Hierarchical models Hierarchical models Represent processes and observations that span multiple levels (aka multi level models) R 1 R 2 R 3 N 1 N 2 N 3 N 4 N 5 N 6 N 7 N 8 N 9 N i = true abundance on a

More information

Lecture 09 - Patch Occupancy and Patch Dynamics

Lecture 09 - Patch Occupancy and Patch Dynamics WILD 7970 - Analysis of Wildlife Populations 1 of 11 Lecture 09 - Patch Occupancy and Patch Dynamics Resources Site Occupancy D. I. MacKenzie, J. D. Nichols, G. D. Lachman, S. Droege, J. A. Royle, and

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Four aspects of a sampling strategy necessary to make accurate and precise inferences about populations are:

Four aspects of a sampling strategy necessary to make accurate and precise inferences about populations are: Why Sample? Often researchers are interested in answering questions about a particular population. They might be interested in the density, species richness, or specific life history parameters such as

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Darryl I. MacKenzie 1

Darryl I. MacKenzie 1 Aust. N. Z. J. Stat. 47(1), 2005, 65 74 WAS IT THERE? DEALING WITH IMPERFECT DETECTION FOR SPECIES PRESENCE/ABSENCE DATA Darryl I. MacKenzie 1 Proteus Wildlife Research Consultants Summary Species presence/absence

More information

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1 Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson

More information

Lecture 7 Models for open populations: Tag recovery and CJS models, Goodness-of-fit

Lecture 7 Models for open populations: Tag recovery and CJS models, Goodness-of-fit WILD 7250 - Analysis of Wildlife Populations 1 of 16 Lecture 7 Models for open populations: Tag recovery and CJS models, Goodness-of-fit Resources Chapter 5 in Goodness of fit in E. Cooch and G.C. White

More information

Logistic Regression: Regression with a Binary Dependent Variable

Logistic Regression: Regression with a Binary Dependent Variable Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression

More information

ECE521 week 3: 23/26 January 2017

ECE521 week 3: 23/26 January 2017 ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear

More information

Brett Skelly, Katharine Lewis, Reina Tyl, Gordon Dimmig & Christopher Rota West Virginia University

Brett Skelly, Katharine Lewis, Reina Tyl, Gordon Dimmig & Christopher Rota West Virginia University CHAPTER 22 Occupancy models multi-species Brett Skelly, Katharine Lewis, Reina Tyl, Gordon Dimmig & Christopher Rota West Virginia University Ecological communities are composed of multiple interacting

More information

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010 1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

More information

Statistical inference

Statistical inference Statistical inference Contents 1. Main definitions 2. Estimation 3. Testing L. Trapani MSc Induction - Statistical inference 1 1 Introduction: definition and preliminary theory In this chapter, we shall

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

EXERCISE 8: REPEATED COUNT MODEL (ROYLE) In collaboration with Heather McKenney

EXERCISE 8: REPEATED COUNT MODEL (ROYLE) In collaboration with Heather McKenney EXERCISE 8: REPEATED COUNT MODEL (ROYLE) In collaboration with Heather McKenney University of Vermont, Rubenstein School of Environment and Natural Resources Please cite this work as: Donovan, T. M. and

More information

CS-E3210 Machine Learning: Basic Principles

CS-E3210 Machine Learning: Basic Principles CS-E3210 Machine Learning: Basic Principles Lecture 4: Regression II slides by Markus Heinonen Department of Computer Science Aalto University, School of Science Autumn (Period I) 2017 1 / 61 Today s introduction

More information

1. Hypothesis testing through analysis of deviance. 3. Model & variable selection - stepwise aproaches

1. Hypothesis testing through analysis of deviance. 3. Model & variable selection - stepwise aproaches Sta 216, Lecture 4 Last Time: Logistic regression example, existence/uniqueness of MLEs Today s Class: 1. Hypothesis testing through analysis of deviance 2. Standard errors & confidence intervals 3. Model

More information

Open Problems in Mixed Models

Open Problems in Mixed Models xxiii Determining how to deal with a not positive definite covariance matrix of random effects, D during maximum likelihood estimation algorithms. Several strategies are discussed in Section 2.15. For

More information

STAT 536: Genetic Statistics

STAT 536: Genetic Statistics STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,

More information

Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics)

Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics) Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics) Probability quantifies randomness and uncertainty How do I estimate the normalization and logarithmic slope of a X ray continuum, assuming

More information

Introduction to capture-markrecapture

Introduction to capture-markrecapture E-iNET Workshop, University of Kent, December 2014 Introduction to capture-markrecapture models Rachel McCrea Overview Introduction Lincoln-Petersen estimate Maximum-likelihood theory* Capture-mark-recapture

More information

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1)

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1) HW 1 due today Parameter Estimation Biometrics CSE 190 Lecture 7 Today s lecture was on the blackboard. These slides are an alternative presentation of the material. CSE190, Winter10 CSE190, Winter10 Chapter

More information

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering Types of learning Modeling data Supervised: we know input and targets Goal is to learn a model that, given input data, accurately predicts target data Unsupervised: we know the input only and want to make

More information

Statistical Methods in Particle Physics

Statistical Methods in Particle Physics Statistical Methods in Particle Physics Lecture 11 January 7, 2013 Silvia Masciocchi, GSI Darmstadt s.masciocchi@gsi.de Winter Semester 2012 / 13 Outline How to communicate the statistical uncertainty

More information

Webinar Session 1. Introduction to Modern Methods for Analyzing Capture- Recapture Data: Closed Populations 1

Webinar Session 1. Introduction to Modern Methods for Analyzing Capture- Recapture Data: Closed Populations 1 Webinar Session 1. Introduction to Modern Methods for Analyzing Capture- Recapture Data: Closed Populations 1 b y Bryan F.J. Manly Western Ecosystems Technology Inc. Cheyenne, Wyoming bmanly@west-inc.com

More information

Statistics 3858 : Maximum Likelihood Estimators

Statistics 3858 : Maximum Likelihood Estimators Statistics 3858 : Maximum Likelihood Estimators 1 Method of Maximum Likelihood In this method we construct the so called likelihood function, that is L(θ) = L(θ; X 1, X 2,..., X n ) = f n (X 1, X 2,...,

More information

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu October

More information

Incorporating Boosted Regression Trees into Ecological Latent Variable Models

Incorporating Boosted Regression Trees into Ecological Latent Variable Models Incorporating Boosted Regression Trees into Ecological Latent Variable Models Rebecca A. Hutchinson, Li-Ping Liu, Thomas G. Dietterich School of EECS, Oregon State University Motivation Species Distribution

More information

To hear the seminar, dial (605) , access code

To hear the seminar, dial (605) , access code Welcome to the Seminar Resource Selection Functions and Patch Occupancy Models: Similarities and Differences Lyman McDonald Senior Biometrician WEST, Inc. Cheyenne, Wyoming and Laramie, Wyoming lmcdonald@west-inc.com

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Matrix Data: Prediction Instructor: Yizhou Sun yzsun@ccs.neu.edu September 14, 2014 Today s Schedule Course Project Introduction Linear Regression Model Decision Tree 2 Methods

More information

Finding patterns in nocturnal seabird flight-call behaviour

Finding patterns in nocturnal seabird flight-call behaviour Information Theoretic Approach: AIC!! Finding patterns in nocturnal seabird flight-call behaviour Rachel Buxton Biology 7932 9 November 29 Nocturnality in seabirds Active around colonies only at night

More information

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)

More information

Cormack-Jolly-Seber Models

Cormack-Jolly-Seber Models Cormack-Jolly-Seber Models Estimating Apparent Survival from Mark-Resight Data & Open-Population Models Ch. 17 of WNC, especially sections 17.1 & 17.2 For these models, animals are captured on k occasions

More information

Model comparison and selection

Model comparison and selection BS2 Statistical Inference, Lectures 9 and 10, Hilary Term 2008 March 2, 2008 Hypothesis testing Consider two alternative models M 1 = {f (x; θ), θ Θ 1 } and M 2 = {f (x; θ), θ Θ 2 } for a sample (X = x)

More information

Multinomial Logistic Regression Models

Multinomial Logistic Regression Models Stat 544, Lecture 19 1 Multinomial Logistic Regression Models Polytomous responses. Logistic regression can be extended to handle responses that are polytomous, i.e. taking r>2 categories. (Note: The word

More information

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014 LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Liang (Sally) Shan Nov. 4, 2014 L Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers

More information

simple if it completely specifies the density of x

simple if it completely specifies the density of x 3. Hypothesis Testing Pure significance tests Data x = (x 1,..., x n ) from f(x, θ) Hypothesis H 0 : restricts f(x, θ) Are the data consistent with H 0? H 0 is called the null hypothesis simple if it completely

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level

More information

Simple logistic regression

Simple logistic regression Simple logistic regression Biometry 755 Spring 2009 Simple logistic regression p. 1/47 Model assumptions 1. The observed data are independent realizations of a binary response variable Y that follows a

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

Density Estimation. Seungjin Choi

Density Estimation. Seungjin Choi Density Estimation Seungjin Choi Department of Computer Science and Engineering Pohang University of Science and Technology 77 Cheongam-ro, Nam-gu, Pohang 37673, Korea seungjin@postech.ac.kr http://mlg.postech.ac.kr/

More information

Introduction: MLE, MAP, Bayesian reasoning (28/8/13)

Introduction: MLE, MAP, Bayesian reasoning (28/8/13) STA561: Probabilistic machine learning Introduction: MLE, MAP, Bayesian reasoning (28/8/13) Lecturer: Barbara Engelhardt Scribes: K. Ulrich, J. Subramanian, N. Raval, J. O Hollaren 1 Classifiers In this

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Class 6 AMS-UCSC Thu 26, 2012 Winter 2012. Session 1 (Class 6) AMS-132/206 Thu 26, 2012 1 / 15 Topics Topics We will talk about... 1 Hypothesis testing

More information

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While

More information

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification, Likelihood Let P (D H) be the probability an experiment produces data D, given hypothesis H. Usually H is regarded as fixed and D variable. Before the experiment, the data D are unknown, and the probability

More information

Bayesian Methods for Machine Learning

Bayesian Methods for Machine Learning Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Lecture 3. Hypothesis testing. Goodness of Fit. Model diagnostics GLM (Spring, 2018) Lecture 3 1 / 34 Models Let M(X r ) be a model with design matrix X r (with r columns) r n

More information

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3 STA 303 H1S / 1002 HS Winter 2011 Test March 7, 2011 LAST NAME: FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 303 STA 1002 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator. Some formulae

More information

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

HYPOTHESIS TESTING: FREQUENTIST APPROACH. HYPOTHESIS TESTING: FREQUENTIST APPROACH. These notes summarize the lectures on (the frequentist approach to) hypothesis testing. You should be familiar with the standard hypothesis testing from previous

More information

LECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test.

LECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test. Economics 52 Econometrics Professor N.M. Kiefer LECTURE 1: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING NEYMAN-PEARSON LEMMA: Lesson: Good tests are based on the likelihood ratio. The proof is easy in the

More information

Topic 12 Overview of Estimation

Topic 12 Overview of Estimation Topic 12 Overview of Estimation Classical Statistics 1 / 9 Outline Introduction Parameter Estimation Classical Statistics Densities and Likelihoods 2 / 9 Introduction In the simplest possible terms, the

More information

Machine Learning Linear Classification. Prof. Matteo Matteucci

Machine Learning Linear Classification. Prof. Matteo Matteucci Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)

More information

EXERCISE 14: SINGLE-SEASON, SPECIES-INTERACTIONS OCCUPANCY MODELS. In collaboration with Rebecca J. Pfeiffer and Jeremy M. Clark

EXERCISE 14: SINGLE-SEASON, SPECIES-INTERACTIONS OCCUPANCY MODELS. In collaboration with Rebecca J. Pfeiffer and Jeremy M. Clark EXERCISE 14: SINGLE-SEASON, SPECIES-INTERACTIONS OCCUPANCY MODELS In collaboration with Rebecca J. Pfeiffer and Jeremy M. Clark University of Vermont, Rubenstein School of Environment and Natural Resources

More information

Site Occupancy Models with Heterogeneous Detection Probabilities

Site Occupancy Models with Heterogeneous Detection Probabilities Biometrics 62, 97 102 March 2006 DOI: 10.1111/j.1541-0420.2005.00439.x Site Occupancy Models with Heterogeneous Detection Probabilities J. Andrew Royle USGS Patuxent Wildlife Research Center, 12100 Beech

More information

Chapter 5: Logistic Regression-I

Chapter 5: Logistic Regression-I : Logistic Regression-I Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu] D. Bandyopadhyay

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

COS513 LECTURE 8 STATISTICAL CONCEPTS

COS513 LECTURE 8 STATISTICAL CONCEPTS COS513 LECTURE 8 STATISTICAL CONCEPTS NIKOLAI SLAVOV AND ANKUR PARIKH 1. MAKING MEANINGFUL STATEMENTS FROM JOINT PROBABILITY DISTRIBUTIONS. A graphical model (GM) represents a family of probability distributions

More information

6.867 Machine Learning

6.867 Machine Learning 6.867 Machine Learning Problem set 1 Solutions Thursday, September 19 What and how to turn in? Turn in short written answers to the questions explicitly stated, and when requested to explain or prove.

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Statistics 203: Introduction to Regression and Analysis of Variance Course review Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

Generalised linear models. Response variable can take a number of different formats

Generalised linear models. Response variable can take a number of different formats Generalised linear models Response variable can take a number of different formats Structure Limitations of linear models and GLM theory GLM for count data GLM for presence \ absence data GLM for proportion

More information

Count data page 1. Count data. 1. Estimating, testing proportions

Count data page 1. Count data. 1. Estimating, testing proportions Count data page 1 Count data 1. Estimating, testing proportions 100 seeds, 45 germinate. We estimate probability p that a plant will germinate to be 0.45 for this population. Is a 50% germination rate

More information

Parameter Estimation and Fitting to Data

Parameter Estimation and Fitting to Data Parameter Estimation and Fitting to Data Parameter estimation Maximum likelihood Least squares Goodness-of-fit Examples Elton S. Smith, Jefferson Lab 1 Parameter estimation Properties of estimators 3 An

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures

9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures FE661 - Statistical Methods for Financial Engineering 9. Model Selection Jitkomut Songsiri statistical models overview of model selection information criteria goodness-of-fit measures 9-1 Statistical models

More information

Bayesian Regression (1/31/13)

Bayesian Regression (1/31/13) STA613/CBB540: Statistical methods in computational biology Bayesian Regression (1/31/13) Lecturer: Barbara Engelhardt Scribe: Amanda Lea 1 Bayesian Paradigm Bayesian methods ask: given that I have observed

More information

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) = Until now we have always worked with likelihoods and prior distributions that were conjugate to each other, allowing the computation of the posterior distribution to be done in closed form. Unfortunately,

More information

Model Selection in GLMs. (should be able to implement frequentist GLM analyses!) Today: standard frequentist methods for model selection

Model Selection in GLMs. (should be able to implement frequentist GLM analyses!) Today: standard frequentist methods for model selection Model Selection in GLMs Last class: estimability/identifiability, analysis of deviance, standard errors & confidence intervals (should be able to implement frequentist GLM analyses!) Today: standard frequentist

More information

Introduction to Occupancy Models. Jan 8, 2016 AEC 501 Nathan J. Hostetter

Introduction to Occupancy Models. Jan 8, 2016 AEC 501 Nathan J. Hostetter Introduction to Occupancy Models Jan 8, 2016 AEC 501 Nathan J. Hostetter njhostet@ncsu.edu 1 Occupancy Abundance often most interesting variable when analyzing a population Occupancy probability that a

More information

Estimation of reliability parameters from Experimental data (Parte 2) Prof. Enrico Zio

Estimation of reliability parameters from Experimental data (Parte 2) Prof. Enrico Zio Estimation of reliability parameters from Experimental data (Parte 2) This lecture Life test (t 1,t 2,...,t n ) Estimate θ of f T t θ For example: λ of f T (t)= λe - λt Classical approach (frequentist

More information

Bayes methods for categorical data. April 25, 2017

Bayes methods for categorical data. April 25, 2017 Bayes methods for categorical data April 25, 2017 Motivation for joint probability models Increasing interest in high-dimensional data in broad applications Focus may be on prediction, variable selection,

More information

Exam Applied Statistical Regression. Good Luck!

Exam Applied Statistical Regression. Good Luck! Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.

More information

Loglikelihood and Confidence Intervals

Loglikelihood and Confidence Intervals Stat 504, Lecture 2 1 Loglikelihood and Confidence Intervals The loglikelihood function is defined to be the natural logarithm of the likelihood function, l(θ ; x) = log L(θ ; x). For a variety of reasons,

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Matrix Data: Prediction Instructor: Yizhou Sun yzsun@ccs.neu.edu September 21, 2015 Announcements TA Monisha s office hour has changed to Thursdays 10-12pm, 462WVH (the same

More information

Finite mixture models in secr 3.0 Murray Efford

Finite mixture models in secr 3.0 Murray Efford Finite mixture models in secr 3.0 Murray Efford 2017-04-05 Contents Implementation in secr 1 Number of classes 3 Multimodality 3 Hybrid hcov model 5 Notes 5 References 5 Appendix. SECR finite mixture model

More information

CHAPTER 21. Occupancy models

CHAPTER 21. Occupancy models CHAPTER 21 Occupancy models Brian D. Gerber, Brittany Mosher, Daniel Martin, Larissa Bailey, Colorado State University Thierry Chambert, Penn State University & USGS As ecologists and conservation biologists,

More information

Hypothesis testing: theory and methods

Hypothesis testing: theory and methods Statistical Methods Warsaw School of Economics November 3, 2017 Statistical hypothesis is the name of any conjecture about unknown parameters of a population distribution. The hypothesis should be verifiable

More information

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest

More information

Basic Concepts of Inference

Basic Concepts of Inference Basic Concepts of Inference Corresponds to Chapter 6 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT) with some slides by Jacqueline Telford (Johns Hopkins University) and Roy Welsch (MIT).

More information

Stat 587: Key points and formulae Week 15

Stat 587: Key points and formulae Week 15 Odds ratios to compare two proportions: Difference, p 1 p 2, has issues when applied to many populations Vit. C: P[cold Placebo] = 0.82, P[cold Vit. C] = 0.74, Estimated diff. is 8% What if a year or place

More information

Experimental Design and Statistical Methods. Workshop LOGISTIC REGRESSION. Jesús Piedrafita Arilla.

Experimental Design and Statistical Methods. Workshop LOGISTIC REGRESSION. Jesús Piedrafita Arilla. Experimental Design and Statistical Methods Workshop LOGISTIC REGRESSION Jesús Piedrafita Arilla jesus.piedrafita@uab.cat Departament de Ciència Animal i dels Aliments Items Logistic regression model Logit

More information

Parametric Techniques Lecture 3

Parametric Techniques Lecture 3 Parametric Techniques Lecture 3 Jason Corso SUNY at Buffalo 22 January 2009 J. Corso (SUNY at Buffalo) Parametric Techniques Lecture 3 22 January 2009 1 / 39 Introduction In Lecture 2, we learned how to

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -

More information

Machine Learning for OR & FE

Machine Learning for OR & FE Machine Learning for OR & FE Regression II: Regularization and Shrinkage Methods Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com

More information

Bayesian Analysis for Natural Language Processing Lecture 2

Bayesian Analysis for Natural Language Processing Lecture 2 Bayesian Analysis for Natural Language Processing Lecture 2 Shay Cohen February 4, 2013 Administrativia The class has a mailing list: coms-e6998-11@cs.columbia.edu Need two volunteers for leading a discussion

More information

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds Chapter 6 Logistic Regression In logistic regression, there is a categorical response variables, often coded 1=Yes and 0=No. Many important phenomena fit this framework. The patient survives the operation,

More information

Mohammed. Research in Pharmacoepidemiology National School of Pharmacy, University of Otago

Mohammed. Research in Pharmacoepidemiology National School of Pharmacy, University of Otago Mohammed Research in Pharmacoepidemiology (RIPE) @ National School of Pharmacy, University of Otago What is zero inflation? Suppose you want to study hippos and the effect of habitat variables on their

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 12: Frequentist properties of estimators (v4) Ramesh Johari ramesh.johari@stanford.edu 1 / 39 Frequentist inference 2 / 39 Thinking like a frequentist Suppose that for some

More information

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood PRE 906: Structural Equation Modeling Lecture #3 February 4, 2015 PRE 906, SEM: Estimation Today s Class An

More information

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Statistical Methods. Missing Data  snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23 1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing

More information