Range extensions in anemonefishes and host sea anemones in eastern Australia: potential constraints to tropicalisation

Similar documents
Introduction to the Generalized Linear Model: Logistic regression and Poisson regression

Generalized Linear Models. stat 557 Heike Hofmann

A Generalized Linear Model for Binomial Response Data. Copyright c 2017 Dan Nettleton (Iowa State University) Statistics / 46

Linear Regression Models P8111

Modeling Overdispersion

Generalised linear models. Response variable can take a number of different formats

Regression models. Generalized linear models in R. Normal regression models are not always appropriate. Generalized linear models. Examples.

Duration of Unemployment - Analysis of Deviance Table for Nested Models

Tento projekt je spolufinancován Evropským sociálním fondem a Státním rozpočtem ČR InoBio CZ.1.07/2.2.00/

Logistic Regressions. Stat 430

Logistic Regression - problem 6.14

Multiple Regression and Regression Model Adequacy

Exercise 5.4 Solution

Generalized linear models

Logistic Regression 21/05

Interactions in Logistic Regression

Overdispersion Workshop in generalized linear models Uppsala, June 11-12, Outline. Overdispersion

Spatio-temporal dynamics of Marbled Murrelet hotspots during nesting in nearshore waters along the Washington to California coast

R Output for Linear Models using functions lm(), gls() & glm()

Sample solutions. Stat 8051 Homework 8

Log-linear Models for Contingency Tables

12 Modelling Binomial Response Data

A strategy for modelling count data which may have extra zeros

STA102 Class Notes Chapter Logistic Regression

9 Generalized Linear Models

R Hints for Chapter 10

Explanatory variables are: weight, width of shell, color (medium light, medium, medium dark, dark), and condition of spine.

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION. ST3241 Categorical Data Analysis. (Semester II: ) April/May, 2011 Time Allowed : 2 Hours

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Generalized linear models for binary data. A better graphical exploratory data analysis. The simple linear logistic regression model

BMI 541/699 Lecture 22

Age 55 (x = 1) Age < 55 (x = 0)

How to work correctly statistically about sex ratio

A Handbook of Statistical Analyses Using R. Brian S. Everitt and Torsten Hothorn

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form:

Generalized linear models

Statistical Methods III Statistics 212. Problem Set 2 - Answer Key

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

Clinical Trials. Olli Saarela. September 18, Dalla Lana School of Public Health University of Toronto.

Poisson Regression. The Training Data

Matched Pair Data. Stat 557 Heike Hofmann

MSH3 Generalized linear model

Model Estimation Example

PAPER 206 APPLIED STATISTICS

You can specify the response in the form of a single variable or in the form of a ratio of two variables denoted events/trials.

Generalized Linear Models

SCW6-Doc05. CPUE standardization for the offshore fleet fishing for Jack mackerel in the SPRFMO area

Linear Regression. Data Model. β, σ 2. Process Model. ,V β. ,s 2. s 1. Parameter Model

Week 7 Multiple factors. Ch , Some miscellaneous parts

Falkland Island Fisheries Department. Castelo (ZDLT1) Falkland Islands. Dates 9/02/ /02/ May

Checking the Poisson assumption in the Poisson generalized linear model

STAC51: Categorical data Analysis

Influence of feeding conditions on breeding of African penguins importance of adequate local food supplies

Statistical analysis of trends in climate indicators by means of counts

Logistic Regression for Circular Data

STA 450/4000 S: January

On the Inference of the Logistic Regression Model

Nonlinear Models. What do you do when you don t have a line? What do you do when you don t have a line? A Quadratic Adventure

How to deal with non-linear count data? Macro-invertebrates in wetlands

Various Issues in Fitting Contingency Tables

Non-Gaussian Response Variables

Classification. Chapter Introduction. 6.2 The Bayes classifier

Cherry.R. > cherry d h v <portion omitted> > # Step 1.

PAPER 218 STATISTICAL LEARNING IN PRACTICE

Poisson Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

MSH3 Generalized linear model

1. Hypothesis testing through analysis of deviance. 3. Model & variable selection - stepwise aproaches

STAT 526 Spring Midterm 1. Wednesday February 2, 2011

Supplementary Material

Logistic Regression. Interpretation of linear regression. Other types of outcomes. 0-1 response variable: Wound infection. Usual linear regression

Regression so far... Lecture 21 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Hurricane Forecasting Using Regression Modeling

DIC, AIC, BIC, PPL, MSPE Residuals Predictive residuals

Survival Analysis I (CHL5209H)

Today. HW 1: due February 4, pm. Aspects of Design CD Chapter 2. Continue with Chapter 2 of ELM. In the News:

Niche Modeling. STAMPS - MBL Course Woods Hole, MA - August 9, 2016

Logistic & Tobit Regression

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,

ssh tap sas913, sas

Q30b Moyale Observed counts. The FREQ Procedure. Table 1 of type by response. Controlling for site=moyale. Improved (1+2) Same (3) Group only

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

Booklet of Code and Output for STAD29/STA 1007 Midterm Exam

Regression Methods for Survey Data

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: )

Recap. HW due Thursday by 5 pm Next HW coming on Thursday Logistic regression: Pr(G = k X) linear on the logit scale Linear discriminant analysis:

Stat 8053, Fall 2013: Multinomial Logistic Models

Exam Applied Statistical Regression. Good Luck!

Where have all the puffins gone 1. An Analysis of Atlantic Puffin (Fratercula arctica) Banding Returns in Newfoundland

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

MSH3 Generalized linear model Ch. 6 Count data models

The glarma Package for Observation Driven Time Series Regression of Counts

Stat 401B Final Exam Fall 2016

Logistic Regression. 0.1 Frogs Dataset

UNIVERSITY OF TORONTO Faculty of Arts and Science

Transcription:

Marine and Freshwater Research, 2017, 68, 1224 1232 CSIRO 2017 Supplementary material Range extensions in anemonefishes and host sea anemones in eastern Australia: potential constraints to tropicalisation Hamish Malcolm A,C and Anna Scott B A Fisheries Research, Marine Ecosystems, NSW Department of Primary Industries, Department of Primary Industries, Locked Bag 800, Nelson Bay, NSW 2315, Australia. B National Marine Science Centre and Marine Ecology Research Centre, Southern Cross University, PO Box 4321, Coffs Harbour, NSW 2450, Australia. C Corresponding author. Email: hamish.malcolm@dpi.nsw.gov.au Sea-temperature patterns There were distinct differences in minimum, maximum and average sea temperature across-shelf in the Solitaries Region for the 3 years before the present survey, with warmer sea temperature further offshore. South Solitary Island had the highest sea temperatures of the survey locations in the Solitaries (16.8 C minimum, 26.8 C maximum, 21.6 C average), whereas temperatures were lower at inshore sites (16.4 C minimum, 25.7 C maximum, 21.2 C average). Although Fish Rock had average and maximum sea temperatures very similar to those of South Solitary, the minimum temperature recorded was much lower at Fish Rock (14.5 v. 16.8 C). Further inshore at Smoky Cape (Black Rock, Green Island), average, maximum and minimum temperatures were ~0.5 1 C lower than at Fish Rock. The average temperature at Broughton Island over the study period was 2.8 C lower than that at Fish Rock. The proportion of time that sea-temperature was below 18 C was much higher at Broughton Island and the proportion of time it was above 24 C much lower. Page 1 of 11

28 26 24 22 20 18 16 06/2010 12/2010 06/2011 12/2011 06/2012 12/2012 06/2013 Fig. S1. Daily average sea temperatures ( C) from logging thermistor stations recording every 30 min at inshore Solitaries (green line) v. Fish Rock (blue line). 28 26 24 22 20 18 16 06/2010 12/2010 06/2011 12/2011 06/2012 12/2012 06/2013 Fig. S2. Daily average sea temperatures ( C) from logging thermistor stations recording every 30 min at South Solitary Island (orange line) v. Fish Rock (blue line). Page 2 of 11

28 26 24 22 20 18 16 06/2010 12/2010 06/2011 12/2011 06/2012 12/2012 06/2013 Fig. S3. Daily average sea temperatures ( C) from logging thermistor stations recording every 30 min at inshore Solitaries (green line) v. South Solitary Island (orange line). 28 26 24 22 20 18 16 14 03/2010 11/2010 07/2011 04/2012 12/2012 08/2013 Fig. S4. Daily average sea temperatures ( C) from logging thermistor stations recording every 60 min at Fish Rock (blue line) v. Broughton Island (purple line). Page 3 of 11

GLM results Table S1. Collinearity Collinear (Pearson and Spearman correlations) Pearson Latitude and minimum temperature (0.9) Pearson, Spearman Maximum temperature and average temperature (0.76, 0.86) Spearman Distance from shore and average temp (0.70) Summary Entacmaea quadricolor Where E. quadricolor was recorded, it was more abundant with a greater area of emergent rock and with higher average temperatures and a greater distance from shore. The final model for E. quadricolor was emergent rock + average temperature (AIC 45.8), with both variables being significant (Table 3 of the main paper), although a model with emergent rock + distance from shore (AIC 46.7) was similar. This supported observed patterns, with highest abundance offshore at islands where sea temperature is higher, on average, than it is closer to the mainland coast. Individually, emergent rock was the most important predictor (AIC 51.5) of the abundance of E quadricolor. Area of emergent rock + average temperature explained ~68% of the variation for the count model. Area of emergent rock individually explained ~31% of the variation. Heteractis crispa Where H. crispa was recorded, it was more abundant with a greater area of emergent rock and with higher average temperatures. The final model was emergent rock + average temperature (AIC 93.0), with both significantly contributing to abundance (Table 3 of the main paper). Models with emergent rock + maximum temperature (AIC 99.1), maximum temperature (AIC 103.1) and emergent rock (AIC 105) were less suitable. Area of emergent rock + average temperature explained ~31% of the variation for the count model. Area of emergent rock individually explained ~22% of the variation. Amphiprion akindynos Where A. akindynos was recorded, it was more abundant with a greater area of emergent rock and with a higher maximum temperature. The final model was emergent rock + maximum temperature (AIC 72.8), with both significantly contributing to abundance (Table 3 of the main paper). The model with emergent rock (AIC 73.0) was similar and maximum temperature (AIC 78.5) was less suitable. Area of emergent rock + maximum temperature explained ~24% of the variation for the count model. Area of emergent rock individually explained ~21% of the variation. Amphiprion latezonatus Where A. latezonatus was recorded, it was more abundant with a greater area of emergent rock and with higher average temperatures. The final model was emergent rock + average temperature (AIC 42.1), with emergent rock marginally (0.07) contributing to abundance (Table 3 of the main paper). Models with emergent rock + average temperature + distance from shore (AIC 49.0), emergent rock (AIC 49.6) or average temperature (AIC 51.8) were less Page 4 of 11

suitable. Area of emergent rock + average temperature explained ~68% of the variation for the count model and area of emergent rock individually explained ~31% of the variation. This was similar to that found for E. quadricolor. Dascyllus trimaculatus Where D. trimaculatus was recorded, it was more abundant with a greater area of emergent rock and less abundant with a decreasing minimum temperature. The final model was emergent rock + minimum temperature (AIC 112.3), with both significantly contributing to abundance (Table 3 of the main paper). Area of emergent rock + minimum temperature explained ~26% of the variation for the count model. Area of emergent rock individually explained ~21% of the variation. Entacmaea quadricolor Model: zero-inflated negative binomial Table S2. GLM output Bold variables show final models based on lowest AIC for each species Call: zeroinfl(formula = Eq ~emerge_rk + AvTemp, data = anemone, dist = negbin ) Pearson residuals 0.9688169 0.0824485 0.0011823 0.0001437 1.4700244 Count model coefficients (negbin with log link) (Intercept) 234.0401 83.0570 2.818 0.004835** emerge_rk 0.6385 0.1689 3.781 0.000156*** AvTemp 10.6490 3.8052 2.799 0.005133** Log(theta) 0.1311 0.7653 0.171 0.864014 Zero-inflation model coefficients (binomial with logit link) (Intercept) 549.089 1394.006 0.394 0.694 emerge_rk 1.918 5.256-0.365 0.715 AvTemp 25.231 63.918 0.395 0.693 Theta = 1.14 Number of iterations in Broyden Fletcher Goldfarb Shanno (BFGS) optimisation: 4951 Log-likelihood: 15.91 on 7 d.f. d.f. AIC 7 46.73931 emerg rk + dist shore 7 45.82391 emerg rk + av temp 5 51.50527 emerg rk Page 5 of 11

Diagnostic test on non-zero sites Emergent rock Emergent rock + Av temp Heteractis crispa Call: glm(formula = I(Eq > 0) ~emerge_rk, family = binomial(link = logit ), data = anemone) 1.4221 0.4576 0.3028 0.2914 2.3955 (Intercept) 3.1376 1.2710 2.469 0.0136 * emerge_rk 0.3891 0.1749 2.225 0.0261 * Null deviance: 23.053 on 20 d.f. Residual deviance: 16.013 on 19 d.f. Akaike information criterion: 20.013 glm(formula = I(Eq > 0) ~emerge_rk + AvTemp, family = binomial(link = logit ), data = anemone) 1.22566 0.12100 0.01047 0.00007 1.73737 Estimate s.e. z-value Pr(> z ) (Intercept) 409.373 411.131 0.996 0.319 emerge_rk 1.140 1.303 0.875 0.381 AvTemp 18.680 18.762 0.996 0.319 Null deviance: 23.0527 on 20 d.f. Residual deviance: 7.4646 on 18 d.f. Akaike informatio criterion: 13.465 Number of Fisher scoring iterations: 9 Model: zero-inflated negative binomial Call: zeroinfl(formula = Hc ~emerge_rk + AvTemp, data = anemone, dist = negbin, link = logit ) Pearson residuals 1.0067 0.5106 0.2718 0.3431 2.5022 Count model coefficients (negbin with log link) (Intercept) 131.68645 21.09320 6.243 4.29e-10*** emerge_rk 0.20212 0.04707 4.294 1.76e 05*** AvTemp 6.19892 0.97236 6.375 1.83e 10*** Log(theta) 1.94895 0.83850 2.324 0.0201* Zero-inflation model coefficients (binomial with logit link) (Intercept) 92.0926 67.6519 1.361 0.173 emerge_rk 0.3306 0.1657 1.995 0.046* AvTemp 4.2059 3.1336 1.342 0.180 Theta = 7.0213 Number of iterations in Broyden Fletcher Goldfarb Shanno (BFGS) optimisation: 61 Log-likelihood: 39.54 on 7 d.f. d.f. Akaike information criterion 7 93.08987 emerg rk + av temp 7 99.17366 emerg rk + maxtemp 5 103.15948 MaxTemp Page 6 of 11

Diagnostic test on non-zero sites Emergent rock Emergent rock + Av temp Call: glm(formula = I(Hc > 0) ~emerge_rk, family = binomial(link = logit ), data = anemone) 1.7068 0.6475 0.5759 0.6808 1.9383 (Intercept) 1.7127 0.7826 2.188 0.0286* emerge_rk 0.3057 0.1394 2.193 0.0283* Signif. codes 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 Null deviance: 27.910 on 20 d.f. Residual deviance: 21.983 on 19 d.f. AIC: 25.983 glm(formula = I(Hc > 0) ~emerge_rk + AvTemp, family = binomial(link = logit ), data = anemone) 1.4027 0.7472 0.4278 0.4364 2.1703 (Intercept) 98.2909 65.3569 1.504 0.133 emerge_rk 0.3437 0.1639 2.097 0.036* AvTemp 4.4897 3.0289 1.482 0.138 Signif. codes 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 Null deviance: 27.91 on 20 d.f. Residual deviance: 19.38 on 18 d.f. Akaike informatio criterion: 25.38 Number of Fisher scoring iterations: 5 Page 7 of 11

Amphiprion akindynos Model: zero-inflated negative binomial Diagnostic test on non-zero sites Emergent rock Call: zeroinfl(formula = Aa ~emerge_rk + MaxTemp, data = anemone, dist = negbin ) Pearson residuals 1.1880 0.5140 0.3180 0.1905 2.8810 Count model coefficients (negbin with log link) (Intercept) 9.93833 1.934 0.0531 19.21787 emerge_rk 0.10066 0.05013 2.008 0.0447* MaxTemp 0.76966 0.38042 2.023 0.0431* Log(theta) 9.58731 107.92486 0.089 0.9292 Zero-inflation model coefficients (binomial with logit link) (Intercept) 24.3071 31.2929 0.777 0.4373 emerge_rk 0.3131 0.1529 2.048 0.0405* MaxTemp 0.8630 1.1864 0.727 0.4670 Signif. codes 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 Theta = 14578.6547 Number of iterations in BFGS optimisation: 138 Log-likelihood: 29.42 on 7 d.f. d.f. AIC 7 72.83895 emerg rk + maxt 5 73.01584 emerg rk 5 78.52842 maxt Call: glm(formula = I(Aa > 0) ~emerge_rk, family = binomial(link = logit ), data = anemone) 1.7068 0.6475 0.5759 0.6808 1.9383 (Intercept) 1.7127 0.7826 2.188 0.0286* emerge_rk 0.3057 0.1394 2.193 0.0283* Null deviance: 27.910 on 20 d.f. Residual deviance: 21.983 on 19 d.f. Akaike information criterion: 25.983 Page 8 of 11

Emergent rock + Max.Temp glm(formula = I(Aa > 0) ~emerge_rk + Max.Temp, family = binomial(link = logit ), data = anemone) 1.5456 0.7510 0.5110 0.4994 2.1076 (Intercept) 26.6429 30.9108 0.862 0.3887 emerge_rk 0.3204 0.1508 2.124 0.0336* MaxTemp 0.9488 1.1724 0.809 0.4183 Null deviance: 27.910 on 20 d.f. Residual deviance: 21.296 on 18 d.f. Akaike information criterion: 27.296 Number of Fisher scoring iterations: 4 Amphiprion latezonatus Model: zero-inflated negative binomial Call: zeroinfl(formula = Al ~emerge_rk + AvTemp, data = anemone, dist = negbin ) Pearson residuals 1.501525 0.065799 0.006035 0.002055 1.603183 Count model coefficients (negbin with log link) (Intercept) 92.90266 68.48319 1.357 0.1749 emerge_rk 0.17044 0.09543 1.786 0.0741. AvTemp 4.30423 3.13232 1.374 0.1694 Log(theta) 10.49798 n.a. n.a. n.a. Zero-inflation model coefficients (binomial with logit link) (Intercept) 407.509 495.143 0.823 0.411 emerge_rk 1.180 1.583 0.745 0.456 AvTemp 18.595 22.592 0.823 0.410 Theta = 36242.3641 Number of iterations in Broyden Fletcher Goldfarb Shanno (BFGS) optimisation: 117 Log-likelihood: 14.09 on 7 d.f. d.f. Akaike information criterion 5 49.62044 emerg rk 5 51.85001 av temp [1] 42.18975 emerg rock + av temp Page 9 of 11

Diagnostic test on non-zero sites Emergent rock glm(formula = I(Al > 0) ~emerge_rk, family = binomial(link = logit ), data = anemone) 1.4221 0.4576 0.3028 0.2914 2.3955 (Intercept) 3.1376 1.2710 2.469 0.0136* emerge_rk 0.3891 0.1749 2.225 0.0261* Null deviance: 23.053 on 20 d.f. Residual deviance: 16.013 on 19 d.f. Akaike information criterion: 20.013 Number of Fisher scoring iterations: 5 Emergent rock + Average Temp Call: glm(formula = I(Al > 0) ~emerge_rk + AvTemp, family = binomial(link = logit ), data = anemone) 1.22566 0.12100 0.01047 0.00007 1.73737 Estimate s.e. z-value Pr(> z ) (Intercept) 409.373 411.131 0.996 0.319 emerge_rk 1.140 1.303 0.875 0.381 AvTemp 18.680 18.762 0.996 0.319 Null deviance: 23.0527 on 20 d.f. Residual deviance: 7.4646 on 18 d.f. Akaike informatin criterion: 13.465 Number of Fisher scoring iterations: 9 Dascyllus trimaculatus Model: zero-inflated negative binomial Call: zeroinfl(formula = Dt ~emerge_rk + MinTemp, data = anemone, dist = negbin ) Pearson residuals 0.71491 0.42456 0.19496 0.06307 1.63963 Count model coefficients (negbin with log link) (Intercept) 51.0409 13.1541 3.880 0.000104*** emerge_rk 1.0776 0.2248 4.793 1.64e-06*** MinTemp 3.3528 0.8850 3.788 0.000152*** Log(theta) 0.4188 0.6817 0.614 0.538978 Zero-inflation model coefficients (binomial with logit link) (Intercept) 65.801 66.294 0.993 0.321 emerge_rk 1.174 1.297 0.905 0.365 MinTemp 4.664 4.710 0.990 0.322 Theta = 0.6578 Number of iterations in Broyden Fletcher Goldfarb Shanno (BFGS) optimisation: 151 Log-likelihood: 49.16 on 7 d.f. Akaike information criterion: 112.32 emergent rock + mintemp Page 10 of 11

Diagnostic test on non-zero sites Emergent rock Emergent rock + mintemp glm(formula = I(Dt > 0) ~emerge_rk, family = binomial(link = logit ), data = anemone) 1.7068 0.6475 0.5759 0.6808 1.9383 (Intercept) 1.7127 0.7826 2.188 0.0286* emerge_rk 0.3057 0.1394 2.193 0.0283* Signif. codes 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 Null deviance: 27.910 on 20 d.f. Residual deviance: 21.983 on 19 d.f. Akaike information criterion: 25.983 Number of Fisher scoring iterations: 4 glm(formula = I(Dt > 0) ~emerge_rk + MinTemp, family = binomial(link = logit ), data = anemone) 1.8040 0.7584 0.3958 0.6886 1.6069 (Intercept) 8.0768 8.8291 0.915 0.3603 emerge_rk 0.4142 0.1886 2.196 0.0281* MinTemp 0.6462 0.5921 1.091 0.2751 Null deviance: 27.910 on 20 d.f. Residual deviance: 20.703 on 18 d.f. Akaike information criterion: 26.703 Number of Fisher scoring iterations: 4 Page 11 of 11