ENMTools: a toolbox for comparative studies of environmental niche models

Similar documents
Geographical Information Systems

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200 Spring 2014 University of California, Berkeley

C ommunications. Ecological niche modeling in Maxent: the importance of model complexity and the performance of model selection criteria

Karsten Vennemann, Seattle. QGIS Workshop CUGOS Spring Fling 2015

Applying GIS to Hydraulic Analysis

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

ArcGIS Platform For NSOs

LAND CHANGE MODELER SOFTWARE FOR ARCGIS

Multi-scale modeling of species distributions, hydrology, & gene flow

The Implementation of Autocorrelation-Based Regioclassification in ArcMap Using ArcObjects

CyberGIS: What Still Needs to Be Done? Michael F. Goodchild University of California Santa Barbara

Outline. - Background of coastal and marine conservation - Species distribution modeling (SDM) - Reserve selection analysis. - Results & discussion

Gary G. Mittelbach Michigan State University

Write a report (6-7 pages, double space) on some examples of Internet Applications. You can choose only ONE of the following application areas:

ORIGINAL ARTICLE. John E. McCormack, 1,2 Amanda J. Zellmer, 1,3 and L. Lacey Knowles 1,4

GIS Options RELU Upland Moorland Scoping Study Project CCG/SoG Working Paper, February 2005 Andy Turner

Geodatabase An Overview

HIGH RESOLUTION BASE MAP: A CASE STUDY OF JNTUH-HYDERABAD CAMPUS

Managing Uncertainty in Habitat Suitability Models. Jim Graham and Jake Nelson Oregon State University

Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week:

Lecture 2. A Review: Geographic Information Systems & ArcGIS Basics

a system for input, storage, manipulation, and output of geographic information. GIS combines software with hardware,

LATITUDINAL VARIATION IN SPECIATION MECHANISMS IN FROGS

Land-Use Land-Cover Change Detector

FleXScan User Guide. for version 3.1. Kunihiko Takahashi Tetsuji Yokoyama Toshiro Tango. National Institute of Public Health

Package phyloclim. R topics documented: February 20, 2015

Geographic Systems and Analysis

Among various open-source GIS programs, QGIS can be the best suitable option which can be used across partners for reasons outlined below.

FIRE DEPARMENT SANTA CLARA COUNTY

Explanation and guidance for a decision-support tool to help manage post-fire Black-backed Woodpecker habitat

Chad Burrus April 6, 2010

LAB EXERCISE #4 Modeling Connectivity

The two foundation ideas or concepts that are of vital importance in GIS work (in my humble opinion) are:

Outline. Practical Point Pattern Analysis. David Harvey s Critiques. Peter Gould s Critiques. Global vs. Local. Problems of PPA in Real World

ArcGIS Pro: Essential Workflows STUDENT EDITION

SDMtoolbox 2.0 User Guide

Hydraulic Processes Analysis System (HyPAS)

Exelis and Esri Technologies for Defense and National Security. Cherie Muleh

Complex patterns of environmental niche evolution in Iberian columbines (genus Aquilegia, Ranunculaceae)

VarCan (version 1): Variation Estimation and Partitioning in Canonical Analysis

Geodatabase An Introduction

Lauren Jacob May 6, Tectonics of the Northern Menderes Massif: The Simav Detachment and its relationship to three granite plutons

Welcome! Text: Community Ecology by Peter J. Morin, Blackwell Science ISBN (required) Topics covered: Date Topic Reading

CHAPTER 7 PRODUCT USE AND AVAILABILITY

Diversity partitioning without statistical independence of alpha and beta

Lecture 5. GIS Data Capture & Editing. Tomislav Sapic GIS Technologist Faculty of Natural Resources Management Lakehead University

)UDQFR54XHQWLQ(DQG'tD]'HOJDGR&

QGIS FLO-2D Integration

Species distribution modelling with MAXENT Mikael von Numers Åbo Akademi

Introduction to Geographic Information Systems

What are the five components of a GIS? A typically GIS consists of five elements: - Hardware, Software, Data, People and Procedures (Work Flows)

WindNinja Tutorial 3: Point Initialization

GEOGRAPHIC INFORMATION SYSTEMS (GIS)

Spatial Effects on Current and Future Climate of Ipomopsis aggregata Populations in Colorado Patterns of Precipitation and Maximum Temperature

Data Dictionary for Network of Conservation Areas Transcription Reports from the Colorado Natural Heritage Program

YYT-C3002 Application Programming in Engineering GIS I. Anas Altartouri Otaniemi

Geodatabase An Introduction

MODELLING AND UNDERSTANDING MULTI-TEMPORAL LAND USE CHANGES

GENERALIZATION IN THE NEW GENERATION OF GIS. Dan Lee ESRI, Inc. 380 New York Street Redlands, CA USA Fax:

REFERENCING COORDINATE SYSTEMS MAP PROJECTIONS GEOREFERENCING

ArcGIS Role in Maxent Modeling

GIS at JLAB. T. Larrieu Jefferson Lab, Newport News, VA 23606, USA. Controls Group. October 2006 PCaPAC

Distance-based test for uncertainty hypothesis testing

NR402 GIS Applications in Natural Resources

Objectives Define spatial statistics Introduce you to some of the core spatial statistics tools available in ArcGIS 9.3 Present a variety of example a

Search for the Dubai in the remap search bar:

MERGING (MERGE / MOSAIC) GEOSPATIAL DATA

Projecting a Gully on Wilson Ranch Meadow, Eldorado National Forest By David Russell and Angelina Lasko Humboldt State University

Evaluating Corridors. Jeff Jenness Dan Majka Paul Beier. CorridorDesigner Workshop Fall 2007

WindNinja Tutorial 3: Point Initialization

Gridded Ambient Air Pollutant Concentrations for Southern California, User Notes authored by Beau MacDonald, 11/28/2017

An introduction to the picante package

Esri UC2013. Technical Workshop.

Overview Objectives Materials Background Activity Procedure

GEOGRAPHY MA, ASSESSMENT REPORT AY

A Temporal Hydrologic Database for Rapidly Changing Landscapes

BLAST: Target frequencies and information content Dannie Durand

K1D: Multivariate Ripley s K-function for one-dimensional data. Daniel G. Gavin University of Oregon Department of Geography Version 1.

CHAPTER 3 RESEARCH METHODOLOGY

Digitization of the Beatty Odonata Collection at the Frost Entomological Museum (PSUC): the Terrain of Ecological Niche Modeling

IndiFrag v2.1: An Object-based Fragmentation Analysis Software Tool

Enabling ENVI. ArcGIS for Server

PHYLOGENETIC ANALYSIS OF THE EVOLUTION OF THE NICHE IN LIZARDS OF THE ANOLIS SAGREI GROUP

Sleep data, two drugs Ch13.xls

LAB EXERCISE #2 Quantifying Patch Mosaics

Measuring earthquake-generated surface offsets from high-resolution digital topography

Inquiry: The University of Arkansas Undergraduate Research Journal. Volume 5 Article 15

Lecture 6 - Raster Data Model & GIS File Organization

Hydroacoustic survey and bathymetric map creation for Brant Lake, New York

PASS Sample Size Software. Poisson Regression

Using GIS to Delineate Watersheds Ed Poyer NRS 509, Fall 2010

Minimum Standards for Wetland Delineations

Louisiana Transportation Engineering Conference. Monday, February 12, 2007

An Introduction to the Community Maps Information Model

Giant Kangaroo Rat Dispersion Analysis

Search for the Gulf of Carpentaria in the remap search bar:

BUILDING A GRIDDED CLIMATOLOGICAL DATASET FOR USE IN THE STATISTICAL INTERPRETATION OF NUMERICAL WEATHER PREDICTION MODELS

Introduction. Project Summary In 2014 multiple local Otsego county agencies, Otsego County Soil and Water

Implementation of CLIMAP and GIS for Mapping the Climatic Dataset of Northern Iraq

Geometric Algorithms in GIS

Transcription:

Ecography 33: 607611, 2010 doi: 10.1111/j.1600-0587.2009.06142.x # 2010 The Authors. Journal compilation # 2010 Ecography Subject Editor: Thiago Rangel. Accepted 4 September 2009 ENMTools: a toolbox for comparative studies of environmental niche models Dan L. Warren, Richard E. Glor and Michael Turelli D. L. Warren (dan.l.warren@gmail.com) and M. Turelli, Dept of Evolution and Ecology, Univ. of California, Davis, CA 95616, USA. R. E. Glor, Dept of Biology, Univ. of Rochester, Rochester, NY 14627, USA. We present software that facilitates quantitative comparisons of environmental niche models (ENMs). Our software quantifies similarity of ENMs generated using the program Maxent and uses randomization tests to compare observed similarity to that expected under different null hypotheses. ENMTools is available online free of charge from /<http:// purl.oclc.org/enmtools/>. Environmental niche models are used to describe the ecological tolerances of populations or species. A range of methods generate ENMs from georeferenced occurrence data (i.e. sample localities associated with latitude and longitude coordinates) and environmental data in the form of geographic information system (GIS) data layers (reviewed in Elith et al. 2006). These ENMs can predict the suitability of habitats across a landscape and are routinely used in conservation planning and biodiversity inventories (Kremen et al. 2008). A growing number of studies compare predicted habitat suitabilities generated by ENMs from different species or populations to aid in species delimitation or test hypotheses about niche conservation (Buermann et al. 2008, Lentz et al. 2008, Warren et al. 2008). Comparative analyses of ENMs also permit studies of niche evolution in a phylogenetic framework. Warren et al. (2008) introduced a set of comparative similarity measures and statistical tests that permit quantitative comparison of ENMs. Here we provide an implementation of those methods called ENMTools. ENMTools interacts with the maximum entropy niche modeling program Maxent (Phillips et al. 2006, Phillips and Dudík 2008), allowing users to automate generation of ENMs, calculate similarity measures, and implement various statistical comparisons of ENMs. Although ENMTools is designed to interact directly only with Maxent, the pseudoreplicate data sets that it generates for hypothesis testing can be used with other methods of ENM construction. The similarity measures and tests discussed here are used on numerous examples in Warren et al. (2008). Other tools for making measurements on ENMs and comparing them are in development. Implementation and required input ENMTools is a Perl script with a graphical user interface written with the Tk package. It is also available as an executable file for Windows and Macintosh. In theory, the ENMTools Perl script may be implemented on any platform capable of running Perl and the associated Tkx package with minor adjustments. In practice, ENMTools has been implemented only on the Microsoft Windows and Mac OS X platforms. ENMTools requires working installations of Maxent and Java. Input for ENMTools is similar to that required by the widely used Maxent niche modeling software and consists of simple.asc and.csvformatted text files containing locality data for two or more populations and associated ASCII-formatted GIS raster layers of environmental data. Tests implemented in ENMTools Quantifying niche similarity Measurement of similarity among ENMs is central to comparative analyses. ENMTools quantifies niche similarity using the two measures introduced by Warren et al. (2008): Schoener s (1968) D and a measure derived from Hellinger distance called I. These similarity measures are obtained by comparing the estimates of habitat suitability calculated for each grid cell of a study area using a Maxentgenerated ENM, after normalizing each ENM so that all suitability scores within the geographic space sum to 1. Schoener s D is defined as D(p X ; p Y )1 1 X jp X;i p Y;i j; 2 i 607

where p X,i and p Y,i are the normalized suitability scores for species X and Y in grid cell i. I is defined as sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi I (p X ; p Y )1 1 X p ffiffiffiffiffiffiffi pffiffiffiffiffiffi 2 p X;i p Y;i : 2 i Note that I is 1 Hellinger s distance from probability theory; similarly, D is 1 total variation distance. Both similarity measures range from 0, when species predicted environmental tolerances do not overlap at all (i.e. p X,i p Y,i 0 for all i), to 1, when all grid cells are estimated to be equally suitable for both species (i.e. p X,i p Y,i for all i). We note that an actual suitability of 0 is rarely produced by Maxent for most real data sets unless a minimum suitability threshold is applied, as Maxent ENMs predict a nonzero suitability score at every grid cell. However, values arbitrarily close to 0 are possible. While these two similarity measures are calculated in a very similar fashion and are both simply related to standard measures of distance from probability theory, Schoener s D carries an historical ecological interpretation that is not attached to I. The ecological interpretation of D implicitly assumes that the suitability scores p X,i produced by Maxent are proportional to species abundance, whereas I simply treats the two ENMs as probability distributions. If localities for only two populations are loaded into ENMTools, single values of D and I will be produced. If three or more populations are loaded into ENMTools, pairwise D and I values will be calculated for each possible comparison. Pairwise similarity values or tables of pairwise similarity values can be opened and viewed in a simple text editor or in commercially available spreadsheet software such as Microsoft Excel. Tests of niche similarity ENMTools implements two quantitative tests of niche similarity introduced by Warren et al. (2008). These tests ask whether the ENMs generated from two populations are identical, or, at the other extreme, merely more similar than expected by chance. Identity test The first comparative hypothesis tested by ENMTools is whether the ENMs produced by two populations are identical. This hypothesis is addressed with the niche identity test. The test begins by pooling the georeferenced data points for a pair of populations, randomizing the population identities of these data points, and extracting two new population samples with the same sizes as the two original samples. This process is repeated to generate a user-specified number of pseudoreplicated data sets. For each pseudoreplicate, ENMTools calls on Maxent to generate an ENM from each of the two populations. ENMTools then uses the model and predicted suitability scores generated by Maxent from each population to calculate niche similarity indices (e.g. I and Schoener s D). In this manner, ENMTools obtains a distribution of overlap scores between populations drawn from a shared distribution, assuming that the populations are interchangeable in their use of niche space. The actual observed measures of niche similarity between populations can be compared to this null distribution to test whether they are significantly different from those generated assuming no niche differentiation: the hypothesis of niche identity is rejected when the empirically observed value for I and/or D is significantly lower than the values expected from the pseudoreplicated data sets. As noted in Warren et al. (2008), the identity test will accurately reflect the probabilities under the null hypothesis if sampling is unbiased with respect to the species environmental tolerances (i.e. spatial sampling is such that the relationship between environmental variables and species occurrence accurately represents the suitability of habitat). However, biases can be introduced by many factors, including differences in sampling effort between populations, spatial bias in sampling, and differences in the habitat available to populations in geographic regions where they do not overlap. The identity test should be interpreted with these caveats in mind. Background test In addition to testing whether the ENMs generated from two species are identical, one may also want to determine whether ENMs are more similar than expected by chance, based on the geographical regions in which they reside (as opposed to the exact locations in which specimens are collected). This type of analysis is particularly important when allopatric populations are being compared, because some niche differentiation may follow inevitably from the fact that distinct geographic regions rarely encompass identical distributions of environmental variables. The background test in ENMTools may be used to ask whether the ENMs obtained from two allopatrically distributed populations are more different than would be expected given the underlying environmental differences between the regions in which they occur. The background test addresses this hypothesis by generating a null distribution for the ENM difference expected between one population and occurrence points placed at random within the range of another population (i.e. not the specific locations at which the other population was sampled, but within a user-specified area that includes all of those samples). The points to be treated as environmental background can either be specified as a set of commaseparated coordinates, or can be sampled from an ASCII raster file. If the observed values of the niche similarity measures obtained from the two original populations are significantly higher (or lower) than expected from this null distribution, the null hypothesis that similarity (or divergence) between species is no more than expected based on the availability of habitat is rejected. While the null hypothesis addressed by the background test may be more biologically meaningful than that of the identity test for allopatric populations, the outcome and interpretation of the test may be sensitive to the definition of the environmental background from which each species is presumed to be selecting habitat. Whenever possible, a biological justification should be made for the definition of background used for a particular study. In the absence of a compelling biological reason for selecting background regions to sample from, background samples can be taken 608

from buffer zones around known occurrences or from range maps estimated using one of the many algorithms available for this purpose, such as minimum convex polygon (e.g. as implemented in ArcGIS (ESRI 2006)) or localized convex hull (Getz and Wilmers 2004). When there are questions about the definition of environmental background for a particular study, users are advised to assess the sensitivity of their conclusions to selection of alternative background regions. Empirical examples of both tests are provided in Fig. 1 and 2, using data from Anolis ahli and Anolis allogus (Knouft et al. 2006). The tests show that the two species are Figure 1. Sampling for identity test. In the identity test, the similarity score for ENMs built from known occurrences of two species (panel A) is compared to the distribution of similarity scores between ENMs built from occurrences drawn randomly from the pooled occurrences for the two species (panels BD). The entirety of Cuba was treated as the study area for ENM construction. This example compares the environmental niches of Anolis allogus (west) and Anolis ahli. As seen in panel D, the similarity score for ENMs built for the actual occurrences of the two species (black arrow) is much lower than expected based on the null hypothesis of niche equivalency, indicating that the two species environmental niches are not equivalent. 609

not drawn from identical distributions of environmental variables (identity test), and are in fact more different from each other than would be expected under the hypothesis that the differences between the two species are due only to differences in the habitat available to them. Users should note that the background test can yield different results when the comparison is reversed; it is possible for species to be more similar than expected based on the habitat available for one species, but less than expected based on the habitat available for the other. Figure 2. Sampling for background test. In the background test, the similarity score for ENMs built from known occurrences of two species (panel A) is compared to similarity scores for ENMs constructed using points drawn at random from the region defined as environmental background for one of the species (panels BD). In this example, occurrence points for Anolis allogus (west) are being compared to points drawn at random from the environmental background that has been chosen for Anolis ahli. In this case, the background (blue region, panel B) was chosen arbitrarily to be all points within 0.5 decimal degrees of a known occurrence of A. ahli. As seen in panel D, the observed overlap between the two species (black arrow) is lower than expected under the null hypothesis, indicating that the two species are more divergent than expected based on the habitat available to A. ahli. 610

The ENMTools software, manual, and sample data are available at the ENMTools web site /<http://purl.oclc.org/ enmtools/>. To cite ENMTools or acknowledge its use, cite this Software note as follows, substituting the version of the application that you used for Version 0 : Warren, D. L., Glor, R. E. and Turelli, M. 2009. ENMTools: a toolbox for comparative studies of environmental niche models. Ecography 33: 607611 (Version 0). Acknowledgements The authors would like to thank the reviewers for useful comments and early users of ENMTools who have been indispensible in suggesting features and bug fixes. This work was funded in part by the Center for Population Biology at UC Davis, the John Muir Inst. for the Environment, the California Dept of Fish and Game, NSF grant DEB 0920892 to REG, and NSF grant DEB 0815145 to MT. References Buermann, W. et al. 2008. Predicting species distributions across the Amazonian and Andean regions using remote sensing data. J. Biogeogr. 35: 11601176. Elith, J. et al. 2006. Novel methods improve prediction of species distributions from occurrence data. Ecography 29: 129151. ESRI 2006. ArcGIS 9.2. Environmental Systems Research Inst., Redlands, CA. Getz, W. and Wilmers, C. 2004. A local nearest-neighbor convexhull construction of home ranges and utilization distributions. Ecography 27: 489505. Knouft, J. H. et al. 2006. Phylogenetic analysis of the evolution of the niche in lizards of the Anolis sagrei group. Ecology 87: S29S38. Kremen, C. et al. 2008. Aligning conservation priorities across taxa in Madagascar with high-resolution planning tools. Science 320: 222226. Lentz, D. L. et al. 2008. Ecological niche modeling and distribution of wild sunflower (Helianthus anuus l.) in Mexico. Int. J. Plant Sci. 169: 541549. Phillips, S. J. et al. 2006. Maximum entropy modeling of species geographic distributions. Ecol. Model. 190: 231259. Phillips, S. J. and Dudík, M. 2008. Modeling of species distributions with Maxent: new extensions and a comprehensive evaluation. Ecography 31: 161175. Schoener, T. W. 1968. The Anolis lizards of Bimini: resource partitioning in a complex fauna. Ecology 49: 704726. Warren, D. L. et al. 2008. Environmental niche equivalency versus conservatism: quantitative approaches to niche evolution. Evolution 62: 28682883. 611