Biotic Element Analysis in Biogeography

Size: px
Start display at page:

Download "Biotic Element Analysis in Biogeography"

Transcription

1 Syst. Biol. 52(5): , 2003 Copyright c Society of Systematic Biologists ISSN: print / X online DOI: / Biotic Element Analysis in Biogeography BERNHARD HAUSDORF 1 AND CHRISTIAN HENNIG 2 1 Zoologisches Museum der Universität Hamburg, Martin-Luther-King-Platz 3, Hamburg, Germany; hausdorf@zoologie.uni-hamburg.de 2 ETH Zürich (LEO), Seminar für Statistik, 8092 Zürich, Switzerland; hennig@stat.math.ethz.ch, and Universität Hamburg, Fachbereich Mathematik SPST, Bundesstrasse 55, Hamburg, Germany Abstract. Biotic element analysis is an alternative to the areas-of-endemism approach for recognizing the presence or absence of vicariance events in a given region. If an ancestral biota was fragmented by vicariance events, biotic elements or clusters of distribution areas should emerge. We propose a statistical test for clustering of distribution areas based on a Monte Carlo simulation with a null model that considers the spatial autocorrelation in the data. The hypothesis tested is that the observed degree of clustering of ranges can be explained by the range size distribution, the varying number of taxa per cell, and the spatial autocorrelation of the occurrences of a taxon alone. A method for the delimitation of biotic elements which uses model-based Gaussian clustering is introduced. We demonstrate our methods and show the importance of grid size by means of a case study, an analysis of the distribution patterns of southern African species of the weevil genus Scobius. The example highlights the difficulties in delimiting areas of endemism if dispersal has occurred and illustrates the advantages of the biotic element approach. [Area of endemism; biogeography; biotic elements; null model; Scobius; South Africa; vicariance.] According to the vicariance model (Croizat et al., 1974; Rosen, 1976, 1978; Platnick and Nelson, 1978; Nelson and Platnick, 1981; Wiley, 1988; Humphries and Parenti, 1999), an ancestral biota was fragmented by the appearance of a barrier. The barrier interrupted the gene flow between the populations separated by the barrier and, consequently, this vicariance event resulted in allopatric speciation of many of the species formerly constituting the ancestral biota. In this way, two new biotas originate, separated by the barrier. By repetitions of this process, areas of endemism with distinct biotas, i.e., with many species restricted to the individual areas, emerge. On average, the ranges of the species that have originated in the same area of endemism will be more similar to each other than to ranges of species that have originated in other areas of endemism. Thus, the vicariance model predicts a nonrandom congruence of species ranges (Morrone, 1994; Hausdorf, 2002). The existence of such a distribution pattern has not been tested in most biogeographic analysis. Only recently, Mast and Nyffeler (2003) introduced a test for the nonrandom co-occurrence of pairs of taxa into the vicariance biogeography framework. It must be emphasized that the pattern for which we can examine distribution data is nonrandom congruence, i.e., clustering of ranges. Areas of endemism as such cannot be found in distribution data. They can only be inferred from clusters of species ranges. This distinction is obscurred in the operational methods for identifying and delimiting areas of endemism (Morrone, 1994; Linder, 2001; Szumik et al., 2002; Mast and Nyffeler, 2003). The delimitation of areas of endemism is not problematic as long as species do not disperse across the barriers separating the areas of endemism. Under these conditions, the borders of the areas of endemism can be drawn between the range clusters. However, usually there is stochastic dispersal of species across barriers with time. Moreover, many barriers weaken or disappear with time. If dispersal across the barriers that separated the areas of endemism resulted in an overlap of ranges of species which originated in different areas of endemism, biogeographical data alone are insufficient for delimiting areas of endemism (Hausdorf, 2002). However, biotic elements, i.e., groups of taxa whose ranges are more similar to each other than to those of taxa of other such groups (Hausdorf, 2002), can be recognized, even if some of the species that originated by vicariance dispersed across the barriers that separated the areas of endemism. Previous computational approaches for the determination of biotic elements did not provide a decision about how many biotic elements should be recognized and which species cannot be adequately assigned to any biotic element (e.g., Holloway and Jardine, 1968; Jardine, 1972; Dennis et al., 1998). We propose a statistical test for clustering of distribution areas based on a Monte Carlo simulation and a method for the delimitation of biotic elements which uses model-based Gaussian clustering. MATERIAL AND METHODS Distribution Data As a case study, we analyzed the ranges of the 47 southern African weevil species of the genus Scobius Schoenherr, because this example was previously used to introduce three other methods for the analysis of distribution patterns (Morrone, 1994; Szumik et al., 2002; Mast and Nyffeler, 2003). We used Morrone s (1994) presence/absence matrix for 2 latitude 2 longitude grid cells as corrected by Mast and Nyffeler (2003) as well as the presence/absence matrix for 1 latitude 1 longitude grid cells generated by Mast and Nyffeler (2003). Test for Clustering of Distribution Areas As first step in the analysis, we investigated whether distribution areas are clustered. We test the hypothesis that the observed degree of clustering of ranges can be 717

2 718 SYSTEMATIC BIOLOGY VOL. 52 explained by the varying number of taxa per cell and the spatial autocorrelation of the occurrences of a taxon alone. This test is based on distances between the ranges of the examined taxa. Three specifications must be made: a distance measure, a test statistic, and a null model for the generation of sets of ranges. The distribution of the test statistics under the null hypothesis is approximated by Monte Carlo simulation, which depends on some parameters which are estimated from the data. Details of the test procedure are described by Hennig and Hausdorf (in press). The tests as well as the subsequently described method for the delimitation of biotic elements is implemented in the program package PRABCLUS which is an add-on package for the statistical software R. These programs are available at Distance measure. R ={1,...,c} is a set of geographic units (e.g., c = 25 cells). The range of a taxon A is the subset of R where the taxon is present. Let A denote the number of elements of a set A, e.g., the number of cells occupied by the taxon. If A 1,A 2 R are the ranges of two taxa, A 1 A 2 is the number of cells where both taxa are present. Often, the Jaccard distance d J is used in biogeographic analyses (e.g., Jardine, 1972; Dennis et al., 1998): d J (A 1,A 2 ) = 1 A 1 A 2 A 1 A 2. (1) We found that this distance measure is not appropriate for our analyses because it is approximately equal to 1 if one range is much smaller than the other, even if the smaller range is a subset of the larger one. However, it might well be that one species occupies only a small part of an area of endemism (e.g., because it is restricted to a rare habitat), whereas another species that originated in the same area of endemism has a much larger range because it occurs in several habitats. In such a case, the distance between the ranges should not approach 1, because we would like to recognize that they belong to the same biotic element. Therefore the Jaccard distance is misleading because it relates the common distribution area only to the size of the more widely distributed species (which equals the union if the smaller area is a subset of the larger one). We used the Kulczynski distance d K (1 Kulczynski unnamed 2 ; Shi, 1993): d K (A 1,A 2 ) = ( A1 A 2 A 1 + A ) 1 A 2, (2) A 2 which relates the common area to both ranges of the individual species in a properly balanced way. This distance approximates 0.5 when one range is small subset of the other, as opposed to 1 for species whose ranges do not overlap. Thus, it is better suited for our analysis. Test statistics. A significant clustering of ranges means that the distances are small between ranges of the same cluster, while the distances between ranges of distinct clusters are large. The variation of distances of a homogeneously distributed set of ranges is expected to be lower, since there is no clear distinction between ranges that belong together and ranges that should be separated. Based on these considerations, we propose a new test statistic. If n is the number of ranges, then there are n(n 1)/2 distances between ranges. Let d 1:n(n 1)/2 d 2:n(n 1)/2 d n(n 1)/2:n(n 1)/2 denote the ordered Kulczynski distances. Let 0 <π<0.5 be a proportion. Then the test statistic T := i πn(n 1)/2 d i:n(n 1)/2 i (1 π)n(n 1)/2 d, (3) i:n(n 1)/2 i.e., the ratio between the π smallest and the π largest distances, can be expected to be small for clustered data compared to homogeneous data. The proportion of distances that occur inside clusters is not known, so it is not immediately clear how to choose π. However, in our setup the result does not turn out to be very sensitive to the choice of π.weuseπ = 0.25, i.e., half of the distances are used for the test. See also Hennig and Hausdorf, in press, for a comparison of test statistics for clustering including the present proposal. Null model. The null model should simulate the case in which all inhomogeneities of the data can be attributed to varying range sizes, to varying numbers of taxa per geographic unit, and the spatial autocorrelation of the occurrences of a taxon. We developed a null model in which the nonoccurrence of range clusters is modeled so that all ranges are generated independently according to the same probabilistic routine. This routine yields ranges such that their cell number distribution approximates the actual distribution of the number of cells per range, the richness distribution of the cells approximates the actual richness distribution of the cells, and the tendency to form disjunct areas is governed by a parameter which is estimated from the real data set. Computational details have been described elsewhere (Hausdorf and Hennig, 2003; Hennig and Hausdorf, in press). Determination of Biotic Elements For the determination of biotic elements, we used model-based Gaussian clustering (MBGC) as implemented in the software MCLUST (Fraley and Raftery, 1998), because this method provides a decision about the number of meaningful clusters and about ranges that cannot be assigned adequately to any biotic elements. MBGC operates on a data set where the cases are defined by variables of metric scale. Therefore, we performed nonmetric multidimensional scaling (MDS; Kruskal, 1964) on the matrix of Kulczynski distances. The principle of MBGC is as follows. For a given number of clusters k and a number of variables p, MBGC fits

3 2003 HAUSDORF AND HENNIG BIOTIC ELEMENT ANALYSIS 719 a probability mixture model of the form ε 0 U C + k ε i N p (µ i, i ), i=1 k ε i = 0, ε i 0, i=1 i = 0,...,k (4) to the data by means of maximum likelihood estimation. N p (µ, ) denotes a p-dimensional normal distribution with mean vector µ and covariance matrix, and U C denotes the uniform distribution on a convex set C containing all data points. This means that the k clusters are modeled by distinct p-dimensional normal distributions (ε i gives the proportion of the ith cluster). All data points that do not fit well into any normal cluster are assigned to the mixture component defined by U C ( noise component ). A data point x is assigned to the mixture component j that maximizes ε j f j (x) over j = 0,...,k, where f j (x) denotes its estimated probability density. Such a fit is calculated for various values of k, the number of clusters. An optimal k can be found by means of the Bayesian information criterion. The same criterion is used to choose an optimal model for the relation between the covariance matrices of the different clusters (see Fraley and Raftery, 1998, for further details). MCLUST needs an initial estimate of noise, which was performed by the software NNCLEAN (Byers and Raftery, 1998) as suggested by Fraley and Raftery (1998). We used as NNCLEAN constant k = number of species/40 and four MDS dimensions. RESULTS Test for Clustering of Distribution Areas The test statistic T, i.e., the ratio of the sum of the 25% smallest d K distances to the sum of the 25% largest dis- tances, is for the southern African Scobius species data set generated with a 2 grid. It is significantly smaller (P = 0.01) than should be expected under our null model (for 1000 artificial populations, T varied between and 0.446, mean = 0.335). The test indicates that the distribution areas of the southern African Scobius species are significantly clustered. In contrast, the test with the data set generated with a 1 grid indicates that there is no significant clustering of distribution areas (T = 0.415; T varied between and for 1,000 artificial populations, mean = 0.478; P = 0.09). Determination of Biotic Elements A nonmetric MDS representation of the partition of the 47 southern African Scobius species to biotic elements is shown in Figure 1. If 2 grid squares are used, as in the analysis of Morrone (1994), most species ranges were assigned to four biotic elements. Only four ranges (8.5%) are classified in the noise component. The geographic distribution of the four biotic elements is shown in Figure 2. Biotic element 1 (seven species) is restricted to northeastern South Africa. The highest species richness can be found in grid squares B, E, and F (nomenclature of the grid cells after Morrone, 1994; Mast and Nyffeler, 2003). The geographic center of biotic element 2 (18 species) is in Natal, in the grid squares I, J, L, and M. The highest species richness of biotic element 3 (10 species) can be found in grid squares N and O in the Eastern Cape Province. Most species of biotic element 4 (eight species) are restricted to grid square P in the Eastern Cape Province. Only one species is more widespread. If 1 grid squares are used, there is greater scatter in the occupancy of grid cells. Biotic elements 1 and 3 found in the analysis based on the 2 grid are no longer recognized as clusters due to the greater scatter. The species that were FIGURE 1. First two dimensions of the nonmetric MDS of the range data of the southern African Scobius species. 1 4-Biotic elements found by PRABCLUS; N = noise component. (a) Result if the distribution areas are mapped on a 2 grid. (b) Result if the distribution areas are mapped ona1 grid.

4 720 SYSTEMATIC BIOLOGY VOL. 52 FIGURE 2. Distribution maps of four biotic elements found if the distribution areas of the southern African Scobius species are mapped on a 2 grid (some grid squares in Namibia are not shown). The different shadings indicate the areas where >70%, >30%, and >0% of the species of an element are present. assigned to these biotic elements in the previous analysis are now classified in the noise component, which includes more than half of the species ranges (57.4%). Two biotic elements are recognized (Figs. 1, 3). Biotic element 1 (seven species) is restricted to grid square J15. It corresponds to biotic element 4 of the previous analysis with the exception that the single more widespread species is excluded. The geographic center of biotic element 2 (13 species) is in Natal. It corresponds to biotic element 2 but includes fewer species. DISCUSSION Test for Clustering of Distribution Areas Most methods for the analysis of areas of endemism or biotic elements proposed so far take for granted that there is nonrandom structure in the data without testing this hypothesis. We propose a test with which it can be examined whether the analyzed distribution areas are clustered. Only if this is the case, it is meaningful to determine biotic elements, which would be a consequence of a subdivision of an ancestral biota by vicariance events (Hausdorf, 2002). Our test for clustering of distribution areas should not be confounded with the test of Mast and Nyffeler (2003) for the significance of pairwise co-occurences of taxa. The objective of our test is to check whether the entire set of ranges shows a degree of clustering that differs significantly from the degree of clustering observed in a set of similar ranges positioned independently of each other in the same area. Even if the null hypothesis of independent distribution of ranges is not rejected by our test, there may be some pairs of taxa whose ranges are more similar than should be expected by chance. On the other hand, no conclusion about the degree of clustering of the entire set of ranges is possible by a rejection of the null hypothesis of independent distribution for some pairs of taxa by the test of Mast and Nyffeler (2003). The tests proposed by Mast and Nyffeler (2003) and by us are not directly comparable, but the null models to generate ranges are. The major differences between

5 2003 HAUSDORF AND HENNIG BIOTIC ELEMENT ANALYSIS 721 the null model of Mast and Nyffeler (2003) and our null model are that our null model can consider the richness distribution of the cells and can produce ranges with a similar amount of disjunctions as in the real range set, whereas the null model of Mast and Nyffeler (2003) does not consider the richness distribution of the cells and cannot consider disjunctions of ranges (or of grid cells). Our null model was already applied in a test for nestedness of distribution areas (Hausdorf and Hennig, 2003). Scobius Example The Scobius example demonstrates our test for clustering of distribution areas and our method for the delimitation of biotic elements. If 2 grid squares are used, the test indicates a significant clustering of distribution areas. Four biotic elements are found with our method. We can investigate how far these biotic elements correspond to areas of endemism as identified and delimited in previous studies. Our biotic element 1 corresponds more or less to sets 2, 10, and 11 of Szumik et al. (2002), biotic element 2 corresponds to area 1 of Morrone (1994) and Mast and Nyffeler (2003) and sets 1, 4, 5, 6, 7, 8, and 9 of Szumik et al. (2002), biotic element 3 corresponds to area 2 (or 2 ) of Morrone (1994) and Mast and Nyffeler (2003) and sets 3, 12, 13, and 14 of Szumik et al. (2002), and biotic element 4 corresponds to area 3 of Morrone (1994) and Mast and Nyffeler (2003). For the 1 grid data set, our test resulted in a P value of 0.09, which means that a clustering of distribution areas cannot significantly be established. However, the value of the test statistic is still clearly below the average of the values for the simulated populations, and therefore it may be interesting to consider the result of the cluster analysis also for this grid. We found two biotic elements. All species belonging to biotic element 1 belong also to biotic element 4 of the 2 grid data set, which corresponds to area iv of Mast and Nyffeler (2003). All species belonging to biotic element 2 belong also to biotic element 2 of the 2 grid data set, which corresponds to areas i and ii of Mast and Nyffeler (2003). No biotic elements were found in the 1 grid data set that would correspond to the biotic elements 1 and 3 of the 2 grid data set. The distribution areas of the species that form these biotic elements are not similar enough at the finer scale to form distinct clusters. The differences between the results of the analysis of the 1 grid and those of the 2 grid data set show the importance of grid size (Jardine, 1972; Mast and Nyffeler, 2003). Distinct biotic elements may be amalgamated if the grid used is too coarse. In the Scobius example, no elements are found in the analysis with the 1 grid data set that are not also found with the 2 grid data set. Thus, there is no indication that the 2 grid is too coarse. If the grid used is too fine and the distribution data are not interpolated, insufficient sampling may introduce artificial noise in the data set. This can cause a failure to recognize existing biotic elements. This seems to be the case in the analysis of the 1 grid Scobius data set, where the clustering is not significant at the 5% level, the cluster analysis does not find two biotic elements that are recognized with the 2 grid data set, and the MDS representation (Fig. 1b) indicates a much weaker clustering pattern. The distribution maps of the biotic elements (Figs. 2, 3) show that the elements often overlap geographically and that the species belonging to one element are usually not homogeneously distributed. In several cases, a center can be discerned, where most species belonging to an element occur. With increasing distance from the center less and less of the species belonging to the element are present. One process that can create such a pattern is diversification by vicariance and subsequent chance dispersal of the species that originated in an area of endemism. The problems in delimitating areas of endemism are due to this pattern. FIGURE 3. Distribution maps of two biotic elements found if the distribution areas of the southern African Scobius species are mapped on a 1 grid (some grid squares in Namibia are not shown). The different shadings indicate the areas where >70%, >30%, and >0% of the species of an element are present.

6 722 SYSTEMATIC BIOLOGY VOL. 52 Biotic Elements versus Areas of Endemism Both biotic element analysis and the areas of endemism approach try to identify similarities in distribution areas, which might be the result of a common biogeographic history (primary biogeographic homology, sensu Morrone, 2001). The Scobius example illustrates two major problems with the areas-of-endemism approach. First, a single set of nonoverlapping areas of endemism has to be identified. But the application of an optimality criterion, namely the number of the species that can be considered endemic, shows that there are many combinations of grid squares with similar scores of endemicity (Szumik et al., 2002). Optimality criteria can be formulated to select a set of nonoverlapping areas of endemism. However, in this way the information about overlapping distribution patterns gets lost. In the next step of the usual vicarance biogeography protocol, an area cladogram is constructed based on the previously selected set of areas of endemism. In this step it is tested whether the taxa that are supposed to have been originated in a given set of areas of endemism actually had a common biogeographic history (secondary biogeographic homology, sensu Morrone, 2001). This step depends crucially on the correct identification and delimitation of areas of endemism. Different delimitations of areas of endemism can result in different outcomes concerning area relationships (Henderson, 1991). Furthermore, the comparison of area cladograms is a possibility to check which candidate areas of endemism might actually be historical units that were separated by vicariance events (Harold and Mooi, 1994). However, such a test is not very effective, if many candidate areas of endemism have been excluded already during the selection of a set of nonoverlapping areas of endemism based on other criteria. The second major problem of the areas-of-endemism approach is that it is hardly possible to reconstruct the borders between areas of endemism if dispersal resulted in overlapping distribution patterns even if we knew the true number and the approximate position of these areas (Hausdorf, 2002). This difficulty is illustrated in the Scobius example by the different delimitations of areas of endemism (Morrone, 1994; Szumik et al., 2002; Mast and Nyffeler, 2003), which are a consequence of the overlap of the biotic elements (Fig. 2). Another well-known example which highlights that problem are the numerous lines which have been proposed to delimit the Oriental and Australian regions (Mayr, 1944; Holloway and Jardine, 1968; Simpson, 1977; Vane-Wright, 1991). It is possible to formulate optimality criteria to choose among different delimitations of areas (Szumik et al., 2002). But the resulting borders vary with the choice of the criteria and there is no rationale for how to find the true borders. Again, this might create problems in the reconstruction of an area cladogram. Our approach based on biotic elements presents solutions to both problems. First, there is no necessity to select a single set of nonoverlapping biotic elements. Overlap between the areas occupied by different biotic elements is the rule rather than the exception (Fig. 2) and indicates that processes other than vicariance (e.g., dispersal across barriers) were frequently involved in the origination of these distribution patterns. It can be investigated which biotic elements represent historical units that originated by vicariance, even if they overlap geographically. For that purpose, we proposed to convert a taxon cladogram into an element cladogram by substituting the names of the taxa with the respective biotic elements and looking for recurrent patterns in the resulting element cladograms (Hausdorf, 2002). Thus, overlapping distribution patterns are not ignored, as in the areas-of-endemism approach. Further elaboration of this topic is beyond the topic of this paper. Second, in contrast to the area-of-endemism approach, where the correct delimitation of the borders of the areas of endemism is crucial for the conversion of a taxon cladogram into an area cladogram, it is not even necessary to reconstruct the borders between areas of endemism in the biotic element approach. Biotic element cladograms can be constructed and searched for patterns without fixing borders. Other Methods for the Analysis of Biotic Elements A well-known problem in cluster analysis is that most cluster analysis methods assign all units to clusters, i.e., they always yield an exhaustive clustering, if this is adequate for the data at hand or not, and no automatic decision about the number of clusters is possible. It is not clear from the results of previous analyses of biotic elements, which employed hierarchical distance based cluster analyses such as single or complete linkage (e.g., Holloway and Jardine, 1968; Jardine, 1972; Dennis et al., 1998), whether all biotic elements which were delimited are meaningful. In contrast to such methods, MBGC provides a decision about the number of meaningful biotic elements and about ranges that cannot be assigned adequately to any biotic element. There are attempts to decide about the meaningfulness of biotic elements by the use of pairwise significance tests of co-occurrence. Márquez et al. (1997) combined a hierarchical average linkage (UPGMA) clustering with a sequence of tests concerning the number of significantly similar species in different branches of the dendrogram. Mast and Nyffeler (2003) used an informal clustering approach equivalent to single linkage for the discussed examples, where the P values of the pairwise comparisons are used as dissimilarities between species and the resulting tree is cutted at a level of The appropriateness of P values as a measure of similarity between ranges is doubtful because P values do not only depend on the similarity between ranges, but also on the power of the test, which changes, e.g., with the scales of the grids. Moreover, the statistical validity of both approaches is questionable because they lead to multiple testing procedures, where no adequate adjustment of the P values is possible (see Mast and Nyffeler, 2003, for arguments

7 2003 HAUSDORF AND HENNIG BIOTIC ELEMENT ANALYSIS 723 why the Bonferroni adjustment is too conservative for this task). ACKNOWLEDGMENT We are grateful to Austin Mast for communicating details of an unpublished manuscript. REFERENCES BYERS,S.,AND A. E. RAFTERY Nearest neighbor clutter removal for estimating features in spatial point processes. J. Am. Stat. Assoc. 95: CROIZAT, L., G. NELSON, AND D. E. ROSEN Centers of origin and related concepts. Syst. Zool. 23: DENNIS, R. L. H., W. R. WILLIAMS, AND T. G. S HREEVE Faunal structures among European butterflies: Evolutionary implications of bias for geography, endemism and taxonomic affilation. Ecography 21: FRALEY, C., AND A. E. RAFTERY How many clusters? Which clustering method? Answers via model based cluster analysis. Comput. J. 41: HAROLD, A. S., AND R. D. MOOI Areas of endemism: Definition and recognition criteria. Syst. Biol. 43: HAUSDORF, B Units in biogeography. Syst. Biol. 51: HAUSDORF, B., AND C. HENNIG Nestedness of north-west European land snail ranges as a consequence of differential immigration from Pleistocene glacial refuges. Oecologia 135: HENDERSON, I. M Biogeography without area? Aust. Syst. Bot. 4: HENNIG, C., AND B. HAUSDORF. In press. Distance-based parametric bootstrap tests for clustering of species ranges. Comput. Stat. Data Anal. HOLLOWAY, J. D., AND N. JARDINE Two approaches to zoogeography: A study based on the distributions of butterflies, birds and bats in the Indo-Australian area. Proc. Linn. Soc. Lond. 179: HUMPHRIES, C. J., AND L. R. PARENTI Cladistic biogeography. Oxford Univ. Press, Oxford, UK. JARDINE, N Computational methods in the study of plant distributions. Pages in Taxonomy, phytogeography and evolution (D. H. Valentine, ed.). Academic Press, London. KRUSKAL, J. B Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrika 29:1 27. LINDER, H. P On areas of endemism, with an example from the African Restionaceae. Syst. Biol. 50: MÁRQUEZ, A. L., R. REAL, J.M.VARGAS, AND A. E. SALVO On identifying common distribution patterns and their causal factors: A probabilistic method applied to pteridophytes in the Iberian Peninsula. J. Biogeogr. 24: MAST, A. R., AND R. NYFFELER Using a null model to recognize significant co-occurrence prior to identifying candidate areas of endemism. Syst. Biol. 52: MAYR, E Wallace s line in the light of recent zoogeographic studies. Q. Rev. Biol. 19:1 14. MORRONE, J. J On the identification of areas of endemism. Syst. Biol. 43: MORRONE, J. J Homology, biogeography and areas of endemism. Divers. Distrib. 7: NELSON, G., AND N. PLATNICK Systematics and biogeography: Cladistics and vicariance. Columbia Univ. Press, New York. PLATNICK, N. I., AND G. NELSON A method of analysis for historical biogeography. Syst. Zool. 27:1 16. ROSEN, D. E A vicariance model of Caribbean biogeography. Syst. Zool. 24: ROSEN, D. E Vicariant patterns and historical explanations in biogeography. Syst. Zool. 27: SHI, G. R Multivariate data analysis in palaeoecology and palaeobiogeography A review. Palaeogeogr. Palaeoclimatol. Palaeoecol. 105: SIMPSON, G. G Too many lines; the limits of the Oriental and Australian zoogeographic regions. Proc. Am. Philos. Soc. 121: SZUMIK, C. A., F. CUEZZO, P.A.GOLOBOFF, AND A. E. CHALUP An optimality criterion to determine areas of endemism. Syst. Biol. 51: VANE-WRIGHT, R. I Transcending the Wallace line: Do the western edges of the Australian region and the Australian Plate coincide? Aust. Syst. Bot. 4: WILEY, E. O Vicariance biogeography. Annu. Rev. Ecol. Syst. 19: First submitted 26 February 2003; reviews returned 23 May 2003; final acceptance 19 June 2003 Associate Editor: Roderic Page

Units in Biogeography

Units in Biogeography 648 SYSTEMATIC BIOLOGY VOL. 51 Syst. Biol. 51(4):648 652, 2002 DOI: 10.1080/10635150290102320 Units in Biogeography BERNHARD HAUSDORF Zoologisches Institut und Zoologisches Museum der Universität Hamburg,

More information

A Robust Distance Coefficient between Distribution Areas Incorporating Geographic Distances

A Robust Distance Coefficient between Distribution Areas Incorporating Geographic Distances 170 SYSTEMATI BIOLOGY VOL. 55 Syst. Biol. 55(l):170-175,2006 opyright Society of Systematic Biologists ISSN: 1063-5157 print / 1076-36X online DO1:10.100/1063515050041523 A Robust Distance oefficient between

More information

,...,.,.,,.,...,.,...,...,.,.,...

,...,.,.,,.,...,.,...,...,.,.,... Areas of Endemism The definitions and criteria for areas of endemism are complex issues (Linder 2001; Morrone 1994b; Platnick 1991; Szumik et al. 2002; Viloria 2005). There are severa1 definitions of areas

More information

Reconstructing the history of lineages

Reconstructing the history of lineages Reconstructing the history of lineages Class outline Systematics Phylogenetic systematics Phylogenetic trees and maps Class outline Definitions Systematics Phylogenetic systematics/cladistics Systematics

More information

Some thoughts on the design of (dis)similarity measures

Some thoughts on the design of (dis)similarity measures Some thoughts on the design of (dis)similarity measures Christian Hennig Department of Statistical Science University College London Introduction In situations where p is large and n is small, n n-dissimilarity

More information

Species diversification in space: biogeographic patterns

Species diversification in space: biogeographic patterns Species diversification in space: biogeographic patterns Outline Endemism and cosmopolitanism Disjunctions Biogeographic regions Barriers and interchanges Divergence and convergence Biogeographic patterns

More information

IB 200B Phylogenetics 21 April II. Vicariance Biogeography

IB 200B Phylogenetics 21 April II. Vicariance Biogeography II. Vicariance Biogeography The breakthrough in the application of cladistic reasoning to biogeography was led by the ichthyologists Gary Nelson and Donn Rosen at the American Museum of Natural History

More information

Multiple regression and inference in ecology and conservation biology: further comments on identifying important predictor variables

Multiple regression and inference in ecology and conservation biology: further comments on identifying important predictor variables Biodiversity and Conservation 11: 1397 1401, 2002. 2002 Kluwer Academic Publishers. Printed in the Netherlands. Multiple regression and inference in ecology and conservation biology: further comments on

More information

PCB6675C, BOT6935, ZOO6927 Evolutionary Biogeography Spring 2014

PCB6675C, BOT6935, ZOO6927 Evolutionary Biogeography Spring 2014 PCB6675C, BOT6935, ZOO6927 Evolutionary Biogeography Spring 2014 Credits: 3 Schedule: Wednesdays and Fridays, 4 th & 5 th Period (10:40 am - 12:35 pm) Location: Carr 221 Instructors Dr. Nico Cellinese

More information

Geography of Evolution

Geography of Evolution Geography of Evolution Biogeography - the study of the geographic distribution of organisms. The current distribution of organisms can be explained by historical events and current climatic patterns. Darwin

More information

Historical Biogeography. Historical Biogeography. Systematics

Historical Biogeography. Historical Biogeography. Systematics Historical Biogeography I. Definitions II. Fossils: problems with fossil record why fossils are important III. Phylogeny IV. Phenetics VI. Phylogenetic Classification Disjunctions debunked: Examples VII.

More information

EFFECTS OF TAXONOMIC GROUPS AND GEOGRAPHIC SCALE ON PATTERNS OF NESTEDNESS

EFFECTS OF TAXONOMIC GROUPS AND GEOGRAPHIC SCALE ON PATTERNS OF NESTEDNESS EFFECTS OF TAXONOMIC GROUPS AND GEOGRAPHIC SCALE ON PATTERNS OF NESTEDNESS SFENTHOURAKIS Spyros, GIOKAS Sinos & LEGAKIS Anastasios Zoological Museum, Department of Biology, University of Athens, Greece

More information

Three Monte Carlo Models. of Faunal Evolution PUBLISHED BY NATURAL HISTORY THE AMERICAN MUSEUM SYDNEY ANDERSON AND CHARLES S.

Three Monte Carlo Models. of Faunal Evolution PUBLISHED BY NATURAL HISTORY THE AMERICAN MUSEUM SYDNEY ANDERSON AND CHARLES S. AMERICAN MUSEUM Notltates PUBLISHED BY THE AMERICAN MUSEUM NATURAL HISTORY OF CENTRAL PARK WEST AT 79TH STREET NEW YORK, N.Y. 10024 U.S.A. NUMBER 2563 JANUARY 29, 1975 SYDNEY ANDERSON AND CHARLES S. ANDERSON

More information

Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley

Integrative Biology 200 PRINCIPLES OF PHYLOGENETICS Spring 2018 University of California, Berkeley Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley B.D. Mishler Feb. 14, 2018. Phylogenetic trees VI: Dating in the 21st century: clocks, & calibrations;

More information

Lecture V Phylogeny and Systematics Dr. Kopeny

Lecture V Phylogeny and Systematics Dr. Kopeny Delivered 1/30 and 2/1 Lecture V Phylogeny and Systematics Dr. Kopeny Lecture V How to Determine Evolutionary Relationships: Concepts in Phylogeny and Systematics Textbook Reading: pp 425-433, 435-437

More information

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200 Spring 2014 University of California, Berkeley

PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION Integrative Biology 200 Spring 2014 University of California, Berkeley "PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200 Spring 2014 University of California, Berkeley D.D. Ackerly April 16, 2014. Community Ecology and Phylogenetics Readings: Cavender-Bares,

More information

The practice of naming and classifying organisms is called taxonomy.

The practice of naming and classifying organisms is called taxonomy. Chapter 18 Key Idea: Biologists use taxonomic systems to organize their knowledge of organisms. These systems attempt to provide consistent ways to name and categorize organisms. The practice of naming

More information

Does vicariance shape biotas? Biogeographical tests of the vicariance model in the north-west European land snail fauna

Does vicariance shape biotas? Biogeographical tests of the vicariance model in the north-west European land snail fauna Journal of Biogeography (J. Biogeogr.) (2004) 31, 1751 1757 ORIGINAL ARTICLE Does vicariance shape biotas? Biogeographical tests of the vicariance model in the north-west European land snail fauna Bernhard

More information

What is the range of a taxon? A scaling problem at three levels: Spa9al scale Phylogene9c depth Time

What is the range of a taxon? A scaling problem at three levels: Spa9al scale Phylogene9c depth Time What is the range of a taxon? A scaling problem at three levels: Spa9al scale Phylogene9c depth Time 1 5 0.25 0.15 5 0.05 0.05 0.10 2 0.10 0.10 0.20 4 Reminder of what a range-weighted tree is Actual Tree

More information

Welcome! Text: Community Ecology by Peter J. Morin, Blackwell Science ISBN (required) Topics covered: Date Topic Reading

Welcome! Text: Community Ecology by Peter J. Morin, Blackwell Science ISBN (required) Topics covered: Date Topic Reading Welcome! Text: Community Ecology by Peter J. Morin, Blackwell Science ISBN 0-86542-350-4 (required) Topics covered: Date Topic Reading 1 Sept Syllabus, project, Ch1, Ch2 Communities 8 Sept Competition

More information

ENMTools: a toolbox for comparative studies of environmental niche models

ENMTools: a toolbox for comparative studies of environmental niche models Ecography 33: 607611, 2010 doi: 10.1111/j.1600-0587.2009.06142.x # 2010 The Authors. Journal compilation # 2010 Ecography Subject Editor: Thiago Rangel. Accepted 4 September 2009 ENMTools: a toolbox for

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Chapter 17A. Table of Contents. Section 1 Categories of Biological Classification. Section 2 How Biologists Classify Organisms

Chapter 17A. Table of Contents. Section 1 Categories of Biological Classification. Section 2 How Biologists Classify Organisms Classification of Organisms Table of Contents Section 1 Categories of Biological Classification Section 1 Categories of Biological Classification Classification Section 1 Categories of Biological Classification

More information

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the

More information

Using a Null Model to Recognize Significant Co-Occurrence Prior to Identifying Candidate Areas of Endemism

Using a Null Model to Recognize Significant Co-Occurrence Prior to Identifying Candidate Areas of Endemism 2003 POINTS OF VIEW 271 EDMONDS, D., AND J. EIDINOW. 2001. Wittgenstein s poker. Harper- Collins, New York. EDWARDS, A. W. F. 1992. Likelihood. Expanded edition. Johns Hopkins Univ. Press, Baltimore, Maryland.

More information

Non-independence in Statistical Tests for Discrete Cross-species Data

Non-independence in Statistical Tests for Discrete Cross-species Data J. theor. Biol. (1997) 188, 507514 Non-independence in Statistical Tests for Discrete Cross-species Data ALAN GRAFEN* AND MARK RIDLEY * St. John s College, Oxford OX1 3JP, and the Department of Zoology,

More information

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis

More information

Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size

Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size Berkman Sahiner, a) Heang-Ping Chan, Nicholas Petrick, Robert F. Wagner, b) and Lubomir Hadjiiski

More information

The Tempo of Macroevolution: Patterns of Diversification and Extinction

The Tempo of Macroevolution: Patterns of Diversification and Extinction The Tempo of Macroevolution: Patterns of Diversification and Extinction During the semester we have been consider various aspects parameters associated with biodiversity. Current usage stems from 1980's

More information

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Statistical Methods in Particle Physics Lecture 1: Bayesian methods Statistical Methods in Particle Physics Lecture 1: Bayesian methods SUSSP65 St Andrews 16 29 August 2009 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

Ontology of areas of endemism

Ontology of areas of endemism Journal of Biogeography (J. Biogeogr.) (2011) GUEST EDITORIAL Ontology of areas of endemism Brian I. Crother* and Christopher M. Murray Department of Biological Sciences, Southeastern Louisiana University,

More information

Unsupervised machine learning

Unsupervised machine learning Chapter 9 Unsupervised machine learning Unsupervised machine learning (a.k.a. cluster analysis) is a set of methods to assign objects into clusters under a predefined distance measure when class labels

More information

Consensus Methods. * You are only responsible for the first two

Consensus Methods. * You are only responsible for the first two Consensus Trees * consensus trees reconcile clades from different trees * consensus is a conservative estimate of phylogeny that emphasizes points of agreement * philosophy: agreement among data sets is

More information

Phylogenetic Analysis

Phylogenetic Analysis Phylogenetic Analysis Aristotle Through classification, one might discover the essence and purpose of species. Nelson & Platnick (1981) Systematics and Biogeography Carl Linnaeus Swedish botanist (1700s)

More information

Phylogenetic Analysis

Phylogenetic Analysis Phylogenetic Analysis Aristotle Through classification, one might discover the essence and purpose of species. Nelson & Platnick (1981) Systematics and Biogeography Carl Linnaeus Swedish botanist (1700s)

More information

BRIEF COMMUNICATIONS

BRIEF COMMUNICATIONS Evolution, 59(12), 2005, pp. 2705 2710 BRIEF COMMUNICATIONS THE EFFECT OF INTRASPECIFIC SAMPLE SIZE ON TYPE I AND TYPE II ERROR RATES IN COMPARATIVE STUDIES LUKE J. HARMON 1 AND JONATHAN B. LOSOS Department

More information

Data analysis and Ecological Classification

Data analysis and Ecological Classification Data analysis and Ecological Classification Gabriela Augusto Marco Painho Abstract Classification is a basic tool for knowledge conceptualisation in any science. Ecology is not an exception. There are

More information

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together SPECIATION Origin of new species=speciation -Process by which one species splits into two or more species, accounts for both the unity and diversity of life SPECIES BIOLOGICAL CONCEPT Population or groups

More information

(Stevens 1991) 1. morphological characters should be assumed to be quantitative unless demonstrated otherwise

(Stevens 1991) 1. morphological characters should be assumed to be quantitative unless demonstrated otherwise Bot 421/521 PHYLOGENETIC ANALYSIS I. Origins A. Hennig 1950 (German edition) Phylogenetic Systematics 1966 B. Zimmerman (Germany, 1930 s) C. Wagner (Michigan, 1920-2000) II. Characters and character states

More information

2/7/2018. Module 4. Spatial Statistics. Point Patterns: Nearest Neighbor. Spatial Statistics. Point Patterns: Nearest Neighbor

2/7/2018. Module 4. Spatial Statistics. Point Patterns: Nearest Neighbor. Spatial Statistics. Point Patterns: Nearest Neighbor Spatial Statistics Module 4 Geographers are very interested in studying, understanding, and quantifying the patterns we can see on maps Q: What kinds of map patterns can you think of? There are so many

More information

8/23/2014. Phylogeny and the Tree of Life

8/23/2014. Phylogeny and the Tree of Life Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major

More information

Phylogenetic Analysis

Phylogenetic Analysis Phylogenetic Analysis Aristotle Through classification, one might discover the essence and purpose of species. Nelson & Platnick (1981) Systematics and Biogeography Carl Linnaeus Swedish botanist (1700s)

More information

Phylogeny: building the tree of life

Phylogeny: building the tree of life Phylogeny: building the tree of life Dr. Fayyaz ul Amir Afsar Minhas Department of Computer and Information Sciences Pakistan Institute of Engineering & Applied Sciences PO Nilore, Islamabad, Pakistan

More information

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING Vishwanath Mantha Department for Electrical and Computer Engineering Mississippi State University, Mississippi State, MS 39762 mantha@isip.msstate.edu ABSTRACT

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature11226 Supplementary Discussion D1 Endemics-area relationship (EAR) and its relation to the SAR The EAR comprises the relationship between study area and the number of species that are

More information

Chapter 19: Taxonomy, Systematics, and Phylogeny

Chapter 19: Taxonomy, Systematics, and Phylogeny Chapter 19: Taxonomy, Systematics, and Phylogeny AP Curriculum Alignment Chapter 19 expands on the topics of phylogenies and cladograms, which are important to Big Idea 1. In order for students to understand

More information

C3020 Molecular Evolution. Exercises #3: Phylogenetics

C3020 Molecular Evolution. Exercises #3: Phylogenetics C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from

More information

ANALYSIS OF CHARACTER DIVERGENCE ALONG ENVIRONMENTAL GRADIENTS AND OTHER COVARIATES

ANALYSIS OF CHARACTER DIVERGENCE ALONG ENVIRONMENTAL GRADIENTS AND OTHER COVARIATES ORIGINAL ARTICLE doi:10.1111/j.1558-5646.2007.00063.x ANALYSIS OF CHARACTER DIVERGENCE ALONG ENVIRONMENTAL GRADIENTS AND OTHER COVARIATES Dean C. Adams 1,2,3 and Michael L. Collyer 1,4 1 Department of

More information

Linking species-compositional dissimilarities and environmental data for biodiversity assessment

Linking species-compositional dissimilarities and environmental data for biodiversity assessment Linking species-compositional dissimilarities and environmental data for biodiversity assessment D. P. Faith, S. Ferrier Australian Museum, 6 College St., Sydney, N.S.W. 2010, Australia; N.S.W. National

More information

PATTERN CLASSIFICATION

PATTERN CLASSIFICATION PATTERN CLASSIFICATION Second Edition Richard O. Duda Peter E. Hart David G. Stork A Wiley-lnterscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim Brisbane Singapore Toronto CONTENTS

More information

Statistical Applications in Genetics and Molecular Biology

Statistical Applications in Genetics and Molecular Biology Statistical Applications in Genetics and Molecular Biology Volume 5, Issue 1 2006 Article 28 A Two-Step Multiple Comparison Procedure for a Large Number of Tests and Multiple Treatments Hongmei Jiang Rebecca

More information

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological

More information

Nebraska Conservation and Environmental Review Tool (CERT): Terminology used in the Tables of the CERT Report

Nebraska Conservation and Environmental Review Tool (CERT): Terminology used in the Tables of the CERT Report Nebraska Conservation and Environmental Review Tool (CERT): Terminology used in the Tables of the CERT Report Nebraska Natural Heritage Program Nebraska Game and Parks Commission February 8, 2018 Contents

More information

The Life System and Environmental & Evolutionary Biology II

The Life System and Environmental & Evolutionary Biology II The Life System and Environmental & Evolutionary Biology II EESC V2300y / ENVB W2002y Laboratory 1 (01/28/03) Systematics and Taxonomy 1 SYNOPSIS In this lab we will give an overview of the methodology

More information

Heaving Toward Speciation

Heaving Toward Speciation Temporal Waves of Genetic Diversity in a Spatially Explicit Model of Evolution: Heaving Toward Speciation Guy A. Hoelzer 1, Rich Drewes 2 and René Doursat 2,3 1 Department of Biology, 2 Brain Computation

More information

GIS Applications to Museum Specimens

GIS Applications to Museum Specimens GIS Applications to Museum Specimens Joseph Grinnell (1877 1939) At this point I wish to emphasize what I believe will ultimately prove to be the greatest value of our museum. This value will not, however,

More information

Wake Acceleration Academy - Biology Note Guide Unit 6: Evolution & The Diversity of Life

Wake Acceleration Academy - Biology Note Guide Unit 6: Evolution & The Diversity of Life Wake Acceleration Academy - Biology Note Guide Unit 6: Evolution & The Diversity of Life Extra Resources Website: http://waa-science.weebly.com Module 1: Darwin and Natural Selection Vocabulary Term Charles

More information

Algorithmic Methods Well-defined methodology Tree reconstruction those that are well-defined enough to be carried out by a computer. Felsenstein 2004,

Algorithmic Methods Well-defined methodology Tree reconstruction those that are well-defined enough to be carried out by a computer. Felsenstein 2004, Tracing the Evolution of Numerical Phylogenetics: History, Philosophy, and Significance Adam W. Ferguson Phylogenetic Systematics 26 January 2009 Inferring Phylogenies Historical endeavor Darwin- 1837

More information

A Statistical Input Pruning Method for Artificial Neural Networks Used in Environmental Modelling

A Statistical Input Pruning Method for Artificial Neural Networks Used in Environmental Modelling A Statistical Input Pruning Method for Artificial Neural Networks Used in Environmental Modelling G. B. Kingston, H. R. Maier and M. F. Lambert Centre for Applied Modelling in Water Engineering, School

More information

Input data, analytical methods and biogeography of Elegia (Restionaceae)

Input data, analytical methods and biogeography of Elegia (Restionaceae) Journal of Biogeography (J. Biogeogr.) (2006) 33, 47 62 ORIGINAL ARTICLE Input data, analytical methods and biogeography of Elegia (Restionaceae) P. M. Moline* and H. P. Linder Institute of Systematic

More information

Remote sensing of biodiversity: measuring ecological complexity from space

Remote sensing of biodiversity: measuring ecological complexity from space Remote sensing of biodiversity: measuring ecological complexity from space Duccio Rocchini Fondazione Edmund Mach Research and Innovation Centre Department of Biodiversity and Molecular Ecology San Michele

More information

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26 Phylogeny Chapter 26 Taxonomy Taxonomy: ordered division of organisms into categories based on a set of characteristics used to assess similarities and differences Carolus Linnaeus developed binomial nomenclature,

More information

More on Unsupervised Learning

More on Unsupervised Learning More on Unsupervised Learning Two types of problems are to find association rules for occurrences in common in observations (market basket analysis), and finding the groups of values of observational data

More information

On the biogeography of Romanian spiders (Araneae)

On the biogeography of Romanian spiders (Araneae) European Arachnology 2005 (Deltshev, C. & Stoev, P., eds) Acta zoologica bulgarica, Suppl. No. 1: pp. 281-285. On the biogeography of Romanian spiders (Araneae) Andreea Tatole 1 Abstract: The biogeographical

More information

Integrating Fossils into Phylogenies. Throughout the 20th century, the relationship between paleontology and evolutionary biology has been strained.

Integrating Fossils into Phylogenies. Throughout the 20th century, the relationship between paleontology and evolutionary biology has been strained. IB 200B Principals of Phylogenetic Systematics Spring 2011 Integrating Fossils into Phylogenies Throughout the 20th century, the relationship between paleontology and evolutionary biology has been strained.

More information

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression) Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures

More information

Estimating Evolutionary Trees. Phylogenetic Methods

Estimating Evolutionary Trees. Phylogenetic Methods Estimating Evolutionary Trees v if the data are consistent with infinite sites then all methods should yield the same tree v it gets more complicated when there is homoplasy, i.e., parallel or convergent

More information

Phylogenetic Tree Reconstruction

Phylogenetic Tree Reconstruction I519 Introduction to Bioinformatics, 2011 Phylogenetic Tree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Evolution theory Speciation Evolution of new organisms is driven

More information

Clustering. CSL465/603 - Fall 2016 Narayanan C Krishnan

Clustering. CSL465/603 - Fall 2016 Narayanan C Krishnan Clustering CSL465/603 - Fall 2016 Narayanan C Krishnan ckn@iitrpr.ac.in Supervised vs Unsupervised Learning Supervised learning Given x ", y " "%& ', learn a function f: X Y Categorical output classification

More information

Propensity Score Matching

Propensity Score Matching Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In

More information

1 The spatial component of evolution

1 The spatial component of evolution 1 The spatial component of evolution Molecular studies have documented high levels of geographic structure in most plant and animal groups and this finding has fundamental implications for the science

More information

ELE4120 Bioinformatics Tutorial 8

ELE4120 Bioinformatics Tutorial 8 ELE4120 ioinformatics Tutorial 8 ontent lassifying Organisms Systematics and Speciation Taxonomy and phylogenetics Phenetics versus cladistics Phylogenetic trees iological classification Goal: To develop

More information

Integrative Biology 200A "PRINCIPLES OF PHYLOGENETICS" Spring 2012 University of California, Berkeley

Integrative Biology 200A PRINCIPLES OF PHYLOGENETICS Spring 2012 University of California, Berkeley Integrative Biology 200A "PRINCIPLES OF PHYLOGENETICS" Spring 2012 University of California, Berkeley B.D. Mishler Feb. 7, 2012. Morphological data IV -- ontogeny & structure of plants The last frontier

More information

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

Statistical Machine Learning

Statistical Machine Learning Statistical Machine Learning Christoph Lampert Spring Semester 2015/2016 // Lecture 12 1 / 36 Unsupervised Learning Dimensionality Reduction 2 / 36 Dimensionality Reduction Given: data X = {x 1,..., x

More information

Letter to the Editor. The Effect of Taxonomic Sampling on Accuracy of Phylogeny Estimation: Test Case of a Known Phylogeny Steven Poe 1

Letter to the Editor. The Effect of Taxonomic Sampling on Accuracy of Phylogeny Estimation: Test Case of a Known Phylogeny Steven Poe 1 Letter to the Editor The Effect of Taxonomic Sampling on Accuracy of Phylogeny Estimation: Test Case of a Known Phylogeny Steven Poe 1 Department of Zoology and Texas Memorial Museum, University of Texas

More information

Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2016 University of California, Berkeley. Parsimony & Likelihood [draft]

Integrative Biology 200 PRINCIPLES OF PHYLOGENETICS Spring 2016 University of California, Berkeley. Parsimony & Likelihood [draft] Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2016 University of California, Berkeley K.W. Will Parsimony & Likelihood [draft] 1. Hennig and Parsimony: Hennig was not concerned with parsimony

More information

Diversity partitioning without statistical independence of alpha and beta

Diversity partitioning without statistical independence of alpha and beta 1964 Ecology, Vol. 91, No. 7 Ecology, 91(7), 2010, pp. 1964 1969 Ó 2010 by the Ecological Society of America Diversity partitioning without statistical independence of alpha and beta JOSEPH A. VEECH 1,3

More information

Lecture 5: Clustering, Linear Regression

Lecture 5: Clustering, Linear Regression Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-3.2 STATS 202: Data mining and analysis October 4, 2017 1 / 22 .0.0 5 5 1.0 7 5 X2 X2 7 1.5 1.0 0.5 3 1 2 Hierarchical clustering

More information

Outline. Classification of Living Things

Outline. Classification of Living Things Outline Classification of Living Things Chapter 20 Mader: Biology 8th Ed. Taxonomy Binomial System Species Identification Classification Categories Phylogenetic Trees Tracing Phylogeny Cladistic Systematics

More information

Lecture 11 Friday, October 21, 2011

Lecture 11 Friday, October 21, 2011 Lecture 11 Friday, October 21, 2011 Phylogenetic tree (phylogeny) Darwin and classification: In the Origin, Darwin said that descent from a common ancestral species could explain why the Linnaean system

More information

Principles of Pattern Recognition. C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata

Principles of Pattern Recognition. C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata Principles of Pattern Recognition C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata e-mail: murthy@isical.ac.in Pattern Recognition Measurement Space > Feature Space >Decision

More information

The Tree of Life. Chapter 17

The Tree of Life. Chapter 17 The Tree of Life Chapter 17 1 17.1 Taxonomy The science of naming and classifying organisms 2000 years ago Aristotle Grouped plants and animals Based on structural similarities Greeks and Romans included

More information

Biologists use a system of classification to organize information about the diversity of living things.

Biologists use a system of classification to organize information about the diversity of living things. Section 1: Biologists use a system of classification to organize information about the diversity of living things. K What I Know W What I Want to Find Out L What I Learned Essential Questions What are

More information

Community phylogenetics review/quiz

Community phylogenetics review/quiz Community phylogenetics review/quiz A. This pattern represents and is a consequent of. Most likely to observe this at phylogenetic scales. B. This pattern represents and is a consequent of. Most likely

More information

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation Speciation Today s OUTLINE: (1) Geographic Mechanisms of Speciation (What circumstances lead to the formation of new species?) (2) Species Concepts (How are Species Defined?) Mechanisms of Speciation Last

More information

Applying cluster analysis to 2011 Census local authority data

Applying cluster analysis to 2011 Census local authority data Applying cluster analysis to 2011 Census local authority data Kitty.Lymperopoulou@manchester.ac.uk SPSS User Group Conference November, 10 2017 Outline Basic ideas of cluster analysis How to choose variables

More information

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them? Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them? Carolus Linneaus:Systema Naturae (1735) Swedish botanist &

More information

10-810: Advanced Algorithms and Models for Computational Biology. Optimal leaf ordering and classification

10-810: Advanced Algorithms and Models for Computational Biology. Optimal leaf ordering and classification 10-810: Advanced Algorithms and Models for Computational Biology Optimal leaf ordering and classification Hierarchical clustering As we mentioned, its one of the most popular methods for clustering gene

More information

ESS 345 Ichthyology. Systematic Ichthyology Part II Not in Book

ESS 345 Ichthyology. Systematic Ichthyology Part II Not in Book ESS 345 Ichthyology Systematic Ichthyology Part II Not in Book Thought for today: Now, here, you see, it takes all the running you can do, to keep in the same place. If you want to get somewhere else,

More information

Biogeography. An ecological and evolutionary approach SEVENTH EDITION. C. Barry Cox MA, PhD, DSc and Peter D. Moore PhD

Biogeography. An ecological and evolutionary approach SEVENTH EDITION. C. Barry Cox MA, PhD, DSc and Peter D. Moore PhD Biogeography An ecological and evolutionary approach C. Barry Cox MA, PhD, DSc and Peter D. Moore PhD Division of Life Sciences, King's College London, Fmnklin-Wilkins Building, Stamford Street, London

More information

NEW YORK DEPARTMENT OF SANITATION. Spatial Analysis of Complaints

NEW YORK DEPARTMENT OF SANITATION. Spatial Analysis of Complaints NEW YORK DEPARTMENT OF SANITATION Spatial Analysis of Complaints Spatial Information Design Lab Columbia University Graduate School of Architecture, Planning and Preservation November 2007 Title New York

More information

LAB EXERCISE #3 Neutral Landscape Analysis Summary of Key Results and Conclusions

LAB EXERCISE #3 Neutral Landscape Analysis Summary of Key Results and Conclusions LAB EXERCISE #3 Neutral Landscape Analysis Summary of Key Results and Conclusions Below is a brief summary of the key results and conclusions of this exercise. Note, this is not an exhaustive summary,

More information

A (short) introduction to phylogenetics

A (short) introduction to phylogenetics A (short) introduction to phylogenetics Thibaut Jombart, Marie-Pauline Beugin MRC Centre for Outbreak Analysis and Modelling Imperial College London Genetic data analysis with PR Statistics, Millport Field

More information

Supplementary Information

Supplementary Information Supplementary Information For the article"comparable system-level organization of Archaea and ukaryotes" by J. Podani, Z. N. Oltvai, H. Jeong, B. Tombor, A.-L. Barabási, and. Szathmáry (reference numbers

More information

Classification, Phylogeny yand Evolutionary History

Classification, Phylogeny yand Evolutionary History Classification, Phylogeny yand Evolutionary History The diversity of life is great. To communicate about it, there must be a scheme for organization. There are many species that would be difficult to organize

More information

Lecture 5: Clustering, Linear Regression

Lecture 5: Clustering, Linear Regression Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-3.2 STATS 202: Data mining and analysis October 4, 2017 1 / 22 Hierarchical clustering Most algorithms for hierarchical clustering

More information

Applications of Genetics to Conservation Biology

Applications of Genetics to Conservation Biology Applications of Genetics to Conservation Biology Molecular Taxonomy Populations, Gene Flow, Phylogeography Relatedness - Kinship, Paternity, Individual ID Conservation Biology Population biology Physiology

More information

The Nature of Geographic Data

The Nature of Geographic Data 4 The Nature of Geographic Data OVERVIEW Elaborates on the spatial is special theme Focuses on how phenomena vary across space and the general nature of geographic variation Describes the main principles

More information

Statistical Analysis of fmrl Data

Statistical Analysis of fmrl Data Statistical Analysis of fmrl Data F. Gregory Ashby The MIT Press Cambridge, Massachusetts London, England Preface xi Acronyms xv 1 Introduction 1 What Is fmri? 2 The Scanning Session 4 Experimental Design

More information