Weighted compromise trees: a method to summarize competing phylogenetic hypotheses

Similar documents
Letter to the Editor. The Effect of Taxonomic Sampling on Accuracy of Phylogeny Estimation: Test Case of a Known Phylogeny Steven Poe 1

Points of View Matrix Representation with Parsimony, Taxonomic Congruence, and Total Evidence

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2009 University of California, Berkeley

Parsimony via Consensus

Effects of Gap Open and Gap Extension Penalties

Consensus Methods. * You are only responsible for the first two

(Stevens 1991) 1. morphological characters should be assumed to be quantitative unless demonstrated otherwise

Evaluating phylogenetic hypotheses

Lecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) p.1/30

Ratio of explanatory power (REP): A new measure of group support

Letter to the Editor. Department of Biology, Arizona State University

Distinctions between optimal and expected support. Ward C. Wheeler

Phylogenetic inference

Molecular Phylogenetics and Evolution

Axiomatic Opportunities and Obstacles for Inferring a Species Tree from Gene Trees

Bayesian Inference using Markov Chain Monte Carlo in Phylogenetic Studies

C3020 Molecular Evolution. Exercises #3: Phylogenetics

How to read and make phylogenetic trees Zuzana Starostová

Dr. Amira A. AL-Hosary

A data based parsimony method of cophylogenetic analysis

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Can taxon-sampling effects be minimized by using branch supports? P. Hovenkamp

A Fitness Distance Correlation Measure for Evolutionary Trees

A Chain Is No Stronger than Its Weakest Link: Double Decay Analysis of Phylogenetic Hypotheses

Reconstructing the history of lineages

Inferring phylogeny. Today s topics. Milestones of molecular evolution studies Contributions to molecular evolution

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

X X (2) X Pr(X = x θ) (3)

Increasing Data Transparency and Estimating Phylogenetic Uncertainty in Supertrees: Approaches Using Nonparametric Bootstrapping

Assessing an Unknown Evolutionary Process: Effect of Increasing Site- Specific Knowledge Through Taxon Addition

Assessing Congruence Among Ultrametric Distance Matrices

Algorithmic Methods Well-defined methodology Tree reconstruction those that are well-defined enough to be carried out by a computer. Felsenstein 2004,

A Phylogenetic Network Construction due to Constrained Recombination

arxiv: v1 [q-bio.pe] 6 Jun 2013

Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley

Cladistics. Measures of stratigraphic fit to phylogeny and their sensitivity to tree size, tree shape, and scale

Phylogenetic Tree Reconstruction

The Information Content of Trees and Their Matrix Representations

Constructing Evolutionary/Phylogenetic Trees

8/23/2014. Phylogeny and the Tree of Life

Phylogenies Scores for Exhaustive Maximum Likelihood and Parsimony Scores Searches

Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2016 University of California, Berkeley. Parsimony & Likelihood [draft]

Assessing Progress in Systematics with Continuous Jackknife Function Analysis

Constructing Evolutionary/Phylogenetic Trees

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Combining Data Sets with Different Phylogenetic Histories

Consensus methods. Strict consensus methods

Computational aspects of host parasite phylogenies Jamie Stevens Date received (in revised form): 14th June 2004

Integrating Fossils into Phylogenies. Throughout the 20th century, the relationship between paleontology and evolutionary biology has been strained.

Concepts and Methods in Molecular Divergence Time Estimation

How should we organize the diversity of animal life?

Non-independence in Statistical Tests for Discrete Cross-species Data

Bootstrapping and Tree reliability. Biol4230 Tues, March 13, 2018 Bill Pearson Pinn 6-057

What is Phylogenetics

Phylogenetic analyses. Kirsi Kostamo

Consistency Index (CI)

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Phylogenetics in the Age of Genomics: Prospects and Challenges

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

A (short) introduction to phylogenetics

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

,...,.,.,,.,...,.,...,...,.,.,...

Towards a general model of superorganism life history

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Lecture 6 Phylogenetic Inference

Algorithms in Bioinformatics

reconciling trees Stefanie Hartmann postdoc, Todd Vision s lab University of North Carolina the data

Integrative Biology 200A "PRINCIPLES OF PHYLOGENETICS" Spring 2008

Total Evidence Or Taxonomic Congruence: Cladistics Or Consensus Classification

THE THREE-STATE PERFECT PHYLOGENY PROBLEM REDUCES TO 2-SAT

ESS 345 Ichthyology. Systematic Ichthyology Part II Not in Book

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

OEB 181: Systematics. Catalog Number: 5459

Isolating - A New Resampling Method for Gene Order Data

INCREASED RATES OF MOLECULAR EVOLUTION IN AN EQUATORIAL PLANT CLADE: AN EFFECT OF ENVIRONMENT OR PHYLOGENETIC NONINDEPENDENCE?

THE TRIPLES DISTANCE FOR ROOTED BIFURCATING PHYLOGENETIC TREES

arxiv: v1 [q-bio.pe] 1 Jun 2014

Bootstrap confidence levels for phylogenetic trees B. Efron, E. Halloran, and S. Holmes, 1996

Phylogenetic hypotheses and the utility of multiple sequence alignment

Phylogeny of the Drosophila saltans Species Group Based on Combined Analysis of Nuclear and Mitochondrial DNA Sequences

Macroevolution Part I: Phylogenies

Chapter 9 BAYESIAN SUPERTREES. Fredrik Ronquist, John P. Huelsenbeck, and Tom Britton

Lecture V Phylogeny and Systematics Dr. Kopeny

Bayesian support is larger than bootstrap support in phylogenetic inference: a mathematical argument

ASYMMETRIES IN PHYLOGENETIC DIVERSIFICATION AND CHARACTER CHANGE CAN BE UNTANGLED

A phylogenomic toolbox for assembling the tree of life

COMPUTING LARGE PHYLOGENIES WITH STATISTICAL METHODS: PROBLEMS & SOLUTIONS

On conditioned reconstruction, gene content data, and the recovery of fusion genomes

PHYLOGENY & THE TREE OF LIFE

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Classification, Phylogeny yand Evolutionary History

Cladistics and Bioinformatics Questions 2013

RECONSTRUCTING THE HISTORY OF HOST-PARASITE ASSOCIATIONS USING GENF, RAIJRED PARSIMONY

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses

Bayesian Models for Phylogenetic Trees

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)

Transcription:

Cladistics Cladistics 29 (2013) 309 314 10.1111/cla.12000 Weighted compromise trees: a method to summarize competing phylogenetic hypotheses Michael J. Sharkey a, *, Stephanie Stoelb a, Daniel R. Miranda-Esquivel b and Barabara J. Sharanowski c a University of Kentucky, Department of Entomology, S225 Agricultural Science Center North, Lexington, KY 40546-0091, USA; b Laboratorio de Sistema tica y Biogeografı a, Escuela de Biologı a, Universidad Industrial de Santander, Bucaramanga, Santander, Colombia; c University of Manitoba, Department of Entomology, Winnipeg, Manitoba, Canada, R3T 2N Accepted 20 August 2012 Abstract A new consensus method for summarizing competing phylogenetic hypotheses, ed compromise, is described. The method corrects for a bias inherent in majority-rule consensus compromise trees when the source trees exhibit non-independence due to ambiguity in terminal clades. Suggestions are given for its employment in parsimony analyses and tree resampling strategies such as bootstrapping and jackknifing. An R function is described that can be used with the programming language R to produce the consensus. Ó The Willi Hennig Society 2012. *Corresponding author: E-mail address: msharkey@uky.edu Nixon and Carpenter (1996) wrote that the only true consensus method is a strict consensus, whereas alternatives such as majority-rule consensus should rightfully be referred to as compromise trees; we follow their terminology here. These other methods, which include Adams (Adams, 1972), majority-rule (Margush and McMorris, 1981), Nelson (Nelson, 1979; Page, 1989; Nixon and Carpenter, 1996) and Bremer or semi-strict (Bremer, 1990; Swofford, 1990) compromise methods, are generally employed when a strict consensus is too poorly resolved for the purposes of the study and a compromise must be made to effect the desired resolution. Consensus and compromise trees may also be useful for highlighting agreement or ambiguity among source cladograms, or summarizing phylogenetic information from multiple lines of evidence (Miyamoto, 1985; Carpenter, 1988; Anderberg and Tehler, 1990; Wilkinson, 1994). Strict consensus trees retain only those nodes that are present in all source trees, and thus represent unequivocal statements about phylogenetic relationships (Nixon and Carpenter, 1996). However, almost all resolution of evolutionary relationships can be lost when a rogue or wildcard taxon is variably placed throughout the source trees (Page, 1989; Bremer, 1990; Nixon and Wheeler, 1991; Wilkinson, 1994; Sumrall et al., 2001; Wilkinson and Thorley, 2001). Adams compromise is particularly useful for identifying rogue taxa, but has been criticized (Page, 1989; Swofford, 1991) for recovering nodes that are not present in any of the source trees (but see Wilkinson, 1994). Majority-rule compromise (MRC) trees retain clades found in a userdefined frequency of source trees (typically 50%) and they are often more resolved than strict consensus trees. They are widely employed in statistically based phylogenetic methods such as Bayesian inference (Huelsenbeck and Ronquist, 2001; Ronquist and Huelsenbeck, 2003) as well as for summarizing trees from resampling techniques, such as bootstrap and jackknife (Efron, 1982; Felsenstein, 1985). However, the frequency of nodes present in the majority of source trees is not always an adequate criterion for inclusion in a compromise tree (Barrett et al., 1991; Sharkey and Leathers, 2001). Comprehensive reviews on consensus methods, Ó The Willi Hennig Society 2012

310 M.J. Sharkey et al. / Cladistics 29 (2013) 309 314 particularly for summarizing information from multiple data sets, can be found in Swofford (1991), Nixon and Carpenter (1996), Steel et al. (2000) and Bryant (2003). A short definition is warranted to assist readers of this article. A component is any group of taxa defined by a node in a tree, regardless of the interrelationships among the taxa included in the component. Thus in Fig. 1 the clades (E(FG)) and (F(EG)) both define the component (EFG). MRC has been criticized for including components that are only present in the majority of trees due to nonindependence of the source trees (Sharkey and Leathers, 2001; Sumrall et al., 2001). To demonstrate the lack of independence in MRC, a contrived dataset is presented in Table 1, which results in three most-parsimonious trees (Fig. 1a, trees 1 3) from a traditional heuristic search in TNT (Goloboff et al., 2008) (50 replications RAS, 2 trees saved per replication, TBR, rseed = 0). The strict consensus and MRC trees are presented in Fig. 1b and c, respectively. An examination of the three source trees (Fig. 1a) reveals changes in the placement of terminal taxa C, D, E, F and G leading to little resolution in the strict consensus (Fig. 1b). In the MRC tree the relationships (BC) and (D(E(FG))) are recovered because they are present in two of the three trees (trees 1 and 2) and thus meet the majority criterion. Tree 1 and tree 2 differ only in the placement of taxon G. The basal split, (A((BC) (DEFG))), is identical in these two trees, but the interrelationships within the more apical component (EFG) are ambiguous. Source trees 1 and 2 are not independent, i.e. their shared basal split results in two solutions for the more derived clade (EFG). The alternative basal split, ((A)(DBCEFG)), found in source Table 1 Contrived data matrix for the trees presented in Fig. 1 111111111122222 Taxon 123456789012345678901234 A 000000000000000000000000 B 111001110111110000100000 C 111000010111110000011010 D 111110000001111000000101 E 111111010111111110111111 F 111111101011101101011110 G 111111000011111101111110 tree 3 gives a unique solution to clade (EFG) and thus this basal topology is recovered only once as a mostparsimonious solution. The lack of independence between trees 1 and 2 affects the MRC, leading to a higher frequency of the basal split shared by trees 1 and 2 in the source trees. Thus the relationship (A((BC)(D(E(FG))))) is recovered in the MRC (Fig. 1c). This can be argued in evolutionary terms. In this example there are two equally supported basal splits. The first is that (BC) is sister to (DEFG), and in this case the latter clade can be resolved in two maximally parsimonious ways. The second basal split, the alternative hypothesis (Tree 3), is that D is the sister of (BCEFG), and in this case the latter clade has a unique maximally parsimonious solution. We see no justification for equivocal terminal clades, as in trees 1 and 2, adding support to a basal split. Viewed from an historical perspective, there are two equally supported resolutions of the outgroup, and the resolution of the ingroup (BCDEFG) is dependent on each of these. (a) (b) (c) (d) Fig. 1. (a) Three minimum-length trees from the data set of Table 1. Numbers in the boxes surrounding the trees are the s of the trees. (b) Strict consensus of the three trees in (a). (c) Majority-rule compromise of the three trees in (a). (d) Weighted compromise of the three trees in (a).

M.J. Sharkey et al. / Cladistics 29 (2013) 309 314 311 MRC (and majority-rule bootstrap and jackknife compromise) continues to be published as a preferred tree (e.g. Highton et al., 2002; Martin and Tooley, 2003; Cadotte et al., 2008; Grande et al., 2008) and is frequently utilized to summarize the posterior distribution of trees in Bayesian inference (Huelsenbeck et al., 2002). This paper introduces a compromise method, entitled ed compromise, with the aim of correcting a bias in MRC. We hope that ed compromise will be employed as a useful way to summarize the source trees resulting from phylogenetic analyses when resolution of the strict consensus is insufficient. Weighted compromise An example is described below based on the data set in Table 2 and the five most-parsimonious (MP) trees that were recovered (Fig. 2a e) from a heuristic search using TNT (Goloboff et al., 2008) (50 replications, 2 trees saved per replication, TBR, rseed = 0). Weighted compromise can be described in terms of conditional probabilities. The probabilities below are based on the source trees in Fig. 2 and the logic may be followed in the graph in Fig. 3. Given that one of the five trees in Fig. 2 is correct, the probability at node A in the graph of Fig. 4 is 1, because it is found in all five trees: The probability at node B on the graph is P(B) = P(A) P(B A) = 1 0.5 = 0.5. The probability at node C on the graph is P(C) = P(A) P(C A) = 1 0.5 = 0.5. The probability at node D on the graph is P(D) = P (A) P (C A) P(D A,C) = 1 0.5 0.5 = 0.25. The probability of Tree a is P(Tree a) = P(A) P(B A) P(Tree a A,B) = 1 0.5 0.5 = 0.25. The probability of Tree b is, P(Tree b) = P(A) P(B A) P(Tree b A,B) = 1 0.5 0.5 = 0.25. The probability of Tree c is, P(Tree c) = P(A) P(C A) P(Tree c A,C) = 1 0.5 0.5 = 0.25. Table 2 Character matrix for the trees in Fig. 2 11111111112222222222 Taxon 12345678901234567890123456789 P 01111111111111011000111111111 Q 00010000000000010000000000000 R 11000001110100100111000001122 S 11000001110001101011000001100 T 00001001110100100101001001122 U 00001000110100001011001000111 V 00000100101010100102000100111 W 00000100101010101011000110111 X 00110010101010001111110010111 Y 00110010101011100102110010111 Z 00000000010011100102000000011 The probability of Tree d is, P(Tree d) = P(A) P(C A) P(D A,C) P(Tree d A,C,D) = 1 0.5 0.5 0.5 = 0.125. The probability of Tree e is, P(Tree e) = P(A) P(C A) P(D A,C) P(Tree e A,C,D) = 1 0.5 0.5 0.5 = 0.125. The probabilities of trees a e are entered into Table 3 to arrive at the summed for each component (clade). Those components with a value > 0.5 are included in the ed compromise tree (Fig. 4c). Note that the resolution of the ed compromise tree is intermediate between those of the strict and MRC trees. We are hesitant to describe this as a probabilistic approach because these probabilities are conditional on the assumption that one of the five source trees is correct, and of course this is usually unknown. Thus we prefer to refer to the resulting compromise as ed rather than probabilistic. Discussion MRC trees had an intuitive appeal; under the assumptions that all source trees are independent of each other and all are equally supported, i.e. the set of minimum-length trees, then those components that are present in more than 50% of the source trees are better supported than those that are not. Sharkey and Leathers (2001) and Sumrall et al. (2001) demonstrated that MRC source trees are not independent due to equivocal apical topologies causing basal splits to be overed. Hence, components that are present in more than 50% of source trees are not necessarily better supported than those that are not. Here we have corrected the dependence issue by down-ing interdependent source trees. Given that the source data are now independent we maintain that ed compromise trees represent an accurate measure of the degree of support for each component found in the source trees. Consensus and compromise trees would never have been fathomed if phylogenetic analyses always resulted in a unique hypothesis; neither would most of the various character ing methods that abound in the literature. The trees produced employing character s may also be considered compromise trees, as they compromise the principal of parsimony. We here suggest that, rather than selecting a preferred tree from a set of source trees or ing characters employing any of the myriad of methods available, ed consensus trees might first be investigated to see if they supply the necessary resolution to answer the questions being addressed. The resolution of a ed compromise tree is bounded by those of the respective strict (Figs 1b and 4a) and MRC trees (Figs 1c and 4b). The upper limit

312 M.J. Sharkey et al. / Cladistics 29 (2013) 309 314 (a) (b) (c) (d) (e) Fig. 2. The five minimum-length trees from the data set of Table 2. Numbers in the boxes surrounding the trees are the s of the trees. Fig. 3. Probability graph showing derivation of the s of the five trees in Fig. 2. Nodes are identified by capital letters. on resolution is set by the majority-rule tree and the lower limit by the strict consensus. The most basal components with conflicts in the source trees will never be resolved in a ed compromise tree as there will always be two or more ties at this point. Given only two source trees, the strict consensus, the MRC and the ed compromise will have the same topology. Another approach to ed compromise is to include all clades in the ed compromise tree with the highest s even if they are not over 0.5. For example, the highest for resolution within a clade might only be 0.33, but if this is higher than all other alternatives it may be included in the ed compromise tree. Like strict consensus trees and compromise methods, ed compromise trees may not represent an optimal tree from which one can directly draw all phylogenetic conclusions, but may do so in many cases.

M.J. Sharkey et al. / Cladistics 29 (2013) 309 314 313 (a) (b) (c) Fig. 4. (a) Strict consensus of the five source trees in Fig. 2. (b) Majority-rule compromise of the five source trees in Fig. 2. (c) Weighted compromise of the five source trees in Fig. 2. Table 3 Weights for the five trees in Fig. 2 and s for all components in the five trees Component Tree a Tree b Tree c Tree d Tree e 1 (UTRSZVWYPX) 0.25 0.25 0.125 0.125 0.25 1.0 2 (TRSZVWYPX) 0.25 0.25 N A N A N A 0.5 3 (TRS) 0.25 0.25 0.125 0.125 0.25 1.0 4 (RS) 0.25 0.25 0.125 0.125 0.25 1.0 5 (ZVWYPX) 0.25 0.25 N A N A N A 0.5 6 (VWYPX) 0.25 0.25 0.125 0.125 N A 0.75 7 (VW) 0.25 N A 0.125 N A N A 0.375 8 (YPX) 0.25 0.25 0.125 0.125 0.25 1.0 9 (PX) 0.25 N A 0.125 N A N A 0.375 10 (WPXY) N A 0.25 N A 0.125 0.25 0.625 11 (XY) N A 0.25 N A 0.125 N A 0.375 12 (UTRSVWYPX) N A N A 0.125 0.125 0.25 0.5 13 (UTRS) N A N A 0.125 0.125 0.25 0.5 14 (UTRSWXPY) N A N A N A N A 0.25 0.25 15 (PY) N A N A N A N A 0.25 0.25 Total 9.00 N A, not applicable. Component Program To calculate the ed compromise, D.R.M.-E. developed the program wconsensus, using the programing language R (R Development Core Team, 2011). It can be implemented using the R package APE (Paradis et al., 2004). The function calculates the ed compromise and presents either the MRC or the ed compromise with the nodes collapsed according to a cut value (ed compromise values at the nodes). The default cut value is 0.5, but this can be modified by the user. The input data (the initial trees in Newick format only) are entered as a multiphylo object, and a collapsed ed compromise tree is output according to the calculated s. This same output can be calculated by using the R-code (wconsensus-test.r) in the console command line or R command line. The wconsensus function and R code are available at http://code.google.com/p/wconsensus/, under a GPL 3.x licence, and instructions for their use are included. Acknowledgements We thank Mark Wilkinson, Graham Jones, Tom Shearin and members of the Hymenoptera Institute for comments on the manuscript; financial support was received via NSF grants EF-0337220 and DEB-0542864. D.R.M.-E. is indebted to Divisio n de Investigacio n y Extensio n, Facultad de salud, Universidad Industrial de Santander (project 5658) and Divisio n de Investigacio n y Extensio n, Facultad de Ciencias, Universidad Industrial de Santander (project 5132). References Adams, E.N. III, 1972. Consensus techniques and the comparison of taxonomic trees. Syst. Zool. 21, 390 397. Anderberg, A.A., Tehler, A., 1990. Consensus trees, a necessity in taxonomic practice. Cladistics 6, 399 402. Barrett, M., Donoghue, M.J., Sober, E., 1991. Against consensus. Syst. Zool. 40, 486 493.

314 M.J. Sharkey et al. / Cladistics 29 (2013) 309 314 Bremer, K., 1990. Combinable component consensus. Cladistics 6, 369 372. Bryant, D., 2003. A classification of consensus methods for phylogenetics. In: Janowitz, M., Lapointe, F.J., McMorris, F.R., Mirkin, B., Roberts, F.S. (Eds.), Bioconsensus. DIMACS-AMS, pp. 163 184. Cadotte, M.W., Cardinale, B.J., Oakley, T.H., 2008. Evolutionary history and the effect of biodiversity on plant productivity. Proc. Natl. Acad. Sci. U.S.A. 105, 17012 17017. Carpenter, J.M., 1988. Choosing among multiple equally parsimonious cladograms. Cladistics 4, 291 296. Efron, B., 1982. The Jackknife, the Bootstrap and Other Resampling Plans. CBMS-NSF Regional Conference Series in Applied Mathematics, Monograph 38, SIAM, Philadelphia. Felsenstein, J., 1985. Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39, 783 791. Goloboff, P.A., Farris, J.S., Nixon, K.C., 2008. TNT, a free program for phylogenetic analysis. Cladistics, 24, 774 786. Grande, C., Templado, J., Zardoya, R., 2008. Evolution of gastropod mitochondrial genome arrangements. BMC Evol. Biol. 8, 61. Highton, R., Hedges, S.B., Hass, C.A., Dowling, H.G., 2002. Snake relationships revealed by slowly-evolving proteins: further analysis and a reply. Herpetologica 58, 270 275. Huelsenbeck, J.P., Ronquist, F., 2001. MRBAYES: Bayesian inference of phylogeny. Bioinformatics 17, 754 755. Huelsenbeck, J.P., Larget, B., Miller, R.E., Ronquist, F., 2002. Potential applications and pitfalls of Bayesian inference of phylogeny. Syst. Biol. 51, 673 688. Margush, T., McMorris, F.R., 1981. Consensus n-trees. Bull. Math. Biol. 43, 239 244. Martin, F.N., Tooley, P.W., 2003. Phylogenetic relationships among Phytophthora species inferred from sequence analysis of mitochondrially encoded cytochrome oxidase I and II genes. Mycologia 95, 269 284. Miyamoto, M.M., 1985. Consensus cladograms and general classifications. Cladistics 1, 186 189. Nelson, G.J., 1979. Cladistic analysis and synthesis: principles and definitions, with a historical note on AdansonÕs Families des Plantes (1763-1764). Syst. Zool. 28, 1 21. Nixon, K.C., Carpenter, J.M., 1996. On consensus, collapsibility, and clade concordance. Cladistics 12, 305 321. Nixon, K.C., Wheeler, Q.D. 1991. Extinction and the origin of species. In: Wheeler, Q.D., Novacek, M. (Eds.), Extinction and Phylogeny. Columbia University Press, New York, pp. 119 143. Page, R.D.M., 1989. Comments on component-compatibility in historical biogeography. Cladistics 5, 167 182. Paradis, E., Claude, J., Strimmer, K., 2004. APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20, 289 290. R Development Core Team. 2011. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, http://www.r-project.org. Ronquist, F., Huelsenbeck, J.P., 2003. MRBAYES 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572 1574. Sharkey, M.J., Leathers, J.W., 2001. Majority does not rule: the trouble with majority-rule consensus trees. Cladistics 17, 282 284. Steel, M., Dress, A.W.M., Bocker, S., 2000. Simple but fundamental limitations on supertree and consensus tree methods. Syst. Biol. 49, 363 368. Sumrall, C.D., Brochu, C.A., Merck, J.W. Jr, 2001. Global lability, regional resolution, and majority-rule consensus bias. Paleobiology 27, 254 261. Swofford, D.L. 1990. PAUP: phylogenetic analysis using parsimony, version 3.0q. Illinois Natural History Survey, Champaign, IL. Swofford, D.L., 1991. When are phylogeny estimates from molecular and morphological data incongruent? In: Miyamoto, M.M., Cracraft, J. (Eds.), Phylogenetic Analysis of DNA Sequences. Oxford University Press, New York, pp. 295 333. Wilkinson, M., 1994. Common cladistic information and its consensus representation: reduced Adams and reduced cladistic trees and profiles. Syst. Biol. 43, 343 368. Wilkinson, M., Thorley, J.L., 2001. Efficiency of strict consensus trees. Syst. Biol. 50, 610 612.