Pasan Samarasin-Dissanayake

Size: px
Start display at page:

Download "Pasan Samarasin-Dissanayake"

Transcription

1 Population Differentiation, Historical Demography and Evolutionary Relationships Among Widespread Common Chaffinch Populations (Fringilla coelebs ssp.) by Pasan Samarasin-Dissanayake A thesis submitted in conformity with the requirements for the degree of Master of Science Ecology and Evolutionary Biology University of Toronto Copyright by Pasan Samarasin-Dissanayake 2010

2 Population Differentiation, Historical Demography and Evolutionary Relationships Among Widespread Common Chaffinch Populations (Fringilla coelebs ssp.) Pasan Samarasin-Dissanayake Master of Science Department of Ecology and Evolutionary Biology University of Toronto 2010 Abstract Widespread species that occupy continents and oceanic islands provide an excellent opportunity to study evolutionary forces responsible for population divergence. Here, I use multilocus coalescent based population genetic and phylogenetic methods to infer the evolutionary history of the common chaffinch (Fringilla coelebs), a widespread Palearctic passerine species. My results showed strong population structure between Atlantic islands. However, the two European subspecies can be considered one panmictic population based on gene flow estimates. My investigation of effects of sampling on concatenated and Bayesian estimation of species tree (BEST) methods demonstrated that concatenation is more sensitive to sampling than BEST. Furthermore, concatenation can provide incorrect evolutionary relationships with high confidence when sample size is small. In conclusion, my results suggest European ancestry for the common chaffinch and Atlantic islands appear to have been colonized sequentially from north to south via Azores. ii

3 Acknowledgements First, I would like to thank Dr. Allan Baker for giving me an opportunity to work in his lab, giving me the independence to explore and learn, and providing guidance when I needed it. Thanks to Dr. Asher Cutter for helpful advice and allowing me to use his cluster for analysis. I am in debt of Oliver Haddrath for training me in molecular laboratory techniques, designing an informative molecular marker (ANON OH) and tremendous support in and out of the lab. Thank you for everyone in the lab including Dr. Erika Tavares, the most helpful postdoc one can imagine; Rosemary Gibson for supporting me in many ways, Alison Cloutier for fun and very educational times in the lab, and Yvonne Verkuil for assistance with analysis and some interesting discussions. Thank you to past and present postdocs Dr. Sergio Pereira and Dr. Debbie Buehler for their assistance, Kristen Choffe for DNA sequencing help, Cathy Dutton and Sue Chopra for administrative support. Thank you to Alivia Dey for friendship. I am very grateful to my parents for their constant support in whatever I chose to do and my sister for running some of my analysis on her computer and annoying me to write this thesis. iii

4 Table of Contents Abstract.. ii Acknowledgements.. iii Table of contents.. iv List of Figures.. vi List of Tables.. viii List of Appendices ix Chapter 1: General Introduction Molecular population genetics and molecular phylogenetics The common chaffinch (Fringilla coelebs) Thesis objectives References 9 Chapter 2: Population genetic structure and historical demography of the common chaffinch (Fringilla coelebs) Abstract Introduction Methods Sampling DNA extraction, amplification and sequencing Data analysis Results Summary statistics and neutrality tests Haplotype networks Genetic clusters and migration rates Effective population sizes and population expansion Discussion Population structure & subspecies status Effective population size and population expansion Conclusions References.. 47 iv

5 Chapter 3: Comparison of multilocus phylogenetic methods for inferring evolutionary relationships among recently diverged common chaffinch subspecies (Fringilla coelebs ssp.) Abstract Introduction Methods Sampling, DNA extraction, amplification and sequencing Data analysis Results Discussion Effects of sampling on phylogenetic inference Species tree estimates with some gene flow Conclusion References.. 83 Chapter 4: General conclusions References.. 91 v

6 List of Figures Chapter 1 Figure 1: Evolutionary relationships among Fringilla sp Figure 2: Distribution map of the common chaffinch (Fringilla coelebs) showing part of its total range... 5 Chapter 2 Figure 1: Map of sampled common chaffinch populations. 17 Figure 2: Median- joining haplotype network of the control region of mtdna. 29 Figure 3: Median- joining haplotype network of nuclear loci EF1α and PTPN. 30 Figure 4: Median- joining haplotype network of nuclear loci TROP and UBIQ 31 Figure 5: Plot of likelihood of data for assumed number of populations (K) in Structure Figure 6: Probabilistic assignment of individual genotypes to populations (K=5) in Structure. 33 Chapter 3 Figure 1: Discordance between gene trees and the species tree.. 57 Figure 2: Geographic locations of sampled common chaffinch subspecies (Fringilla coelebs ssp.). 60 Figure 3: Expected phylogenetic relationships among Atlantic island subspecies under different colonization hypothesis Figure 4: Evolutionary relationships among common chaffinch subspecies from concatenated Bayesian inference.. 70 Figure 5: Evolutionary relationships among common chaffinch subspecies from estimation of species tree (BEST) method 71 vi

7 Figure 6: The large data set molecular clock tree from Bayesian estimation of species tree (BEST) method Figure 7: Effect of sampling when multiple gene lineages persist in recently diverged populations Figure 8: Most probable colonization pattern of Atlantic Islands and Africa. 79 vii

8 List of Tables Chapter 2: Table 1: Sampled loci for population genetic analysis 20 Table 2: Population genetic summary statistics Table 3: Migration rate estimates between common chaffinch populations Table 4: Effective population size and population expansion. 37 Chapter 3: Table 1: Details of 9 nuclear loci sampled for phylogenetic analysis. 65 viii

9 List of Appendices Appendix 1: Details of the Chaffinch samples used in the study 92 ix

10 Chapter 1 General Introduction The study of evolutionary processes responsible for population divergence and speciation is a very important area of research in evolutionary biology. Investigation of structured populations is a key component in understanding how population divergence eventually leads to speciation. According to a simple allopatric model of speciation, an ancestral population splits into two daughter populations and a barrier restricts gene flow between the two daughter populations. In absence of gene flow, the two daughter populations go on different evolutionary trajectories as each population experience random mutations, drift and/or selection. As genetic differences between the two daughter populations accumulate, reproductive isolation may result due to genetic incompatibilities (Coyne and Orr 2004). Understanding details of this process from initial divergence to complete reproductive isolation is an ongoing research goal in evolutionary biology. But it generally takes a long time to go from initial divergence to reproductive isolation; hence study of earth s biodiversity is historical in nature. The fields of population genetics and phylogenetics attempt to understand historical evolutionary and demographic process responsible for current species distribution, and infer past population divergence events using current molecular data. 1.1 Molecular population genetics and molecular phylogenetics Historically, the fields of population genetics and phylogenetics have asked different questions and have employed different methods. While population genetics is concerned with evolutionary forces acting within a species, the primary goal of 1

11 phylogenetics is to infer how species relate to each other. However, these two fields can be thought as being part of a continuum that goes from initial divergence of populations to accumulation of genetic differences between those populations to reach reproductive isolation, hence crossing the species boundary according to the biological definition of species. Early on, both fields have extensively used mtdna for evolutionary inference. While population genetics moved to microsatellites, phylogenetics required sequence based markers and moved to nuclear markers (e.g. introns). However, recent advances in DNA sequencing technology have slowly shifted population genetic studies to use more sequence based markers than microsatellites because sequences are much more information-rich than length polymorphisms. In addition, these two fields have come together in terms of methodology as coalescent theory has come to the forefront of phylogenetic inference methods. The utility of coalescent theory, the most active area of research in theoretical population genetics in the last few decades, provides a promising avenue to infer population genetic parameters and test evolutionary hypotheses. The development primarily attributed to Kingman (1982), the coalescent theory traces a sample of sequences from current generation back to its common ancestor. In the coalescent framework, gene genealogies are used to estimate population genetic parameters by treating genealogies as a random variable (Rosenberg and Nordborg 2002). The coalescent genealogy samplers estimate parameters of interest by making large collections of probable genealogies from data and integrating all sampled genealogies to estimate population genetic parameters. Herein lies the fundamental difference between traditional phylogenetics and theoretical population genetics approaches. Although both use gene genealogies, the inferred tree from the data is the primary interest in 2

12 phylogenetics, while trees are considered random variables in population genetics to estimate important population genetic parameters. The population genetic parameters estimated using the coalescent model provide more biologically realistic estimates with appropriate confidence intervals than estimates from classical summary statistics (Kuhner 2009). Also, coalescent simulations allow us to test demographic hypotheses, such as conformity to a certain population divergence pattern, and test alternative timing for divergence or population bottlenecks. The recognition that coalescent models can be extended from the population level to the species level has given rise to new phylogenetic methods for estimating species trees based on the coalescent such as Bayesian estimation of species trees (BEST) (Liu 2008), Minimize deep coalescence (MDC) (Maddison 1997), ESP-COAL (Degnan and Salter 2005; Carstens and Knowles 2007). These new methods are best suited for inferring evolutionary relationships between recently diverged groups because any single gene tree may not represent the actual species tree in these situations. Therefore, the methods incorporate uncertainty in gene genealogies in the estimate of the phylogeny. As a result, the fields of phylogenetics and population genetics are moving towards multilocus sequence-based coalescent approaches from single locus approaches that were predominant in most of its history (Brito and Edwards 2008). Use of multiple loci provides independent parameter estimates from each locus and gives far better estimates with narrow confidence intervals (Pluzhnikov and Donnelly 1996; Baker 2007). Therefore, there has been an emphasis on using multilocus data for historical evolutionary inference. 3

13 1.2 The common chaffinch (Fringilla coelebs) The genus Fringilla contains three species of finches that feed their young on insects, rather than seeds. They are seed-eating passerines, restricted to the Old World. The three members of this genus are the Brambling (Fringilla montifringilla), Blue chaffinch (Fringilla teydea) and the Common chaffinch (Fringilla coelebs). Figure 1 shows the evolutionary relationships of species in this genus. The common chaffinch and the blue chaffinch are sister species, and the brambling is sister to the two chaffinches. The blue chaffinch is endemic to high elevation pine forests in Canaries (Grant 1979), while the brambling occupies forests of Northern Europe and Asia. Figure 1: Evolutionary relationships among Fringilla sp. The common chaffinch is a widespread Palearctic species that is classified into many subspecies (>19), based on marked phenotypic, behavioural and geographic differences. There are two described subspecies from North Africa (F. c. africana and F. c. spodiogenys), two from Europe (F. c. coelebs and F. c. gengleri) and five from the Atlantic islands (F. c. moreletti, F. c. maderensis, F. c. canariensis, F. c. ombriosa, and F. c. palmae). The Atlantic islands are volcanic islands off the coast of Northwest Africa 4

14 and thought to have formed within the last 20 million years (Carracedo 1999). The Canary Islands are estimated to have formed 1-20 million years ago. The youngest island in the Canaries archipelago is El Hierro (home to F. c. ombriosa), which is estimated to be 1.1 million years old. La Palma (home to F. c. palmae) is estimated to be 2 million years old, while La Gomera (home to F. c. canariensis) is about 12 million years old (Carracedo 1999). The Azores and Madeira islands are thought to have formed about 5 million years ago. The Canary Islands are located about 100 km off the coast of Northwest Africa. Madeira is located between the Azores and Canaries, about 800 km from the Azores and 400 km from the Canaries. The Azores archipelago extends approximately 600 km in Northwest-southeast direction and is about 1,500 km from Portugal (Figure 2). Figure 2: Distribution map of the common chaffinch (Fringilla coelebs) showing part of its total range. The populations are named corresponding to their subspecies designations. 5

15 Based on the estimated origins of Atlantic islands, the common chaffinch populations are believed to have diverged within the last million years. There are substantial morphological and behavioural differences between island and continental types (Grant 1979; Dennison and Baker 1991; Lynch and Baker 1994), with island populations having evolved shorter wings, longer legs, longer bills and larger bodies than continental conspecifics (Grant 1979). In addition, island populations are characterized by extensive dark blue dorsal plumage, wings and tails in males (Grant 1979). Grant (1979) also found that beak depth and width have increased substantially in Azores birds, compared to the Canaries; and birds from the Canaries appear to be less variable than from the Azores for bill, wing and leg characters. However, Dennison and Baker (1991) used 12 morphometric measurements taken from larger samples and did not find statistically significant differences between the Canary and Azores populations in their total variance. They did noted that total variance in Canaries was lower than Azores and continental populations. Lynch and Baker (1994) investigated the cultural evolution of chaffinch populations by analyzing song memes. Their analysis revealed that the African and European populations had the lowest levels of meme differentiation between populations. The Azores population had the highest level of meme diversity and Canaries showed the highest level of among population differentiation. The high levels of memetic differentiation between Atlantic islands shown in this study suggest limited migration between islands (Lynch and Baker 1994). A previous phylogenetic analysis using mtdna suggested that Atlantic islands were sequentially colonized in a single wave via Azores (Marshall and Baker 1999). 6

16 However, the bootstrap support for some of the internal nodes in this phylogeny was low and incomplete lineage sorting may have swamped the phylogenetic signal, yielding incorrect evolutionary relationships by traditional methods. A population genetic study to infer the location of Pleistocene refugia using the same locus estimated moderate gene flow between some African and European populations, but parameter estimates had large confidence intervals since they were inferred from a single locus. However, the analysis suggested that Iberia, Greece and North Africa may have served as refugia during the Pleistocene glacial maximum (Griswold and Baker 2002). This study however did not sample any of the Atlantic island populations. 1.3 Thesis objectives A new method of phylogenetic inference based on coalescence was proposed by Liu and Pearl (2007), which is designed to rectify the problem of inferring evolutionary relationships when incomplete lineage sorting is present. This method has performed well in simulations and in a few empirical studies, showing potential for inferring correct evolutionary relationships for recently diverged species. Furthermore, coalescent theory allows us to estimate important population genetic parameters such as the scaled population mutation rate (θ), migration rates and historical population size changes (i.e. population growth/ bottlenecks). I employ these latest phylogenetic and population genetic tools to determine the evolutionary history of the common chaffinch, which can further our understanding of the speciation process in oceanic islands, which is the primary focus of my thesis. In chapter 2, I use molecular population genetic tools to examine population structuring and historical demography of common chaffinch populations. I use the 7

17 Bayesian genetic clustering program Structure to examine whether described populations based on morphology correspond to genetic populations. In a coalescent framework, I estimate migration rates among populations and investigate the possibility of recent population growth. The primary objective was to determine the level of population differentiation among putative populations (specifically Atlantic island populations) and whether morphological subspecies correspond to genetic clusters. In addition, I investigate the demographic history of these populations by modeling population size changes in the recent past. One possibility is that these island populations have been isolated for thousands of generations, with very limited gene flow between them, essentially functioning as allopatric incipient species. Another possibility is that these populations have diverged very recently, experience ongoing gene flow, and function as demes. In between these two extremes is the possibility that these populations have diverged relatively recently, have limited ongoing gene flow, and possibly are in the process of acquiring reproductive isolation, in which case, we may be seeing speciation in progress. In chapter 3, I investigate evolutionary relationships between common chaffinch populations by constructing multilocus intraspecific phylogenies. I use the popular phylogenetic package MrBayes to construct trees from concatenated sequence data and also by using the new coalescent based phylogenetic inference method implemented in the program BEST. First, I investigate effects of sampling on concatenated phylogenetic inference by comparing phylogenies constructed using few (4) sequences per population versus many (16) sequences per population. It is a common practice in phylogenetics to use a single sequence to represent the species because interspecific sequence variation is 8

18 expected to be significantly higher than intraspecific variation. But for recently diverged groups, this may result in incorrect evolutionary inferences due to the stochasticity of the lineage sorting process. Therefore, I use a new coalescent-based phylogenetic method (BEST) (Liu and Pearl 2007; Liu 2008), which accounts for incomplete lineage sorting to infer evolutionary relationships. I compare and contrast estimated phylogenies from each method to determine the most likely evolutionary relationships among chaffinch populations. I summarize my findings from the population genetic and phylogenetic analyses in chapter 4 to elucidate the evolutionary history of the common chaffinch. I also discuss future research directions in phylogenetic and population genetic inference methods to improve accuracy of historical inferences. Chapter 2 and 3 are written as two primary research articles to be published in separate scientific journals. 1.4 References Baker, A. J Molecular advances in the study of geographic variation and speciation in birds. Auk 124: Brito, P. H., and S. V. Edwards Multilocus phylogeography and phylogenetics using sequence-based markers. Genetica:1-17. Carracedo, J. C Growth, structure, instability and collapse of Canarian volcanoes and comparisons with Hawaiian volcanoes. J Volcanol Geotherm Res 94:

19 Carstens, B. C., and L. L. Knowles Estimating species phylogeny from gene-tree probabilities despite incomplete lineage sorting: An example from melanoplus grasshoppers. Syst Biol 56: Coyne, J. A., and H. A. Orr Speciation. Sinauer Associates, Sunderland, MA. Degnan, J. H., and L. A. Salter Gene tree distributions under the coalescent process. Evolution 59: Dennison, M. D., and A. J. Baker Morphometric variability in continental and Atlantic island populations of chaffinches (Fringilla coelebs). Evolution 45: Grant, P. R Evolution of the Chaffinch, Fringilla coelebs, on the Atlantic islands. Biol J Linn Soc 11: Griswold, C. K., and A. J. Baker Time to the most recent common ancestor and divergence times of populations of common chaffinches (Fringilla coelebs) in Europe and North Africa: Insights into Pleistocene refugia and current levels of migration. Evolution 56: Kingman, J. F. C On the genealogy of large populations. J Appl Probab 19A: Kuhner, M. K Coalescent genealogy samplers: windows into population history. Trends Ecol Evol 24: Liu, L BEST: Bayesian estimation of species trees under the coalescent model. Bioinformatics 24: Liu, L., and D. K. Pearl Species trees from gene trees: Reconstructing Bayesian posterior distributions of a species phylogeny using estimated gene tree distributions. Syst Biol 56:

20 Lynch, A., and A. J. Baker A population memetics approach to cultural evolution in chaffinch song: Differentiation among populations. Evolution 48: Maddison, W. P Gene trees in species trees. Syst Biol 46: Marshall, H. D., and A. J. Baker Colonization history of Atlantic island common chaffinches (Fringilla coelebs) revealed by mitochondrial DNA. Mol Phylogenet Evol 11: Pluzhnikov, A., and P. Donnelly Optimal sequencing strategies for surveying molecular genetic diversity. Genetics 144: Rosenberg, N. A., and M. Nordborg Genealogical trees, coalescent theory and the analysis of genetic polymorphisms. Nat Rev Genet 3:

21 Chapter 2 Population genetic structure and historical demography of the common chaffinch (Fringilla coelebs) 2.1 Abstract The widespread common chaffinch (Fringilla coelebs) is divided into over 19 subspecies based on morphological differences. Here, I investigated the genetic structure and demographic history of the common chaffinch populations in Western Europe, Northern Africa and the Atlantic islands using 10 sequence-based genetic markers (mtdna and 9 nuclear markers). The migration rate estimates from the multilocus coalescent analysis showed significant genetic differentiation among Atlantic island populations (4Nm<1). High levels of gene flow were detected between the British chaffinch (F. c. gengleri) and the Western European chaffinch (F. c. coelebs) (4Nm = 7.86). The mismatch distribution analysis of mtdna and multilocus analysis suggests that all populations (except F. c. ombriosa) have been growing at relatively equal rates. Our results suggest that the British chaffinch is part of the more widespread Western European chaffinch population. 2.2 Introduction The question of what constitutes a species is not without its share of controversy; and species delimitation continues to be a source of debate among biologists (Mallet 1995; Sites Jr and Marshall 2003; Hey 2006). Although the most commonly used definition of species today is the biological species concept (Mayr 1963), species are 12

22 frequently described based only on morphology. In addition to morphological and biological species concepts, other species concepts such as genealogical species (Baum and Shaw, 1995) rely on a phylogenetic framework, and define a species as a basal, exclusive group of organisms, whose members are more closely related to each other than they are to any organism outside the group. Furthermore, some have argued that species is a human classification constructed to divide the biodiversity continuum and not a real separate entity (Mallet 2008). Regardless, species is the best recognized unit of biodiversity in evolutionary biology and conservation. Even though the large majority of described species are morphological species, genetics has played an increasingly important role in species delimitation in the past few decades (Hebert et al. 2003; Sites Jr and Marshall 2003). In the continuum of biological organization from genes to populations to higher levels of organization such as phyla, subspecies represents an intermediate step between populations and species. In general, subspecies are populations that have sufficiently diverged in morphological traits to warrant a distinction, but these trait differences are not pronounced enough to classify them as distinct species. The common chaffinch (Fringilla coelebs), a widespread Palearctic passerine species, provides a great opportunity to study this continuum from structured populations to recognized species. Common chaffinches occupy much of Eurasia, Northern Africa and the Atlantic islands in the Canaries, Madeira and Azores, and these populations have undergone marked phenotypic differentiation to warrant designation of multiple subspecies (>19) (Grant 1979; Baker et al. 1990; Marshall and Baker 1999; Suárez et al. 2009). 13

23 The study of geographic variation in a species is often the first step in the quest to understand how a single ancestral species gives rise to two or more daughter species. Since the advent of phylogeography, mtdna has played a crucial role in furthering our understanding about genetic variation within species and it continues to play a critical role. mtdna is the molecule of choice for phylogeographic and population genetic studies because it is easy to amplify, does not recombine (but see Rokas et al. 2003; White et al. 2008) and mutates faster than the average nuclear locus. These properties make mtdna genes ideal for building gene genealogies and to estimate population genetic parameters. However, insight from coalescent theory predicts that gene trees from different loci can differ substantially in topology, and different genes can have different histories even though they come from the same populations (Maddison 1997; Wakeley 2008; Nielsen and Beaumont 2009). Therefore, population genetic parameters estimated from a single locus may not accurately reflect the true evolutionary history of the population (Hey and Machado 2003; Nielsen and Beaumont 2009). Additionally, population genetic parameters estimated from a single locus have very large 95% credible intervals due to stochastic variance in the coalescent process (Edwards and Beerli 2000; Baker 2007). Due to these reasons, over-interpretation of phylogeographies and population genetic parameter estimates from a single locus has been criticized (Knowles and Maddison 2002; Edwards et al. 2005). Furthermore, concerns have been raised about historical demography based solely on mtdna because mtdna genes could be affected by selective sweeps (Bensch et al. 2006) and sex biases in fitness or dispersal (Hare 2001). With rapidly advancing DNA sequencing technology and development of new 14

24 analytical tools based on coalescent theory, population genetic parameter estimates and interpretations from multiple loci can avoid problems inherent in single locus interpretations. Baker et al. (1990) conducted a population genetic survey of the common chaffinch using 22 polymorphic allozyme loci and showed that the highest level of genetic differentiation was between continental (Europe and Africa populations pooled together) and Atlantic island populations (F ST = 0.386). There was a high level of genetic differentiation among Atlantic islands (F ST = 0.321) but the European population was weakly differentiated from the African population (F ST = 0.092). Furthermore, populations in different islands of the Canaries have undergone significantly more differentiation compared to populations among Azores Islands. The mean F ST among Canary populations was while it was in Azores Islands (Baker et al. 1990). This suggests that even though distances between Azores islands are greater than Canary Islands, Azores populations are essentially panmictic. They also reported that the Azores Islands support much larger populations than the Canary Islands because of more agricultural land and a much more humid climate in the Azores. The widespread distribution of common chaffinches in continents and oceanic islands provide an excellent opportunity to investigate the effects of geographic isolation and genetic drift on population divergence and speciation. In this study, I focus on common chaffinch populations in Western Europe, Northern Africa and the Atlantic islands to investigate population structure and demographic history using coalescent- 15

25 based multilocus population genetic tools. For simplicity, I use current subspecies designations to refer to the putative populations because subspecies names correspond very closely to geographic locations (Figure 1). However, the validity of certain subspecies designations has been questioned. One such point of contention is the validity of the British chaffinch (F. c. gengleri) as a separate subspecies from the more widespread F. c. coelebs in rest of Western Europe. Others have questioned the validity of three subspecies from the Canary Islands. For example, Mayr (1968) assigns the El Hierro population as a separate subspecies (F. c. ombriosa) from the La Palma population (F. c. palmae), while Baker et al. (1990) claimed that these two populations were phenotypically indistinguishable. Here, I investigate genetic structure of the common chaffinch populations using haplotype networks, Bayesian genetic clustering and by estimating migration rates between populations. I also estimate the effective population size and investigate the possibility of historical population size changes. Population genetic parameters are estimated in a multilocus coalescent framework to avoid problems inherent to single locus methods, thus providing more reliable inferences about the history of the common chaffinch. 16

26 Figure 1: Distribution map of the common chaffinch (Fringilla coelebs) showing part of its total range. The populations are named corresponding to the subspecies designation. There are three putative populations in the Canaries, each occupying different islands. 2.3 Methods Sampling A total of 121 chaffinches were sampled from nine populations corresponding to their subspecies designation (Figure 1). Appendix I provide the details of the sampled individuals. Nineteen individuals were sampled from continental Europe (Denmark and Greece) that were classified as F. c. coelebs, eight individuals from the United Kingdom that were classified as F. c. gengleri, and fourteen individuals from Morocco and twelve from Tunisia, corresponding to F. c. africana and F. c. spodiogenys. From the Atlantic islands, fourteen individuals were sampled from the Azores islands population of F. c. moreletti, fourteen from the Madeira population of F. c. maderensis, and thirty-nine 17

27 individuals from the Canary Islands. Twelve of the Canary Island individuals were from the island of El Hierro, classified as the F. c. ombriosa subspecies; fourteen from La Palma, classified as the F. c. palmae subspecies and fourteen from La Gomera, classified as F. c. canariensis subspecies DNA extraction, amplification and sequencing Genomic DNA was extracted from frozen muscle tissue using the standard proteinase K-phenol-chloroform method (Sambrook and Russell 2001) or rapid alkaline extraction (Rudbeck and Dissing 1998). Briefly, rapid alkaline extraction was carried out by first adding a minute amount of muscle tissue (2-3 ul equivalent of blood) to a 96 well PCR plate containing 20 ul of 0.2 M NaOH. The plate was covered and heated to 75 C for 20 minutes in a thermocycler. Finally, the solution was neutralized by adding 180 ul of a 0.04 M Tris-HCl ph 7.5 solution and frozen. The control region of the mtdna and 9 nuclear loci were amplified using locus specific primers (Table 1). The amplified loci are: Aconitase II (ACON: Backström et al. 2008); B-Actin (B-ACT: Waltari and Edwards 2002); Elongation factor 1 alpha (EF1α: Backström et al. 2008); Glyceraldehyde-3-phosphate dehydrogenase (GAPD: Friesen et al. 1999); Locus L27331 (L27331: Backström et al. 2008); Protein tyrosine phosphatase non-receptor 12 (PTPN12; Townsend et al. 2008), Tropomyosin (TROP: Friesen et al. 1999); and Ubiquitin carboxyl-terminal esterase (UBIQ: Backström et al. 2008), Anonymous locus OH (ANON OH), and the Control region of mtdna (Marshall and 18

28 Baker 1997). All the nuclear loci with the exception of PTPN12 were non-coding. The anonymous locus OH was developed by the following method. First, Chaffinch genomic DNA was isolated, purified and digested using five different restriction enzymes which were selected to leave single stranded overhangs at each cut site. The enzymes used were Hind III, Eco RI, Xba I, Xho I and Nhe I. The digested DNA was then size selected and ligated to double stranded linkers containing specific single strand overhangs complementary to the restriction enzyme cut sites. The library that resulted represented the complete Chaffinch genome with the average fragment size being approximately 1,500 bp and each fragment having amplifiable ends. Using a primer specific for the retrotransposon CR1 and a primer specific for the linker, fragments were amplified, size selected and cloned. A number of clones were sequenced to identify flanking regions to the repetitive element. The flanking regions were blasted against the chicken genome to ensure they were present as a single copy, and from the candidate sequences primers were designed to amplify this region but to exclude the repetitive element. 19

29 Table 1: Sampled loci for population genetic analysis Locus Primer sequence Fragment size (bp) Chromosomal location in zebra finch Model of sequence evolution Reference Aconitase II (ACON) Anonymous locus OH (ANON OH) B-Actin (B- ACT) Elongation factor 1 alpha (EF1α) Glyceraldehyde 3 phosphase dehydrogenase (GAPD) Locus (L27331) Protein tyrosine phosphatase non-receptor 12 (PTPN12) Tropomyosin (TROP) Ubiquitin carboxyl-terminal esterase (UBIQ) mtdna Control region (CR) F- CCAATGCTTGTGGGCCATG R- ATTGCGACCTGTGAAATTCC 750 1A GTR+I Backström et al F- TCCCATTGCAACAACCTGTTCAC R- GGGCACTTCAGTCACTCTGAC GTR+I NA F- CCTGATGGTCAGGTCATCA R- CAGCAATGCCAGGGTACAT GTR+I+G Waltari and Edwards 2002 F- ATTGGCTACAACCCAGACAC R- CAGGATGCAGTCCAAGGCT HKY+G Backström et al F- ACCTTTAATGCGGGTGCTGGCATTGC R- CATCAAGTCCACAACACGGTTGCTGTA HKY+G Friesen et al F- CCTAGCTAAATATGTTCTGGC R- TAGGCTTCCTGATGATGGCT GTR+I+G Backström et al F- AGTTGCCTTGTWGAAGGRGATGC R- CTRGCAATKGACATYGGYAATAC 830 1A HKY+I Townsend et al F- GAGTTGGATCGGGCTCAGGAGCG R- CGGTCAGCCTCTTCAGCAATGTGCTT GTR+I Friesen et al F- GCTTGTGGGACAATTGGG R- TATTTGGCCCTCTCTTCAGG HKY+I Backström et al F- TCAGGGTATGTATAATATGC R- CACTTGCTGTGAAGAGC 480 NA GTR+I+G Baker and Marshall

30 Polymerase chain reactions (PCR) were carried out using 1.5 µl of DNA in a 12.5 µl of total reaction volume, with 1.25 µl of PCR buffer (10mM TrisHCl ph8.3, 2.5 mm MgCl, 50mM KCl and 0.01% gelatin), 0.28 µl of 1X dntp s, 0.5 µl of each primer, and 0.05 µl of Platinum Taq (5units/ µl) (Invitrogen Inc). The thermocycling profile was as follows: an initial 94 C denaturation step for 4 min, followed by a total of 35 cycles consisting of a 30 sec at 94 C denaturation step, a 30 sec annealing step starting at 65 C and decreasing by a degree per cycle until the annealing temperature reached 55 C, and 30 sec 72 C extension step, and a final extension of 5 min at 72 C. Amplified product was isolated by separation in a 2% agarose gel, the DNA band was cut out of the gel and purified for sequencing. Cycle sequencing reactions with forward and reverse PCR primers were carried out using BigDye Terminator 3.1 (Applied Biosystems) and visualized on an ABI 3100 DNA Sequencer Data analysis For each locus, all sequences were edited using Chromas Pro 1.42 (Technelysium Pty. Ltd., Australia). The sequences were aligned initially with the program ClustalW (Thompson et al. 1994) in the BioEdit Sequence Alignment Editor (Hall 1999) and adjusted manually. For nuclear loci, heterozygous sites were identified from the presence of two equal height peaks in the chromatograms. The program PHASE 2.1 (Stephens and Scheet 2005) was used to resolve the haplotypes from the unphased genotype data when a sequence contained multiple heterozygous sites. A small subset of individuals that did not sequence well directly or that were needed for phasing were cloned into the PCR

31 TOPO TA cloning vector (Invitrogen) and sequenced in both directions using M13 primers. This was done to eliminate the possibility of using paralogous genes for analysis and to verify that statistically inferred haplotypes were correct. For each locus, the number of segregating sites (S), nucleotide diversity (π), haplotype diversity (H d ), Tajima's D statistic (Tajima 1989), Fu and Li s F* statistic (Fu and Li 1993) and minimum number of recombination events (R M, (Hudson and Kaplan 1985) were estimated using the DnaSP 5.0 software package (Librado and Rozas 2009). Deviation from the standard neutral model was tested using Tajima s D and Fu and Li s F* statistic, and significance was assessed by conducting 1000 coalescent simulations. The Hudson-Kreitman-Aguade (HKA) test (Hudson et al. 1987) implemented in DnaSP was conducted on loci that deviated from neutral predictions according to Tajima s D and Fu and Li s F* test. The HKA test compares observed and expected number of segregating sites within a species and the number of pairwise differences between species at two or more loci, and is more sensitive in detecting natural selection than neutrality tests based on site frequency spectrum (Zhai et al. 2009). I also conducted the fourgamete test (Hudson and Kaplan 1985) implemented in DnaSP to determine whether recombination needs to be incorporated into the coalescent model for population genetic parameter estimates. I constructed median-joining haplotype networks for each locus using NETWORK (available at to explore relationships among sequences at each locus. The goal of constructing haplotype 22

32 networks was to determine whether a) haplotypes segregate according to geographic locations and b) if there are signatures of recent population expansion. Therefore, haplotype networks function as a qualitative first pass test for exploring population structure and expansion. To determine historical population expansion events, mismatch distributions were calculated using the program Arlequin 3.1 (Excoffier et al. 2005). The expected distribution under a sudden demographic expansion model (Rogers and Harpending 1992) was generated using 1000 parametric bootstrap replicates. The sum-of-squared deviations (SSD) between observed and expected mismatch distribution were computed and significant deviation from the demographic expansion model was assessed by calculating the proportion of simulations producing a larger SSD than the observed SSD (α = 0.05). To determine the genetic structure of the common chaffinch, I used the Bayesian genetic clustering program Structure (Pritchard et al. 2000), which utilizes allele frequencies from multiple loci to assign sampled individuals to genetic clusters. Data were coded as haplotypes. Analysis was conducted with a burn-in of 400,000 and two million MCMC iterations after the burn-in under admixture model with correlated allele frequencies (Falush et al. 2003). Ten independent runs were conducted at each K with different seed numbers to calculate Ln [Pr(X K)]. Simulations were run from K=1 to K=10. I used the coalescent genealogy sampler program LAMARC (Kuhner and Smith 2007) to estimate population genetic parameters theta (θ), migration rates (M) and population growth parameter (g), and incorporated recombination into the model. Two 23

33 Bayesian replicates with four simultaneous searches with adaptive heating were conducted. The first 10,000 trees were discarded as burn-in, and then 50,000 trees were collected for parameter estimates by sampling every 50 th tree. The best model of nucleotide sequence evolution for each locus was identified using Akaike Information Criterion (AIC) in MrModeltest 2.3 (Nylander 2004) and specified for the analysis in LAMARC. Overall curve files for each parameter were inspected to ensure that parameter space was searched adequately and estimates were reliable. 2.4 Results Summary statistics and neutrality tests In general, the number of segregating sites (S) and the average pair-wise number of nucleotide differences per site between sequences (π) was higher in continental populations than island populations (Table 2). The Canary island populations harbored the least amount of genetic diversity, while the continental European F. c. coelebs population was the most diverse. The diversity statistics (S, π, H d ) among the three Canaries populations were very similar. In Atlantic islands, the Azores appears to be the most genetically diverse, but the Madeira population is close to the Azores population in the level of genetic diversity. The F. c. coelebs and F. c. gengleri populations had comparable numbers of segregating sites and nucleotide diversity at most loci. In the African continent, the F. c. africana population appears to be slightly more genetically diverse than the F. c. spodiogenys population. Recombination was detected for all examined loci except for the EF1a locus. The B-Actin locus contained the highest 24

34 number of recombination events as detected by the four gamete test. Only one recombination event was detected for the GAPD, PTPN 12, TROP and UBIQ loci. The results from Tajima s D and Fu and Li s F* neutrality tests were not significant (α =0.05) except in a few cases; suggesting conformity to the neutral coalescent model. In general, Tajima s D and Fu and Li s F* statistic for most populations were negative but not significant (Table 2). Fu and Li s F* statistic was negative for at least seven of the 10 loci in F. c. africana, F. c. spodiogenys, F. c. coelebs, F. c. gengleri, F. c. maderensis and F. c. palmae populations. Fu and Li s F* was negative across all loci for the F. c. coelebs population. Both Tajima s D and Fu and Li s F* were significantly negative for the F. c. palmae population at the EF1α locus (D = , F* = ), and the F. c. coelebs population at the L27331 locus (D= , F* = ). At the TROP locus, Tajima s D was significantly negative for the F. c. maderensis population (D = ) and Fu and Li s F* was significantly negative for the F. c. ombriosa population (F* = ). Since both these tests are sensitive to demographic history, the HKA test was conducted with Blue chaffinch as the outgroup. The HKA test did not detect significant deviation from neutrality (χ 2 = 0.215, p = 0.64). Both Tajima s D and Fu and Li s F* were significantly positive for the control region of mtdna in the F. c. moreletti population in the Azores (D = 2.186, F* =1.780). 25

35 Table 2: Population genetic summary statistics Locus Population N S π H d Tajima's D Fu and Li's F* ACON F. c. canariensis F. c. maderensis F. c. africana F. c. spodiogenys F. c. coelebs F. c. gengleri F. c. ombriosa F. c. palmae F. c. moreletti ANON OH F. c. canariensis F. c. maderensis F. c. africana F. c. spodiogenys F. c. coelebs F. c. gengleri F. c. ombriosa F. c. palmae F. c. moreletti B- ACT F. c. canariensis F. c. maderensis F. c. africana F. c. spodiogenys F. c. coelebs F. c. gengleri F. c. ombriosa F. c. palmae F. c. moreletti EF1α F. c. canariensis F. c. maderensis F. c. africana F. c. spodiogenys F. c. coelebs F. c. gengleri F. c. ombriosa F. c. palmae * * 0 F. c. moreletti GAPD F. c. canariensis F. c. maderensis F. c. africana F. c. spodiogenys F. c. coelebs R M 26

36 F. c. gengleri F. c. ombriosa F. c. palmae F. c. moreletti L27331 F. c. canariensis F. c. maderensis F. c. africana F. c. spodiogenys F. c. coelebs * * 1 F. c. gengleri F. c. ombriosa F. c. palmae F. c. moreletti PTPN12 F. c. canariensis F. c. maderensis F. c. africana F. c. spodiogenys F. c. coelebs F. c. gengleri F. c. ombriosa F. c. palmae F. c. moreletti TROP F. c. canariensis F. c. maderensis * F. c. africana F. c. spodiogenys F. c. coelebs F. c. gengleri F. c. ombriosa * 0 F. c. palmae F. c. moreletti UBIQ F. c. canariensis F. c. maderensis F. c. africana F. c. spodiogenys F. c. coelebs F. c. gengleri F. c. ombriosa F. c. palmae F. c. moreletti CR F. c. canariensis (1) F. c. maderensis F. c. africana F. c. spodiogenys F. c. coelebs (1) F. c. gengleri

37 F. c. ombriosa F. c. palmae F. c. moreletti * 1.78* (1) N: Sample size S: Number of segregating sites π: Average number of nucleotide differences per site between two sequences H d : Haplotype (gene) diversity Tajima's D: Tajima s D statistic (Tajima 1989) calculated in DnaSP. * p<0.05 Fu and Li's F*: Fu and Li's F* statistic (Fu and Li 1993) calculated in DnaSP. * p<0.05 R M : Minimum number of recombinant events estimated using four-gamete test (Hudson and Kaplan 1985) Haplotype networks The mtdna haplotype network (Figure 2) shows a clear distinction between Atlantic island and continental populations. There is no haplotype sharing between the islands and the continents, and five mutational steps separate the two groups. In the Atlantic islands, the birds from the Canaries, Madeira and Azores islands are separated into different haplo-groups with little haplotype sharing. Haplotype 4 is shared between the Azores and Madeira populations while haplotype 2 is shared between Madeira and Canary Islands. Within the Canary Islands, the three populations appear to segregate into distinct haplotypes with no haplotype sharing. The continental haplotype relationships are slightly more complex than islands. Haplotype 9 is shared between all four putative continental populations. Haplotypes 15 and 21 are shared between F. c. gengleri and F. c. coelebs populations. There are many low frequency haplotypes, mostly belonging to the F. c. coelebs population. 28

Colonization History of Atlantic Island Common Chaffinches (Fringilla coelebs) Revealed by Mitochondrial DNA

Colonization History of Atlantic Island Common Chaffinches (Fringilla coelebs) Revealed by Mitochondrial DNA Molecular Phylogenetics and Evolution Vol. 11, No. 2, March, pp. 201 212, 1999 Article ID mpev.1998.0552, available online at http://www.idealibrary.com on Colonization History of Atlantic Island Common

More information

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics. Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary

More information

Molecular Markers, Natural History, and Evolution

Molecular Markers, Natural History, and Evolution Molecular Markers, Natural History, and Evolution Second Edition JOHN C. AVISE University of Georgia Sinauer Associates, Inc. Publishers Sunderland, Massachusetts Contents PART I Background CHAPTER 1:

More information

Intraspecific gene genealogies: trees grafting into networks

Intraspecific gene genealogies: trees grafting into networks Intraspecific gene genealogies: trees grafting into networks by David Posada & Keith A. Crandall Kessy Abarenkov Tartu, 2004 Article describes: Population genetics principles Intraspecific genetic variation

More information

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

7. Tests for selection

7. Tests for selection Sequence analysis and genomics 7. Tests for selection Dr. Katja Nowick Group leader TFome and Transcriptome Evolution Bioinformatics group Paul-Flechsig-Institute for Brain Research www. nowicklab.info

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological

More information

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the

More information

Estimating Evolutionary Trees. Phylogenetic Methods

Estimating Evolutionary Trees. Phylogenetic Methods Estimating Evolutionary Trees v if the data are consistent with infinite sites then all methods should yield the same tree v it gets more complicated when there is homoplasy, i.e., parallel or convergent

More information

8/23/2014. Phylogeny and the Tree of Life

8/23/2014. Phylogeny and the Tree of Life Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major

More information

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009 Gene Genealogies Coalescence Theory Annabelle Haudry Glasgow, July 2009 What could tell a gene genealogy? How much diversity in the population? Has the demographic size of the population changed? How?

More information

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information # Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Details of PRF Methodology In the Poisson Random Field PRF) model, it is assumed that non-synonymous mutations at a given gene are either

More information

A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences

A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences Indian Academy of Sciences A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences ANALABHA BASU and PARTHA P. MAJUMDER*

More information

C3020 Molecular Evolution. Exercises #3: Phylogenetics

C3020 Molecular Evolution. Exercises #3: Phylogenetics C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from

More information

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships Chapter 26: Phylogeny and the Tree of Life You Must Know The taxonomic categories and how they indicate relatedness. How systematics is used to develop phylogenetic trees. How to construct a phylogenetic

More information

Lecture 11 Friday, October 21, 2011

Lecture 11 Friday, October 21, 2011 Lecture 11 Friday, October 21, 2011 Phylogenetic tree (phylogeny) Darwin and classification: In the Origin, Darwin said that descent from a common ancestral species could explain why the Linnaean system

More information

Evolution Problem Drill 10: Human Evolution

Evolution Problem Drill 10: Human Evolution Evolution Problem Drill 10: Human Evolution Question No. 1 of 10 Question 1. Which of the following statements is true regarding the human phylogenetic relationship with the African great apes? Question

More information

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression) Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures

More information

Curriculum Links. AQA GCE Biology. AS level

Curriculum Links. AQA GCE Biology. AS level Curriculum Links AQA GCE Biology Unit 2 BIOL2 The variety of living organisms 3.2.1 Living organisms vary and this variation is influenced by genetic and environmental factors Causes of variation 3.2.2

More information

Applications of Genetics to Conservation Biology

Applications of Genetics to Conservation Biology Applications of Genetics to Conservation Biology Molecular Taxonomy Populations, Gene Flow, Phylogeography Relatedness - Kinship, Paternity, Individual ID Conservation Biology Population biology Physiology

More information

Anatomy of a species tree

Anatomy of a species tree Anatomy of a species tree T 1 Size of current and ancestral Populations (N) N Confidence in branches of species tree t/2n = 1 coalescent unit T 2 Branch lengths and divergence times of species & populations

More information

Genetic Drift in Human Evolution

Genetic Drift in Human Evolution Genetic Drift in Human Evolution (Part 2 of 2) 1 Ecology and Evolutionary Biology Center for Computational Molecular Biology Brown University Outline Introduction to genetic drift Modeling genetic drift

More information

Chapter 16: Reconstructing and Using Phylogenies

Chapter 16: Reconstructing and Using Phylogenies Chapter Review 1. Use the phylogenetic tree shown at the right to complete the following. a. Explain how many clades are indicated: Three: (1) chimpanzee/human, (2) chimpanzee/ human/gorilla, and (3)chimpanzee/human/

More information

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics: Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships

More information

Incomplete Lineage Sorting: Consistent Phylogeny Estimation From Multiple Loci

Incomplete Lineage Sorting: Consistent Phylogeny Estimation From Multiple Loci University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 1-2010 Incomplete Lineage Sorting: Consistent Phylogeny Estimation From Multiple Loci Elchanan Mossel University of

More information

Modern Evolutionary Classification. Section 18-2 pgs

Modern Evolutionary Classification. Section 18-2 pgs Modern Evolutionary Classification Section 18-2 pgs 451-455 Modern Evolutionary Classification In a sense, organisms determine who belongs to their species by choosing with whom they will mate. Taxonomic

More information

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. OEB 242 Exam Practice Problems Answer Key Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. First, recall

More information

STEM-hy: Species Tree Estimation using Maximum likelihood (with hybridization)

STEM-hy: Species Tree Estimation using Maximum likelihood (with hybridization) STEM-hy: Species Tree Estimation using Maximum likelihood (with hybridization) Laura Salter Kubatko Departments of Statistics and Evolution, Ecology, and Organismal Biology The Ohio State University kubatko.2@osu.edu

More information

Evolutionary Significant Units (ESUs) & Management Units (MUs)

Evolutionary Significant Units (ESUs) & Management Units (MUs) Evolutionary Significant Units (ESUs) & Management Units (MUs) Diversity is Diverse and Complex Defining Management Units Within Species Genetic Distinctiveness & ESU s definition Measuring & Managing

More information

PHYLOGENY AND SYSTEMATICS

PHYLOGENY AND SYSTEMATICS AP BIOLOGY EVOLUTION/HEREDITY UNIT Unit 1 Part 11 Chapter 26 Activity #15 NAME DATE PERIOD PHYLOGENY AND SYSTEMATICS PHYLOGENY Evolutionary history of species or group of related species SYSTEMATICS Study

More information

Evolutionary Patterns, Rates, and Trends

Evolutionary Patterns, Rates, and Trends Evolutionary Patterns, Rates, and Trends Macroevolution Major patterns and trends among lineages Rates of change in geologic time Comparative Morphology Comparing body forms and structures of major lineages

More information

Evolution Test Review

Evolution Test Review Name Evolution Test Review Period 1) A group of interbreeding organisms (a species) living in a given area is called population 2) Give an example of a species. Ex. One wolf Give an example of a population.

More information

Supporting Information

Supporting Information Supporting Information Hammer et al. 10.1073/pnas.1109300108 SI Materials and Methods Two-Population Model. Estimating demographic parameters. For each pair of sub-saharan African populations we consider

More information

Ph ylogeography. A guide to the study of the spatial distribution of Seahorses. By Leila Mougoui Bakhtiari

Ph ylogeography. A guide to the study of the spatial distribution of Seahorses. By Leila Mougoui Bakhtiari Ph ylogeography A guide to the study of the spatial distribution of Seahorses By Leila Mougoui Bakhtiari Contents An Introduction to Phylogeography JT Bohem s Resarch Map of erectu s migration Conservation

More information

I. Short Answer Questions DO ALL QUESTIONS

I. Short Answer Questions DO ALL QUESTIONS EVOLUTION 313 FINAL EXAM Part 1 Saturday, 7 May 2005 page 1 I. Short Answer Questions DO ALL QUESTIONS SAQ #1. Please state and BRIEFLY explain the major objectives of this course in evolution. Recall

More information

Chapter 5 Evolution of Biodiversity. Sunday, October 1, 17

Chapter 5 Evolution of Biodiversity. Sunday, October 1, 17 Chapter 5 Evolution of Biodiversity CHAPTER INTRO: The Dung of the Devil Read and Answer Questions Provided Module 14 The Biodiversity of Earth After reading this module you should be able to understand

More information

Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012

Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012 Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium November 12, 2012 Last Time Sequence data and quantification of variation Infinite sites model Nucleotide diversity (π) Sequence-based

More information

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation Speciation Today s OUTLINE: (1) Geographic Mechanisms of Speciation (What circumstances lead to the formation of new species?) (2) Species Concepts (How are Species Defined?) Mechanisms of Speciation Last

More information

Estimating Species Phylogeny from Gene-Tree Probabilities Despite Incomplete Lineage Sorting: An Example from Melanoplus Grasshoppers

Estimating Species Phylogeny from Gene-Tree Probabilities Despite Incomplete Lineage Sorting: An Example from Melanoplus Grasshoppers Syst. Biol. 56(3):400-411, 2007 Copyright Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150701405560 Estimating Species Phylogeny from Gene-Tree Probabilities

More information

Coalescent based demographic inference. Daniel Wegmann University of Fribourg

Coalescent based demographic inference. Daniel Wegmann University of Fribourg Coalescent based demographic inference Daniel Wegmann University of Fribourg Introduction The current genetic diversity is the outcome of past evolutionary processes. Hence, we can use genetic diversity

More information

Concepts and Methods in Molecular Divergence Time Estimation

Concepts and Methods in Molecular Divergence Time Estimation Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks

More information

Phylogeography and genetic differentiation between Loxigilla noctis and L. barbadensis in the Lesser Antilles

Phylogeography and genetic differentiation between Loxigilla noctis and L. barbadensis in the Lesser Antilles Phylogeography and genetic differentiation between Loxigilla noctis and L. barbadensis in the Lesser Antilles Sophie Arnaud-Haond 1, Carla Daniel 2, Sébastien Motreuil 3, Julia Horrocks 2 & Frank Cézilly

More information

ESTIMATING A GEOGRAPHICALLY EXPLICIT MODEL OF POPULATION DIVERGENCE

ESTIMATING A GEOGRAPHICALLY EXPLICIT MODEL OF POPULATION DIVERGENCE ORIGINAL ARTICLE doi:10.1111/j.1558-5646.2007.00043.x ESTIMATING A GEOGRAPHICALLY EXPLICIT MODEL OF POPULATION DIVERGENCE L. Lacey Knowles 1,2 and Bryan C. Carstens 1,3 1 Department of Ecology and Evolutionary

More information

Neutral Theory of Molecular Evolution

Neutral Theory of Molecular Evolution Neutral Theory of Molecular Evolution Kimura Nature (968) 7:64-66 King and Jukes Science (969) 64:788-798 (Non-Darwinian Evolution) Neutral Theory of Molecular Evolution Describes the source of variation

More information

Gene Pool The combined genetic material for all the members of a population. (all the genes in a population)

Gene Pool The combined genetic material for all the members of a population. (all the genes in a population) POPULATION GENETICS NOTES Gene Pool The combined genetic material for all the members of a population. (all the genes in a population) Allele Frequency The number of times a specific allele occurs in a

More information

MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale

MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale evolutionary changes such as speciation events, origin of

More information

Review of molecular biology

Review of molecular biology Review of molecular biology DNA is into RNA, which is into protein. What mrna sequence would be transcribed from the DNA template CTA? What sequence of trna would be attracted by the above mrna sequence?

More information

Mathematical models in population genetics II

Mathematical models in population genetics II Mathematical models in population genetics II Anand Bhaskar Evolutionary Biology and Theory of Computing Bootcamp January 1, 014 Quick recap Large discrete-time randomly mating Wright-Fisher population

More information

CONSERVATION AND THE GENETICS OF POPULATIONS

CONSERVATION AND THE GENETICS OF POPULATIONS CONSERVATION AND THE GENETICS OF POPULATIONS FredW.Allendorf University of Montana and Victoria University of Wellington and Gordon Luikart Universite Joseph Fourier, CNRS and University of Montana With

More information

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS. !! www.clutchprep.com CONCEPT: OVERVIEW OF EVOLUTION Evolution is a process through which variation in individuals makes it more likely for them to survive and reproduce There are principles to the theory

More information

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together SPECIATION Origin of new species=speciation -Process by which one species splits into two or more species, accounts for both the unity and diversity of life SPECIES BIOLOGICAL CONCEPT Population or groups

More information

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species

More information

Statistical phylogeography

Statistical phylogeography Molecular Ecology (2002) 11, 2623 2635 Statistical phylogeography Blackwell Science, Ltd L. LACEY KNOWLES and WAYNE P. MADDISON Department of Ecology and Evolutionary Biology, University of Arizona, Tucson,

More information

Genetic Variation in Finite Populations

Genetic Variation in Finite Populations Genetic Variation in Finite Populations The amount of genetic variation found in a population is influenced by two opposing forces: mutation and genetic drift. 1 Mutation tends to increase variation. 2

More information

Population Genetics of Modern Human Evolution

Population Genetics of Modern Human Evolution Population Genetics of Modern Human Evolution John H Relethford, State University of New York College at Oneonta, Oneonta, New York, USA Rosalind M Harding, University of Oxford, Oxford, UK The aim of

More information

NOTES CH 24: The Origin of Species

NOTES CH 24: The Origin of Species NOTES CH 24: The Origin of Species Species Hummingbirds of Costa Rica SPECIES: a group of individuals that mate with one another and produce fertile offspring; typically members of a species appear similar

More information

Understanding How Stochasticity Impacts Reconstructions of Recent Species Divergent History. Huateng Huang

Understanding How Stochasticity Impacts Reconstructions of Recent Species Divergent History. Huateng Huang Understanding How Stochasticity Impacts Reconstructions of Recent Species Divergent History by Huateng Huang A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor

More information

e.g. population: 500, two alleles: Red (R) and White (r). Total: 1000 genes for flower color in the population

e.g. population: 500, two alleles: Red (R) and White (r). Total: 1000 genes for flower color in the population The Evolution of Populations What is Evolution? A change over time in the genetic composition of a population Human evolution The gene pool Is the total aggregate of genes for a particular trait in a population

More information

Microevolution (Ch 16) Test Bank

Microevolution (Ch 16) Test Bank Microevolution (Ch 16) Test Bank Multiple Choice Identify the letter of the choice that best completes the statement or answers the question. 1. Which of the following statements describes what all members

More information

Evolutionary change. Evolution and Diversity. Two British naturalists, one revolutionary idea. Darwin observed organisms in many environments

Evolutionary change. Evolution and Diversity. Two British naturalists, one revolutionary idea. Darwin observed organisms in many environments Evolutionary change Evolution and Diversity Ch 13 How populations evolve Organisms change over time In baby steps Species (including humans) are descended from other species Two British naturalists, one

More information

Evolution Problem Drill 09: The Tree of Life

Evolution Problem Drill 09: The Tree of Life Evolution Problem Drill 09: The Tree of Life Question No. 1 of 10 Question 1. The age of the Earth is estimated to be about 4.0 to 4.5 billion years old. All of the following methods may be used to estimate

More information

Classical Selection, Balancing Selection, and Neutral Mutations

Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection Perspective of the Fate of Mutations All mutations are EITHER beneficial or deleterious o Beneficial mutations are selected

More information

GIS Applications to Museum Specimens

GIS Applications to Museum Specimens GIS Applications to Museum Specimens Joseph Grinnell (1877 1939) At this point I wish to emphasize what I believe will ultimately prove to be the greatest value of our museum. This value will not, however,

More information

Chapter 7: Models of discrete character evolution

Chapter 7: Models of discrete character evolution Chapter 7: Models of discrete character evolution pdf version R markdown to recreate analyses Biological motivation: Limblessness as a discrete trait Squamates, the clade that includes all living species

More information

Name: Hour: Teacher: ROZEMA. Inheritance & Mutations Connected to Speciation

Name: Hour: Teacher: ROZEMA. Inheritance & Mutations Connected to Speciation Name: Hour: Teacher: ROZEMA Inheritance & Mutations Connected to Speciation Let s Review What We Already Know: What Have We Learned? Lesson 26: PI 1 (Projected Image) - Human Karyotype (image from https://en.wikipedia.org/wiki/karyotype#/media/file:nhgri_human_male_karyotype.png)

More information

Evolution and Natural Selection (16-18)

Evolution and Natural Selection (16-18) Evolution and Natural Selection (16-18) 3 Key Observations of Life: 1) Shared Characteristics of Life (Unity) 2) Rich Diversity of Life 3) Organisms are Adapted to their Environment These observations

More information

Theory a well supported testable explanation of phenomenon occurring in the natural world.

Theory a well supported testable explanation of phenomenon occurring in the natural world. Evolution Theory of Evolution Theory a well supported testable explanation of phenomenon occurring in the natural world. Evolution the process by which modern organisms changed over time from ancient common

More information

Organizing Life s Diversity

Organizing Life s Diversity 17 Organizing Life s Diversity section 2 Modern Classification Classification systems have changed over time as information has increased. What You ll Learn species concepts methods to reveal phylogeny

More information

Fields connected to Phylogeography Microevolutionary disciplines Ethology Demography Population genetics

Fields connected to Phylogeography Microevolutionary disciplines Ethology Demography Population genetics Stephen A. Roussos Fields connected to Phylogeography Microevolutionary disciplines Ethology Demography Population genetics Macrevolutionary disciplines Historical geography Paleontology Phylogenetic biology

More information

Quartet Inference from SNP Data Under the Coalescent Model

Quartet Inference from SNP Data Under the Coalescent Model Bioinformatics Advance Access published August 7, 2014 Quartet Inference from SNP Data Under the Coalescent Model Julia Chifman 1 and Laura Kubatko 2,3 1 Department of Cancer Biology, Wake Forest School

More information

Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution

Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution 15.2 Intro In biology, evolution refers specifically to changes in the genetic makeup of populations over time.

More information

5/31/17. Week 10; Monday MEMORIAL DAY NO CLASS. Page 88

5/31/17. Week 10; Monday MEMORIAL DAY NO CLASS. Page 88 Week 10; Monday MEMORIAL DAY NO CLASS Page 88 Week 10; Wednesday Announcements: Family ID final in lab Today Final exam next Tuesday at 8:30 am here Lecture: Species concepts & Speciation. What are species?

More information

A. Incorrect! Form is a characteristic used in the morphological species concept.

A. Incorrect! Form is a characteristic used in the morphological species concept. CLEP Biology - Problem Drill 23: Evolutionary Processes No. 1 of 10 The biological-species concept is based on. (A) Form. (B) Similar size. (C) Similar appearance to all other individuals in the population.

More information

4/4/2017. Extrinsic Isolating Barriers. 1. Biological species concept: 2. Phylogenetic species concept:

4/4/2017. Extrinsic Isolating Barriers. 1. Biological species concept: 2. Phylogenetic species concept: Chapter 13 The origin of species 13.1 What Is a Species? p. 414 Ways to identify species 1. Biological species concept: 1. There are many different concepts of species 2. Species are important taxonomic

More information

Origin of an idea about origins

Origin of an idea about origins Origin of an idea about origins Biological evolution is the process of change during the course of time because of the alteration of the genotype and the transfer of these altered genes to the next generation.

More information

The Mechanisms of Evolution

The Mechanisms of Evolution The Mechanisms of Evolution Figure.1 Darwin and the Voyage of the Beagle (Part 1) 2/8/2006 Dr. Michod Intro Biology 182 (PP 3) 4 The Mechanisms of Evolution Charles Darwin s Theory of Evolution Genetic

More information

SWEEPFINDER2: Increased sensitivity, robustness, and flexibility

SWEEPFINDER2: Increased sensitivity, robustness, and flexibility SWEEPFINDER2: Increased sensitivity, robustness, and flexibility Michael DeGiorgio 1,*, Christian D. Huber 2, Melissa J. Hubisz 3, Ines Hellmann 4, and Rasmus Nielsen 5 1 Department of Biology, Pennsylvania

More information

Unit 7: Evolution Guided Reading Questions (80 pts total)

Unit 7: Evolution Guided Reading Questions (80 pts total) AP Biology Biology, Campbell and Reece, 10th Edition Adapted from chapter reading guides originally created by Lynn Miriello Name: Unit 7: Evolution Guided Reading Questions (80 pts total) Chapter 22 Descent

More information

Evolution of Populations

Evolution of Populations Evolution of Populations Gene Pools 1. All of the genes in a population - Contains 2 or more alleles (forms of a gene) for each trait 2. Relative frequencies - # of times an allele occurs in a gene pool

More information

Population Genetics I. Bio

Population Genetics I. Bio Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn

More information

Past climate change on Sky Islands drives novelty in a core developmental gene network and its phenotype

Past climate change on Sky Islands drives novelty in a core developmental gene network and its phenotype Supplementary tables and figures Past climate change on Sky Islands drives novelty in a core developmental gene network and its phenotype Marie-Julie Favé 1, Robert A. Johnson 2, Stefan Cover 3 Stephan

More information

Biology Chapter 15 Evolution Notes

Biology Chapter 15 Evolution Notes Biology Chapter 15 Evolution Notes Section 1: Darwin's Theory of Evolution by Natural Selection Charles Darwin- English naturalist that studied animals over a number of years before developing the theory

More information

To link to this article: DOI: / URL:

To link to this article: DOI: / URL: This article was downloaded by:[ohio State University Libraries] [Ohio State University Libraries] On: 22 February 2007 Access Details: [subscription number 731699053] Publisher: Taylor & Francis Informa

More information

Biology 110 Survey of Biology. Quizzam

Biology 110 Survey of Biology. Quizzam 1. Mendel conducted his most memorable experiments on A) peas. B) roses. C) guinea pigs. D) fruit flies. E) clones. 2. Varieties of plants in which self-fertilization produces offspring that are identical

More information

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9 Lecture 5 Alignment I. Introduction. For sequence data, the process of generating an alignment establishes positional homologies; that is, alignment provides the identification of homologous phylogenetic

More information

CoalFace: a graphical user interface program for the simulation of coalescence

CoalFace: a graphical user interface program for the simulation of coalescence 114 Chapter 6 CoalFace: a graphical user interface program for the simulation of coalescence I ve never had a conflict between teaching and research as some people do because when I m teaching, I m doing

More information

Molecular Phylogenetics and Evolution

Molecular Phylogenetics and Evolution Molecular Phylogenetics and Evolution 49 (2008) 277 291 Contents lists available at ScienceDirect Molecular Phylogenetics and Evolution journal homepage: www.elsevier.com/locate/ympev In situ genetic differentiation

More information

Conservation Genetics. Outline

Conservation Genetics. Outline Conservation Genetics The basis for an evolutionary conservation Outline Introduction to conservation genetics Genetic diversity and measurement Genetic consequences of small population size and extinction.

More information

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation Speciation Today s OUTLINE: (1) Geographic Mechanisms of Speciation (What circumstances lead to the formation of new species?) (2) Species Concepts (How are Species Defined?) Mechanisms of Speciation Last

More information

Chapter 22: Descent with Modification 1. BRIEFLY summarize the main points that Darwin made in The Origin of Species.

Chapter 22: Descent with Modification 1. BRIEFLY summarize the main points that Darwin made in The Origin of Species. AP Biology Chapter Packet 7- Evolution Name Chapter 22: Descent with Modification 1. BRIEFLY summarize the main points that Darwin made in The Origin of Species. 2. Define the following terms: a. Natural

More information

Taming the Beast Workshop

Taming the Beast Workshop Workshop and Chi Zhang June 28, 2016 1 / 19 Species tree Species tree the phylogeny representing the relationships among a group of species Figure adapted from [Rogers and Gibbs, 2014] Gene tree the phylogeny

More information

The Origin of Species

The Origin of Species The Origin of Species Introduction A species can be defined as a group of organisms whose members can breed and produce fertile offspring, but who do not produce fertile offspring with members of other

More information

Rapid speciation following recent host shift in the plant pathogenic fungus Rhynchosporium

Rapid speciation following recent host shift in the plant pathogenic fungus Rhynchosporium Rapid speciation following recent host shift in the plant pathogenic fungus Rhynchosporium Tiziana Vonlanthen, Laurin Müller 27.10.15 1 Second paper: Origin and Domestication of the Fungal Wheat Pathogen

More information

A A A A B B1

A A A A B B1 LEARNING OBJECTIVES FOR EACH BIG IDEA WITH ASSOCIATED SCIENCE PRACTICES AND ESSENTIAL KNOWLEDGE Learning Objectives will be the target for AP Biology exam questions Learning Objectives Sci Prac Es Knowl

More information

Chapter 14 The Origin of Species

Chapter 14 The Origin of Species Chapter 14 The Origin of Species PowerPoint Lectures Campbell Biology: Concepts & Connections, Eighth Edition REECE TAYLOR SIMON DICKEY HOGAN Lecture by Edward J. Zalisko Adaptations Biological Adaptation

More information

that of Phylotree.org, mtdna tree Build 1756 (Supplementary TableS2). is resulted in 78 individuals allocated to the hg B4a1a1 and three individuals to hg Q. e control region (nps 57372 and nps 1602416526)

More information

Microsatellite data analysis. Tomáš Fér & Filip Kolář

Microsatellite data analysis. Tomáš Fér & Filip Kolář Microsatellite data analysis Tomáš Fér & Filip Kolář Multilocus data dominant heterozygotes and homozygotes cannot be distinguished binary biallelic data (fragments) presence (dominant allele/heterozygote)

More information

Conceptually, we define species as evolutionary units :

Conceptually, we define species as evolutionary units : Bio 1M: Speciation 1 How are species defined? S24.1 (2ndEd S26.1) Conceptually, we define species as evolutionary units : Individuals within a species are evolving together Individuals of different species

More information

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8 The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as

More information