Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication

Size: px
Start display at page:

Download "Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication"

Transcription

1 Buffering of crucial functions by paleologous duplicated genes may contribute cyclicality to angiosperm genome duplication Brad A. Chapman*, John E. Bowers*, Frank A. Feltus*, and Andrew H. Paterson* *Plant Genome Mapping Laboratory and Departments of Plant Biology, Genetics, and Crop and Soil Science, University of Georgia, Athens, GA Edited by Tomoko Ohta, National Institute of Genetics, Mishima, Japan, and approved December 8, 2005 (received for review September 13, 2005) Genome duplication followed by massive gene loss has permanently shaped the genomes of many higher eukaryotes, particularly angiosperms. It has long been believed that a primary advantage of genome duplication is the opportunity for the evolution of genes with new functions by modification of duplicated genes. If so, then patterns of genetic diversity among strains within taxa might reveal footprints of selection that are consistent with this advantage. Contrary to classical predictions that duplicated genes may be relatively free to acquire unique functionality, we find among both Arabidopsis ecotypes and Oryza subspecies that SNPs encode less radical amino acid changes in genes for which there exists a duplicated copy at a paleologous locus than in singleton genes. Preferential retention of duplicated genes encoding long complex proteins and their unexpectedly slow divergence (perhaps because of homogenization) suggest that a primary advantage of retaining duplicated paleologs may be the buffering of crucial functions. Functional buffering and functional divergence may represent extremes in the spectrum of duplicated gene fates. Functional buffering may be especially important during genomic turmoil immediately after genome duplication but continues to act 60 million years later, and its gradual deterioration may contribute cyclicality to genome duplication in some lineages. amino acid substitution Arabidopsis Oryza protein functional domain single-nucleotide polymorphism Recent sequencing efforts have revealed that genome duplication, a punctuational event in the evolution of a lineage, is more common than previously suspected. For example, detailed analyses of the sequences of the dicotyledonous angiosperm (dicot) Arabidopsis thaliana (1, 2) and monocotyledonous Oryza sativa (rice) (3, 4) show duplicated genome structure. Hypotheses about the timing and extent of individual events contributing to this structure range from our preference of a few discrete events (1, 3) to many segmental events (5), but all agree that much segmental duplication has occurred in both monocots and dicots since their divergence from a common ancestor, variously estimated at (6) to million years ago (7). Additional, more recent, duplications in specific angiosperm lineages (8) lead one to question whether this process may be not just episodic but truly cyclical in this group of taxa. Under classical models, gene duplication is proposed to be a primary source of genetic material available for the evolution of genes with new functions (9 11); one member of a duplicated gene pair may mutate and acquire unique functionality (12, 13), with the fitness of the organism insulated by the homoeolog. Such models would predict that, in natural populations, higher levels of polymorphism would occur in duplicated genes than in singletons [a prediction supported by recent theory (14)] and that the ability of duplicates to provide functional compensation for one another would erode as their functions diverged (15). However, recent findings raise perplexing questions about this classical functional divergence model. Analysis of 17 nonallelic duplicates in Xenopus laevis shows evidence of purifying selection on each duplicate gene (16). For three recently duplicated ( million years ago) Arabidopsis genes, both progenitor and derived copies show significantly reduced species-wide polymorphism (17). Duplicated yeast genes provide a discernible degree of functional compensation for a remarkably long period (18) and appear to undergo gene conversion (19). Realization of the potential benefits that may result from functional divergence of newly duplicated genes would naturally require a new polyploid to persist long enough for adaptive evolution to occur. Most higher organisms are thought to continuously produce aberrant unreduced gametes at low rates, but the rarity of genome duplication shows that the overwhelming majority do not survive. Study of both natural polyploids and synthetic polyploids formed by colchicine-based manipulation of interspecific hybrids reveals immediate consequences of polyploidization that ostensibly seem maladaptive, including loss and restructuring of low-copy DNA sequences (20 25), activation of genes and retrotransposons (26, 27), and gene silencing (28 31). The angiosperms are an outstanding higher-eukaryote model in which to elucidate consequences of genome duplication in view of the strong signal that remains from multiple genome duplications, naturally occurring replication afforded by independent duplications, and a host of genetic and molecular tools. The fates of duplicated genes may be closely associated with effective population size (N e ) for a taxon (32), meaning that recent insights from microbes (18, 33 35) may not extend well to crown eukaryotes. For example, recent empirical data (35) validate the prediction that subfunctionalization should be rare in organisms with large effective population size, such as yeast (12), and highlight the need for complementary investigations of larger-bodied (smaller N e ) eukaryotes, such as angiosperms. We explore for footprints of selection associated with genome duplication, investigating whether the evolution of genes in modern populations or recently diverged taxa are influenced by the presence of an ancient duplicated copy at an unlinked locus. Specifically, we distinguish strictly singleton genes from those that have retained duplicated copies as a result of ancient duplication and compare levels and patterns of polymorphism in the coding sequences of these two gene classes. Our results suggest that functional conservation, particularly of complex genes and functional domains, may occur in many paleologs that are still recognizable as such. Thus, functional divergence suggested by classical models and functional buffering suggested herein may be alternative outcomes in the spectrum of possible fates for paleologs. Several avenues may offer some reconciliation between these respective models. Conflict of interest statement: No conflicts declared. This paper was submitted directly (Track II) to the PNAS office. To whom correspondence should be addressed at: Plant Genome Mapping Laboratory, University of Georgia, 111 Riverbend Road, Athens, GA paterson@uga.edu by The National Academy of Sciences of the USA PNAS February 21, 2006 vol. 103 no. 8 cgi doi pnas

2 Fig. 1. Preferential retention of ancient duplicated genes. (a) Wholegenome duplications and SNP study systems. Colinear regions of genes resulting from ancient duplications (three in Arabidopsis and one in Oryza) are recognized based on the subset of cases in which both homoeologous copies of an ancestral gene are retained (X); however, in the majority of cases one homoeolog has been lost (O). For each gene, whether a duplicated copy is present or not, SNPs between recently diverged landraces (Arabidopsis) or subspecies (Oryza) are used to characterize alleles that have persisted in modern populations. (b) Arabidopsis genes present in duplicate are preferentially maintained through subsequent duplication events. Individual singleton and duplicated genes at corresponding locations in collinear regions resulting from the oldest ( ) duplication event were traced through subsequent and events (see text). Bold numbers indicate the total copy numbers of a single event-derived gene in the Arabidopsis genome sequence. Numbers in parentheses are the total gene counts within each class. The differences in final copy numbers between singleton and duplicated genes are statistically significant (P 0.01, 2 ). Results and Discussion Gene Fates Are Correlated Across Multiple Duplication Events. From the oldest ( ) duplicated chromosomal segments that we could discern (1), we first determined whether individual genes did or did not retain a paleologous copy in subsequent duplications (Fig. 1b). Indeed, -duplicated genes are more often retained in duplicate for both the (36.3% of the time versus 20.6% for singletons; binomial proportion, P ) and (30.2% versus 22.8%; P ) events. -Event-derived duplicated and singleton genes show a similar pattern; 41.8% of -duplicated genes are retained in duplicate during the event compared with only 27.0% of singleton genes (P ). The tendency for singletons to be repeatedly restored to singleton status after new duplications, together with the unexpected excess of duplicated genes that not only persist for long time periods but continue to spawn additional copies, suggest that gene retention after duplication is selective. Others have recently supported these results (36) and suggested differential reduplication of particular gene functional classes (37, 38). SNP Alleles That Persist in Populations Are More Conservative in Genes with Extant Paleologs Than in Singleton Genes. To investigate whether the evolution of genes in modern populations or recently diverged taxa are influenced by the presence of ancient duplicated copies at paleologous loci, we surveyed naturally occurring polymorphic alleles in large intraspecific SNP collections for Arabidopsis [37,344 (39)] and rice [384,341 (40)] (Fig. 1a). A total of 2,022 and 7,971 SNPs could be plotted to codon locations within genes in ancient duplicated regions of Arabidopsis and rice, respectively. This finding permitted investigation of whether SNP alleles show differential patterns of abundance or severity in duplicated genes versus singletons, the latter defined stringently to exclude genes with recent nonhomoelogous duplicated copies. The overall SNP rate is lower in Arabidopsis (in which landraces within a species were compared) than in rice (in which subspecies were compared) (Table 1). Because our experimental unit is the gene rather than the organism, identification of common trends associated with independent duplications in these divergent lineages offers a broader inference space than study of SNP frequencies in additional members of each taxon, although the latter would be a valuable future investigation. By placing SNPs in their context within codons, striking differences between singleton and duplicated genes became evident. In genes for which an ancient duplicated copy was present at a paleologous location, a larger proportion of SNPs were in the third codon position than in singletons. This observation held true across all duplication events in both Arabidopsis and rice (Fig. 2a). Indeed, SNPs caused an inferred amino acid change in 52%, 51%, and 50% of cases for,, and Arabidopsis singleton genes, respectively, but only 39%, 44%, and 47% of cases for corresponding duplicated genes. In rice, SNPs caused inferred amino acid changes in 61% of singletons versus 48% in duplicated genes. These differences are statistically significant for the Arabidopsis and rice duplications (P 0.001, 2 ) but not for the older Arabidopsis duplications. Consistent with the finding that a higher fraction of SNPs in duplicated genes were in the third codon position, the severity of amino acid changes resulting from SNPs in duplicated genes was less than in singletons. We used the Blosum80 substitution matrix to compare amino acids inferred to be encoded by alternative SNP alleles. For all cases, duplicated genes have a higher mean substitution value (Fig. 2b), indicating that the changes caused are less radical. This result is statistically significant for the Arabidopsis and rice duplications but not for the older Arabidopsis duplications. To preclude the possibility that erroneous gene predictions influence this result, the analyses were repeated with only the subsets of singleton and duplicated genes that matched an EST (E ) with nearly identical results. EVOLUTION Table 1. SNPs located within coding regions of duplicate and singleton genes Gene type No. of genes Mean gene size, bp No. of SNPs SNPs per kb Arabidopsis singleton 3,586 1, Arabidopsis duplicate 4,872 1, Arabidopsis singleton 1,953 1, Arabidopsis duplicate 502 1, Arabidopsis singleton 749 1, Arabidopsis duplicate 64 1, Oryza singleton 7, , Oryza duplicate 3,884 1, , SEM is included for each gene size, and 95% binomial confidence intervals for SNP frequencies are indicated. Chapman et al. PNAS February 21, 2006 vol. 103 no

3 Fig. 2. SNPs that have persisted in natural populations cause less severe inferred amino acid changes in ancient duplicated genes than in singletons. (a) Codon position of SNP substitutions. For all duplication events, a higher proportion of changes are concentrated in the third wobble position for duplicated genes than for singletons. (b) Severity of SNP-derived amino acid changes. Blosum80 substitution values were used as a measure of the severity of a change, with higher values indicating less severe changes. For all cases, mean changes in duplicated genes are less radical than in singletons. For the Arabidopsis and rice duplication events, these differences are statistically significant. Error bars indicate SEM. The conservative evolution of duplicated genes was evident not only qualitatively but also quantitatively. Gene pairs that were relatively similar tended to retain less severe SNPs (in terms of inferred amino acid changes) than more divergent gene pairs from the same event (i.e., of similar age) (Fig. 3). The conservative evolution of paleologs is all the more striking in view of the theoretical expectation that variation within species in duplicated genes should be greater than in single-copy genes (14). This finding further contradicts the expectations of traditional functional divergence models for polyploidy (9 11), in which radically altered alleles should persist more frequently in similar duplicated genes because of the presence of a redundant copy but rarely in members of more divergent pairs that may no longer have the same functions. Genes Encoding Long and Complex Proteins Are Preferentially Preserved in Duplicate and Evolve Conservatively. Genes for which an ancient duplicated copy has been preserved are % longer, on average, than singletons (Table 1). To investigate whether Fig. 3. SNPs that have persisted in natural populations cause less severe inferred amino acid changes in genes with a similar duplicated homoeolog than those with a more divergent duplicated homoeolog. For each duplication event, the distance between duplicated proteins was calculated by PROTDIST and plotted against Blosum80 substitution values measuring the severity of SNP changes. Error bars indicate the SEM for each Blosum80 substitution class. (Upper) and * (Lower) designate statistically different groups using Tukey Kramer multiple comparisons corrected for unequal sample sizes (P 0.05). (Data lacking the two indicated symbols are not statistically distinguishable from either of the other categories.) The statistically significant negative slope for both Arabidopsis and rice plots (both P ) indicate that, as the distance between duplicated genes decreases, SNPs are less severe. greater structural complexity could confer a selective advantage to preservation of their sequence and function, we first determined the percentage of each gene (coding region) covered by discernible Pfam database-defined protein domains (widely characterized functional units) and then plotted that percentage against Blosum80-based severity of inferred amino acid substitutions resulting from SNPs that have persisted in natural populations. In both singleton and duplicated proteins, higher domain coverage is associated with lower severity of amino acid changes encoded by SNPs (Fig. 4). Not only are duplicated genes generally longer than singletons, but a much higher fraction of the coding regions comprise identifiable domains, thus making duplicated genes doubly sensitive to nonsynonymous mutations. For the Arabidopsis event, an average of 20.5% of the coding regions of singletons and 44% of duplicates are covered by characterized domains. Similarly, rice singletons average 7.7% coverage by domains versus 41.3% for duplicated genes. The greater retention in duplicate of longer and more complex proteins, together with their lower tolerance of nonsynonymous mutations, suggest that the potential advantage conferred by the possibility of future neofunctionalization may be outweighed by the immediate benefits of buffering crucial functionality, i.e., ensuring that the functions of essential genes and domains are cgi doi pnas Chapman et al.

4 Fig. 4. Retained SNPs cause less radical amino acid changes in domain-rich proteins. The mean percentage of proteins contained within defined Pfam domains was plotted against the severity of SNP-caused changes, as measured by Blosum80 substitution values. Error bars indicate the SEM. * designates statistically different groups (P 0.05, Tukey Kramer). Although statistically significant groupings of data points could be resolved only for rice duplicated genes, significant positive slopes (P ) for three of the four lines (Arabidopsis duplicate and rice singleton and duplicate), with the fourth marginally missing significance (Arabidopsis singleton, P 0.074), strongly support the hypothesis that less radical changes are associated with proteins containing a higher percentage of domains. This finding appears true regardless of duplication status. met even after mutation of one copy by the presence of a second copy. The tendency for duplicated gene copies to tolerate less severe amino acid substitutions than singletons in natural populations is in contrast to a primary advantage of polyploidy being the freedom for duplicated genes to acquire new functions (9 11). However, this finding is consistent with recent results about the evolution of recently evolved duplicates (16, 17) and long-term functional compensation (18, 41, 42). Further testing of this hypothesis might derive from obtaining data for additional taxa that parallel the fitness data available for individual gene knockouts in yeast (18). Such genetic buffering may be especially important in survival of lineages experiencing the early genomic turmoil that immediately follows polyploid formation (20 25). However, our data show that such buffering still markedly affects diversity among ecotypes of Arabidopsis thaliana and subspecies of Oryza sativa, species that each trace to genome duplications that occurred 60 million years or more before speciation (1, 3). Do Homogenization Processes Act on Paleologs? What mechanisms might preserve the sequence of thousands of pairs of genes at paleologous sites across a genome for 60 million years or more? Occasional nonhomologous associations between chromosomes are observed during mitosis in many taxa, including rice (43), and Fig. 5. Domain regions of duplicated homoeologs show longer-thanexpected stretches of identical codons. The length distributions of the longest regions of identical codons found from alignments of duplicated pairs was compared with singletons aligned against pre- and postduplication homologs for both domain-containing and non-domain-containing regions; phylogenetic trees show the relative timing of Arabidopsis and rice duplication events. Box plots of the first quartile, median, and third quartile show the distribution of these regions, with the number of regions contributing to each plot in parentheses. Letters indicate statistically different groups (P 0.05, Tukey Kramer multiple comparisons) for each class of gene type and domain or nondomain region. Based on calculated substitution rates (Supporting Results and Table 2), identity stretches between duplicated genes would be expected to fall between the two flanking comparisons of singletons to homologs. This pattern is observed for nondomain regions in Arabidopsis and rice. In contrast, identity stretches between duplicated genes in domain regions are statistically indistinguishable from comparisons of singletons to taxa that diverged much more recently than the duplication event. might periodically permit homogenization processes to act between paleologs. Mechanisms such as gene repair (44) and gene conversion (45) have been suggested to occur between ancient duplicated genes (19) but can be difficult to quantify (46). Indeed, genome-wide characterization of duplicates via established methods provided ambiguous results. To attempt to circumvent problems with accurately detecting homogenization across an entire genome, we explored pairwise alignments of duplicated genes for unexpectedly long stretches of sequence identity. Paleologs were compared with one another, whereas corresponding singletons from the same polyploidization event were compared with their best homolog from species that diverged from the lineage before or after duplication, respectively, as detailed in Materials and Methods.In the absence of homogenization, the lengths of identical regions for duplicate pairs should fall between those of such pairs of singleton-to-homolog comparisons flanking the duplication event (Fig. 5; see also Supporting Results and Table 2, which are published as supporting information on the PNAS web site). Within this framework, the largest stretch of identical codons for each gene, representing the best evidence for potential homogenization, was compared in both domain and nondomain regions of singleton and duplicated genes. Longer-than-expected stretches of identity are found in domain regions of duplicated genes but not singleton genes, suggesting that homogenization processes may act between paleologs. For nondomain regions, the distribution of the largest identical regions for EVOLUTION Chapman et al. PNAS February 21, 2006 vol. 103 no

5 duplicated genes falls between that of the pre- and postduplication homolog versus singleton comparisons, as expected based on K S and K A values (Fig. 5; see also Supporting Results and Table 2). In contrast, domain regions of duplicates show a length distribution of identity stretches that is statistically equivalent to those in postduplication comparisons (Fig. 5). For example, the distribution of identity stretches between duplicated genes in domain regions of rice are slightly longer than those found between singletons and Sorghum, which is estimated to have diverged 20 million years more recently (3). Although the possibility of interaction between paleologous genes needs further study, the preservation of key domains by homogenization processes could offer a selective advantage to the retention of duplicated copies of genes that serve crucial functions. Loss of duplicated copies may permit the remaining singleton genes to evolve more rapidly, thus accelerating mutation rates in an adaptive manner by a mechanism that is explicable without projecting a specialized purpose into evolution (47). Homogenization may also cause molecular clocks based on substitution rates between duplicated genes to chronically underestimate the age of duplications, as suggested recently (23). This potential bias would affect many dating results, including some from our own laboratory (1, 3). Reconciliation Between the Models. Several avenues may offer some reconciliation between the classical functional divergence model for genome duplication (9 11) and an emerging functional buffering model. Our hypothesis that a primary advantage of retaining genes in duplicate may be the immediate buffering of crucial functionality in no way precludes the occasional evolution of unique functionality as one outcome in a spectrum of possibilities that is generally weighted in favor of buffering. Indeed, genes for which divergence was advantageous may have diverged long ago and are no longer recognizable as paleologs. Finally, the present results focus on changes in protein sequence and do not address changes in noncoding regulatory elements that can cause expression divergence (48). Polyploid buffering, along with recombination, may contribute to reduced accumulation of degenerative mutations via Müller s ratchet (49). Although proximal duplication has been suggested to primarily amplify genes with secondary functionality, whole-genome duplication amplifies, and may buffer, genes involved in essential processes, such as primary metabolism (50). Many predominantly clonally propagated angiosperms (such as banana and sugarcane) and apomicts (51) are recently formed polyploids; protection of critical functionality by duplication and preservation of essential genes would explain how, in the absence of recombination, they avoid degeneration via Müller s ratchet (49). Similarly, repair processes fostered by whole-genome duplication are also key to the extraordinary radiation tolerance of Deinococcus radiodurans (52). Might Erosion of Buffering Impart Cyclicality to Genome Duplication? Reciprocal buffering presumably erodes with time as ancient duplicated genes eventually diverge, potentially imparting cyclicality to genome duplication. For example, in Arabidopsis, we could only assert that SNP frequencies in duplicated genes were lower than those in singletons (39% versus 52%) for the most recently ( ) duplicated segments; the frequencies of SNPs encoding nonsynonymous changes in older Arabidopsis and duplicate genes are virtually indistinguishable from those for singletons (49% versus 51% and 47% versus 50%, respectively). In organisms such as angiosperms, in which there are few roadblocks to polyploidization (such as sex chromosomes), the selective advantage for newly formed polyploids to survive may increase as the buffering from previous genome duplication erodes. The emerging functional buffering model for polyploid genome evolution accommodates the ability of duplicated genes to innovate despite conservative rates of protein evolution. Genomes appear to have retained duplicated genes with slower evolutionary rates (53), homogenized protein domains, and higher connectedness in cellular networks (33). Duplicate gene evolution is thus a balance between maintaining critical gene components and diversification of the use of these components for evolutionary innovation. The identification of these critical components and characterization of their buffering capacity, mechanisms, and longevity may be important steps toward understanding the conditions under which large-scale duplications lead to successful taxa. Materials and Methods Identifying Duplicated and Singleton Genes. Duplicated genes were defined as reciprocal best BLAST hits at homoeologous locations within colinear regions of genes previously defined for the Arabidopsis (1) and Oryza duplicated segments (3). Singletons had no best BLAST hit at an E value of anywhere in the genome. Proximal duplications were removed based on having additional nonhomoeologous duplicate pairs. Duplicated pairs corresponding to a recent segmental duplication between rice chromosomes 11 and 12 were removed. Arabidopsis coding, genomic, and protein sequences were downloaded from TAIR (The Arabidopsis Information Resource, release TIGR (The Institute for Genomic Research) Rice Pseudomolecules and Genome Annotation Version 1.0 ( tdb e2k1 osa1) was used for rice analyses. SNPs. Arabidopsis SNPs were selected as described from 37,344 polymorphisms derived from whole-genome shotgun data for the Landsberg ecotype compared with the finished sequence of ecotype Columbia (39). Rice SNPs were from a whole-genome comparison between the japonica and indica subspecies (40). To locate SNPs within coding regions of duplicated and singleton genes, 20-bp sequences flanking the SNP were located in the genomic sequences from BLAST to identify potential genes and fuzznuc from the EMBOSS package to find exact positions. Genomic locations were translated into positions in the coding sequence by comparing alignments of genomic and coding sequence created with the EMBOSS program EST2GENOME. Positions of SNPs within codons and Blosum80 substitution values were determined by using the predicted coding sequence and previously determined positions. The entire process was automated with PYTHON scripts. Characterization of Gene Features. Distances between duplicate gene pairs were calculated with the PROTDIST program from PHYLIP 3.6A3, using the default Jones Taylor Thornton substitution matrix and a -distribution of rates across positions ( 0.9). Locations of protein domains within singleton and duplicated genes were determined with HMMPFAM from the HMMER package version 2.2g. Searches were conducted against the Pfam 11.0 database (7,255 models), retaining global matches with an E value of Analysis of Homogenization. For both Arabidopsis and Oryza, we used gene databases from taxa selected to flank the duplication event as determined by phylogenetic dating techniques (1, 3). Arabidopsis singletons used clustered EST databases from Plant- GDB (Plant Genome Database, for Brassica napus (which diverged from Arabidopsis after the duplication event) and Gossypium arboreum (which diverged from Arabidopsis before the duplication event). For Oryza singletons, gene databases were used from Sorghum bicolor (which diverged from Oryza after the duplication event) from PlantGDB and Musa (which diverged from Oryza before the duplication event) from cgi doi pnas Chapman et al.

6 PROMUSA ( Homologs from these databases were selected based on having a TBLASTN E value of and minimum alignment of 40 aa with the singleton protein sequences used as the query. For duplicate pairs and singletons with detected homologs, amino acid sequences were aligned using CLUSTALW version 1.83 and used to align the corresponding coding sequences with TRANALIGN from EMBOSS. Aligned sequences were then examined to identify stretches of identical codons without a substitution or gap. The longest region of identity within domains, if present, or in nondomain regions was chosen from each singleton and duplicate gene. The distributions were examined for statistical differences by ANOVA of Box-Cox-transformed data ( 1 3). Subsequently, Tukey Kramer comparisons corrected for unequal sample sizes were used to define statistically different groups. This work was supported by a predoctoral fellowship from the Howard Hughes Medical Institute (to B.A.C.) and by the Rockefeller Foundation (F.A.F. and A.H.P.) and the U.S. National Science Foundation (J.E.B. and A.H.P.). 1. Bowers, J. E., Chapman, B. A., Rong, J. K. & Paterson, A. H. (2003) Nature 422, Simillion, C., Vandepoele, K., Van Montagu, M. C. E., Zabeau, M. & Van de Peer, Y. (2002) Proc. Natl. Acad. Sci. USA 99, Paterson, A. H., Bowers, J. E. & Chapman, B. A. (2004) Proc. Natl. Acad. Sci. USA 101, Paterson, A., Bowers, J., Peterson, D., Estill, J. & Chapman, B. (2003) Curr. Opin. Genet. Dev. 13, Hughes, A. L. & Friedman, R. (2003) Genome Res. 13, Davies, T. J., Barraclough, T. G., Chase, M. W., Soltis, P. S., Soltis, D. E. & Savolainen, V. (2004) Proc. Natl. Acad. Sci. USA 101, Yang, Y. W., Lai, K. N., Tai, P. Y. & Li, W. H. (1999) J. Mol. Evol. 48, Blanc, G. & Wolfe, K. H. (2004) Plant Cell 16, Stephens, S. (1951) Adv. Genet. 4, Ohno, S. (1970) Evolution by Gene Duplication (Springer, Berlin). 11. Taylor, J. S. & Raes, J. (2004) Annu. Rev. Genet. 38, Lynch, M., O Hely, M., Walsh, B. & Force, A. (2001) Genetics 159, Tocchini-Valentini, G. D., Fruscoloni, P. & Tocchini-Valentini, G. P. (2005) Proc. Natl. Acad. Sci. USA 102, Innan, H. (2003) Genetics 163, Wagner, A. (2000) Nat. Genet. 24, Hughes, M. K. & Hughes, A. L. (1993) Mol. Biol. Evol. 10, Moore, R. C. & Purugganan, M. D. (2003) Proc. Natl. Acad. Sci. USA 100, Gu, Z. L., Steinmetz, L. M., Gu, X., Scharfe, C., Davis, R. W. & Li, W. H. (2003) Nature 421, Gao, L. Z. & Innan, H. (2004) Science 306, Kashkush, K., Feldman, M. & Levy, A. A. (2002) Genetics 160, Ozkan, H., Levy, A. A. & Feldman, M. (2001) Plant Cell 13, Shaked, H., Kashkush, K., Ozkan, H., Feldman, M. & Levy, A. A. (2001) Plant Cell 13, Feldman, M., Liu, B., Segal, G., Abbo, S., Levy, A. A. & Vega, J. M. (1997) Genetics 147, Song, K. M., Lu, P., Tang, K. L. & Osborn, T. C. (1995) Proc. Natl. Acad. Sci. USA 92, Ozkan, H., Levy, A. A. & Feldman, M. (2002) Isr. J. Plant Sci. 50, S65 S Kashkush, K., Feldman, M. & Levy, A. A. (2003) Nat. Genet. 33, O Neill, R. J. W., O Neill, M. J. & Graves, J. A. M. (2002) Nature 420, Chen, Z. J. & Pikaard, C. S. (1997) Genes Dev. 11, Chen, Z. J. & Pikaard, C. S. (1997) Proc. Natl. Acad. Sci. USA 94, Comai, L., Tyagi, A. P., Winter, K., Holmes-Davis, R., Reynolds, S. H., Stevens, Y. & Byers, B. (2000) Plant Cell 12, Lee, H. S. & Chen, Z. J. (2001) Proc. Natl. Acad. Sci. USA 98, Lynch, M. & Conery, J. S. (2003) Science 302, Wagner, A. (2001) Mol. Biol. Evol. 18, Conant, G. C. & Wagner, A. (2003) Genome Res. 13, Kellis, M., Birren, B. W. & Lander, E. S. (2004) Nature 428, Seoighe, C. & Gehring, C. (2004) Trends Genet. 20, Maere, S., De Bodt, S., Raes, J., Casneuf, T., Van Montagu, M., Kuiper, M. & Van de Peer, Y. (2005) Proc. Natl. Acad. Sci. USA 102, Blanc, G. & Wolfe, K. H. (2004) Plant Cell 16, Jander, G., Norris, S., Rounsley, S., Bush, D., Levin, I. & Last, R. (2002) Plant Physiol. 129, Feltus, F. A., Wan, J., Schulze, S. R., Estill, J. C., Jiang, N. & Paterson, A. H. (2004) Genome Res. 14, Conant, G. C. & Wagner, A. (2004) Proc. R. Soc. London Ser. B 271, Teichmann, S. A. & Babu, M. M. (2004) Nat. Genet. 36, Lawrence, W. J. C. (1931) Cytologia 2, Liu, L., Parekh-Olmedo, H. & Kmiec, E. B. (2003) Nat. Rev. Genet. 4, Newman, T. & Trask, B. J. (2003) Genome Res. 13, Zhang, L. Q., Vision, T. J. & Gaut, B. S. (2002) Mol. Biol. Evol. 19, Dickinson, W. J. & Seger, J. (1999) Nature 399, Makova, K. D. & Li, W. H. (2003) Genome Res. 13, Muller, H. J. (1932) Am. Nat. 66, Hooper, S. D. & Berg, O. G. (2003) Mol. Biol. Evol. 20, Bayer, R. J. & Stebbins, G. L. (1987) Syst. Bot. 12, Levin-Zaidman, S., Englander, J., Shimoni, E., Sharma, A. K., Minton, K. W. & Minsky, A. (2003) Science 299, Davis, J. C. & Petrov, D. A. (2004) PLoS Biol. 2, e55. EVOLUTION Chapman et al. PNAS February 21, 2006 vol. 103 no

Lecture Notes: BIOL2007 Molecular Evolution

Lecture Notes: BIOL2007 Molecular Evolution Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits

More information

Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics

Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics A. H. Paterson*, J. E. Bowers, and B. A. Chapman Plant Genome Mapping Laboratory, University

More information

Small RNA in rice genome

Small RNA in rice genome Vol. 45 No. 5 SCIENCE IN CHINA (Series C) October 2002 Small RNA in rice genome WANG Kai ( 1, ZHU Xiaopeng ( 2, ZHONG Lan ( 1,3 & CHEN Runsheng ( 1,2 1. Beijing Genomics Institute/Center of Genomics and

More information

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics. Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary

More information

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16 Genome Evolution Outline 1. What: Patterns of Genome Evolution Carol Eunmi Lee Evolution 410 University of Wisconsin 2. Why? Evolution of Genome Complexity and the interaction between Natural Selection

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18 Genome Evolution Outline 1. What: Patterns of Genome Evolution Carol Eunmi Lee Evolution 410 University of Wisconsin 2. Why? Evolution of Genome Complexity and the interaction between Natural Selection

More information

Computational approaches for functional genomics

Computational approaches for functional genomics Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding

More information

In 1996, when the research plant community decided to determine

In 1996, when the research plant community decided to determine The hidden duplication past of Arabidopsis thaliana Cedric Simillion, Klaas Vandepoele, Marc C. E. Van Montagu, Marc Zabeau, and Yves Van de Peer* Department of Plant Systems Biology, Flanders Interuniversity

More information

How to detect paleoploidy?

How to detect paleoploidy? Genome duplications (polyploidy) / ancient genome duplications (paleopolyploidy) How to detect paleoploidy? e.g. a diploid cell undergoes failed meiosis, producing diploid gametes, which selffertilize

More information

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information # Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Details of PRF Methodology In the Poisson Random Field PRF) model, it is assumed that non-synonymous mutations at a given gene are either

More information

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly Comparative Genomics: Human versus chimpanzee 1. Introduction The chimpanzee is the closest living relative to humans. The two species are nearly identical in DNA sequence (>98% identity), yet vastly different

More information

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot

More information

Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law

Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law Ze Zhang,* Z. W. Luo,* Hirohisa Kishino,à and Mike J. Kearsey *School of Biosciences, University of Birmingham,

More information

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Bio 1B Lecture Outline (please print and bring along) Fall, 2007 Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution

More information

8/23/2014. Phylogeny and the Tree of Life

8/23/2014. Phylogeny and the Tree of Life Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major

More information

C3020 Molecular Evolution. Exercises #3: Phylogenetics

C3020 Molecular Evolution. Exercises #3: Phylogenetics C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from

More information

Big Questions. Is polyploidy an evolutionary dead-end? If so, why are all plants the products of multiple polyploidization events?

Big Questions. Is polyploidy an evolutionary dead-end? If so, why are all plants the products of multiple polyploidization events? Plant of the Day Cyperus esculentus - Cyperaceae Chufa (tigernut) 8,000 kg/ha, 720 kcal/sq m per month Top Crop for kcal productivity! One of the world s worst weeds Big Questions Is polyploidy an evolutionary

More information

The Origin of Species

The Origin of Species The Origin of Species Introduction A species can be defined as a group of organisms whose members can breed and produce fertile offspring, but who do not produce fertile offspring with members of other

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)

More information

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME MATHEMATICAL MODELING AND THE HUMAN GENOME Hilary S. Booth Australian National University, Australia Keywords: Human genome, DNA, bioinformatics, sequence analysis, evolution. Contents 1. Introduction:

More information

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together SPECIATION Origin of new species=speciation -Process by which one species splits into two or more species, accounts for both the unity and diversity of life SPECIES BIOLOGICAL CONCEPT Population or groups

More information

Sex accelerates adaptation

Sex accelerates adaptation Molecular Evolution Sex accelerates adaptation A study confirms the classic theory that sex increases the rate of adaptive evolution by accelerating the speed at which beneficial mutations sweep through

More information

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection CHAPTER 23 THE EVOLUTIONS OF POPULATIONS Section C: Genetic Variation, the Substrate for Natural Selection 1. Genetic variation occurs within and between populations 2. Mutation and sexual recombination

More information

Model plants and their Role in genetic manipulation. Mitesh Shrestha

Model plants and their Role in genetic manipulation. Mitesh Shrestha Model plants and their Role in genetic manipulation Mitesh Shrestha Definition of Model Organism Specific species or organism Extensively studied in research laboratories Advance our understanding of Cellular

More information

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species

More information

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Chapter 12 (Strikberger) Molecular Phylogenies and Evolution METHODS FOR DETERMINING PHYLOGENY In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Modern

More information

SWEEPFINDER2: Increased sensitivity, robustness, and flexibility

SWEEPFINDER2: Increased sensitivity, robustness, and flexibility SWEEPFINDER2: Increased sensitivity, robustness, and flexibility Michael DeGiorgio 1,*, Christian D. Huber 2, Melissa J. Hubisz 3, Ines Hellmann 4, and Rasmus Nielsen 5 1 Department of Biology, Pennsylvania

More information

An Introduction to Sequence Similarity ( Homology ) Searching

An Introduction to Sequence Similarity ( Homology ) Searching An Introduction to Sequence Similarity ( Homology ) Searching Gary D. Stormo 1 UNIT 3.1 1 Washington University, School of Medicine, St. Louis, Missouri ABSTRACT Homologous sequences usually have the same,

More information

Classical Selection, Balancing Selection, and Neutral Mutations

Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection Perspective of the Fate of Mutations All mutations are EITHER beneficial or deleterious o Beneficial mutations are selected

More information

Comparative genomics of Gossypium and Arabidopsis: Unraveling the consequences of both ancient and recent polyploidy

Comparative genomics of Gossypium and Arabidopsis: Unraveling the consequences of both ancient and recent polyploidy Article Comparative genomics of Gossypium and Arabidopsis: Unraveling the consequences of both ancient and recent polyploidy Junkang Rong, 1 John E. Bowers, 1 Stefan R. Schulze, 1 Vijay N. Waghmare, 1

More information

Understanding relationship between homologous sequences

Understanding relationship between homologous sequences Molecular Evolution Molecular Evolution How and when were genes and proteins created? How old is a gene? How can we calculate the age of a gene? How did the gene evolve to the present form? What selective

More information

MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale

MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale evolutionary changes such as speciation events, origin of

More information

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

CHAPTERS 24-25: Evidence for Evolution and Phylogeny CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology

More information

Molecular evolution - Part 1. Pawan Dhar BII

Molecular evolution - Part 1. Pawan Dhar BII Molecular evolution - Part 1 Pawan Dhar BII Theodosius Dobzhansky Nothing in biology makes sense except in the light of evolution Age of life on earth: 3.85 billion years Formation of planet: 4.5 billion

More information

EVOLUTION. Evolution - changes in allele frequency in populations over generations.

EVOLUTION. Evolution - changes in allele frequency in populations over generations. EVOLUTION Evolution - changes in allele frequency in populations over generations. Sources of genetic variation: genetic recombination by sexual reproduction (produces new combinations of genes) mutation

More information

Comparative genomics: Overview & Tools + MUMmer algorithm

Comparative genomics: Overview & Tools + MUMmer algorithm Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune 411 007. urmila@bioinfo.ernet.in Genome sequence: Fact file 1995: The first

More information

Impact of recurrent gene duplication on adaptation of plant genomes

Impact of recurrent gene duplication on adaptation of plant genomes Fischer et al. BMC Plant Biology 2014, 14:151 RESEARCH ARTICLE Open Access Impact of recurrent gene duplication on adaptation of plant genomes Iris Fischer 1,2*, Jacques Dainat 3,6, Vincent Ranwez 3, Sylvain

More information

Warm-Up- Review Natural Selection and Reproduction for quiz today!!!! Notes on Evidence of Evolution Work on Vocabulary and Lab

Warm-Up- Review Natural Selection and Reproduction for quiz today!!!! Notes on Evidence of Evolution Work on Vocabulary and Lab Date: Agenda Warm-Up- Review Natural Selection and Reproduction for quiz today!!!! Notes on Evidence of Evolution Work on Vocabulary and Lab Ask questions based on 5.1 and 5.2 Quiz on 5.1 and 5.2 How

More information

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological

More information

Impact of recurrent gene duplication on adaptation of plant genomes

Impact of recurrent gene duplication on adaptation of plant genomes Impact of recurrent gene duplication on adaptation of plant genomes Iris Fischer, Jacques Dainat, Vincent Ranwez, Sylvain Glémin, Jacques David, Jean-François Dufayard, Nathalie Chantret Plant Genomes

More information

Genomics and bioinformatics summary. Finding genes -- computer searches

Genomics and bioinformatics summary. Finding genes -- computer searches Genomics and bioinformatics summary 1. Gene finding: computer searches, cdnas, ESTs, 2. Microarrays 3. Use BLAST to find homologous sequences 4. Multiple sequence alignments (MSAs) 5. Trees quantify sequence

More information

Evolution Problem Drill 09: The Tree of Life

Evolution Problem Drill 09: The Tree of Life Evolution Problem Drill 09: The Tree of Life Question No. 1 of 10 Question 1. The age of the Earth is estimated to be about 4.0 to 4.5 billion years old. All of the following methods may be used to estimate

More information

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. OEB 242 Exam Practice Problems Answer Key Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. First, recall

More information

1 Introduction. Abstract

1 Introduction. Abstract CBS 530 Assignment No 2 SHUBHRA GUPTA shubhg@asu.edu 993755974 Review of the papers: Construction and Analysis of a Human-Chimpanzee Comparative Clone Map and Intra- and Interspecific Variation in Primate

More information

Genomes and Their Evolution

Genomes and Their Evolution Chapter 21 Genomes and Their Evolution PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships Chapter 26: Phylogeny and the Tree of Life You Must Know The taxonomic categories and how they indicate relatedness. How systematics is used to develop phylogenetic trees. How to construct a phylogenetic

More information

SEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION

SEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION Annu. Rev. Genomics Hum. Genet. 2003. 4:213 35 doi: 10.1146/annurev.genom.4.020303.162528 Copyright c 2003 by Annual Reviews. All rights reserved First published online as a Review in Advance on June 4,

More information

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION Using Anatomy, Embryology, Biochemistry, and Paleontology Scientific Fields Different fields of science have contributed evidence for the theory of

More information

Gene duplication and loss

Gene duplication and loss Gene duplication and loss Matthew Hahn Indiana University mwh@indiana.edu How many genes does a human have: a. 100,000 How many genes does a human have: a.

More information

WHERE DOES THE VARIATION COME FROM IN THE FIRST PLACE?

WHERE DOES THE VARIATION COME FROM IN THE FIRST PLACE? What factors contribute to phenotypic variation? The world s tallest man, Sultan Kosen (8 feet 1 inch) towers over the world s smallest, He Ping (2 feet 5 inches). WHERE DOES THE VARIATION COME FROM IN

More information

Lineage specific conserved noncoding sequences in plants

Lineage specific conserved noncoding sequences in plants Lineage specific conserved noncoding sequences in plants Nilmini Hettiarachchi Department of Genetics, SOKENDAI National Institute of Genetics, Mishima, Japan 20 th June 2014 Conserved Noncoding Sequences

More information

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the

More information

BIOLOGY STANDARDS BASED RUBRIC

BIOLOGY STANDARDS BASED RUBRIC BIOLOGY STANDARDS BASED RUBRIC STUDENTS WILL UNDERSTAND THAT THE FUNDAMENTAL PROCESSES OF ALL LIVING THINGS DEPEND ON A VARIETY OF SPECIALIZED CELL STRUCTURES AND CHEMICAL PROCESSES. First Semester Benchmarks:

More information

Gene and genome duplications: the impact of dosage-sensitivity on the fate of nuclear genes

Gene and genome duplications: the impact of dosage-sensitivity on the fate of nuclear genes Chromosome Research (2009) 17:699 717 DOI 10.1007/s10577-009-9055-9 Gene and genome duplications: the impact of dosage-sensitivity on the fate of nuclear genes Patrick P. Edger & J. Chris Pires # Springer

More information

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17. Genetic Variation: The genetic substrate for natural selection What about organisms that do not have sexual reproduction? Horizontal Gene Transfer Dr. Carol E. Lee, University of Wisconsin In prokaryotes:

More information

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation Molecular Evolution & the Origin of Variation What Is Molecular Evolution? Molecular evolution differs from phenotypic evolution in that mutations and genetic drift are much more important determinants

More information

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation Molecular Evolution & the Origin of Variation What Is Molecular Evolution? Molecular evolution differs from phenotypic evolution in that mutations and genetic drift are much more important determinants

More information

Microbial Taxonomy and the Evolution of Diversity

Microbial Taxonomy and the Evolution of Diversity 19 Microbial Taxonomy and the Evolution of Diversity Copyright McGraw-Hill Global Education Holdings, LLC. Permission required for reproduction or display. 1 Taxonomy Introduction to Microbial Taxonomy

More information

Supplementary Information for Hurst et al.: Causes of trends of amino acid gain and loss

Supplementary Information for Hurst et al.: Causes of trends of amino acid gain and loss Supplementary Information for Hurst et al.: Causes of trends of amino acid gain and loss Methods Identification of orthologues, alignment and evolutionary distances A preliminary set of orthologues was

More information

Science Unit Learning Summary

Science Unit Learning Summary Learning Summary Inheritance, variation and evolution Content Sexual and asexual reproduction. Meiosis leads to non-identical cells being formed while mitosis leads to identical cells being formed. In

More information

Homology Modeling. Roberto Lins EPFL - summer semester 2005

Homology Modeling. Roberto Lins EPFL - summer semester 2005 Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,

More information

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega BLAST Multiple Sequence Alignments: Clustal Omega What does basic BLAST do (e.g. what is input sequence and how does BLAST look for matches?) Susan Parrish McDaniel College Multiple Sequence Alignments

More information

The Origin of Species

The Origin of Species The Origin of Species A. Macroevolution: Up to this point we have discussed changes in alleles or microevolution, with evolution this is the evolution of new. is the origin of a new species. There are

More information

7. Tests for selection

7. Tests for selection Sequence analysis and genomics 7. Tests for selection Dr. Katja Nowick Group leader TFome and Transcriptome Evolution Bioinformatics group Paul-Flechsig-Institute for Brain Research www. nowicklab.info

More information

MiGA: The Microbial Genome Atlas

MiGA: The Microbial Genome Atlas December 12 th 2017 MiGA: The Microbial Genome Atlas Jim Cole Center for Microbial Ecology Dept. of Plant, Soil & Microbial Sciences Michigan State University East Lansing, Michigan U.S.A. Where I m From

More information

Computational methods for predicting protein-protein interactions

Computational methods for predicting protein-protein interactions Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational

More information

GENE duplication is believed to be the primary NF-II. In recent years, an alternative to the NF hypothesis

GENE duplication is believed to be the primary NF-II. In recent years, an alternative to the NF hypothesis Copyright 2005 by the Genetics Society of America DOI: 10.1534/genetics.104.037051 Rapid Subfunctionalization Accompanied by Prolonged and Substantial Neofunctionalization in Duplicate Gene Evolution Xionglei

More information

Sequence Alignment: A General Overview. COMP Fall 2010 Luay Nakhleh, Rice University

Sequence Alignment: A General Overview. COMP Fall 2010 Luay Nakhleh, Rice University Sequence Alignment: A General Overview COMP 571 - Fall 2010 Luay Nakhleh, Rice University Life through Evolution All living organisms are related to each other through evolution This means: any pair of

More information

J. MITCHELL MCGRATH, LESLIE G. HICKOK, and ERAN PICHERSKY

J. MITCHELL MCGRATH, LESLIE G. HICKOK, and ERAN PICHERSKY P1. Syst. Evol. 189:203-210 (1994) --Plant Systematics and Evolution Springer-Verlag 1994 Printed in Austria Assessment of gene copy number in the homosporous ferns Ceratopteris thalictroides and C. richardii

More information

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types Exp 11- THEORY Sequence Alignment is a process of aligning two sequences to achieve maximum levels of identity between them. This help to derive functional, structural and evolutionary relationships between

More information

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes. February 8, 2005 Bio 107/207 Winter 2005 Lecture 11 Mutation and transposable elements - the term mutation has an interesting history. - as far back as the 17th century, it was used to describe any drastic

More information

Whole-genome duplications and paleopolyploidy

Whole-genome duplications and paleopolyploidy Whole-genome duplications and paleopolyploidy Whole-genome duplications in protozoa Aury et al. (2006) analyzed the unicellular eukaryote Paramecium tetraurelia most of 40,000 genes arose through at least

More information

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Supplementary Note S2 Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Phylogenetic trees reconstructed by a variety of methods from either single-copy orthologous loci (Class

More information

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression) Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures

More information

GENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS.

GENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS. !! www.clutchprep.com CONCEPT: HISTORY OF GENETICS The earliest use of genetics was through of plants and animals (8000-1000 B.C.) Selective breeding (artificial selection) is the process of breeding organisms

More information

Unit 7: Evolution Guided Reading Questions (80 pts total)

Unit 7: Evolution Guided Reading Questions (80 pts total) AP Biology Biology, Campbell and Reece, 10th Edition Adapted from chapter reading guides originally created by Lynn Miriello Name: Unit 7: Evolution Guided Reading Questions (80 pts total) Chapter 22 Descent

More information

Concepts and Methods in Molecular Divergence Time Estimation

Concepts and Methods in Molecular Divergence Time Estimation Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks

More information

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Bioinformatics. Dept. of Computational Biology & Bioinformatics Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS

More information

6 Introduction to Population Genetics

6 Introduction to Population Genetics 70 Grundlagen der Bioinformatik, SoSe 11, D. Huson, May 19, 2011 6 Introduction to Population Genetics This chapter is based on: J. Hein, M.H. Schierup and C. Wuif, Gene genealogies, variation and evolution,

More information

NUCLEOTIDE SUBSTITUTIONS AND THE EVOLUTION OF DUPLICATE GENES

NUCLEOTIDE SUBSTITUTIONS AND THE EVOLUTION OF DUPLICATE GENES Conery, J.S. and Lynch, M. Nucleotide substitutions and evolution of duplicate genes. Pacific Symposium on Biocomputing 6:167-178 (2001). NUCLEOTIDE SUBSTITUTIONS AND THE EVOLUTION OF DUPLICATE GENES JOHN

More information

Modeling gene and genome duplications in eukaryotes

Modeling gene and genome duplications in eukaryotes Modeling gene and genome duplications in eukaryotes Steven Maere*, Stefanie De Bodt*, Jeroen Raes, Tineke Casneuf, Marc Van Montagu, Martin Kuiper, and Yves Van de Peer Department of Plant Systems Biology,

More information

Single alignment: Substitution Matrix. 16 march 2017

Single alignment: Substitution Matrix. 16 march 2017 Single alignment: Substitution Matrix 16 march 2017 BLOSUM Matrix BLOSUM Matrix [2] (Blocks Amino Acid Substitution Matrices ) It is based on the amino acids substitutions observed in ~2000 conserved block

More information

A Phylogenetic Network Construction due to Constrained Recombination

A Phylogenetic Network Construction due to Constrained Recombination A Phylogenetic Network Construction due to Constrained Recombination Mohd. Abdul Hai Zahid Research Scholar Research Supervisors: Dr. R.C. Joshi Dr. Ankush Mittal Department of Electronics and Computer

More information

Computational analyses of ancient polyploidy

Computational analyses of ancient polyploidy Computational analyses of ancient polyploidy Kevin P. Byrne 1 and Guillaume Blanc 2* 1 Department of Genetics, Smurfit Institute, University of Dublin, Trinity College, Dublin 2, Ireland. 2 Laboratoire

More information

Neutral behavior of shared polymorphism

Neutral behavior of shared polymorphism Proc. Natl. Acad. Sci. USA Vol. 94, pp. 7730 7734, July 1997 Colloquium Paper This paper was presented at a colloquium entitled Genetics and the Origin of Species, organized by Francisco J. Ayala (Co-chair)

More information

TE content correlates positively with genome size

TE content correlates positively with genome size TE content correlates positively with genome size Mb 3000 Genomic DNA 2500 2000 1500 1000 TE DNA Protein-coding DNA 500 0 Feschotte & Pritham 2006 Transposable elements. Variation in gene numbers cannot

More information

Principles of Genetics

Principles of Genetics Principles of Genetics Snustad, D ISBN-13: 9780470903599 Table of Contents C H A P T E R 1 The Science of Genetics 1 An Invitation 2 Three Great Milestones in Genetics 2 DNA as the Genetic Material 6 Genetics

More information

6 Introduction to Population Genetics

6 Introduction to Population Genetics Grundlagen der Bioinformatik, SoSe 14, D. Huson, May 18, 2014 67 6 Introduction to Population Genetics This chapter is based on: J. Hein, M.H. Schierup and C. Wuif, Gene genealogies, variation and evolution,

More information

Genetically Engineering Yeast to Understand Molecular Modes of Speciation

Genetically Engineering Yeast to Understand Molecular Modes of Speciation Genetically Engineering Yeast to Understand Molecular Modes of Speciation Mark Umbarger Biophysics 242 May 6, 2004 Abstract: An understanding of the molecular mechanisms of speciation (reproductive isolation)

More information

Case Study. Who s the daddy? TEACHER S GUIDE. James Clarkson. Dean Madden [Ed.] Polyploidy in plant evolution. Version 1.1. Royal Botanic Gardens, Kew

Case Study. Who s the daddy? TEACHER S GUIDE. James Clarkson. Dean Madden [Ed.] Polyploidy in plant evolution. Version 1.1. Royal Botanic Gardens, Kew TEACHER S GUIDE Case Study Who s the daddy? Polyploidy in plant evolution James Clarkson Royal Botanic Gardens, Kew Dean Madden [Ed.] NCBE, University of Reading Version 1.1 Polypoidy in plant evolution

More information

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting. Genome Annotation Bioinformatics and Computational Biology Genome Annotation Frank Oliver Glöckner 1 Genome Analysis Roadmap Genome sequencing Assembly Gene prediction Protein targeting trna prediction

More information

Gene duplication as a major force in evolution

Gene duplication as a major force in evolution c Indian Academy of Sciences REVIEW ARTICLE Gene duplication as a major force in evolution SANTOSHKUMAR MAGADUM 1, URBI BANERJEE 2, PRIYADHARSHINI MURUGAN 3, DODDABHIMAPPA GANGAPUR 4 and RAJASEKAR RAVIKESAVAN

More information

Evidence That Mutation Is Universally Biased towards AT in Bacteria

Evidence That Mutation Is Universally Biased towards AT in Bacteria Evidence That Mutation Is Universally Biased towards AT in Bacteria Ruth Hershberg*, Dmitri A. Petrov Department of Biology, Stanford University, Stanford, California, United States of America Abstract

More information

Sara C. Madeira. Universidade da Beira Interior. (Thanks to Ana Teresa Freitas, IST for useful resources on this subject)

Sara C. Madeira. Universidade da Beira Interior. (Thanks to Ana Teresa Freitas, IST for useful resources on this subject) Bioinformática Sequence Alignment Pairwise Sequence Alignment Universidade da Beira Interior (Thanks to Ana Teresa Freitas, IST for useful resources on this subject) 1 16/3/29 & 23/3/29 27/4/29 Outline

More information

MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS. Masatoshi Nei"

MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS. Masatoshi Nei MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS Masatoshi Nei" Abstract: Phylogenetic trees: Recent advances in statistical methods for phylogenetic reconstruction and genetic diversity analysis were

More information

How Molecules Evolve. Advantages of Molecular Data for Tree Building. Advantages of Molecular Data for Tree Building

How Molecules Evolve. Advantages of Molecular Data for Tree Building. Advantages of Molecular Data for Tree Building How Molecules Evolve Guest Lecture: Principles and Methods of Systematic Biology 11 November 2013 Chris Simon Approaching phylogenetics from the point of view of the data Understanding how sequences evolve

More information

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics: Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships

More information

Modes of Macroevolution

Modes of Macroevolution Modes of Macroevolution Macroevolution is used to refer to any evolutionary change at or above the level of species. Darwin illustrated the combined action of descent with modification, the principle of

More information