Comparative Sequence Analysis of Mycobacterium leprae and the New Leprosy-Causing Mycobacterium lepromatosis

Size: px
Start display at page:

Download "Comparative Sequence Analysis of Mycobacterium leprae and the New Leprosy-Causing Mycobacterium lepromatosis"

Transcription

1 JOURNAL OF BACTERIOLOGY, Oct. 2009, p Vol. 191, No /09/$ doi: /jb Copyright 2009, American Society for Microbiology. All Rights Reserved. Comparative Sequence Analysis of Mycobacterium leprae and the New Leprosy-Causing Mycobacterium lepromatosis Xiang Y. Han, 1 Kurt C. Sizer, 1 Erika J. Thompson, 2 Juma Kabanja, 3 Jun Li, 3 Peter Hu, 3 Laura Gómez-Valero, 4 and Francisco J. Silva 5,6 * Department of Laboratory Medicine, 1 DNA Analysis Core Facility, 2 and School of Health Sciences, 3 The University of Texas M. D. Anderson Cancer Center, Houston, Texas; Institut Pasteur, Unité de Biologie des Bactéries Intracellulaires, Paris, France 4 ; Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, València, Spain 5 ; and CIBER en Epidemiología y Salud Pública (CIBERESP), Madrid, Spain 6 Received 12 June 2009/Accepted 20 July 2009 Mycobacterium lepromatosis is a newly discovered leprosy-causing organism. Preliminary phylogenetic analysis of its 16S rrna gene and a few other gene segments revealed significant divergence from Mycobacterium leprae, a well-known cause of leprosy, that justifies the status of M. lepromatosis as a new species. In this study we analyzed the sequences of 20 genes and pseudogenes (22,814 nucleotides). Overall, the level of matching of these sequences with M. leprae sequences was 90.9%, which substantiated the species-level difference; the levels of matching for the 16S rrna genes and 14 protein-encoding genes were 98.0% and 93.1%, respectively, but the level of matching for five pseudogenes was only 79.1%. Five conserved protein-encoding genes were selected to construct phylogenetic trees and to calculate the numbers of synonymous substitutions (ds values) and nonsynonymous substitutions (dn values) in the two species. Robust phylogenetic trees constructed using concatenated alignment of these genes placed M. lepromatosis and M. leprae in a tight cluster with long terminal branches, implying that the divergence occurred long ago. The ds and dn values were also much higher than those for other closest pairs of mycobacteria. The ds values were 14 to 28% of the ds values for M. leprae and Mycobacterium tuberculosis, a more divergent pair of species. These results thus indicate that M. lepromatosis and M. leprae diverged 10 million years ago. The M. lepromatosis pseudogenes analyzed that were also pseudogenes in M. leprae showed nearly neutral evolution, and their relative ages were similar to those of M. leprae pseudogenes, suggesting that they were pseudogenes before divergence. Taken together, the results described above indicate that M. lepromatosis and M. leprae diverged from a common ancestor after the massive gene inactivation event described previously for M. leprae. Downloaded from * Corresponding author. Mailing address: Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Apartat 22085, València, Spain. Phone: Fax: francisco.silva@uv.es. Published ahead of print on 24 July Leprosy, one of the oldest human diseases, remains a significant public health problem in many developing countries (8). Mycobacterium leprae was the only known cause of leprosy until recently, when a new mycobacterium, Mycobacterium lepromatosis, was found to be the cause of diffuse lepromatous leprosy (DLL), a unique form of leprosy endemic in Mexico and the Caribbean (17). The discovery of this new species may provide an explanation for the clinical and geographic variability of leprosy. The initial phylogenetic analysis of M. lepromatosis was carried out using the sequences of the 16S rrna gene and segments of groel, rpob, and other genes (total, 4.99 kb) (17). This study revealed significant sequence differences between M. lepromatosis and all known Mycobacterium species and placed M. lepromatosis closest to M. leprae. However, the sequence variation justified assigning a new species for the new organism instead of classifying it as a variant of M. leprae. All M. leprae strains collected worldwide have been found to be clonal and to differ by only single-nucleotide polymorphism or variable numbers of tandem repeats (24). Also, the genomes of two M. leprae strains, strain TN from India (GenBank accession numbers AL to AL583926) (4) and strain Br4923 from Brazil (GenBank accession number FM211192) (N. Honore et al., unpublished data) share 99.98% identity. Like M. leprae, M. lepromatosis has not been cultivated on artificial media. In addition, our previous study also showed other similarities between these organisms, such as degeneration of mmaa3 into a pseudogene, the presence of unique AT-rich inserted sequences in the 16S rrna gene, identical six-base tandem repeats in rpot, similar G C contents, and great evolutionary distance from other mycobacteria (17). The M. leprae genome (3.3 Mb) is much smaller than the Mycobacterium tuberculosis genome (4.4 Mb) (3, 4). More intriguingly, the M. leprae genome has undergone reductive evolution; 40% of the genes are inactivated (4), and 50% of the genes of the last common ancestor of M. leprae and M. tuberculosis have been lost (13). On the other hand, the M. leprae genome has been far more stable than the M. tuberculosis genome, and the worldwide clonality of the M. leprae strains paralleled the global spread of M. leprae strains that occurred via human activity and migration during the last 100,000 years (24). Recently, by comparing the genomes of M. leprae and M. tuberculosis and by analyzing the ages of the M. leprae pseudogenes, Gomez-Valero et al. (13) estimated that a massive gene inactivation event took place in the M. leprae genome in the last 20 million years. The discovery of M. lepromatosis and its differences from M. leprae make it relevant for further study for diagnosis, treat- on April 15, 2019 by guest 6067

2 6068 HAN ET AL. J. BACTERIOL. ment, and prevention of DLL. Likewise, the many similarities between these two organisms prompted questions about their evolutionary histories and about how M. lepromatosis became endemic mainly in Mexico, while M. leprae occurs worldwide. In this study, we extended and refined our previous phylogenetic study by determining and analyzing the sequences of 20 genes and pseudogenes of M. lepromatosis. Our findings solidified the phylogeny of this new organism and provided new insights into the history of pseudogenes. MATERIALS AND METHODS DNA isolation and sequencing. DNA was extracted from the autopsy liver of a patient who died of DLL (17). Briefly, approximately 200 mg of freshly frozen infected liver tissue was homogenized and processed with N-acetyl-L-cysteine and NaOH to enrich the mycobacterium, decontaminate the preparation, and disrupt the host cells, the usual procedure for mycobacterial culture. After essentially pure mycobacterial cells were obtained, a commercial DNA extraction solution (PrepMan Ultra; Applied Biosystems, CA) was added, and the mixture was boiled to disrupt the bacterial cells. The supernatant contained the released DNA and was used for PCRs. Many sets of PCR primers were designed to target various genes and pseudogenes of M. lepromatosis. Without knowledge of the specific sequences of this organism, the primers were designed using the conserved sequences of the corresponding genes of M. leprae and M. tuberculosis after alignment. For a long gene, multiple sets of forward and reverse primers were designed, with each set amplifying 500 to 600 bp, with overlaps. The PCR amplicons were sequenced directly using an ABI sequencer. All sequences were matched with GenBank data and inspected visually to ensure accuracy. The full-length rpob gene of Mycobacterium haemophilum ATCC T was also sequenced for phylogenetic analysis. Retrieval of other sequences. Gene and pseudogene sequences of M. lepromatosis were used to extract putative orthologous genes from completely sequenced mycobacterial genomes. The extraction was performed using the Microbial Genome Database for Comparative Analysis (30). The genomes used included those of Mycobacterium abscessus ATCC T, Mycobacterium avium strain 104, M. avium subsp. paratuberculosis, Mycobacterium bovis BCG Pasteur 1173P2, Mycobacterium gilvum, M. leprae, Mycobacterium marinum M, Mycobacterium smegmatis, Mycobacterium sp. strain JLS, Mycobacterium sp. strain KMS, Mycobacterium sp. strain MCS, M. tuberculosis CDC1551, Mycobacterium ulcerans, and Mycobacterium vanbaalenii. Because there were six genomes of M. tuberculosis complex organisms in the database and all of them have essentially identical sequences, only two organisms, M. bovis BCG Pasteur 1173P2 and M. tuberculosis CDC1551, were used. Sequence alignment. The MAFFT program (18) was used to align nucleotide sequences of genes and pseudogenes. The ClustalW program implemented in MEGA 3.1 (19) was used to align the amino acid sequences of protein-encoding genes for the corresponding nucleotide alignments. Each alignment was visually inspected to detect short misalignments or poorly aligned regions. Doubtful regions, usually at both ends, were trimmed. To calculate the numbers of synonymous (ds) and nonsynonymous (dn) nucleotide substitutions per site in five M. lepromatosis pseudogenes (cita, gnd2, icd1, lld1, and mmaa3), alignment with corresponding M. leprae pseudogenes was performed using the following parameters: (i) the alignment length corresponded to the available length of each M. lepromatosis pseudogene; (ii) to maintain continuous reading frames, like those in protein-encoding genes, the nucleotides at triplet codon positions covering any indel in either organism were not counted; and (iii) nucleotide positions at which there were stop codons in either organism were removed. The final lengths of the aligned regions, with 15% reduction of the original alignments, were 918, 852, 552, 483, and 540 nucleotides for cita, gnd2, icd1, lld1, and mmaa3, respectively. Phylogenetics and relative rate tests. To determine the relationship of M. lepromatosis to other mycobacteria, phylogenetic trees were constructed using a concatenated alignment of five conserved genes. These genes, rpob, liga, groel, gnd1, and bfra, were chosen from 14 protein-encoding genes based on the following criteria: functional importance or conservation, the presence of orthologous genes in the 17 completely sequenced mycobacterial genomes, full length or nearly full length, and alignment accuracy. In particular, rpob and groel have frequently been used in phylogenetic studies (6). The alignments were also visually inspected to trim more variably aligned ends (on average, 9% of the alignments). The final lengths of the rpob, liga, groel, gnd1, and bfra genes were 3,429, 2,013, 1,575, 1,125, and 405 nucleotides, respectively, and there was total concatenation of 8,547 nucleotide sites. A maximum likelihood phylogenetic analysis was performed with the concatenated alignment using the program PHYML (16) with 100 bootstrap replicates and the HKY model. An additional neighbor-joining phylogeny was also obtained with the program MEGA 3.1 (19) with the Tamura-Nei model and 1,000 bootstrap replicates. To include M. haemophilum (without genome data) in the analysis, additional phylogenies with the same conditions were constructed using the full-length rpob gene of this organism. Tajima s relative rate tests (28) were implemented in MEGA (19). Both M. leprae and M. lepromatosis branches were compared with M. haemophilum using M. tuberculosis CDC1551 as the outgroup. The analyses included the first and second codon positions (usually associated with nonsynonymous substitutions) and the third codon positions (usually resulting in synonymous substitutions due to codon wobbling). Estimation of dn and ds values. The ds and dn values, along with standard errors, were estimated for gene-gene, gene-pseudogene, or pseudogene-pseudogene pairs using yn00 from PAML (32). When a pseudogene was involved, ds and dn values were used only for comparison, and no protein-encoding function of the pseudogene was indicated. To estimate the relative age of an M. lepromatosis pseudogene, the formula for the p parameter developed for M. leprae (13) was used: p dn ilps f dn it / ds ilps f dn it where dn ilps is the dn value for the ancestor (i) ofm. tuberculosis and M. leprae and the present pseudogene (lps)ofm. leprae (or M. lepromatosis), dn it is the dn value for the same ancestor (i) and the gene of M. tuberculosis, and f(dn it )isa function that makes a small correction for the expected number of substitutions in the lineage as if the pseudogene had not formed and is estimated from the complete set of pairs of orthologous genes for M. tuberculosis and M. leprae. This formula is based on the fact that the dn or ds value for a gene-pseudogene pair (for example, an M. tuberculosis gene and an M. leprae pseudogene) is the sum for the period of evolution as a gene (for the complete lineage of M. tuberculosis and part of the lineage of M. leprae) and the period of evolution as a pseudogene (which depends on the moment of inactivation in the M. leprae lineage). It also takes into consideration the fact that the rates of evolution for nonsynonymous changes are gene specific and that to determine these rates in M. leprae (or M. lepromatosis), the dn ilps and d Nit for each gene had to be compared. Because of the lack of a data set for M. lepromatosis, the function estimated for M. leprae was used. For the substitution values, M. avium subsp. paratuberculosis was used as an outgroup. Nucleotide sequence accession numbers. The M. lepromatosis nucleotide sequences have been deposited in the GenBank database under the accession numbers shown in Table 1. The GenBank accession number for the nucleotide sequence of the full-length rpob gene of M. haemophilum ATCC T is GQ RESULTS M. lepromatosis sequences and matches with M. leprae. The 20 genes and pseudogenes (22,814 nucleotides) from M. lepromatosis and their matches with M. leprae genes and pseudogenes are shown in Table 1. These genes and pseudogenes were chosen from the eight initial sequences (17) with extension of the 16S rrna, rpob, groel, and rpot genes and 16 other metabolic genes or pseudogenes in the M. leprae and M. tuberculosis genomes as previously described (4). Of the 16 new genes or pseudogenes, 12 were successfully amplified and sequenced, but 4 pseudogenes (ligb, umma, bfrb, and pfkb) were not amplified and sequenced despite many attempts and redesign of the primers, implying that there is too much variation or these pseudogenes are not present in M. lepromatosis. The G C content of the nucleotide sequences was 58.6%, similar to the 58.8% for the corresponding M. leprae sequences or the M. leprae genome (57.8%) (4). The average levels of nucleotide identity between M. lepromatosis and M. leprae were 93.1% and 79.1% for the coding genes and pseudogenes, respectively, and the overall level of

3 VOL. 191, 2009 SEQUENCE ANALYSIS OF M. LEPROMATOSIS 6069 TABLE 1. Sequences for 22,814 bp of M. lepromatosis and matches with M. leprae Gene or pseudogene (description) GenBank accession no. Length (bp) No. aligned Nucleotide matches No. of matches % No. of gaps (%) No. aligned Amino acid matches No. of matches All 14 protein-encoding genes 17,318 17,250 16, (0.27) 5,527 5, rpot (RNA polymerase sigma EU (5.3) factor) rpso (ribosomal protein O) EU a liga (DNA ligase) EU ,138 2,130 1, ribf (riboflavin kinase) EU bfra (bacterioferritin) EU icd2 (isocitrate dehydrogenase) EU ,255 2,240 2, lld2 (L-lactate dehydrogenase 2) EU ,292 1,290 1, mmaa4 (mycolic acid synthase 4) EU gnd1 (6-phosphogluconate EU ,545 1,543 1, dehydrogenase) pfka (6-phosphofructokinase) EU groel (heat shock protein 65) EU ,615 1,614 1, umaa2 (mycolic acid synthase 2) EU glta2 (citrate synthase 1) EU ,266 1,265 1, rpob (RNA polymerase beta EU ,596 3,590 3, ,178 1, subunit) All five pseudogenes 3,958 4,077 3, (5.1) gnd2 (6-phosphogluconate EU ,147 1, (6.7) dehydrogenase 2) lld1 (L-lactate dehydrogenase 1) EU (6.9) icd1 (isocitrate dehydrogenase 1) EU b (6.8) (2.8) cita (citrate synthase A) EU (2.1) mmaa3 (mycolic acid synthase 3) EU (5.7) 16S rrna gene EU ,538 1,542 1, (0.65) Total 22,814 22,869 20, (1.2) a There is a 3-bp gap before the start codon. b There is a 128-bp deletion between the better matched regions. % Downloaded from nucleotide identity was 90.9% (Table 1). This level is well below the 95% genome-level cutoff used to distinguish species (15). In addition, the level of nucleotide identity for the rpob gene was 94.5%, which is also less than the 98% cutoff proposed recently (1). Therefore, M. lepromatosis meets the definition of a species. The levels of identity for the protein-encoding genes were 88.0 to 94.6% at the nucleotide level and 79.0 to 98.0% at the amino acid level (overall level of identity, 95.2%). Thirteen of the 14 genes studied (16,369 bp) were aligned without gaps; the exception was rpot, which had 47 gap sites (5.3% of 881 bp). The rpot gene of M. lepromatosis contained a unique 74-bp tandem repeat, (CGAGCCACCAATACAGCATCT) 3 CGAGCCA CCAA, which corresponded to amino acids (RATNTAS) 3 RAT. A 42-bp portion of this sequence, (AATACAGCATCTCGA GCCACC) 2, which corresponded to (NTASRAT) 2, was not present in the M. leprae rpot gene. Slightly downstream of this sequence was a 3-bp insertion in M. lepromatosis. Upstream, a 2-bp insert in M. lepromatosis shifted the reading frame and likely led to a different start codon. Thus, rpot is the most variable of the 14 genes. The levels of identity for the pseudogenes were much lower, ranging 74.7 to 82.4%. Due to indels, the sequence of each of the five pseudogenes had many gaps in the alignment. Altogether, there was 207 bp of gaps (5.1% of 4,077 bp), accounting for 24.3% of the 853 mismatches. In addition, the M. lepromatosis icd1 gene had a 128-bp deletion in the middle. Therefore, in these pseudogenes, the lack of constraints of reading frames allowed much faster accumulation of indels. Finally, the level of identity for the 16S rrna gene was 98.0%, which was the highest level of identity, but there were 10 gap sites (0.65% of 1,542 bp). In all mycobacteria, 16S rrna genes are highly conserved, and there are relatively few indels or indels are not present in closely related mycobacteria, such as M. marinum and M. ulcerans, M. avium and Mycobacterium intracellulare, and other pairs of species (data not shown). Thus, indel mutations are also a feature of 16S rrna gene evolution in the two leprosy organisms. Other 16S rrna gene features were analyzed previously (17). Phylogenetic position of M. lepromatosis. As shown in Fig. 1A, phylogenetic trees constructed by using the maximum likelihood and neighbor-joining methods produced the same topology with bootstrap values of 100 at all nodes except one. M. leprae and M. lepromatosis were the most closely related species, yet in both trees the terminal branches leading to these species were much longer than those leading to other tight groups, such as the M. tuberculosis complex and the M. avium group. These long branches could not be explained by faster evolution or high substitution rates. On average, a gene evolving in the lineage leading to M. leprae had a slightly higher ds value than a gene in the M. tuberculosis lineage (from the last common ancestor) (0.82 and 0.72, respectively) (10), while the average dn values were and 0.045, respectively (13). Because the ds values are similar and their contribution to on April 15, 2019 by guest

4 6070 HAN ET AL. J. BACTERIOL. Downloaded from FIG. 1. Maximum likelihood phylogenies of selected mycobacteria based on (A) concatenated alignment of the rpob, liga, groel, gnd1, and bfra genes, (B) alignment of the rpob gene, and (C) alignment of the 16S rrna gene. Bootstrap values are indicated at the nodes. At some nodes there are two values, a value obtained by the neighbor-joining method (left value) and a value obtained by the maximum likelihood method (right value). The bars indicate the numbers of nucleotide substitutions per site. The trees were rooted to separate the slow-growing mycobacterial clade (A and B) and the M. avium branch (C). branch lengths is much more important than that of dn values, we believe that a much earlier divergence time is the best explanation. Because M. haemophilum was recently reported to be a close relative of M. leprae (22) and M. lepromatosis (17), phylogenetic trees based on rpob (3,429 bp) were also reconstructed (Fig. 1B). The same topology was obtained despite some lower bootstrap values. It was observed that M. haemophilum was much closer to the leprosy organisms than to other mycobacteria. Yet the branch lengths indicated that there was faster evolution of the leprosy mycobacteria than of M. haemophilum. Using Tajima s relative rate tests for this gene, the branches leading to M. leprae and M. lepromatosis were found to be significantly longer than the branch leading to M. haemophilum for the third codon position in both leprosy organisms (P for both) and for the first and second codon positions in M. lepromatosis (P 0.042). The main reason for this difference was the decrease in the G C content, mainly at the third codon positions (Table 2). A phylogeny was also obtained using the 16S rrna gene sequences, which resulted in a different topology (Fig. 1C); the Species TABLE 2. G C contents of rpob gene G C content (%) Codon positions 1 and 2 a Codon position 3 a Total M. lepromatosis M. leprae M. haemophilum M. tuberculosis a The first- and/or second-codon substitutions are frequently nonsynonymous, whereas the third-codon substitutions are usually synonymous due to codon wobbling. on April 15, 2019 by guest

5 VOL. 191, 2009 SEQUENCE ANALYSIS OF M. LEPROMATOSIS 6071 FIG. 2. Numbers of nucleotide substitutions per site for five pairs of Mycobacterium species. (Left panel) ds values. (Right panel) dn values. The genes analyzed were rpob, liga, groel, gnd1, and bfra (bars from left to right). Abbreviations: map, M. avium subsp. paratuberculosis; mav, M. avium 104; mbo, M. bovis AF2122/97; mle, M. leprae; mlp, M. lepromatosis; mmi, M. marinum M; mtc, M. tuberculosis CDC1551; mul, M. ulcerans. The dn/ds ratios for the rpob, liga, groel, gnd1, and bfra genes for the M. lepromatosis-m. leprae pair were 0.022, 0.155, 0.031, 0.075, and 0.089, respectively; the average dn/ds ratio was branch between the leprosy organisms was much longer, and the M. marinum-m. ulcerans clade was closer to M. tuberculosis than to the clade formed by M. haemophilum and the leprosy organisms. This tree was less realistic than the tree obtained using the concatenated protein-encoding genes (Fig. 1A). Phylogenomic studies have shown that in a gene phylogeny, when one lineage evolves much faster than the other lineages, an incorrect topology may be obtained, in which the fast-evolving branch has a tendency to move toward the base of the tree. Thus, the true topology may be obtained only with a small subset of genes that evolve slowly (2). The 16S rrna genes of the leprosy organisms evolved much faster than those of other mycobacteria, as shown by more indels, as noted above and previously (17). Evolutionary distance between M. lepromatosis and M. leprae. To estimate the evolutionary distance between M. lepromatosis and M. leprae, ds and dn values were calculated using the five conserved genes described above. These values were compared with the ds and dn values for other pairs of most closely related species or subspecies, including M. avium subspecies, M. marinum and M. ulcerans, and M. tuberculosis and M. bovis. The more divergent pair M. tuberculosis and M. leprae was also included as a gauge because these most medically important mycobacteria diverged 66 million years ago (13). The results are shown in Fig. 2. Since they are nearly unaffected by natural selection, the ds values are similar for many genes and thus more suitable for estimating relative divergence times. The ds values for M. tuberculosis and M. bovis were zero, consistent with the cladelevel divergence of these species and a genome level of identity of 99.95% (7). Similarly, subspecies-level differences for the M. marinum-m. ulcerans and M. avium subspecies pairs were reflected by low ds values. Conversely, for M. lepromatosis and M. leprae the ds values were far higher and for the five genes ranged from 14 to 28% (average, 20%) of the values for the much more divergent organisms M. tuberculosis and M. leprae. Similar results were obtained for the dn values, although in this case natural selection affected each gene or lineage differently, which resulted in some variability. For example, the highest dn value for liga meant faster evolution of this gene in the M. lepromatosis and M. leprae lineages. Together, these results suggest that the divergence of the two leprosy-causing organisms was not recent and that the length of their divergence was around 20% of the age of divergence of M. tuberculosis and M. leprae or 10 million years. Age of M. lepromatosis pseudogenes. For all five pseudogenes (cita, gnd2, icd1, lld1, and mmaa3) in both species there were corresponding functional genes in M. tuberculosis and other mycobacteria. So when were the genes inactivated, in the last common ancestor or after divergence? This question was addressed from a few angles. First, the dn/ds ratio for both species was estimated for each pseudogene. For an ancestral pseudogene, continued evolution after divergence in both lineages would be neutral, resulting in similar dn and ds values with a ratio of 1 or close to 1. Otherwise, selective evolution as a gene in both lineages after divergence with late inactivation would result in a much smaller dn/ds ratio. The longer the selective evolution, the smaller the dn/ds ratio would be. For example, Gomez-Valero et al. (13) showed that in the M. leprae lineage after divergence from M. tuberculosis, the average dn/ds ratio for genes was As shown in Table 3, the dn/ds ratios for M. lepromatosis and M. leprae were close to 1 (range, to 0.789) for the cita, gnd2, lld1, and mmaa3 pseudogenes, indicating that there was likely neutral evolution and thus the sequences were pseudogenes prior to divergence. It is noteworthy that removal of stop codons, which was required for calculation, resulted in underestimation of the dn values, leading to smaller ratios. Mutations from codons to stop codons, which are common in pseudogenes, are essentially dn events. Yet similar dn and ds values were also indicated by the overlapping confidence intervals (mean 1.96 standard errors) for these values. The only exception was icd1, for which the dn value was significantly smaller than the ds value (a ratio much less than 1). The pseudogene dn/ds ratios contrasted with the dn/ds ratios for the five coding genes analyzed, for which the average dn/ds ratio was (Fig. 2). The dn/ds ratios were also estimated for M. lepromatosis and M. tuberculosis, and as expected, they

6 6072 HAN ET AL. J. BACTERIOL. Pseudogene M. lepromatosis- M. leprae dn/ds ratio TABLE 3. Synonymous and nonsynonymous substitutions in pseudogenes a M. lepromatosis- M. tuberculosis dn/ds ratio M. lepromatosis-m. leprae dn ds P value cita NS gnd NS mmaa NS lld NS icd a The confidence intervals for dn and ds (mean 1.96 standard errors) overlap (NS, not statistically significant) or do not overlap (P 0.05). were small (Table 3), reflecting the evolution of the sequences entirely as genes in the M. tuberculosis lineage. Second, the relative ages of M. lepromatosis pseudogenes were estimated by using the formula used for the M. leprae pseudogenes (13). This formula required the presence of corresponding orthologous genes in M. tuberculosis and M. avium. Thus, it could not be used for mmaa3 because this pseudogene is not present in M. avium. The relative ages of the remaining four pseudogenes (Table 4) were found to be similar to the relative ages in M. leprae and also to the average age (0.13) of the 611 M. leprae pseudogenes analyzed previously (13). Thus, the similar ages of pseudogenes in the two species may indicate that they were pseudogenes in the ancestor. Finally, indels that were present in both species and thus likely of ancestral origin were examined using sequence alignments. Several common indels were detected, except in cita, and they included a 2-nucleotide insertion in gnd2, a 5-nucleotide deletion in icd1, a 10-nucleotide deletion in lld1, and a 5-nucleotide insertion and a 8- or 9-nucleotide deletion in mmaa3. These indels were not in triplet nucleotides; as a result, they could alter the codon reading frames and possibly create stop codons. Thus, these findings in essence show the ancestral presence of these pseudogenes, rather than convergent evolution. Many unique indels in each species represented continued evolution of these pseudogenes after divergence. Together, these analyses provided ample evidence showing that (i) all or most of the pseudogenes analyzed were inactivated prior to the divergence of M. lepromatosis and M. leprae and (ii) the massive gene inactivation described for M. leprae (4, 13) before the discovery of M. lepromatosis (17) in fact took place in the last common ancestor of these two species. DISCUSSION The discovery of the new leprosy-causing organism M. lepromatosis and its close relationship to M. leprae provided an TABLE 4. Relative pseudogene age a Pseudogene M. lepromatosis M. leprae icd cita lld gnd Avg a The relative age ranges from 1 (the age equals the time of divergence of M. lepromatosis-m. leprae from M. tuberculosis) to 0 (pseudogene inactivation at the present time). opportunity to discern the evolutionary features of these organisms. Using in-depth analysis of 20 genes and pseudogenes from M. lepromatosis, this study fulfilled some of our goals. First, the species-level divergence between these two organisms was confirmed to be on a larger genetic scale despite the current lack of culture and phenotypic features. Second, divergence 10 million years ago is proposed based on the topology of robust phylogenetic trees and the levels of nucleotide substitutions in the protein-encoding genes. This time is orders of magnitude ( 88 times) greater than the relatively well-established divergence time for M. tuberculosis and M. bovis ( 113,000 years ago), which share 99.95% genome identity (7) and had no resolvable difference in our analyses (Fig. 1 and 2). And third, the age of the pseudogenes of M. lepromatosis, which is nearly identical to the age of M. leprae pseudogenes, their nearly neutral evolution, and the presence of codonaltering common indels suggest these pseudogenes were pseudogenes in the last common ancestor of the two species. Bacterial taxonomy has changed along with technological advances, from morphological classification a century ago to phenotype-based chemotaxonomy to a polyphasic approach involving assessment of genetic variation, such as genomic DNA hybridization (70% cutoff to differentiate species) and divergence of the 16S rrna gene (in general, 97% cutoff) in recent decades (21). Numerous bacterial genome sequences are now readily accessible, making genetic comparisons accurate and comprehensive for the purpose of species identification (15). Yet the phenotype of an organism, which includes complex traits involving many genetic elements, is still essential. For instance, historically as well as practically, several species with essentially identical genome sequences are retained, such as M. tuberculosis for humans and M. bovis for cows. In the case of M. lepromatosis, status as a novel species can be justified because of its remarkable genetic variation, as well as uniqueness of the disease that it causes (DLL). However, to satisfy the bacteriological code for a full description of this new species, phenotypic characterization after cultivation and/or animal passage is needed, and efforts to do this are being made. Practically, the sequences enable us to separate the two leprosy organisms to better define the disease. For instance, we have devised a reliable PCR assay based on the species-specific 16S rrna gene sequences to detect the presence of each species in a skin biopsy (X. Y. Han, K. C. Sizer, F. Aung, H. H. Tan, and B. Werner, unpublished data; X. Y. Han, K. C. Sizer, S. Velarde-Felix, and F. Vargas-Ocampo, unpublished data). Using this assay in a follow-up study of M. lepromatosis infections in Mexico (X. Y. Han, K. C. Sizer, S. Velarde-Felix, and

7 VOL. 191, 2009 SEQUENCE ANALYSIS OF M. LEPROMATOSIS 6073 F. Vargas-Ocampo, unpublished data), we found that far more leprosy cases were caused by M. lepromatosis than by M. leprae, suggesting that M. lepromatosis is the dominant leprosy-causing species in Mexico. We also noted that, in addition to lepromatous leprosy, M. lepromatosis also specifically caused DLL in 30% of the infections. DLL is endemic in Mexico but rare elsewhere. These findings, as well as the lack of DLL reports or description in Europe, more specifically in Spain, suggest that DLL is native to Mexico and was not brought over by the Spaniards over the past 500 years. Native Mongoloid Mexicans initially migrated from Asia 12,000 years ago through the Alaska Bering land bridge and settled in the Pacific coastal areas of Mexico. Some of them migrated further south and settled in the Caribbean, other central American countries, and South America. Our finding of M. lepromatosis in Brazil and Singapore Chinese (X. Y. Han, K. C. Sizer, F. Aung, H. H. Tan, and B. Werner, unpublished data) and the presence of DLL in the Caribbean (8) are also consistent with the possibility that there was Mongoloid spread of this organism. Therefore, the studies of M. lepromatosis have provided new insights into the puzzling clinical and geographic features of leprosy. While the overall features of M. lepromatosis and M. leprae are very similar in terms of culture difficulty, G C content, and the functional status of genes and pseudogenes, M. lepromatosis likely has some unique phenotypic traits that have yet to be discovered that may be responsible for some distinct clinical features of M. lepromatosis infections, such as DLL. DLL is chronic, and the organism is strictly intracellular (17, 20, 26). Vargas-Ocampo (31) recently studied the histopathology of 199 patients with DLL in Mexico and concluded that DLL is essentially a vascular disease caused by mycobacterial invasion, with early vaculitis to late-stage vascular occlusion, which is responsible for the clinical manifestations of the disease that include skin purpura, ulceration, and necrosis. Humans are the only known natural host of M. leprae and have probably been infected for 100,000 years according to genomic evidence based on many M. leprae strains obtained worldwide (24). Consistent with the strictly parasitic lifestyle of this organism, the M. leprae genome is the smallest genome of all of the mycobacterial genomes sequenced to date. This genome has also undergone reductive evolution with massive inactivation of genes (4) estimated to have occurred during the last 20 million years (13). The present study further establishes that this genome decay took place in the last common ancestor of M. leprae and M. lepromatosis. What led to the genome decay in the ancestor 20 million years ago? We speculate that there is some relationship to adaptation from a free-living lifestyle, like that of almost all other mycobacteria ( 100 species) except M. tuberculosis and M. bovis, to an increasingly parasitic lifestyle in a host (an early ancestor of hominids?). During the last decade, sequencing and comparative analysis of many bacterial genomes have shown that changes in lifestyle from free living to host dependence or restriction to a specific host may lead to relaxation of natural selection and make many genes nonessential, initiating the process of genome downsizing. In the early stages, many genes (up to thousands of genes), usually less selectively constrained genes, are inactivated (4, 5, 13, 29). This inactivation occurs through missense or nonsense nucleotide substitutions, indels, or mobilization of insertion elements (9). There is then a tendency to lose the nonfunctional DNA (12, 23, 27). Analysis of some bacterial endosymbionts of insects, such as the aphid endosymbiont Buchnera aphidicola and the ant endosymbiont Blochmannia floridanus, has permitted workers to establish a stepwise scenario for genome reduction involving very slow and gradual degradation with scattered punctual large deletion events (11, 14). This model has been confirmed by complete sequencing of several B. aphidicola genomes (25). In this regard, future genome sequencing of M. lepromatosis and comparative genomic analyses with M. leprae may shed light on the genome decay in these highly specialized pathogens. Why did the ancestor evolve into two species 10 million years ago if it had become parasitic in a primate host? The answer to this question is even more elusive. There are a few possible explanations, including subtle niche differences in the host, eventually leading to separate evolution paths; coevolution with different host populations of the same species that might be affected variably by the availability of nutrients, susceptibility, geographic restriction, and other factors; and a shift from one host species to another. The niche possibility can be tested in future clinical studies by examining tissue infected by each organism. Both organisms are known to be intracellular in macrophages; however, remarkable endothelial invasion by M. lepromatosis in DLL has been documented (17, 26, 31). In our clinical studies, we also found many cases of dual M. lepromatosis-m. leprae infection (X. Y. Han, K. C. Sizer, F. Aung, H. H. Tan, and B. Werner, unpublished data; X. Y. Han, K. C. Sizer, S. Velarde-Felix, and F. Vargas-Ocampo, unpublished data). Further studies of the coexistence of these two organisms may provide insight as well. ACKNOWLEDGMENTS This work was supported in part by a University Cancer Foundation grant from the University of Texas M. D. Anderson Cancer Center (MDACC) (to X.Y.H.), by the School of Health Sciences of MDACC, and by National Institutes of Health grant CA16672 for the MDACC DNA Core Facility. We thank the DNA Core Facility staff for DNA sequencing and John Spencer for helpful discussions. X.Y.H. initiated and orchestrated this study; X.Y.H., K.C.S., E.J.T., J.K., J.L., and P.H. determined the sequences; F.J.S., L.G.V., and X.Y.H. performed data analysis; and F.J.S. and X.Y.H. wrote the paper. REFERENCES 1. Adekambi, T., T. M. Shinnick, D. Raoult, and M. Drancourt Complete rpob gene sequencing as a suitable supplement to DNA-DNA hybridization for bacterial species and genus delineation. Int. J. Syst. Evol. Microbiol. 58: Canback, B., I. Tamas, and S. G. Andersson A phylogenomic study of endosymbiotic bacteria. Mol. Biol. Evol. 21: Cole, S. T., R. Brosch, J. Parkhill, T. Garnier, C. Churcher, D. Harris, S. V. Gordon, K. Eiglmeier, S. Gas, C. E. Barry, F. Tekaia, K. Badcock, D. Basham, D. Brown, T. Chillingworth, R. Connor, R. Davies, K. Devlin, T. Feltwell, S. Gentles, N. Hamlin, S. Holroyd, T. Hornby, K. Jagels, A. Krogh, J. McLean, S. Moule, L. Murphy, K. Oliver, J. Osborne, M. A. Quail, M. A. Rajandream, J. Rogers, S. Rutter, K. Seeger, J. Skelton, R. Squares, S. Squares, J. E. Sulston, K. Taylor, S. Whitehead, and B. G. Barrell Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 393: Cole, S. T., K. Eiglmeier, J. Parkhill, K. D. James, N. R. Thomson, P. R. Wheeler, N. Honore, T. Garnier, C. Churcher, D. Harris, K. Mungall, D. Basham, D. Brown, T. Chillingworth, R. Connor, R. M. Davies, K. Devlin, S. Duthoy, T. Feltwell, A. Fraser, N. Hamlin, S. Holroyd, T. Hornsby, K. Jagels, C. Lacroix, J. Maclean, S. Moule, L. Murphy, K. Oliver, M. A. Quail, M. A. Rajandream, K. M. Rutherford, S. Rutter, K. Seeger, S. Simon, M. Simmonds, J. Skelton, R. Squares, S. Squares, K. Stevens, K. Taylor, S. White-

8 6074 HAN ET AL. J. BACTERIOL. head, J. R. Woodward, and B. G. Barrell Massive gene decay in the leprosy bacillus. Nature 409: Delmotte, F., C. Rispe, J. Schaber, F. J. Silva, and A. Moya Tempo and mode of early gene loss in endosymbiotic bacteria from insects. BMC Evol. Biol. 6: Devulder, G., M. P. de Montclos, and J. P. Flandrois A multigene approach to phylogenetic analysis using the genus Mycobacterium as a model. Int. J. Syst. Evol. Microbiol. 55: Garnier, T., K. Eiglmeier, J. C. Camus, N. Medina, H. Mansoor, M. Pryor, S. Duthoy, S. Grondin, C. Lacroix, C. Monsempe, S. Simon, B. Harris, R. Atkin, J. Doggett, R. Mayes, L. Keating, P. R. Wheeler, J. Parkhill, B. G. Barrell, S. T. Cole, S. V. Gordon, and R. G. Hewinson The complete genome sequence of Mycobacterium bovis. Proc. Natl. Acad. Sci. USA 100: Gelber, R. H Leprosy (Hansen s disease), p In D. L. Kasper, E. Braunwald, A. S. Fauci, S. L. Hauser, D. L. Longo, and J. L. Jameson (ed.), Harrison s principles of internal medicine. McGraw-Hill, Inc., New York, NY. 9. Gil, R., E. Belda, M. J. Gosalbes, L. Delaye, A. Vallier, C. Vincent-Monegat, A. Heddi, F. J. Silva, A. Moya, and A. Latorre Massive presence of insertion sequences in the genome of SOPE, the primary endosymbiont of the rice weevil Sitophilus oryzae. Int. Microbiol. 11: Gómez-Valero, L Evolución reductiva del tamaño del genoma en bacterias intracelulares. Ph.D. thesis. Universitat de València, València, Spain. 11. Gomez-Valero, L., A. Latorre, R. Gil, J. Gadau, H. Feldhaar, and F. J. Silva Patterns and rates of nucleotide substitution, insertion and deletion in the endosymbiont of ants Blochmannia floridanus. Mol. Ecol. 17: Gomez-Valero, L., A. Latorre, and F. J. Silva The evolutionary fate of nonfunctional DNA in the bacterial endosymbiont Buchnera aphidicola. Mol. Biol. Evol. 21: Gomez-Valero, L., E. P. C. Rocha, A. Latorre, and F. J. Silva Reconstructing the ancestor of Mycobacterium leprae: the dynamics of gene loss and genome reduction. Genome Res. 17: Gomez-Valero, L., F. J. Silva, J. C. Simon, and A. Latorre Genome reduction of the aphid endosymbiont Buchnera aphidicola in a recent evolutionary time scale. Gene 389: Goris, J., K. T. Konstantinidis, J. A. Klappenbach, T. Coenye, P. Vandamme, and J. M. Tiedje DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. Int. J. Syst. Evol. Microbiol. 57: Guindon, S., and O. Gascuel A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52: Han, X. Y., Y. H. Seo, K. C. Sizer, T. Schoberle, G. S. May, J. S. Spencer, W. Li, and R. G. Nair A new Mycobacterium species causing diffuse lepromatous leprosy. Am. J. Clin. Pathol. 130: Katoh, K., K. Misawa, K. Kuma, and T. Miyata MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30: Kumar, S., K. Tamura, and M. Nei MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Briefs Bioinform. 5: Latapi, F., and A. Chevez-Zamora The spotted leprosy of Lucio: an introduction to its clinical and histological study. Int. J. Lepr. 16: Ludwig, W., and H.-P. Klenk Overview: a phylogenetic backbone and taxonomic framework for prokaryotic systematics, p In D. R. Boone, R. W. Castenholz, and G. M. Garrity (ed.), Bergey s manual of systematic bacteriology, 2nd ed., vol. I. The archaea and the deeply branching and phototrophic bacteria. Springer-Verlag, New York, NY. 22. Mignard, S., and J. P. Flandrois A seven-gene, multilocus, genus-wide approach to the phylogeny of mycobacteria using supertrees. Int. J. Syst. Evol. Microbiol. 58: Mira, A., H. Ochman, and N. A. Moran Deletional bias and the evolution of bacterial genomes. Trends Genet. 17: Monot, M., N. Honore, T. Garnier, R. Araoz, J. Y. Coppee, C. Lacroix, S. Sow, J. S. Spencer, R. W. Truman, D. L. Williams, R. Gelber, M. Virmond, B. Flageul, S. N. Cho, B. Ji, A. Paniz-Mondolfi, J. Convit, S. Young, P. E. Fine, V. Rasolofo, P. J. Brennan, and S. T. Cole On the origin of leprosy. Science 308: Moran, N. A., H. J. McLaughlin, and R. Sorek The dynamics and time scale of ongoing genomic erosion in symbiotic bacteria. Science 323: Rea, T. H., and R. S. Jerskey Clinical and histologic variations among thirty patients with Lucio s phenomenon and pure and primitive diffuse lepromatosis (Latapi s lepromatosis). Int. J. Lepr. Other Mycobact. Dis. 73: Silva, F. J., A. Latorre, and A. Moya Genome size reduction through multiple events of gene disintegration in Buchnera APS. Trends Genet. 17: Tajima, F Simple methods for testing the molecular evolutionary clock hypothesis. Genetics 135: Toh, H., B. L. Weiss, S. A. H. Perkin, A. Yamashita, K. Oshima, M. Hattori, and S. Aksoy Massive genome erosion and functional adaptations provide insights into the symbiotic lifestyle of Sodalis glossinidius in the tsetse host. Genome Res. 16: Uchiyama, I MBGD: a platform for microbial comparative genomics based on the automated construction of orthologous groups. Nucleic Acids Res. 35:D343 D Vargas-Ocampo, F Diffuse leprosy of Lucio and Latapi: a histologic study. Lepr. Rev. 78: Yang, Z. H., and R. Nielsen Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. Mol. Biol. Evol. 17:32 43.

Microbial Taxonomy and the Evolution of Diversity

Microbial Taxonomy and the Evolution of Diversity 19 Microbial Taxonomy and the Evolution of Diversity Copyright McGraw-Hill Global Education Holdings, LLC. Permission required for reproduction or display. 1 Taxonomy Introduction to Microbial Taxonomy

More information

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Bio 1B Lecture Outline (please print and bring along) Fall, 2007 Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution

More information

Genomes and Their Evolution

Genomes and Their Evolution Chapter 21 Genomes and Their Evolution PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

Microbial Diversity and Assessment (II) Spring, 2007 Guangyi Wang, Ph.D. POST103B

Microbial Diversity and Assessment (II) Spring, 2007 Guangyi Wang, Ph.D. POST103B Microbial Diversity and Assessment (II) Spring, 007 Guangyi Wang, Ph.D. POST03B guangyi@hawaii.edu http://www.soest.hawaii.edu/marinefungi/ocn403webpage.htm General introduction and overview Taxonomy [Greek

More information

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

Genome reduction in prokaryotic obligatory intracellular parasites of humans: a comparative analysis

Genome reduction in prokaryotic obligatory intracellular parasites of humans: a comparative analysis International Journal of Systematic and Evolutionary Microbiology (2004), 54, 1937 1941 DOI 10.1099/ijs.0.63090-0 Genome reduction in prokaryotic obligatory intracellular parasites of humans: a comparative

More information

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly Comparative Genomics: Human versus chimpanzee 1. Introduction The chimpanzee is the closest living relative to humans. The two species are nearly identical in DNA sequence (>98% identity), yet vastly different

More information

MiGA: The Microbial Genome Atlas

MiGA: The Microbial Genome Atlas December 12 th 2017 MiGA: The Microbial Genome Atlas Jim Cole Center for Microbial Ecology Dept. of Plant, Soil & Microbial Sciences Michigan State University East Lansing, Michigan U.S.A. Where I m From

More information

Lecture 11 Friday, October 21, 2011

Lecture 11 Friday, October 21, 2011 Lecture 11 Friday, October 21, 2011 Phylogenetic tree (phylogeny) Darwin and classification: In the Origin, Darwin said that descent from a common ancestral species could explain why the Linnaean system

More information

8/23/2014. Phylogeny and the Tree of Life

8/23/2014. Phylogeny and the Tree of Life Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major

More information

PHYLOGENY AND SYSTEMATICS

PHYLOGENY AND SYSTEMATICS AP BIOLOGY EVOLUTION/HEREDITY UNIT Unit 1 Part 11 Chapter 26 Activity #15 NAME DATE PERIOD PHYLOGENY AND SYSTEMATICS PHYLOGENY Evolutionary history of species or group of related species SYSTEMATICS Study

More information

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological

More information

7. Tests for selection

7. Tests for selection Sequence analysis and genomics 7. Tests for selection Dr. Katja Nowick Group leader TFome and Transcriptome Evolution Bioinformatics group Paul-Flechsig-Institute for Brain Research www. nowicklab.info

More information

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Chapter 12 (Strikberger) Molecular Phylogenies and Evolution METHODS FOR DETERMINING PHYLOGENY In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Modern

More information

Taxonomy. Content. How to determine & classify a species. Phylogeny and evolution

Taxonomy. Content. How to determine & classify a species. Phylogeny and evolution Taxonomy Content Why Taxonomy? How to determine & classify a species Domains versus Kingdoms Phylogeny and evolution Why Taxonomy? Classification Arrangement in groups or taxa (taxon = group) Nomenclature

More information

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species

More information

Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.

Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible. Microbial Taxonomy Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eukaryote, has been in a mess we were stuck with it for traditional

More information

Microbial Taxonomy. Slowly evolving molecules (e.g., rrna) used for large-scale structure; "fast- clock" molecules for fine-structure.

Microbial Taxonomy. Slowly evolving molecules (e.g., rrna) used for large-scale structure; fast- clock molecules for fine-structure. Microbial Taxonomy Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eukaryote, has been in a mess we were stuck with it for traditional

More information

Supplementary Information for Hurst et al.: Causes of trends of amino acid gain and loss

Supplementary Information for Hurst et al.: Causes of trends of amino acid gain and loss Supplementary Information for Hurst et al.: Causes of trends of amino acid gain and loss Methods Identification of orthologues, alignment and evolutionary distances A preliminary set of orthologues was

More information

Leber s Hereditary Optic Neuropathy

Leber s Hereditary Optic Neuropathy Leber s Hereditary Optic Neuropathy Dear Editor: It is well known that the majority of Leber s hereditary optic neuropathy (LHON) cases was caused by 3 mtdna primary mutations (m.3460g A, m.11778g A, and

More information

Chapter 16: Reconstructing and Using Phylogenies

Chapter 16: Reconstructing and Using Phylogenies Chapter Review 1. Use the phylogenetic tree shown at the right to complete the following. a. Explain how many clades are indicated: Three: (1) chimpanzee/human, (2) chimpanzee/ human/gorilla, and (3)chimpanzee/human/

More information

Lecture Notes: BIOL2007 Molecular Evolution

Lecture Notes: BIOL2007 Molecular Evolution Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits

More information

C3020 Molecular Evolution. Exercises #3: Phylogenetics

C3020 Molecular Evolution. Exercises #3: Phylogenetics C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from

More information

Chapter 19. Microbial Taxonomy

Chapter 19. Microbial Taxonomy Chapter 19 Microbial Taxonomy 12-17-2008 Taxonomy science of biological classification consists of three separate but interrelated parts classification arrangement of organisms into groups (taxa; s.,taxon)

More information

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016 Molecular phylogeny - Using molecular sequences to infer evolutionary relationships Tore Samuelsson Feb 2016 Molecular phylogeny is being used in the identification and characterization of new pathogens,

More information

Microbial Taxonomy. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.

Microbial Taxonomy. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible. Microbial Taxonomy Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eukaryote, has been in a mess we were stuck with it for traditional

More information

Cladistics and Bioinformatics Questions 2013

Cladistics and Bioinformatics Questions 2013 AP Biology Name Cladistics and Bioinformatics Questions 2013 1. The following table shows the percentage similarity in sequences of nucleotides from a homologous gene derived from five different species

More information

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogeny? - Systematics? The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogenetic systematics? Connection between phylogeny and classification. - Phylogenetic systematics informs the

More information

Stepping stones towards a new electronic prokaryotic taxonomy. The ultimate goal in taxonomy. Pragmatic towards diagnostics

Stepping stones towards a new electronic prokaryotic taxonomy. The ultimate goal in taxonomy. Pragmatic towards diagnostics Stepping stones towards a new electronic prokaryotic taxonomy - MLSA - Dirk Gevers Different needs for taxonomy Describe bio-diversity Understand evolution of life Epidemiology Diagnostics Biosafety...

More information

SPECIES OF ARCHAEA ARE MORE CLOSELY RELATED TO EUKARYOTES THAN ARE SPECIES OF PROKARYOTES.

SPECIES OF ARCHAEA ARE MORE CLOSELY RELATED TO EUKARYOTES THAN ARE SPECIES OF PROKARYOTES. THE TERMS RUN AND TUMBLE ARE GENERALLY ASSOCIATED WITH A) cell wall fluidity. B) cell membrane structures. C) taxic movements of the cell. D) clustering properties of certain rod-shaped bacteria. A MAJOR

More information

Introduction to polyphasic taxonomy

Introduction to polyphasic taxonomy Introduction to polyphasic taxonomy Peter Vandamme EUROBILOFILMS - Third European Congress on Microbial Biofilms Ghent, Belgium, 9-12 September 2013 http://www.lm.ugent.be/ Content The observation of diversity:

More information

Constructing Evolutionary/Phylogenetic Trees

Constructing Evolutionary/Phylogenetic Trees Constructing Evolutionary/Phylogenetic Trees 2 broad categories: istance-based methods Ultrametric Additive: UPGMA Transformed istance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood

More information

A. Incorrect! In the binomial naming convention the Kingdom is not part of the name.

A. Incorrect! In the binomial naming convention the Kingdom is not part of the name. Microbiology Problem Drill 08: Classification of Microorganisms No. 1 of 10 1. In the binomial system of naming which term is always written in lowercase? (A) Kingdom (B) Domain (C) Genus (D) Specific

More information

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

CHAPTERS 24-25: Evidence for Evolution and Phylogeny CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology

More information

Biology 211 (2) Week 1 KEY!

Biology 211 (2) Week 1 KEY! Biology 211 (2) Week 1 KEY Chapter 1 KEY FIGURES: 1.2, 1.3, 1.4, 1.5, 1.6, 1.7 VOCABULARY: Adaptation: a trait that increases the fitness Cells: a developed, system bound with a thin outer layer made of

More information

chapter 5 the mammalian cell entry 1 (mce1) operon of Mycobacterium Ieprae and Mycobacterium tuberculosis

chapter 5 the mammalian cell entry 1 (mce1) operon of Mycobacterium Ieprae and Mycobacterium tuberculosis chapter 5 the mammalian cell entry 1 (mce1) operon of Mycobacterium Ieprae and Mycobacterium tuberculosis chapter 5 Harald G. Wiker, Eric Spierings, Marc A. B. Kolkman, Tom H. M. Ottenhoff, and Morten

More information

This is a repository copy of Microbiology: Mind the gaps in cellular evolution.

This is a repository copy of Microbiology: Mind the gaps in cellular evolution. This is a repository copy of Microbiology: Mind the gaps in cellular evolution. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/114978/ Version: Accepted Version Article:

More information

The minimal prokaryotic genome. The minimal prokaryotic genome. The minimal prokaryotic genome. The minimal prokaryotic genome

The minimal prokaryotic genome. The minimal prokaryotic genome. The minimal prokaryotic genome. The minimal prokaryotic genome Dr. Dirk Gevers 1,2 1 Laboratorium voor Microbiologie 2 Bioinformatics & Evolutionary Genomics The bacterial species in the genomic era CTACCATGAAAGACTTGTGAATCCAGGAAGAGAGACTGACTGGGCAACATGTTATTCAG GTACAAAAAGATTTGGACTGTAACTTAAAAATGATCAAATTATGTTTCCCATGCATCAGG

More information

Estimation of gene insertion/deletion rates with missing data

Estimation of gene insertion/deletion rates with missing data Genetics: Early Online, published on August 26, 2016 as 10.1534/genetics.116.191973 Estimation of gene insertion/deletion rates with missing data Utkarsh J. Dang 1,2, Alison M. Devault 3, Tatum D. Mortimer

More information

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky MOLECULAR PHYLOGENY "Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky EVOLUTION - theory that groups of organisms change over time so that descendeants differ structurally

More information

2 Genome evolution: gene fusion versus gene fission

2 Genome evolution: gene fusion versus gene fission 2 Genome evolution: gene fusion versus gene fission Berend Snel, Peer Bork and Martijn A. Huynen Trends in Genetics 16 (2000) 9-11 13 Chapter 2 Introduction With the advent of complete genome sequencing,

More information

Macroevolution Part I: Phylogenies

Macroevolution Part I: Phylogenies Macroevolution Part I: Phylogenies Taxonomy Classification originated with Carolus Linnaeus in the 18 th century. Based on structural (outward and inward) similarities Hierarchal scheme, the largest most

More information

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS. !! www.clutchprep.com CONCEPT: OVERVIEW OF EVOLUTION Evolution is a process through which variation in individuals makes it more likely for them to survive and reproduce There are principles to the theory

More information

Microbiome: 16S rrna Sequencing 3/30/2018

Microbiome: 16S rrna Sequencing 3/30/2018 Microbiome: 16S rrna Sequencing 3/30/2018 Skills from Previous Lectures Central Dogma of Biology Lecture 3: Genetics and Genomics Lecture 4: Microarrays Lecture 12: ChIP-Seq Phylogenetics Lecture 13: Phylogenetics

More information

Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes. - Supplementary Information -

Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes. - Supplementary Information - Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes - Supplementary Information - Martin Bartl a, Martin Kötzing a,b, Stefan Schuster c, Pu Li a, Christoph Kaleta b a

More information

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships Chapter 26: Phylogeny and the Tree of Life You Must Know The taxonomic categories and how they indicate relatedness. How systematics is used to develop phylogenetic trees. How to construct a phylogenetic

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Icm/Dot secretion system region I in 41 Legionella species.

Nature Genetics: doi: /ng Supplementary Figure 1. Icm/Dot secretion system region I in 41 Legionella species. Supplementary Figure 1 Icm/Dot secretion system region I in 41 Legionella species. Homologs of the effector-coding gene lega15 (orange) were found within Icm/Dot region I in 13 Legionella species. In four

More information

Evolution Problem Drill 09: The Tree of Life

Evolution Problem Drill 09: The Tree of Life Evolution Problem Drill 09: The Tree of Life Question No. 1 of 10 Question 1. The age of the Earth is estimated to be about 4.0 to 4.5 billion years old. All of the following methods may be used to estimate

More information

Understanding relationship between homologous sequences

Understanding relationship between homologous sequences Molecular Evolution Molecular Evolution How and when were genes and proteins created? How old is a gene? How can we calculate the age of a gene? How did the gene evolve to the present form? What selective

More information

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5. Five Sami Khuri Department of Computer Science San José State University San José, California, USA sami.khuri@sjsu.edu v Distance Methods v Character Methods v Molecular Clock v UPGMA v Maximum Parsimony

More information

Primate Diversity & Human Evolution (Outline)

Primate Diversity & Human Evolution (Outline) Primate Diversity & Human Evolution (Outline) 1. Source of evidence for evolutionary relatedness of organisms 2. Primates features and function 3. Classification of primates and representative species

More information

Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution

Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution Ziheng Yang Department of Biology, University College, London An excess of nonsynonymous substitutions

More information

Algorithms in Bioinformatics

Algorithms in Bioinformatics Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Distance Methods Character Methods

More information

Ch 10. Classification of Microorganisms

Ch 10. Classification of Microorganisms Ch 10 Classification of Microorganisms Student Learning Outcomes Define taxonomy, taxon, and phylogeny. List the characteristics of the Bacteria, Archaea, and Eukarya domains. Differentiate among eukaryotic,

More information

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the

More information

Chapter 19 Organizing Information About Species: Taxonomy and Cladistics

Chapter 19 Organizing Information About Species: Taxonomy and Cladistics Chapter 19 Organizing Information About Species: Taxonomy and Cladistics An unexpected family tree. What are the evolutionary relationships among a human, a mushroom, and a tulip? Molecular systematics

More information

INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA

INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA XIUFENG WAN xw6@cs.msstate.edu Department of Computer Science Box 9637 JOHN A. BOYLE jab@ra.msstate.edu Department of Biochemistry and Molecular Biology

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)

More information

Microbiology / Active Lecture Questions Chapter 10 Classification of Microorganisms 1 Chapter 10 Classification of Microorganisms

Microbiology / Active Lecture Questions Chapter 10 Classification of Microorganisms 1 Chapter 10 Classification of Microorganisms 1 2 Bergey s Manual of Systematic Bacteriology differs from Bergey s Manual of Determinative Bacteriology in that the former a. groups bacteria into species. b. groups bacteria according to phylogenetic

More information

Examples of Phylogenetic Reconstruction

Examples of Phylogenetic Reconstruction Examples of Phylogenetic Reconstruction 1. HIV transmission Recently, an HIV-positive Florida dentist was suspected of having transmitted the HIV virus to his dental patients. Although a number of his

More information

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9 Lecture 5 Alignment I. Introduction. For sequence data, the process of generating an alignment establishes positional homologies; that is, alignment provides the identification of homologous phylogenetic

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary information S3 (box) Methods Methods Genome weighting The currently available collection of archaeal and bacterial genomes has a highly biased distribution of isolates across taxa. For example,

More information

Comparative genomics: Overview & Tools + MUMmer algorithm

Comparative genomics: Overview & Tools + MUMmer algorithm Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune 411 007. urmila@bioinfo.ernet.in Genome sequence: Fact file 1995: The first

More information

A Novel Ribosomal-based Method for Studying the Microbial Ecology of Environmental Engineering Systems

A Novel Ribosomal-based Method for Studying the Microbial Ecology of Environmental Engineering Systems A Novel Ribosomal-based Method for Studying the Microbial Ecology of Environmental Engineering Systems Tao Yuan, Asst/Prof. Stephen Tiong-Lee Tay and Dr Volodymyr Ivanov School of Civil and Environmental

More information

belonging to the Genus Pantoea

belonging to the Genus Pantoea Emerging diseases of maize and onion caused by bacteria belonging to the Genus Pantoea by Teresa Goszczynska Submitted in partial fulfilment of the requirements for the degree Philosophiae Doctoriae in

More information

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16 Genome Evolution Outline 1. What: Patterns of Genome Evolution Carol Eunmi Lee Evolution 410 University of Wisconsin 2. Why? Evolution of Genome Complexity and the interaction between Natural Selection

More information

a-dB. Code assigned:

a-dB. Code assigned: This form should be used for all taxonomic proposals. Please complete all those modules that are applicable (and then delete the unwanted sections). For guidance, see the notes written in blue and the

More information

Graduate Funding Information Center

Graduate Funding Information Center Graduate Funding Information Center UNC-Chapel Hill, The Graduate School Graduate Student Proposal Sponsor: Program Title: NESCent Graduate Fellowship Department: Biology Funding Type: Fellowship Year:

More information

Unsupervised Learning in Spectral Genome Analysis

Unsupervised Learning in Spectral Genome Analysis Unsupervised Learning in Spectral Genome Analysis Lutz Hamel 1, Neha Nahar 1, Maria S. Poptsova 2, Olga Zhaxybayeva 3, J. Peter Gogarten 2 1 Department of Computer Sciences and Statistics, University of

More information

Effects of Gap Open and Gap Extension Penalties

Effects of Gap Open and Gap Extension Penalties Brigham Young University BYU ScholarsArchive All Faculty Publications 200-10-01 Effects of Gap Open and Gap Extension Penalties Hyrum Carroll hyrumcarroll@gmail.com Mark J. Clement clement@cs.byu.edu See

More information

This is a postprint of an article published in Khachane, A.N., Timmis, K.N., Martins Dos Santos, V.A.P. Dynamics of reductive genome evolution in

This is a postprint of an article published in Khachane, A.N., Timmis, K.N., Martins Dos Santos, V.A.P. Dynamics of reductive genome evolution in This is a postprint of an article published in Khachane, A.N., Timmis, K.N., Martins Dos Santos, V.A.P. Dynamics of reductive genome evolution in mitochondria and obligate intracellular microbes (2007)

More information

Microbial Taxonomy. C. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.

Microbial Taxonomy. C. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible. Microbial Taxonomy 1. Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eucaryote, is in a mess we are stuck with it for traditional

More information

The Minimal-Gene-Set -Kapil PHY498BIO, HW 3

The Minimal-Gene-Set -Kapil PHY498BIO, HW 3 The Minimal-Gene-Set -Kapil Rajaraman(rajaramn@uiuc.edu) PHY498BIO, HW 3 The number of genes in organisms varies from around 480 (for parasitic bacterium Mycoplasma genitalium) to the order of 100,000

More information

Bioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 4. Bioinformatics 1: Biology, Sequences, Phylogenetics

Bioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 4. Bioinformatics 1: Biology, Sequences, Phylogenetics Bioinformatics 1 Biology, Sequences, Phylogenetics Part 4 Sepp Hochreiter Klausur Mo. 30.01.2011 Zeit: 15:30 17:00 Raum: HS14 Anmeldung Kusss Contents Methods and Bootstrapping of Maximum Methods Methods

More information

Consensus Methods. * You are only responsible for the first two

Consensus Methods. * You are only responsible for the first two Consensus Trees * consensus trees reconcile clades from different trees * consensus is a conservative estimate of phylogeny that emphasizes points of agreement * philosophy: agreement among data sets is

More information

PGA: A Program for Genome Annotation by Comparative Analysis of. Maximum Likelihood Phylogenies of Genes and Species

PGA: A Program for Genome Annotation by Comparative Analysis of. Maximum Likelihood Phylogenies of Genes and Species PGA: A Program for Genome Annotation by Comparative Analysis of Maximum Likelihood Phylogenies of Genes and Species Paulo Bandiera-Paiva 1 and Marcelo R.S. Briones 2 1 Departmento de Informática em Saúde

More information

Introduction to Bioinformatics Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Dr. rer. nat. Gong Jing Cancer Research Center Medicine School of Shandong University 2012.11.09 1 Chapter 4 Phylogenetic Tree 2 Phylogeny Evidence from morphological ( 形态学的 ), biochemical, and gene sequence

More information

Chapter 27: Evolutionary Genetics

Chapter 27: Evolutionary Genetics Chapter 27: Evolutionary Genetics Student Learning Objectives Upon completion of this chapter you should be able to: 1. Understand what the term species means to biology. 2. Recognize the various patterns

More information

Phylogenetics: Building Phylogenetic Trees

Phylogenetics: Building Phylogenetic Trees 1 Phylogenetics: Building Phylogenetic Trees COMP 571 Luay Nakhleh, Rice University 2 Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary model should

More information

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Molecular phylogeny How to infer phylogenetic trees using molecular sequences Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 2009 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues

More information

Constructing Evolutionary/Phylogenetic Trees

Constructing Evolutionary/Phylogenetic Trees Constructing Evolutionary/Phylogenetic Trees 2 broad categories: Distance-based methods Ultrametric Additive: UPGMA Transformed Distance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood

More information

doi: / _25

doi: / _25 Boc, A., P. Legendre and V. Makarenkov. 2013. An efficient algorithm for the detection and classification of horizontal gene transfer events and identification of mosaic genes. Pp. 253-260 in: B. Lausen,

More information

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Molecular phylogeny How to infer phylogenetic trees using molecular sequences Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 200 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues

More information

Modern Evolutionary Classification. Section 18-2 pgs

Modern Evolutionary Classification. Section 18-2 pgs Modern Evolutionary Classification Section 18-2 pgs 451-455 Modern Evolutionary Classification In a sense, organisms determine who belongs to their species by choosing with whom they will mate. Taxonomic

More information

Fitness constraints on horizontal gene transfer

Fitness constraints on horizontal gene transfer Fitness constraints on horizontal gene transfer Dan I Andersson University of Uppsala, Department of Medical Biochemistry and Microbiology, Uppsala, Sweden GMM 3, 30 Aug--2 Sep, Oslo, Norway Acknowledgements:

More information

Concepts and Methods in Molecular Divergence Time Estimation

Concepts and Methods in Molecular Divergence Time Estimation Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks

More information

BINF6201/8201. Molecular phylogenetic methods

BINF6201/8201. Molecular phylogenetic methods BINF60/80 Molecular phylogenetic methods 0-7-06 Phylogenetics Ø According to the evolutionary theory, all life forms on this planet are related to one another by descent. Ø Traditionally, phylogenetics

More information

a-fB. Code assigned:

a-fB. Code assigned: This form should be used for all taxonomic proposals. Please complete all those modules that are applicable (and then delete the unwanted sections). For guidance, see the notes written in blue and the

More information

Computational approaches for functional genomics

Computational approaches for functional genomics Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding

More information

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18 Genome Evolution Outline 1. What: Patterns of Genome Evolution Carol Eunmi Lee Evolution 410 University of Wisconsin 2. Why? Evolution of Genome Complexity and the interaction between Natural Selection

More information

Group activities: Making animal model of human behaviors e.g. Wine preference model in mice

Group activities: Making animal model of human behaviors e.g. Wine preference model in mice Lecture schedule 3/30 Natural selection of genes and behaviors 4/01 Mouse genetic approaches to behavior 4/06 Gene-knockout and Transgenic technology 4/08 Experimental methods for measuring behaviors 4/13

More information

Tree of Life iological Sequence nalysis Chapter http://tolweb.org/tree/ Phylogenetic Prediction ll organisms on Earth have a common ancestor. ll species are related. The relationship is called a phylogeny

More information

Computational Biology: Basics & Interesting Problems

Computational Biology: Basics & Interesting Problems Computational Biology: Basics & Interesting Problems Summary Sources of information Biological concepts: structure & terminology Sequencing Gene finding Protein structure prediction Sources of information

More information

How to Use This Presentation

How to Use This Presentation How to Use This Presentation To View the presentation as a slideshow with effects select View on the menu bar and click on Slide Show. To advance through the presentation, click the right-arrow key or

More information

Phylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University

Phylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University Phylogenetics: Building Phylogenetic Trees COMP 571 - Fall 2010 Luay Nakhleh, Rice University Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary

More information

CHAPTER 10 Taxonomy and Phylogeny of Animals

CHAPTER 10 Taxonomy and Phylogeny of Animals CHAPTER 10 Taxonomy and Phylogeny of Animals 10-1 10-2 Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Linnaeus and Taxonomy More than 1.5 million species of

More information

Supplementary material to Whitney, K. D., B. Boussau, E. J. Baack, and T. Garland Jr. in press. Drift and genome complexity revisited. PLoS Genetics.

Supplementary material to Whitney, K. D., B. Boussau, E. J. Baack, and T. Garland Jr. in press. Drift and genome complexity revisited. PLoS Genetics. Supplementary material to Whitney, K. D., B. Boussau, E. J. Baack, and T. Garland Jr. in press. Drift and genome complexity revisited. PLoS Genetics. Tree topologies Two topologies were examined, one favoring

More information

1 Introduction. Abstract

1 Introduction. Abstract CBS 530 Assignment No 2 SHUBHRA GUPTA shubhg@asu.edu 993755974 Review of the papers: Construction and Analysis of a Human-Chimpanzee Comparative Clone Map and Intra- and Interspecific Variation in Primate

More information

Introduction to Bioinformatics Integrated Science, 11/9/05

Introduction to Bioinformatics Integrated Science, 11/9/05 1 Introduction to Bioinformatics Integrated Science, 11/9/05 Morris Levy Biological Sciences Research: Evolutionary Ecology, Plant- Fungal Pathogen Interactions Coordinator: BIOL 495S/CS490B/STAT490B Introduction

More information

Horizontal transfer and pathogenicity

Horizontal transfer and pathogenicity Horizontal transfer and pathogenicity Victoria Moiseeva Genomics, Master on Advanced Genetics UAB, Barcelona, 2014 INDEX Horizontal Transfer Horizontal gene transfer mechanisms Detection methods of HGT

More information