Methods for Microbiome Analysis
|
|
- Esmond Caldwell
- 5 years ago
- Views:
Transcription
1 Classroom Lecture 4 December 2015 Methods for Microbiome Analysis James R. Cole Director, RDP (Ribosomal Database Project) Center for Microbial Ecology Michigan State University East Lansing, Michigan U.S.A.
2 Michigan State University First land-grant university in USA, est teaching of practical agriculture, science, and engineering 50,000 students (11,000 graduate students) College of Agriculture & Natural Resources Dept. of Plant Soil and Microbial Sciences Center for Microbial Ecology Study microbes in the environment and their interactions
3 Some Recent Projects Databases and tools for microbial systematics and identification - NSF, DOE Standards development for metadata - NSF Effect of global warming on permafrost and soil carbon release - DOE Microbial interactions with priority pollutants -NIEHS Biofuel crops on marginal land - DOE, USDA Microbiome and establishment of ulcerative colitis NIH Antibiotic resistance gene transmission from farm animals to humans - USDA
4 Microbes are Dominant in Biosphere Biomass: Prokaryotes 4-6 x cells, containing: Pg of C (0.6-1 X plants) Pg of N (10 X plants) 9-14 Pg of P (10 X plants) Environments: Open ocean 0.12 x cells Soil 0.26 x cells Oceanic subsurface 3.5 x cells Terrestrial subsurface x cells Biodiversity: 400,000 4,000,000 species Whitman et al. (1998) PNAS 95:
5 Can you name these bacteria? From: Ch Terrestrial Bacteria from Agricultural Soils: By Masoomeh Shams-Ghahfarokhi, Sanaz Kalantari and Mehdi Razzaghi-Abyaneh DOI: /
6 This is how we think about diversity 5
7 Phylogenetic Tree of Life Three domains of life based on the work of Carl Woese and colleagues
8 Elucidation of the three domains Carl Woese ( ) of life Ribosomal RNA sequence as phylogenetic marker Discovered 3 rd kingdom Archaea and Bacteria separate domains 7
9 Ribosomes Universal Marker Ribosomes are the protein synthesis factories Subunits 30S 50S rrna 16S 23S 5S Core function present in all cellular organisms Very little evidence of horizontal gene transfer Historically easy to work with Purify by centrifugation and extract rrna. Now we use PCR to amplify from genomic DNA rrna genes have conserved regions interspersed with highly variable regions. Conserved regions used for both PCR primers and sequencing primers. 8
10 Secondary structure of small-subunit ribosomal RNA 5' end 3' end Image adapted from R. Gutell 9
11 Secondary structure of small-subunit ribosomal RNA 5' end 3' end Image adapted from R. Gutell 10
12 The 530 Loop of E. coli Loop Non-canonical G-U basepair Bulge Stem with canonical Watson-Crick base pairing 11
13 The 530 loop structure of six species 1 12
14 Phylogenetic tree using sequence information to represent evolutionary history 63 EschC MeOVanni ThlJanna PrnFreu AqxPyrop SflSalf
15 Diversity of uncultured organisms explored by rrna sequencing David A. Stahl, David J. Lane, Gary J. Olsen and Norman R. Pace Science, New Series, Vol. 224, No (Apr. 27, 1984), pp Published by: American Association for the Advancement of Science
16 Hydrothermal Vent Black Smoker 15
17 Metagenomics studies the genomes of organisms as communities iology/digestive_sys tem Microscopic algae in surface waters depend on nitrate, only available in deep waters. Credit: Kim Fulton- Bennett, MBARI Reference: K.S. Johnson, S.C. Riser, D.M. Karl. Nitrate supply from deep to nearsurface waters of the North Pacific subtropical gyre. Nature. Vol 465, Issue June, Figure credit: Ed Zaborski, University of Illinois. Adapted from House and Parmelee (1985). Soil Fertility in Organic Farming Systems: Much More than Plant Nutrition, Last Updated: April 04, 2011, View as web page, Author: Michelle Wander, U of Illinois 16
18 Microbes are Important in Human Health 17
19 18
20 Hess et al SCIENCE 331:
21 Explosion in rrna Sequencing By 2008, the majority of all bacterial sequences submitted to GenBank were 16S rrna sequences Less than 2% of these had a Latin name attached (valid or not) (R. Christen, 2008)
22 Most of our knowledge of bacterial diversity comes only from rrna sequences 21
23 Three examples from our work Sequence Nearest-Neighbor Match: (Last Common Ancestor) RDP Classifier: Naïve Bayesian FrameBot Dynamic Programming
24 Gene-Targeted Metagenomic Surveys Illumina myseq Use of key (tag) sequences for multi-sample sequencing Primers target informative region Sequence 10 to 100s of samples per run 23
25 24
26 25
27 26
28 27
29 Gene-Targeted Sequencing ~1x10^5 genome copies / ng DNA ~1x10^3 PCR amplification (~1.6^cycles) ~1x10^8 gene copies To produce 1x10^3 sequence reads 28
30 Phylogenetic Analysis vs. Classification Classification is conceptually easier to interpret. Often preferred when the groups are well understood. Phylogenetic methods are preferred for new groups or when the placement is not clear. 29
31 Machine Learning in Microbiology Machine learning focuses on prediction, based on known properties learned from the training data. Wikipedia 30
32 Taxonomic Classifiers Use concepts from Machine Learning Many algorithms can be applied Sequence Match Nearest-Neighbor (Last Common Ancestor) RDP Classifier Naïve Bayesian 31
33 Can Represent a Sequence as a Set of Overlapping k-mers GAAGCACCGGCUAACUCCGUGCCAGCAGCCGCGGUAAUACGGAGGGUGCAAG GAAGCAC UCCGUGC GCGGUAA AAGCACC CCGUGCC CGGUAAU AGCACCG CGUGCCA GGUAAUA GCACCGG GUGCCAG GUAAUAC CACCGGC UGCCAGC UAAUACG ACCGGCU GCCAGCA AAUACGG CCGGCUA CCAGCAG AUACGGA CGGCUAA CAGCAGC UACGGAG GGCUAAC AGCAGCC ACGGAGG GCUAACU GCAGCCG CGGAGGG CUAACUC CAGCCGC GGAGGGU UAACUCC AGCCGCG GAGGGUG AACUCCG GCCGCGG AGGGUGC ACUCCGU CCGCGGU GGGUGCA CUCCGUG CGCGGUA GGUGCAA 32
34 SeqMatch NN Nearest-neighbor classifier k-nn k nearest-neighbors classifier LCA Last common ancestor (Lowest common ancestor) 33
35 GAAGCAC AAGCACC AGCACCG GCACCGG CACCGGC ACCGGCU CCGGCUA CGGAGGG GGAGGGU AGGGUGC GGGUGCA GGUGCAA GUGCAAG CGGCUAA GGCUAAC GCUAACU CUAACUC UAACUCC AACUCCG ACUCCGU CUCCGUG UCCGUGC CCGUGCC CGUGCCA GUGCCAG UGCCAGC GCCAGCA CCAGCAG CAGCAGC AGCAGCC GCAGCCG CAGCCGC AGCCGCG GCCGCGG CCGCGGU CGCGGUA GCGGUAA CGGUAAU GGUAAUA GUAAUAC UAAUACG AAUACGG AUACGGA UACGGAG ACGGAGG GAAGGGA AAGGGAC AGGGACG GGGACGG GGACGGC GACGGCU ACGGCUA CGGAGGU GGAGGUC GAGGUCC AGGUCCC GGUCCCA GUCCCAA UCCCAAG SeqMatch Comparison E. coli A. pyrophilus 34
36 SeqMatch Math AB 35
37 LCA Last Common Ancestor Bacillaceae Bacillus Bacillus thuringiensis; WS 2625; Z84587 Bacillus cereus; AF Bacillus sp. No.49; AB Bacillus clausii; AJ Bacillus flexus (T); IFO15715; AB Bacillales Geobacillus Geobacillus stearothermophilus; DSM 22T; AJ Geobacillus caldoxylosilyticus; B70; AJ Listeriaceae Brochothrix Brochothrix campestris; NBRC 15547; AB Brochothrix thermosphacta; MF 88; AY Brochothrix campestris; DSMZ 4712; AY Listeria Listeria ivanovii (T); CLIP12229; X98529 Listeria grayi; NCTC10812; X
38 LCA Last Common Ancestor Bacillaceae Bacillus Bacillus thuringiensis; WS 2625; Z84587 Bacillus cereus; AF Bacillus sp. No.49; AB Bacillus clausii; AJ Bacillus flexus (T); IFO15715; AB Bacillales Geobacillus Geobacillus stearothermophilus; DSM 22T; AJ Geobacillus caldoxylosilyticus; B70; AJ Listeriaceae Brochothrix Brochothrix campestris; NBRC 15547; AB Brochothrix thermosphacta; MF 88; AY Brochothrix campestris; DSMZ 4712; AY Listeria Listeria ivanovii (T); CLIP12229; X98529 Listeria grayi; NCTC10812; X
39 LCA Last Common Ancestor Bacillaceae Bacillus Bacillus thuringiensis; WS 2625; Z84587 Bacillus cereus; AF Bacillus sp. No.49; AB Bacillus clausii; AJ Bacillus flexus (T); IFO15715; AB Bacillales Geobacillus Geobacillus stearothermophilus; DSM 22T; AJ Geobacillus caldoxylosilyticus; B70; AJ Listeriaceae Brochothrix Brochothrix campestris; NBRC 15547; AB Brochothrix thermosphacta; MF 88; AY Brochothrix campestris; DSMZ 4712; AY Listeria Listeria ivanovii (T); CLIP12229; X98529 Listeria grayi; NCTC10812; X
40 LCA Last Common Ancestor Bacillaceae Bacillus Bacillus thuringiensis; WS 2625; Z84587 Bacillus cereus; AF Bacillus sp. No.49; AB Bacillus clausii; AJ Bacillus flexus (T); IFO15715; AB Bacillales Geobacillus Geobacillus stearothermophilus; DSM 22T; AJ Geobacillus caldoxylosilyticus; B70; AJ Listeriaceae Brochothrix Brochothrix campestris; NBRC 15547; AB Brochothrix thermosphacta; MF 88; AY Brochothrix campestris; DSMZ 4712; AY Listeria Listeria ivanovii (T); CLIP12229; X98529 Listeria grayi; NCTC10812; X
41 RDP Classifier Naïve Bayesian classifier Uses training data to estimate k-mer likelihood Assignment based on genus for which k-mers are most common 40
42 41
43 Bayesian Theory Incorporates a priori probability to calculate that, after seeing the evidence E, the a priori hypothesis H is true Likelihood evidence P(H E) = P(E H)P(H) P(E) A priori probability inferred before evidence was observed A priori probability of observing E 42
44 Find the genus that maximizes Genus G P(G V) P(G V) = P(V G)P(G) P(V) V = Set of 8-mer words in the query sequence S 43
45 Find the genus that maximizes P(G V) Assume all genera equally likely [P(G i )=P(G j )] P(G V) = P(V G)P(G) P(V) P(V) is not dependent on G. To maximize P(G V) Select the Genus that maximizes P(V G) 44
46 Estimating P(V G) CGGCUAA GUAAUAC ACGGAGG GCCGCGG B. subtilis + + B. clausii + + B. smithii + + B. flexus B. ruris + + Probability: 99% 80% 40% 01% P(V Bacillus) = ( ) = or 1 in 316 P(V Sinobaca) = ( ) = or 1 in k mer Set V When you have eliminated the impossible, whatever remains, however improbable, must be the truth. Sir Arthur Conan Doyle 45
47 Genes Beyond rrna Faster phylogenetic markers Full-length 16S rrna resolves genera, not species. Genes encoding important functions Ecologically important genes for carbon and nitrogen cycling, biogeochemical processes. Lateral gene transfer Gene phylogeny may not match the organism phylogeny.
48 Ribosomal RNA Shows the Framework
49 Functional Genes Show the Details Akasaka K Tower Residence from
50 Biogeochemical Cycles Nitrogen Cycle: Factors affecting nitrogen fixation, denitrification (greenhouse gas) Carbon Cycle: Factors affecting soil carbon sequestration, overall soil fertility, biofuel production Source: EPA 49
51 The nifh Gene is a Key Gene in Nitrogen Fixation Example: Trichodesmium thiebautii Forward Amplicon ~ 360 bp Start:112 End: 471 Reverse ~ 882 bp Zehr et al., Forward TGYGAYCCNAARGCNGA Reverse ADNGCCATCATYTCNCC Poly et al., Forward TGCGAYCCSAARGCBGACTC Reverse ATSGCCATCATYTCRCCGGA 50
52 National Ecological Observatory Network
53 NEON nifh Sequencing State Ecological Region Samples Sequences Alaska Taiga ,294 Florida Southeast 17 79,619 Hawaii Pacific Tropical ,824 Utah Great Basin 8 19,105 52
54 Frameshift Correction Standard: A G A G T G A r g V a l 454 Read: Frame 1: Frame 2: Frame 3: C G G g G T A A r g G l y - - G l y V a l - - G l y
55 RDP FrameBot > Sbjct Query NuclLen AlignLen %Identity Score Frameshifts Reversed STATS GF040U102B1BAI false Sbjct 76 GGMLKPIEGGTYEVNEAMVEDLKIGVQGPHASNLGGILSNEIAKEIGKRAFIVDPVVVDE 135 Query 61 GGMLKPIEGGTYEVNEAMVEDLKIGFEGPHAXNLGGILSNEIAKKLGKRAFIVDPVVVDX 120 Frame ggacacagggatggaggagggtaa4tggccg2atggattagagaatgaagtaggcgggg2 ggttactaggcaataacttaatat tagcac atggttcaatcaatgagctttacttta tagtgaaatattactatgtgcaat caaatt cataaattaaagaatgaataatgtttt insertion deletion mismatches 54
56 RDP FrameBot Wang et al., 2013 Extends a dynamic programming algorithm proposed by Guan et al., Comput Appl Biosci. 12:31-40 Requires a set of subject protein sequences Produces an optimal alignment between the query DNA and the subject protein sequences in the presence of frameshifts Returns the frameshift-corrected protein and DNA query sequences Reports the protein pairwise alignment with the best score 55
57 FrameBot Performance using reference sequences at various percentages of identity to query sequences Wang Q et al. mbio 2013; doi: /mbio
58 Using FrameBot to Find Nearest Known Neighbor Sequence 206 nifh references chosen from 1223 nifh genes from cultured bacteria, clustered at 10% identity. Naïve approach requires pairwise comparison with each reference, but assignment requires far more references than frameshift correction. Using a metric indexing strategy to reduce number of pairwise comparisons. 57
59 FrameBot Metric Index Requires a distance measure with metric properties (including triangle inequality). Simple edit distance is metric (global). Complex edit distance is only metric if the scoring matrix is a metric. Blosum 62 is not metric, but a metric approximation is available. Addition of frameshifts violates triangle inequality. In practice, can compensate by slightly relaxing indexing strategy. 58
60 R 2 Q R 3 δ R 1 59
61 Processing NEON nifh Samples 8 hrs to process 1.1 million using index. 92.7% within 90% identity of reference. 99.9% within 80% identity of reference. 12.7% reads had at least 1 frameshift. About half of the reference sequences were closest matches to at least one read. 60
62 Relative Abundance of NEON Reads grouped by nearest matches at the phylum and class levels, averaged for each site (observatory) as indicated by state Wang Q et al. mbio 2013; doi: /mbio
63 Principal Component Analysis of NEON Samples A B Wang Q et al. mbio 2013; doi: /mbio
64 RDP Specialized Tools for Environmental Gene Analysis Ribosomal Database Project: data and tools for high throughput rrna analysis James R. Cole, Qiong Wang, Jordan A. Fish, Benli Chai, Donna M. McGarrell, Yanni Sun, C. Titus Brown, Andrea Porras-Alfaro, Cheryl R. Kuske and James M. Tiedje (2014) Nucl. Acids Res. 41:D633-D
65 RDPʼs Popular Online Tools Interactive tools Browsers browse and select from taxonomic hierarchy; powerful search and selection features SeqMatch finds nearest neighbor, more accurate than BLAST RDP Classifier places sequences into bacterial taxonomy; fast and accurate ProbeMatch fast search algorithm, limit searches to specific regions RDPipeline and FunGene tools for gene targeted metagenomics MIMARKS GoogleSheets helps organizing standards Compliant metadata myrdp space upload and analyze your own 16S sequences in your private space 64
66 RDP Fungene Offers databases of many common ecofunctional genes and proteins Integrated tools allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences Specialized tools to process coding gene amplicon data Fish, J. A., B. Chai, Q. Wang, Y. Sun, C. T. Brown, J. M. Tiedje, and J. R. Cole FunGene: the Functional Gene Pipeline and Repository. Front. Microbiol. 4:291 65
67 RDP Pipelines Initial Process barcode sorting and quality filtering raw reads, includes paired-end reads assembly Defined Community Analysis evaluate error patterns and rates on reads amplified from a mix of known organisms Ecological Measurement Jaccard & Sørensen Indices, Shannon & Chao1 Indices, Rarefaction Aligner Infernal model-based aligners for 16S and fungal 28S Complete-Linkage Clustering Cluster results in RDP, BIOM and other formats Updated d RDP Classifier places sequences into bacterial and fungal taxonomies; easy to train for additional genes 66
68 RDPTools Collection of commonly used RDP Tools for high throughput sequence processing and analysis. Package classifier RDP extensible sequence classifier for fungal large subunit rrna, bacterial & archaeal 16S rrna. Java ReadSeq Sequence file reader and format converter. Java Xander_assembler A gene targeted assembler tool for metagenomic sequences. Shell AlignmentTools Tools for pairwise sequence comparison, distance calculation, and hidden markov model sequence scoring (using HMMER3 models). Java Framebot Dynamic programming based frameshift detection and correction tool with nearest neighbor classification. Java Clustering RDP memory constrained hierarchical clustering tools. Java fungene_pipeline Scripts and resources for analyzing sequence data for select eco functional genes. Python KmerFilter Tool for kmer analysis. Java TaxonomyTree Taxonomy tree building and traversal utility tool. Java SeqFilters Tool for sorting and selecting nucleotide sequences according to given filters and tags. Java SequenceMatch K mer based sequence matching tool to calculate nearest neighbors of sequences. Java ProbeMatch Tool for finding (and removing) DNA/RNA primers in sequence reads. Java AbundanceStats Tool for generating various ecological abundance statistics. Java FungeneUtils Package of tools for protein sequence analysis. Java SOAP examples Code samples from various languages for interacting with RDP soap services. Perl gfclassify A gene family classifier that allows for fast and accurate classification of amplicons (or open reading frame) nucleotide sequences. C and Biopython
69 End of Lecture 68
MiGA: The Microbial Genome Atlas
December 12 th 2017 MiGA: The Microbial Genome Atlas Jim Cole Center for Microbial Ecology Dept. of Plant, Soil & Microbial Sciences Michigan State University East Lansing, Michigan U.S.A. Where I m From
More informationMicrobiome: 16S rrna Sequencing 3/30/2018
Microbiome: 16S rrna Sequencing 3/30/2018 Skills from Previous Lectures Central Dogma of Biology Lecture 3: Genetics and Genomics Lecture 4: Microarrays Lecture 12: ChIP-Seq Phylogenetics Lecture 13: Phylogenetics
More informationComparison of Three Fugal ITS Reference Sets. Qiong Wang and Jim R. Cole
RDP TECHNICAL REPORT Created 04/12/2014, Updated 08/08/2014 Summary Comparison of Three Fugal ITS Reference Sets Qiong Wang and Jim R. Cole wangqion@msu.edu, colej@msu.edu In this report, we evaluate the
More informationTaxonomical Classification using:
Taxonomical Classification using: Extracting ecological signal from noise: introduction to tools for the analysis of NGS data from microbial communities Bergen, April 19-20 2012 INTRODUCTION Taxonomical
More informationMicrobes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.
Microbial Taxonomy Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eukaryote, has been in a mess we were stuck with it for traditional
More informationMicrobial Taxonomy. Slowly evolving molecules (e.g., rrna) used for large-scale structure; "fast- clock" molecules for fine-structure.
Microbial Taxonomy Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eukaryote, has been in a mess we were stuck with it for traditional
More informationMicrobial Taxonomy. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.
Microbial Taxonomy Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eukaryote, has been in a mess we were stuck with it for traditional
More informationNature Biotechnology: doi: /nbt Supplementary Figure 1. Detailed overview of the primer-free full-length SSU rrna library preparation.
Supplementary Figure 1 Detailed overview of the primer-free full-length SSU rrna library preparation. Detailed overview of the primer-free full-length SSU rrna library preparation. Supplementary Figure
More informationIntroduction to Evolutionary Concepts
Introduction to Evolutionary Concepts and VMD/MultiSeq - Part I Zaida (Zan) Luthey-Schulten Dept. Chemistry, Beckman Institute, Biophysics, Institute of Genomics Biology, & Physics NIH Workshop 2009 VMD/MultiSeq
More informationA. Incorrect! In the binomial naming convention the Kingdom is not part of the name.
Microbiology Problem Drill 08: Classification of Microorganisms No. 1 of 10 1. In the binomial system of naming which term is always written in lowercase? (A) Kingdom (B) Domain (C) Genus (D) Specific
More informationPipelining RDP Data to the Taxomatic Background Accomplishments vs objectives
Pipelining RDP Data to the Taxomatic Timothy G. Lilburn, PI/Co-PI George M. Garrity, PI/Co-PI (Collaborative) James R. Cole, Co-PI (Collaborative) Project ID 0010734 Grant No. DE-FG02-04ER63932 Background
More informationAssigning Taxonomy to Marker Genes. Susan Huse Brown University August 7, 2014
Assigning Taxonomy to Marker Genes Susan Huse Brown University August 7, 2014 In a nutshell Taxonomy is assigned by comparing your DNA sequences against a database of DNA sequences from known taxa Marker
More informationExploring Microbes in the Sea. Alma Parada Postdoctoral Scholar Stanford University
Exploring Microbes in the Sea Alma Parada Postdoctoral Scholar Stanford University Cruising the ocean to get us some microbes It s all about the Microbe! Microbes = microorganisms an organism that requires
More informationGrundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson
Grundlagen der Bioinformatik, SS 10, D. Huson, April 12, 2010 1 1 Introduction Grundlagen der Bioinformatik Summer semester 2010 Lecturer: Prof. Daniel Huson Office hours: Thursdays 17-18h (Sand 14, C310a)
More informationBacterial Communities in Women with Bacterial Vaginosis: High Resolution Phylogenetic Analyses Reveal Relationships of Microbiota to Clinical Criteria
Bacterial Communities in Women with Bacterial Vaginosis: High Resolution Phylogenetic Analyses Reveal Relationships of Microbiota to Clinical Criteria Seminar presentation Pierre Barbera Supervised by:
More informationMicrobes and you ON THE LATEST HUMAN MICROBIOME DISCOVERIES, COMPUTATIONAL QUESTIONS AND SOME SOLUTIONS. Elizabeth Tseng
Microbes and you ON THE LATEST HUMAN MICROBIOME DISCOVERIES, COMPUTATIONAL QUESTIONS AND SOME SOLUTIONS Elizabeth Tseng Dept. of CSE, University of Washington Johanna Lampe Lab, Fred Hutchinson Cancer
More informationMicrobial Taxonomy and the Evolution of Diversity
19 Microbial Taxonomy and the Evolution of Diversity Copyright McGraw-Hill Global Education Holdings, LLC. Permission required for reproduction or display. 1 Taxonomy Introduction to Microbial Taxonomy
More informationTitle ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 Title ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses
More informationTaxonomy. Content. How to determine & classify a species. Phylogeny and evolution
Taxonomy Content Why Taxonomy? How to determine & classify a species Domains versus Kingdoms Phylogeny and evolution Why Taxonomy? Classification Arrangement in groups or taxa (taxon = group) Nomenclature
More informationMicrobial Diversity. Yuzhen Ye I609 Bioinformatics Seminar I (Spring 2010) School of Informatics and Computing Indiana University
Microbial Diversity Yuzhen Ye (yye@indiana.edu) I609 Bioinformatics Seminar I (Spring 2010) School of Informatics and Computing Indiana University Contents Microbial diversity Morphological, structural,
More information10 Biodiversity Support. AQA Biology. Biodiversity. Specification reference. Learning objectives. Introduction. Background
Biodiversity Specification reference 3.4.5 3.4.6 3.4.7 Learning objectives After completing this worksheet you should be able to: recall the definition of a species and know how the binomial system is
More informationProbing diversity in a hidden world: applications of NGS in microbial ecology
Probing diversity in a hidden world: applications of NGS in microbial ecology Guus Roeselers TNO, Microbiology & Systems Biology Group Symposium on Next Generation Sequencing October 21, 2013 Royal Museum
More informationThis is a repository copy of Microbiology: Mind the gaps in cellular evolution.
This is a repository copy of Microbiology: Mind the gaps in cellular evolution. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/114978/ Version: Accepted Version Article:
More informationBacillus anthracis. Last Lecture: 1. Introduction 2. History 3. Koch s Postulates. 1. Prokaryote vs. Eukaryote 2. Classifying prokaryotes
Last Lecture: Bacillus anthracis 1. Introduction 2. History 3. Koch s Postulates Today s Lecture: 1. Prokaryote vs. Eukaryote 2. Classifying prokaryotes 3. Phylogenetics I. Basic Cell structure: (Fig.
More informationTaxonomy and Clustering of SSU rrna Tags. Susan Huse Josephine Bay Paul Center August 5, 2013
Taxonomy and Clustering of SSU rrna Tags Susan Huse Josephine Bay Paul Center August 5, 2013 Primary Methods of Taxonomic Assignment Bayesian Kmer Matching RDP http://rdp.cme.msu.edu Wang, et al (2007)
More informationPHYLOGENY AND SYSTEMATICS
AP BIOLOGY EVOLUTION/HEREDITY UNIT Unit 1 Part 11 Chapter 26 Activity #15 NAME DATE PERIOD PHYLOGENY AND SYSTEMATICS PHYLOGENY Evolutionary history of species or group of related species SYSTEMATICS Study
More informationChapter 19. Microbial Taxonomy
Chapter 19 Microbial Taxonomy 12-17-2008 Taxonomy science of biological classification consists of three separate but interrelated parts classification arrangement of organisms into groups (taxa; s.,taxon)
More informationMicrobial Diversity and Assessment (II) Spring, 2007 Guangyi Wang, Ph.D. POST103B
Microbial Diversity and Assessment (II) Spring, 007 Guangyi Wang, Ph.D. POST03B guangyi@hawaii.edu http://www.soest.hawaii.edu/marinefungi/ocn403webpage.htm General introduction and overview Taxonomy [Greek
More informationOther resources. Greengenes (bacterial) Silva (bacteria, archaeal and eukarya)
General QIIME resources http://qiime.org/ Blog (news, updates): http://qiime.wordpress.com/ Support/forum: https://groups.google.com/forum/#!forum/qiimeforum Citing QIIME: Caporaso, J.G. et al., QIIME
More informationName: Class: Date: ID: A
Class: _ Date: _ Ch 17 Practice test 1. A segment of DNA that stores genetic information is called a(n) a. amino acid. b. gene. c. protein. d. intron. 2. In which of the following processes does change
More informationSUPPLEMENTARY INFORMATION
Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)
More informationMicrobiota: Its Evolution and Essence. Hsin-Jung Joyce Wu "Microbiota and man: the story about us
Microbiota: Its Evolution and Essence Overview q Define microbiota q Learn the tool q Ecological and evolutionary forces in shaping gut microbiota q Gut microbiota versus free-living microbe communities
More information8/23/2014. Phylogeny and the Tree of Life
Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major
More informationAmy Driskell. Laboratories of Analytical Biology National Museum of Natural History Smithsonian Institution, Wash. DC
DNA Barcoding Amy Driskell Laboratories of Analytical Biology National Museum of Natural History Smithsonian Institution, Wash. DC 1 Outline 1. Barcoding in general 2. Uses & Examples 3. Barcoding Bocas
More informationPGA: A Program for Genome Annotation by Comparative Analysis of. Maximum Likelihood Phylogenies of Genes and Species
PGA: A Program for Genome Annotation by Comparative Analysis of Maximum Likelihood Phylogenies of Genes and Species Paulo Bandiera-Paiva 1 and Marcelo R.S. Briones 2 1 Departmento de Informática em Saúde
More informationMicrobiology / Active Lecture Questions Chapter 10 Classification of Microorganisms 1 Chapter 10 Classification of Microorganisms
1 2 Bergey s Manual of Systematic Bacteriology differs from Bergey s Manual of Determinative Bacteriology in that the former a. groups bacteria into species. b. groups bacteria according to phylogenetic
More informationAmplicon Sequencing. Dr. Orla O Sullivan SIRG Research Fellow Teagasc
Amplicon Sequencing Dr. Orla O Sullivan SIRG Research Fellow Teagasc What is Amplicon Sequencing? Sequencing of target genes (are regions of ) obtained by PCR using gene specific primers. Why do we do
More informationAn Automated Phylogenetic Tree-Based Small Subunit rrna Taxonomy and Alignment Pipeline (STAP)
An Automated Phylogenetic Tree-Based Small Subunit rrna Taxonomy and Alignment Pipeline (STAP) Dongying Wu 1 *, Amber Hartman 1,6, Naomi Ward 4,5, Jonathan A. Eisen 1,2,3 1 UC Davis Genome Center, University
More informationBiological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor
Biological Networks:,, and via Relative Description Length By: Tamir Tuller & Benny Chor Presented by: Noga Grebla Content of the presentation Presenting the goals of the research Reviewing basic terms
More informationChapter 19: Taxonomy, Systematics, and Phylogeny
Chapter 19: Taxonomy, Systematics, and Phylogeny AP Curriculum Alignment Chapter 19 expands on the topics of phylogenies and cladograms, which are important to Big Idea 1. In order for students to understand
More informationMETHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.
Chapter 12 (Strikberger) Molecular Phylogenies and Evolution METHODS FOR DETERMINING PHYLOGENY In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Modern
More informationMacroevolution Part I: Phylogenies
Macroevolution Part I: Phylogenies Taxonomy Classification originated with Carolus Linnaeus in the 18 th century. Based on structural (outward and inward) similarities Hierarchal scheme, the largest most
More informationComputational Biology: Basics & Interesting Problems
Computational Biology: Basics & Interesting Problems Summary Sources of information Biological concepts: structure & terminology Sequencing Gene finding Protein structure prediction Sources of information
More informationHandling Fungal data in MoBeDAC
Handling Fungal data in MoBeDAC Jason Stajich UC Riverside Fungal Taxonomy and naming undergoing a revolution One fungus, one name http://www.biology.duke.edu/fungi/ mycolab/primers.htm http://www.biology.duke.edu/fungi/
More informationSUPPLEMENTARY INFORMATION
Supplementary information S3 (box) Methods Methods Genome weighting The currently available collection of archaeal and bacterial genomes has a highly biased distribution of isolates across taxa. For example,
More informationContent Descriptions Based on the Georgia Performance Standards. Biology
Content Descriptions Based on the Georgia Performance Standards Biology Introduction The State Board of Education is required by Georgia law (A+ Educational Reform Act of 2000, O.C.G.A. 20-2-281) to adopt
More informationIntroduction to the SNP/ND concept - Phylogeny on WGS data
Introduction to the SNP/ND concept - Phylogeny on WGS data Johanne Ahrenfeldt PhD student Overview What is Phylogeny and what can it be used for Single Nucleotide Polymorphism (SNP) methods CSI Phylogeny
More informationCentrifuge: rapid and sensitive classification of metagenomic sequences
Centrifuge: rapid and sensitive classification of metagenomic sequences Daehwan Kim, Li Song, Florian P. Breitwieser, and Steven L. Salzberg Supplementary Material Supplementary Table 1 Supplementary Note
More informationTest Bank for Microbiology A Systems Approach 3rd edition by Cowan
Test Bank for Microbiology A Systems Approach 3rd edition by Cowan Link download full: http://testbankair.com/download/test-bankfor-microbiology-a-systems-approach-3rd-by-cowan/ Chapter 1: The Main Themes
More informationFundamentals of Biology Valencia College BSC1010C
1 Fundamentals of Biology Valencia College BSC1010C 1 Studying Life Chapter objectives: What Is Biology? Is All Life on Earth Related? How Do Biologists Investigate Life? How Does Biology Influence Public
More information08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega
BLAST Multiple Sequence Alignments: Clustal Omega What does basic BLAST do (e.g. what is input sequence and how does BLAST look for matches?) Susan Parrish McDaniel College Multiple Sequence Alignments
More informationOrigins of Life. Fundamental Properties of Life. Conditions on Early Earth. Evolution of Cells. The Tree of Life
The Tree of Life Chapter 26 Origins of Life The Earth formed as a hot mass of molten rock about 4.5 billion years ago (BYA) -As it cooled, chemically-rich oceans were formed from water condensation Life
More informationBergey s Manual Classification Scheme. Vertical inheritance and evolutionary mechanisms
Bergey s Manual Classification Scheme Gram + Gram - No wall Funny wall Vertical inheritance and evolutionary mechanisms a b c d e * * a b c d e * a b c d e a b c d e * a b c d e Accumulation of neutral
More informationCh 10. Classification of Microorganisms
Ch 10 Classification of Microorganisms Student Learning Outcomes Define taxonomy, taxon, and phylogeny. List the characteristics of the Bacteria, Archaea, and Eukarya domains. Differentiate among eukaryotic,
More informationBiology 160 Cell Lab. Name Lab Section: 1:00pm 3:00 pm. Student Learning Outcomes:
Biology 160 Cell Lab Name Lab Section: 1:00pm 3:00 pm Student Learning Outcomes: Upon completion of today s lab you will be able to do the following: Properly use a compound light microscope Discuss the
More informationobjective functions...
objective functions... COFFEE (Notredame et al. 1998) measures column by column similarity between pairwise and multiple sequence alignments assumes that the pairwise alignments are optimal assumes a set
More informationComparing Prokaryotic and Eukaryotic Cells
A prokaryotic cell Basic unit of living organisms is the cell; the smallest unit capable of life. Features found in all cells: Ribosomes Cell Membrane Genetic Material Cytoplasm ATP Energy External Stimuli
More informationCharacteristics of Life
UNIT 2 BIODIVERSITY Chapter 4- Patterns of Life Biology 2201 Characteristics of Life All living things share some basic characteristics: 1) living things are organized systems made up of one or more cells
More informationSPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together
SPECIATION Origin of new species=speciation -Process by which one species splits into two or more species, accounts for both the unity and diversity of life SPECIES BIOLOGICAL CONCEPT Population or groups
More informationHonor pledge: I have neither given nor received unauthorized aid on this test. Name :
Midterm Exam #1 MB 451 : Microbial Diversity Honor pledge: I have neither given nor received unauthorized aid on this test. Signed : Date : Name : 1. What are the three primary evolutionary branches of
More informationand just what is science? how about this biology stuff?
Welcome to Life on Earth! Rob Lewis 512.775.6940 rlewis3@austincc.edu 1 The Science of Biology Themes and just what is science? how about this biology stuff? 2 1 The Process Of Science No absolute truths
More informationMicrobial Taxonomy. C. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.
Microbial Taxonomy 1. Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eucaryote, is in a mess we are stuck with it for traditional
More informationEffects of Gap Open and Gap Extension Penalties
Brigham Young University BYU ScholarsArchive All Faculty Publications 200-10-01 Effects of Gap Open and Gap Extension Penalties Hyrum Carroll hyrumcarroll@gmail.com Mark J. Clement clement@cs.byu.edu See
More informationTest Bank for Microbiology A Systems Approach 3rd edition by Cowan
Test Bank for Microbiology A Systems Approach 3rd edition by Cowan Link download full: https://digitalcontentmarket.org/download/test-bank-formicrobiology-a-systems-approach-3rd-edition-by-cowan Chapter
More informationChapter 26. Phylogeny and the Tree of Life. Lecture Presentations by Nicole Tunbridge and Kathleen Fitzpatrick Pearson Education, Inc.
Chapter 26 Phylogeny and the Tree of Life Lecture Presentations by Nicole Tunbridge and Kathleen Fitzpatrick Investigating the Tree of Life Phylogeny is the evolutionary history of a species or group of
More information1 Abstract. 2 Introduction. 3 Requirements. 4 Procedure
1 Abstract None 2 Introduction The archaeal core set is used in testing the completeness of the archaeal draft genomes. The core set comprises of conserved single copy genes from 25 genomes. Coverage statistic
More informationRapid Learning Center Chemistry :: Biology :: Physics :: Math
Rapid Learning Center Chemistry :: Biology :: Physics :: Math Rapid Learning Center Presents Teach Yourself AP Biology in 24 Hours 1/37 *AP is a registered trademark of the College Board, which does not
More informationOutline Classes of diversity measures. Species Divergence and the Measurement of Microbial Diversity. How do we describe and compare diversity?
Species Divergence and the Measurement of Microbial Diversity Cathy Lozupone University of Colorado, Boulder. Washington University, St Louis. Outline Classes of diversity measures α vs β diversity Quantitative
More informationThe Road to the Six Kingdoms
Bio 2201 Unit 2 The Road to the Six Kingdoms A 2011study estimated there are about 8.6 million species on earth. Only 1.8 million species have been identified and named. *Chromista is a sub-kingdom group
More informationChad Burrus April 6, 2010
Chad Burrus April 6, 2010 1 Background What is UniFrac? Materials and Methods Results Discussion Questions 2 The vast majority of microbes cannot be cultured with current methods Only half (26) out of
More informationChapter 26 Phylogeny and the Tree of Life
Chapter 26 Phylogeny and the Tree of Life Chapter focus Shifting from the process of how evolution works to the pattern evolution produces over time. Phylogeny Phylon = tribe, geny = genesis or origin
More informationGenomes and Their Evolution
Chapter 21 Genomes and Their Evolution PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from
More informationBioinformatics. Dept. of Computational Biology & Bioinformatics
Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS
More informationWhat can sequences tell us?
Bioinformatics What can sequences tell us? AGACCTGAGATAACCGATAC By themselves? Not a heck of a lot...* *Indeed, one of the key results learned from the Human Genome Project is that disease is much more
More informationMetagenomic analysis of spoiled potato and tomato and the use of the dominant bacterial species in plant growth studies
Metagenomic analysis of spoiled potato and tomato and the use of the dominant bacterial species in plant growth studies Khaya Ntushelo Department of Agriculture and Animal Health, University of South Africa,
More informationMicroscopy, Staining, and Classification
PowerPoint Lecture Presentations prepared by Mindy Miller-Kittrell, North Carolina State University C H A P T E R 4 Microscopy, Staining, and Classification Microscopy Light Microscopy 1) Bright-field
More informationCS612 - Algorithms in Bioinformatics
Fall 2017 Databases and Protein Structure Representation October 2, 2017 Molecular Biology as Information Science > 12, 000 genomes sequenced, mostly bacterial (2013) > 5x10 6 unique sequences available
More informationAlgorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment
Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot
More informationThe practice of naming and classifying organisms is called taxonomy.
Chapter 18 Key Idea: Biologists use taxonomic systems to organize their knowledge of organisms. These systems attempt to provide consistent ways to name and categorize organisms. The practice of naming
More informationStepping stones towards a new electronic prokaryotic taxonomy. The ultimate goal in taxonomy. Pragmatic towards diagnostics
Stepping stones towards a new electronic prokaryotic taxonomy - MLSA - Dirk Gevers Different needs for taxonomy Describe bio-diversity Understand evolution of life Epidemiology Diagnostics Biosafety...
More informationPredicting Protein Functions and Domain Interactions from Protein Interactions
Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput
More informationThe Tree of Life. Chapter 17
The Tree of Life Chapter 17 1 17.1 Taxonomy The science of naming and classifying organisms 2000 years ago Aristotle Grouped plants and animals Based on structural similarities Greeks and Romans included
More informationPostgraduate teaching for the next generation of taxonomists
Postgraduate teaching for the next generation of taxonomists Alfried P. Vogler Professor of Molecular Systematics Imperial College London and Natural History Museum MSc in Taxonomy and Biodiversity MRes
More informationChapter 17. Table of Contents. Objectives. Taxonomy. Classifying Organisms. Section 1 Biodiversity. Section 2 Systematics
Classification Table of Contents Objectives Relatebiodiversity to biological classification. Explainwhy naturalists replaced Aristotle s classification system. Identifythe main criterion that Linnaeus
More informationA Novel Ribosomal-based Method for Studying the Microbial Ecology of Environmental Engineering Systems
A Novel Ribosomal-based Method for Studying the Microbial Ecology of Environmental Engineering Systems Tao Yuan, Asst/Prof. Stephen Tiong-Lee Tay and Dr Volodymyr Ivanov School of Civil and Environmental
More informationWhat examples can you think of?
What examples can you think of? Geocentrism Alchemy Heliocentrism: Copernicus, Kepler, Newton, Galileo Nature of the chemical bond (Rutherford, Pauling ) Aristotelian view of the biosphere Woese/Pace (subject
More informationBioinformatics Exercises
Bioinformatics Exercises AP Biology Teachers Workshop Susan Cates, Ph.D. Evolution of Species Phylogenetic Trees show the relatedness of organisms Common Ancestor (Root of the tree) 1 Rooted vs. Unrooted
More informationCLASSIFICATION OF LIVING THINGS
CLASSIFICATION OF LIVING THINGS 1. Taxonomy The branch of biology that deals with the classification of living organisms About 1.8 million species of plants and animals have been identified. Some scientists
More informationSec$on 9. Evolu$onary Rela$onships
Sec$on 9 Evolu$onary Rela$onships Sec$on 9 Learning Goals Explain why the ribosomal 16S gene is a good marker for molecular phylogene$c comparisons. Be able to interpret a phylogene$c tree. Explain the
More informationFriday April 8 th 2016
Friday April 8 th 2016 Warm-Up Select a highlighter. Get a bottle of glue. Update your Table of Contents (see whiteboard). Today In Science Classification Presentation and Notes How many different types
More informationPart III - Bioinformatics Study of Aminoacyl trna Synthetases. VMD Multiseq Tutorial Web tools. Perth, Australia 2004 Computational Biology Workshop
Part III - Bioinformatics Study of Aminoacyl trna Synthetases VMD Multiseq Tutorial Web tools Perth, Australia 2004 Computational Biology Workshop Multiple Sequence Alignments The aminoacyl-trna synthetases,
More informationCreating a Dichotomous Key
Dichotomous Keys A tool used that allows users to determine the identity of unknown species Keys consist of a series of choices, where the user selects from a series of connected pairs Each pair of choices
More informationUnit 5: Taxonomy. KEY CONCEPT Organisms can be classified based on physical similarities.
KEY CONCEPT Organisms can be classified based on physical similarities. Linnaeus developed the scientific naming system still used today. Taxonomy is the science of naming and classifying organisms. White
More informationAlternative tools for phylogeny. Identification of unique core sequences
Alternative tools for phylogeny Identification of unique core sequences Workshop on Whole Genome Sequencing and Analysis, 19-21 Mar. 2018 Learning objective: After this lecture you should be able to account
More informationBio 1B Lecture Outline (please print and bring along) Fall, 2007
Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution
More informationBiology 2.1 Taxonomy: Domain, Kingdom, Phylum. ICan2Ed.com
Biology 2.1 Taxonomy: Domain, Kingdom, Phylum ICan2Ed.com Taxonomy is the scientific field that catalogs, describes, and names living organisms. The way to divide living organisms into groups based on
More informationChapter 18 Systematics: Seeking Order Amidst Diversity
Chapter 18 Systematics: Seeking Order Amidst Diversity Bird Diversity in Indonesia Chapter 18 At a Glance 18.1 How Are Organisms Named and Classified? 18.2 What Are the Domains of Life? 18.1 How Are Organisms
More informationIntroduction to the Study of Life
1 Introduction to the Study of Life Bio 103 Lecture GMU Dr. Largen 2 Outline Biology is the science of life The process of science Evolution, unity and diversity Core principles of biology 3 The Science
More informationA Bayesian taxonomic classification method for 16S rrna gene sequences with improved species-level accuracy
Gao et al. BMC Bioinformatics (2017) 18:247 DOI 10.1186/s12859-017-1670-4 SOFTWARE Open Access A Bayesian taxonomic classification method for 16S rrna gene sequences with improved species-level accuracy
More informationUsing Ensembles of Hidden Markov Models for Grand Challenges in Bioinformatics
Using Ensembles of Hidden Markov Models for Grand Challenges in Bioinformatics Tandy Warnow Founder Professor of Engineering The University of Illinois at Urbana-Champaign http://tandy.cs.illinois.edu
More information