Taxonomy and Clustering of SSU rrna Tags. Susan Huse Josephine Bay Paul Center August 5, 2013
|
|
- Noel Chase
- 5 years ago
- Views:
Transcription
1 Taxonomy and Clustering of SSU rrna Tags Susan Huse Josephine Bay Paul Center August 5, 2013
2 Primary Methods of Taxonomic Assignment Bayesian Kmer Matching RDP Wang, et al (2007) Appl Environ Microbiol. 73(16): Sequence Matching GAST Huse, et al (2008) PLoS Genetics 4: e
3 k-mer indexing TGGTCTTGACATCCACAGAACTTTCCAGAGA TGGATTGGTGCCTTCGGGAACTGTGAGAC! Ex. 8-mer indexing TGGTCTTG! GGTCTTGA! GTCTTGAC! TCTTGACA! CTTGACAT! TTGACATC! TGACATCC! GACATCCA! ACATCCAC! CATCCACA!
4 RDP Web Results Reports: Taxonomy at each level Bootstrap value at each level Min 80% by default Domain Phylum Class Order Family Genus Root[100%] Bacteria[100%] Proteobacteria [100%] Alphaproteobacteria[100%] Rhodospirillales[100%] Rhodospirillaceae[100%] Dongia[99%]
5 Bootstrap Confidence Estimation the number of times a genus was selected out of 100 bootstrap trials was used as an estimate of confidence in the assignment to that genus.
6 Bootstrap Cutoff Values 0% 50% 80% V3 % classified to genus 100% 92.4% 82.3% % classified to correct genus 92.0% 95.0% 98.1% V4 0% 50% 80% % classified to genus 100% 97% 87.9% % classified to correct genus 92.8% 94.5% 95.7% V6 0% 50% 80% % classified to genus 100% 73.5% 40.4% % classified to correct genus 79.0% 96.5% 98.7% Based on 7,028 human gut sequences, RDP-classified full-length then cut and reclassified
7 Bootstrap Cutoff Values 0% 50% 80% V3 % classified to genus 100% 92.4% 82.3% % classified to correct genus 92.0% 95.0% 98.1% Assume 100 sequences: At 80%: 82 are classified to genus 80 seq are identified correctly (82 X 0.98 = 80) 2 seq are indentified incorrectly At 50%: 92 are classified to genus (10 more) 87 seq are correct (92 X 0.95 = 87) (7 more) 5 seq are incorrect (3 more) 30% of the incremental genus assignments are incorrect.
8 GAST Global Assignment of Sequence Taxonomy Sequence matching to nearest neighbor(s) in a reference database. Uses global alignment (entire length of amplicon) rather than local alignment (fragments) Distance is defined by sequence alignment.
9 GAST R1: TGGTCTTGACATCCACAGAT! Q: TGGTCTTGACATCCACAGAT! TGGTCTTGACATCGACAGAT! TGGTCTTGACATCTACAGAT! R2: TGGTCTTGGCATCTACAGAT! RefID: R1,R2 Distance: 0.05
10 GAST Flowchart High Quality 16S tags Nearest RefSeq(s) and GAST Distance RefDB (RefV6, RefV3V5, RefSSU) Using tags cut to match primers is much more efficient Consensus Taxonomy (2/3s majority)
11 Ex. Consensus Calculation 1: Firmicutes; Clostridia; Clostridiales; Lachnospiraceae (1%) 4: Firmicutes; Clostridia; Clostridiales; Clostridiaceae; Clostridium (40%) 5: Firmicutes; Clostridia; Clostridiales; Clostridiaceae; Clostridium; perfringens (50%) 1. Lowest rank = species 2. Calculate voting: 5 / 10= 50% (less than 66%) 3. Next rank = genus 4. Calculate genus voting: (4+5) / 10 = 90% 5. Assign taxonomy to genus level
12 GAST Distance GAST does not report a bootstrap value. GAST reports the distance to the nearest sequence. If distance = 0, then good accuracy If distance = 0.05 (5%) then likely same family and maybe genus, but not likely species.
13 Genus Level Comparison V3 N = 299,044 Other = 99 V6 N = 322,971 Other = 82 Full-Length N = 5,519 Other = 26 Human gut microbiota with 250nt V3, 60nt V6, 1000nt FL
14 Reference Database Considerations 1. Size of the database Does it contain reference sequences similar to your data? 2. Taxonomy of the database Are the references classified to genus or species? 3. Quality of the database Are there chimeras or low-quality sequences?
15 Example Reference Databases 1. SILVA database 2. RDP training set 3. Annotated Greengenes 4. Site specific: HOMD and CORE for oral
16 Specificity in RefSSU (SILVA) RefHVR sequences mapping V3-V5 V6-V4 1 taxon (unique mapping) 97.1% 97.5% 2 taxa (ambiguous mapping) 2.3% 2.0% 2 taxa by depth only (same lineage) (unique lineage mapping) 1.9% (99%) 1.8% (99.3%) If genus, unique genus 99.7% 99.8% If species, unique species 93.1% 93.6% Assigning taxonomy is only as good as your reference
17 BLAST to nt Don t do it!!! Top BLAST hit can lead to unexpected results: hits to unclassified, sources other than microbial SSU rrna partial hits (local alignment).
18 Methods Comparison RDP is considered a standard is available on their website or can be used locally and through mothur and QIIME Works better on longer sequences, not as well on shorter sequences. GAST works well for shorter sequences (V6) can go to species depending on the reference database BLAST to nt often returns incorrect taxonomy, not good for pipelines
19 Taxonomic Consistency For meaningful community comparisons, taxonomic names must be consistent: Names - one organism should have only one name Levels for systematic comparisons, need consistent levels: Kingdom;Phylum;Class;Order;Family;Genus;Species
20 Taxonomic Names Bergey s Taxonomic Outline manual of taxonomic names for bacteria List of Prokaryotic names with Standing in the Nomenclature (vetting process) NCBI similar taxonomy, but multiple subs (subclass, suborder, subfamily, tribe) Fungi UNITE (a work in progress) Archaea also a work in progress
21 Sources of Error in Taxonomic Analyses Primer bias Chimeras Non-16S tag amplified Discovery of novel 16S Unrepresented in reference database Low-quality references Taxonomy not available Incorrect taxonomy Ambiguous hypervariable sequence Reference biased toward most studied
22 Why OTUs? Because we can! We can t: do whole genome sequencing of communities assign complete taxonomic names calculate pylogenetic trees from millions of short tags Need to develop new methods that are more biologically or evolutionarily meaningful, such as oligotyping Come back in 10 years
23 Different clustering algorithms have very different effects on the size and number of OTUs created
24 Primary Methods complete linkage - no two sequences in a cluster are farther apart than the clustering width single linkage - each sequence is within the clustering width of at least one other sequence in the cluster average linkage - the average distance from a sequence to every other sequence is less than the width. Dependent on input order. greedy clusters - test sequentially and incorporate sequence into first qualifying OTU. Dependent on input order. reference compare tags to pre-established set of reference OTUs (open and closed) oligotyping using sequence-based information directly
25 Cluster Width Diameter Sequences are never more than D apart. (CL) Radius Sequences are never more than R from seed. (AL, Ref, Greedy) Link Length Every sequence is with L of another sequence. (SL)
26 The Problem of Inflation Clustering algorithms return more OTUs than predicted for mock communities. OTU inflation leads to: alpha diversity inflation beta diversity inflation Where does this inflation come from? How can we adjust for this inflation?
27 Inflation in Action 1,042 is a few more than the expected 2
28 Sequencing Noise in E.coli Distance Tags Seqs 0 177, % , % % % % % % % % % > % Cluster at 3% using only tags within 3% of the correct sequence
29 Inflation with small sequence error Only 129 OTUs Much better!!
30 18,156 sequences and 392 positions Example MSA Regardless of clustering algorithm, an MSA cannot fully align tags whose sequences are too divergent
31 MSA Limitations Distance Comparison Independent Distances Subsampled Distances X-axis: alignment of 1000 sequences subsampled from 40,000 sequences Y-axis: MSA of 1,000 bacterial sequences
32 Clustering sequences within 3% of known templates E. coli (2) S. epidermidis (1) Clone 43 (43) MS-CL 129 / 89 / 694 Alignment method PW-CL 6 / 5 / 308 MS Multiple Sequence Alignment CL Complete Linkage clustering PW Pairwise Alignment AL Average Linkage clustering
33 Cluster Count: Complete Linkage propagates OTUs #5 #6 #9 #8 #4 #3 #1 #12 #11 #2 #7 #10 #13 Each V6 variant with 2nt distance, will start a new OTU. Each V6 variant with 1nt distance, can cluster with the seed or a 2nd variant.
34 Average Linkage collapses errors Cluster Count: 1 #1 Clusters tend to be heavily dominated by their most abundant sequence, which strongly weights the average and smoothes the noise.
35 Greedy Seeds (UClust) Sort by abundance First Seq 1 is seed of Cluster 1 Compare next Seq to each cluster in order, if Distance(Seq, Cluster) <= W, include it in the cluster if Seq is not a member of any cluster, create a new cluster
36 Greedy Seeds (UClust) Cluster Count: 1
37 Reference OTUs 1. Create a full-length tree of reference sequences (greengenes) 2. Cluster those sequences 3. Map each tag to the nearest reference 4. Bin each tag to that reference s OTU#
38 Oligotyping 1. Align the sequences 2. Determine the alignment positions with the most information (Shannon entropy) 3. Iterate the position selection 4. Use those positions to bin all sequences
39 Clustering sequences within 3% of known templates E. coli (2) S. epidermidis (1) Clone 43 (43) MS-CL 129 / 89 / 694 Clustering method MS-AL 54 / 44 / 218 Alignment method PW-CL 6 / 5 / 308 PW-AL 2 / 1 / 43 MS Multiple Sequence Alignment CL Complete Linkage clustering PW Pairwise Alignment AL Average Linkage clustering
40 Accurate with small error 2 OTUs created!!
41 OTU Inflation with more errors with errors back up to 277 OTUs
42 Still lose outlier sequencing errors Multiple sequencing errors still not clustered
43 Single Linkage Preclustering at 2% 1. Sort sequences by abundance 2. Precluster using 1nt change (2% in V6) single-linkage 3. Most abundant sequence represents all reads in the precluster
44 SLP Smooths out sequencing errors in data, as well as PCR errors, as well as fine natural variation such as SNPs. SLP is not designed to denoise sequences. SLP should only be used for preclustering before clustering at >= 3%.
45 Total Tags 215,618 / 197,876 / 202,340 Unique Sequences 3,309 / 3,177 / 7,461 3% Clustering Single-Linkage Preclustering MS - CL 1042 / 1267 / 2473 PW - AL 277 / 323 / 458 PW AL 88 / 128 / 275 Expected OTUs: E. coli (2) / S. epidermidis (1) / Clone 43 (43)
46 All Ecoli with SLP Back down to 88 OTUs
47 OTU Consistency Will I get the same OTUs every time?
48 Sample Size and Rarefaction 7000 M2FN PML Rarefaction MS-CL - PML OTUs ,000 40,000 60,000 80, , ,000 Number of Sequences Sampled 5K 10K 15K 20K 50K 100K
49 Sample Size and Rarefaction PML SLP-PW-AL
50 Relative Inflation Absolute number of errant OTUs will increase with sample size. Relative number of errant OTUs will decrease with sample complexity
51 The Magical 3% NOT! 3% SSU OTUs = Species and 6% SSU OTUs = Genera
52 History of The 3% OTUs 1. Wayne et al (1987) IJSEM 37(4): Bacterial systematics phylogeny is best for defining species, want to use the entire genome; therefore, 70% similarity as defined by DNA-DNA reassociation. 2. Stackebrandt and Goebel (1994) IJSEM 44(4): % DNA-DNA ~ 97% similarity of full-length 16S 3. 97% similarity of any hypervariable region anywhere in the Bacterial kingdom = species
53 What is the right cluster width? Distances vary by hypervariable region Full-length does not equal V1-V3 which does not equal V3-V5 which does not equal V6, etc. Distance vary by bacterial lineage What specificity are you trying to gain? No single clustering width is the right answer, use several (and wave your hands a lot)
54 Clustering Options mothur QIIME UClust (USearch) Oligotyping ESPRIT Tree CROP (Bayesian)
55 Clustering Questions How meaningful are clusters functionally? When is an errare rare and when is it an error? Should it be included in an existing cluster or start its own? How to place sequences if OTUs overlap? What is the effect of residual low quality data or chimeras? How sensitive are alpha and beta diversity estimates to clustering results?
56 OTUs vs Taxonomy Novel organisms Many unnamed organisms Some clades only defined to phyla or class Many species names based on phenotype rather than genotype Do not lump together all 16S unknowns or diverse partially classified.
57 From the Sample to the Population: Diversity and Distance Metrics
58 Diversity Sampling Depth and Alpha Diversity Not Richness - 5,000 10,000 15,000 20,000 25,000 30,000 35,000 40,000 45,000 Sampling Depth SLP - NPShannon SLP - Simpson CL - NPShannon Simpson Robust to both singletons and depth
59 Ex. Distance Metrics Jaccard, Unweighted UniFrac: presence / absence U (A,B) / U (A,B) U Bray Curtis, Weighted Unifrac: relative abundance Σ [ min (A i, B i ) ] / avg (N A,B ) A i = abundance of OTU i in A N = sample size Morisita-Horn (ThetaYC): relative and weighted abundance Σ [ RA i * RB i ] / avg ( Σ [ RA i 2 ], Σ [ RB i 2 ] ) RA i = relative abundance of OTU i in A
60 Do subsamples of the same sample look alike? For a sample of 50,000 reads: 1. Randomly select 5,000 reads 10 times 2. Calculate subsample beta diversity using several distance metrics
61 Community Distance of Subsamples at 5,000 Reads Community Distance Replicates Morisita Horn Bray-Curtis Jaccard Presence/absence (Jaccard) is highly skewed by the inconsistency of the low abundance members
62 0.12 Community Distance of Subsamples 0.1 Community Distance Replicates Bray Curtis (1K) Bray-Curtis (5K) Morisita Horn (1K) Morisita Horn (5K) Subsample 1,000 and 5,000 reads from sample of 50,000 reads, Pairwise distances for replicates at single depth
63 Subsampling Across Depths For a sample of 50,000 reads: Randomly select subsamples at depths: 1,000 5,000 7,500 10,000 15,000 20,000 25,000 Calculate subsample beta diversity using several distance metrics
64 Effect of Sample Depth - Bray Curtis Nearly 100% Different Bray Curtis uses absolute counts, intra-community distances are high as depths diverge
65 Morisita Horn uses relative abundances, intra-community distances are low across depths above minimum sampling depth. Effect of Sample Depth - Morisita Horn Nearly 0.5% Different ,000 5,000 7,500 10,000 15,000 20,000 25,000 20,000 15,000 10,000 7,500 5,000 1,000
66 PCoA plots group communities based on pairwise distances. Communities that have low pairwise distances are close together on the plot. Communities with larger pairwise distance are farther apart distant on the plot.
67 "#"$& &'!#"()*+,-./0#1.+2#34-.*.+5#64-/# "#""'& "#""(& "#"")& "#""%& "&!"#$#!"#""%&!"#"")&!"#""(& &$+"""&& &*+"""&& &,+*""&& &$"+"""&& &$*+"""&& &%"+"""&& &%*+"""&&!"#""'&!"#"$&!"#"$%&!"#%#!"#"$*&!"#"$&!"#""*& "& "#""*& "#"$& Minimum sample depth here of 10,000, but will be a function of the diversity of the sample
68 0.4 SLP Clustering and Bray-Curtis PC ,000 2,000 5,000 7,500 10,000 15,000 20,000 25,000 30,000 40, PC Bray-Curtis PCoA clusters entirely on depth (each point represents 10 atop one another)
Assigning Taxonomy to Marker Genes. Susan Huse Brown University August 7, 2014
Assigning Taxonomy to Marker Genes Susan Huse Brown University August 7, 2014 In a nutshell Taxonomy is assigned by comparing your DNA sequences against a database of DNA sequences from known taxa Marker
More informationRobert Edgar. Independent scientist
Robert Edgar Independent scientist robert@drive5.com www.drive5.com "Bacterial taxonomy is a hornets nest that no one, really, wants to get into." Referee #1, UTAX paper Assume prokaryotic species meaningful
More informationAmplicon Sequencing. Dr. Orla O Sullivan SIRG Research Fellow Teagasc
Amplicon Sequencing Dr. Orla O Sullivan SIRG Research Fellow Teagasc What is Amplicon Sequencing? Sequencing of target genes (are regions of ) obtained by PCR using gene specific primers. Why do we do
More informationMicrobiome: 16S rrna Sequencing 3/30/2018
Microbiome: 16S rrna Sequencing 3/30/2018 Skills from Previous Lectures Central Dogma of Biology Lecture 3: Genetics and Genomics Lecture 4: Microarrays Lecture 12: ChIP-Seq Phylogenetics Lecture 13: Phylogenetics
More informationOther resources. Greengenes (bacterial) Silva (bacteria, archaeal and eukarya)
General QIIME resources http://qiime.org/ Blog (news, updates): http://qiime.wordpress.com/ Support/forum: https://groups.google.com/forum/#!forum/qiimeforum Citing QIIME: Caporaso, J.G. et al., QIIME
More informationTaxonomical Classification using:
Taxonomical Classification using: Extracting ecological signal from noise: introduction to tools for the analysis of NGS data from microbial communities Bergen, April 19-20 2012 INTRODUCTION Taxonomical
More informationTitle ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 Title ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses
More informationNature Biotechnology: doi: /nbt Supplementary Figure 1. Detailed overview of the primer-free full-length SSU rrna library preparation.
Supplementary Figure 1 Detailed overview of the primer-free full-length SSU rrna library preparation. Detailed overview of the primer-free full-length SSU rrna library preparation. Supplementary Figure
More informationOutline Classes of diversity measures. Species Divergence and the Measurement of Microbial Diversity. How do we describe and compare diversity?
Species Divergence and the Measurement of Microbial Diversity Cathy Lozupone University of Colorado, Boulder. Washington University, St Louis. Outline Classes of diversity measures α vs β diversity Quantitative
More informationComparison of Three Fugal ITS Reference Sets. Qiong Wang and Jim R. Cole
RDP TECHNICAL REPORT Created 04/12/2014, Updated 08/08/2014 Summary Comparison of Three Fugal ITS Reference Sets Qiong Wang and Jim R. Cole wangqion@msu.edu, colej@msu.edu In this report, we evaluate the
More informationFIG S1: Rarefaction analysis of observed richness within Drosophila. All calculations were
Page 1 of 14 FIG S1: Rarefaction analysis of observed richness within Drosophila. All calculations were performed using mothur (2). OTUs were defined at the 3% divergence threshold using the average neighbor
More informationMiGA: The Microbial Genome Atlas
December 12 th 2017 MiGA: The Microbial Genome Atlas Jim Cole Center for Microbial Ecology Dept. of Plant, Soil & Microbial Sciences Michigan State University East Lansing, Michigan U.S.A. Where I m From
More informationAssessing and Improving Methods Used in Operational Taxonomic Unit-Based Approaches for 16S rrna Gene Sequence Analysis
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, May 2011, p. 3219 3226 Vol. 77, No. 10 0099-2240/11/$12.00 doi:10.1128/aem.02810-10 Copyright 2011, American Society for Microbiology. All Rights Reserved. Assessing
More informationThe Effect of Primer Choice and Short Read Sequences on the Outcome of 16S rrna Gene Based Diversity Studies
The Effect of Primer Choice and Short Read Sequences on the Outcome of 16S rrna Gene Based Diversity Studies Jonas Ghyselinck 1 *., Stefan Pfeiffer 2 *., Kim Heylen 1, Angela Sessitsch 2, Paul De Vos 1
More informationMicrobes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.
Microbial Taxonomy Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eukaryote, has been in a mess we were stuck with it for traditional
More informationMicrobial Taxonomy. Slowly evolving molecules (e.g., rrna) used for large-scale structure; "fast- clock" molecules for fine-structure.
Microbial Taxonomy Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eukaryote, has been in a mess we were stuck with it for traditional
More informationMicrobial Diversity and Assessment (II) Spring, 2007 Guangyi Wang, Ph.D. POST103B
Microbial Diversity and Assessment (II) Spring, 007 Guangyi Wang, Ph.D. POST03B guangyi@hawaii.edu http://www.soest.hawaii.edu/marinefungi/ocn403webpage.htm General introduction and overview Taxonomy [Greek
More informationrrdp: Interface to the RDP Classifier
rrdp: Interface to the RDP Classifier Michael Hahsler Anurag Nagar Abstract This package installs and interfaces the naive Bayesian classifier for 16S rrna sequences developed by the Ribosomal Database
More informationIntroduction to microbiota data analysis
Introduction to microbiota data analysis Natalie Knox, PhD Head Bacterial Genomics, Bioinformatics Core National Microbiology Laboratory, Public Health Agency of Canada 2 National Microbiology Laboratory
More informationAccuracy of taxonomy prediction for 16S rrna and fungal ITS sequences
Accuracy of taxonomy prediction for 16S rrna and fungal ITS sequences Robert C. Edgar Sonoma, CA, USA ABSTRACT Prediction of taxonomy for marker gene sequences such as 16S ribosomal RNA (rrna) is a fundamental
More informationSUPPLEMENTARY INFORMATION
Supplementary information S3 (box) Methods Methods Genome weighting The currently available collection of archaeal and bacterial genomes has a highly biased distribution of isolates across taxa. For example,
More informationImpact of training sets on classification of high-throughput bacterial 16s rrna gene surveys
(2012) 6, 94 103 & 2012 International Society for Microbial Ecology All rights reserved 1751-7362/12 www.nature.com/ismej ORIGINAL ARTICLE Impact of training sets on classification of high-throughput bacterial
More informationSupplemental Online Results:
Supplemental Online Results: Functional, phylogenetic, and computational determinants of prediction accuracy using reference genomes A series of tests determined the relationship between PICRUSt s prediction
More informationBacterial Communities in Women with Bacterial Vaginosis: High Resolution Phylogenetic Analyses Reveal Relationships of Microbiota to Clinical Criteria
Bacterial Communities in Women with Bacterial Vaginosis: High Resolution Phylogenetic Analyses Reveal Relationships of Microbiota to Clinical Criteria Seminar presentation Pierre Barbera Supervised by:
More informationPHYLOGENY AND SYSTEMATICS
AP BIOLOGY EVOLUTION/HEREDITY UNIT Unit 1 Part 11 Chapter 26 Activity #15 NAME DATE PERIOD PHYLOGENY AND SYSTEMATICS PHYLOGENY Evolutionary history of species or group of related species SYSTEMATICS Study
More informationInterpreting the Molecular Tree of Life: What Happened in Early Evolution? Norm Pace MCD Biology University of Colorado-Boulder
Interpreting the Molecular Tree of Life: What Happened in Early Evolution? Norm Pace MCD Biology University of Colorado-Boulder nrpace@colorado.edu Outline What is the Tree of Life? -- Historical Conceptually
More informationCensusing the Sea in the 21 st Century
Censusing the Sea in the 21 st Century Nancy Knowlton & Matthieu Leray Photo: Ove Hoegh-Guldberg Smithsonian s National Museum of Natural History Estimates of Marine/Reef Species Numbers (Millions) Marine
More informationTaxonomy. Content. How to determine & classify a species. Phylogeny and evolution
Taxonomy Content Why Taxonomy? How to determine & classify a species Domains versus Kingdoms Phylogeny and evolution Why Taxonomy? Classification Arrangement in groups or taxa (taxon = group) Nomenclature
More informationA Novel Ribosomal-based Method for Studying the Microbial Ecology of Environmental Engineering Systems
A Novel Ribosomal-based Method for Studying the Microbial Ecology of Environmental Engineering Systems Tao Yuan, Asst/Prof. Stephen Tiong-Lee Tay and Dr Volodymyr Ivanov School of Civil and Environmental
More informationMicrobiota: Its Evolution and Essence. Hsin-Jung Joyce Wu "Microbiota and man: the story about us
Microbiota: Its Evolution and Essence Overview q Define microbiota q Learn the tool q Ecological and evolutionary forces in shaping gut microbiota q Gut microbiota versus free-living microbe communities
More informationLecture 2: Diversity, Distances, adonis. Lecture 2: Diversity, Distances, adonis. Alpha- Diversity. Alpha diversity definition(s)
Lecture 2: Diversity, Distances, adonis Lecture 2: Diversity, Distances, adonis Diversity - alpha, beta (, gamma) Beta- Diversity in practice: Ecological Distances Unsupervised Learning: Clustering, etc
More informationOutline. Classification of Living Things
Outline Classification of Living Things Chapter 20 Mader: Biology 8th Ed. Taxonomy Binomial System Species Identification Classification Categories Phylogenetic Trees Tracing Phylogeny Cladistic Systematics
More informationStepping stones towards a new electronic prokaryotic taxonomy. The ultimate goal in taxonomy. Pragmatic towards diagnostics
Stepping stones towards a new electronic prokaryotic taxonomy - MLSA - Dirk Gevers Different needs for taxonomy Describe bio-diversity Understand evolution of life Epidemiology Diagnostics Biosafety...
More informationChapter 26 Phylogeny and the Tree of Life
Chapter 26 Phylogeny and the Tree of Life Chapter focus Shifting from the process of how evolution works to the pattern evolution produces over time. Phylogeny Phylon = tribe, geny = genesis or origin
More information8/23/2014. Phylogeny and the Tree of Life
Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major
More informationChad Burrus April 6, 2010
Chad Burrus April 6, 2010 1 Background What is UniFrac? Materials and Methods Results Discussion Questions 2 The vast majority of microbes cannot be cultured with current methods Only half (26) out of
More informationMicrobial Taxonomy and the Evolution of Diversity
19 Microbial Taxonomy and the Evolution of Diversity Copyright McGraw-Hill Global Education Holdings, LLC. Permission required for reproduction or display. 1 Taxonomy Introduction to Microbial Taxonomy
More informationProbing diversity in a hidden world: applications of NGS in microbial ecology
Probing diversity in a hidden world: applications of NGS in microbial ecology Guus Roeselers TNO, Microbiology & Systems Biology Group Symposium on Next Generation Sequencing October 21, 2013 Royal Museum
More informationAn Automated Phylogenetic Tree-Based Small Subunit rrna Taxonomy and Alignment Pipeline (STAP)
An Automated Phylogenetic Tree-Based Small Subunit rrna Taxonomy and Alignment Pipeline (STAP) Dongying Wu 1 *, Amber Hartman 1,6, Naomi Ward 4,5, Jonathan A. Eisen 1,2,3 1 UC Davis Genome Center, University
More informationChapter 19. Microbial Taxonomy
Chapter 19 Microbial Taxonomy 12-17-2008 Taxonomy science of biological classification consists of three separate but interrelated parts classification arrangement of organisms into groups (taxa; s.,taxon)
More informationData Mining und Maschinelles Lernen
Data Mining und Maschinelles Lernen Ensemble Methods Bias-Variance Trade-off Basic Idea of Ensembles Bagging Basic Algorithm Bagging with Costs Randomization Random Forests Boosting Stacking Error-Correcting
More informationPHYLOGENY & THE TREE OF LIFE
PHYLOGENY & THE TREE OF LIFE PREFACE In this powerpoint we learn how biologists distinguish and categorize the millions of species on earth. Early we looked at the process of evolution here we look at
More informationBergey s Manual Classification Scheme. Vertical inheritance and evolutionary mechanisms
Bergey s Manual Classification Scheme Gram + Gram - No wall Funny wall Vertical inheritance and evolutionary mechanisms a b c d e * * a b c d e * a b c d e a b c d e * a b c d e Accumulation of neutral
More informationPhylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?
Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species
More informationMicrobial Taxonomy. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.
Microbial Taxonomy Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eukaryote, has been in a mess we were stuck with it for traditional
More informationLecture: Mixture Models for Microbiome data
Lecture: Mixture Models for Microbiome data Lecture 3: Mixture Models for Microbiome data Outline: - - Sequencing thought experiment Mixture Models (tangent) - (esp. Negative Binomial) - Differential abundance
More informationA (short) introduction to phylogenetics
A (short) introduction to phylogenetics Thibaut Jombart, Marie-Pauline Beugin MRC Centre for Outbreak Analysis and Modelling Imperial College London Genetic data analysis with PR Statistics, Millport Field
More informationMicrobial Taxonomy. C. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.
Microbial Taxonomy 1. Traditional taxonomy or the classification through identification and nomenclature of microbes, both "prokaryote" and eucaryote, is in a mess we are stuck with it for traditional
More informationIntroduction to polyphasic taxonomy
Introduction to polyphasic taxonomy Peter Vandamme EUROBILOFILMS - Third European Congress on Microbial Biofilms Ghent, Belgium, 9-12 September 2013 http://www.lm.ugent.be/ Content The observation of diversity:
More informationChapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships
Chapter 26: Phylogeny and the Tree of Life You Must Know The taxonomic categories and how they indicate relatedness. How systematics is used to develop phylogenetic trees. How to construct a phylogenetic
More informationDiversity, Productivity and Stability of an Industrial Microbial Ecosystem
Diversity, Productivity and Stability of an Industrial Microbial Ecosystem Doruk Beyter 1, Pei-Zhong Tang 2, Scott Becker 2, Tony Hoang 3, Damla Bilgin 3, Yan Wei Lim 4, Todd C. Peterson 2, Stephen Mayfield
More informationMicrobial analysis with STAMP
Microbial analysis with STAMP Conor Meehan cmeehan@itg.be A quick aside on who I am Tangents already! Who I am A postdoc at the Institute of Tropical Medicine in Antwerp, Belgium Mycobacteria evolution
More informationDETAILED RESULTS ON FUNGAL AND BACTERIAL COMMUNITIES COLONIZING THE COMPOST-INCUBATED POLYMER CARD SURFACE
DETAILED RESULTS ON FUNGAL AND BACTERIAL COMMUNITIES COLONIZING THE COMPOST-INCUBATED POLYMER CARD SURFACE PART 1: FUNGAL COMMUNITIES COLONIZING THE COMPOST-INCUBATED POLYMER CARD SURFACE Fungal OTUs richness,
More informationA. Incorrect! In the binomial naming convention the Kingdom is not part of the name.
Microbiology Problem Drill 08: Classification of Microorganisms No. 1 of 10 1. In the binomial system of naming which term is always written in lowercase? (A) Kingdom (B) Domain (C) Genus (D) Specific
More informationBLAST Database Searching. BME 110: CompBio Tools Todd Lowe April 8, 2010
BLAST Database Searching BME 110: CompBio Tools Todd Lowe April 8, 2010 Admin Reading: Read chapter 7, and the NCBI Blast Guide and tutorial http://www.ncbi.nlm.nih.gov/blast/why.shtml Read Chapter 8 for
More informationTaxonomy and Biodiversity
Chapter 25/26 Taxonomy and Biodiversity Evolutionary biology The major goal of evolutionary biology is to reconstruct the history of life on earth Process: a- natural selection b- mechanisms that change
More informationCharacterizing and predicting cyanobacterial blooms in an 8-year
1 2 3 4 5 Characterizing and predicting cyanobacterial blooms in an 8-year amplicon sequencing time-course Authors Nicolas Tromas 1*, Nathalie Fortin 2, Larbi Bedrani 1, Yves Terrat 1, Pedro Cardoso 4,
More informationName: Class: Date: ID: A
Class: _ Date: _ Ch 17 Practice test 1. A segment of DNA that stores genetic information is called a(n) a. amino acid. b. gene. c. protein. d. intron. 2. In which of the following processes does change
More informationIntroduction to the SNP/ND concept - Phylogeny on WGS data
Introduction to the SNP/ND concept - Phylogeny on WGS data Johanne Ahrenfeldt PhD student Overview What is Phylogeny and what can it be used for Single Nucleotide Polymorphism (SNP) methods CSI Phylogeny
More informationPlant Names and Classification
Plant Names and Classification Science of Taxonomy Identification (necessary!!) Classification (order out of chaos!) Nomenclature (why not use common names?) Reasons NOT to use common names Theophrastus
More informationBiology 211 (2) Week 1 KEY!
Biology 211 (2) Week 1 KEY Chapter 1 KEY FIGURES: 1.2, 1.3, 1.4, 1.5, 1.6, 1.7 VOCABULARY: Adaptation: a trait that increases the fitness Cells: a developed, system bound with a thin outer layer made of
More informationHandling Fungal data in MoBeDAC
Handling Fungal data in MoBeDAC Jason Stajich UC Riverside Fungal Taxonomy and naming undergoing a revolution One fungus, one name http://www.biology.duke.edu/fungi/ mycolab/primers.htm http://www.biology.duke.edu/fungi/
More informationCh 10. Classification of Microorganisms
Ch 10 Classification of Microorganisms Student Learning Outcomes Define taxonomy, taxon, and phylogeny. List the characteristics of the Bacteria, Archaea, and Eukarya domains. Differentiate among eukaryotic,
More informationPhylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26
Phylogeny Chapter 26 Taxonomy Taxonomy: ordered division of organisms into categories based on a set of characteristics used to assess similarities and differences Carolus Linnaeus developed binomial nomenclature,
More informationC3020 Molecular Evolution. Exercises #3: Phylogenetics
C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from
More informationSupplementary Information
Supplementary Information Table S1. Per-sample sequences, observed OTUs, richness estimates, diversity indices and coverage. Samples codes as follows: YED (Young leaves Endophytes), MED (Mature leaves
More informationSupplementary Figure 1. Chao 1 richness estimator of microbial OTUs (16S rrna
Supplementary Information Supplementary Figure 1. Chao 1 richness estimator of microbial OTUs (16S rrna gene survey) from skin and hindgut of wild vultures and from feces of zoo birds. Supplementary Note
More informationBioinformatics Chapter 1. Introduction
Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!
More informationThe Tree of Life. Chapter 17
The Tree of Life Chapter 17 1 17.1 Taxonomy The science of naming and classifying organisms 2000 years ago Aristotle Grouped plants and animals Based on structural similarities Greeks and Romans included
More informationBioinformatics tools for phylogeny and visualization. Yanbin Yin
Bioinformatics tools for phylogeny and visualization Yanbin Yin 1 Homework assignment 5 1. Take the MAFFT alignment http://cys.bios.niu.edu/yyin/teach/pbb/purdue.cellwall.list.lignin.f a.aln as input and
More informationa,bD (modules 1 and 10 are required)
This form should be used for all taxonomic proposals. Please complete all those modules that are applicable (and then delete the unwanted sections). For guidance, see the notes written in blue and the
More informationPipelining RDP Data to the Taxomatic Background Accomplishments vs objectives
Pipelining RDP Data to the Taxomatic Timothy G. Lilburn, PI/Co-PI George M. Garrity, PI/Co-PI (Collaborative) James R. Cole, Co-PI (Collaborative) Project ID 0010734 Grant No. DE-FG02-04ER63932 Background
More informationSUPPLEMENTARY INFORMATION
doi:10.1038/nature25973 Power Simulations We performed extensive power simulations to demonstrate that the analyses carried out in our study are well powered. Our simulations indicate very high power for
More informationA Bayesian taxonomic classification method for 16S rrna gene sequences with improved species-level accuracy
Gao et al. BMC Bioinformatics (2017) 18:247 DOI 10.1186/s12859-017-1670-4 SOFTWARE Open Access A Bayesian taxonomic classification method for 16S rrna gene sequences with improved species-level accuracy
More informationLecture 3: Mixture Models for Microbiome data. Lecture 3: Mixture Models for Microbiome data
Lecture 3: Mixture Models for Microbiome data 1 Lecture 3: Mixture Models for Microbiome data Outline: - Mixture Models (Negative Binomial) - DESeq2 / Don t Rarefy. Ever. 2 Hypothesis Tests - reminder
More informationUnit 5: Taxonomy. KEY CONCEPT Organisms can be classified based on physical similarities.
KEY CONCEPT Organisms can be classified based on physical similarities. Linnaeus developed the scientific naming system still used today. Taxonomy is the science of naming and classifying organisms. White
More information- conserved in Eukaryotes. - proteins in the cluster have identifiable conserved domains. - human gene should be included in the cluster.
NCBI BLAST Services DELTA-BLAST BLAST (http://blast.ncbi.nlm.nih.gov/), Basic Local Alignment Search tool, is a suite of programs for finding similarities between biological sequences. DELTA-BLAST is a
More informationThe Classification of Plants and Other Organisms. Chapter 18
The Classification of Plants and Other Organisms Chapter 18 LEARNING OBJECTIVE 1 Define taxonomy Explain why the assignment of a scientific name to each species is important for biologists KEY TERMS TAXONOMY
More informationThe practice of naming and classifying organisms is called taxonomy.
Chapter 18 Key Idea: Biologists use taxonomic systems to organize their knowledge of organisms. These systems attempt to provide consistent ways to name and categorize organisms. The practice of naming
More informationChapter 17A. Table of Contents. Section 1 Categories of Biological Classification. Section 2 How Biologists Classify Organisms
Classification of Organisms Table of Contents Section 1 Categories of Biological Classification Section 1 Categories of Biological Classification Classification Section 1 Categories of Biological Classification
More informationInferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT
Inferring phylogeny Constructing phylogenetic trees Tõnu Margus Contents What is phylogeny? How/why it is possible to infer it? Representing evolutionary relationships on trees What type questions questions
More informationSystems biology. Abstract
Bioinformatics, 31(10), 2015, 1607 1613 doi: 10.1093/bioinformatics/btu855 Advance Access Publication Date: 6 January 2015 Original Paper Systems biology Selection of models for the analysis of risk-factor
More informationBLAST. Varieties of BLAST
BLAST Basic Local Alignment Search Tool (1990) Altschul, Gish, Miller, Myers, & Lipman Uses short-cuts or heuristics to improve search speed Like speed-reading, does not examine every nucleotide of database
More informationMicrobes and you ON THE LATEST HUMAN MICROBIOME DISCOVERIES, COMPUTATIONAL QUESTIONS AND SOME SOLUTIONS. Elizabeth Tseng
Microbes and you ON THE LATEST HUMAN MICROBIOME DISCOVERIES, COMPUTATIONAL QUESTIONS AND SOME SOLUTIONS Elizabeth Tseng Dept. of CSE, University of Washington Johanna Lampe Lab, Fred Hutchinson Cancer
More informationCharacteristics of Life
UNIT 2 BIODIVERSITY Chapter 4- Patterns of Life Biology 2201 Characteristics of Life All living things share some basic characteristics: 1) living things are organized systems made up of one or more cells
More informationGEOS 36501/EVOL January 2012 Page 1 of 23
GEOS 36501/EVOL 33001 13 January 2012 Page 1 of 23 III. Sampling 1 Overview of Sampling, Error, Bias 1.1 Biased vs. random sampling 1.2 Biased vs. unbiased statistic (or estimator) 1.3 Precision vs. accuracy
More informationThe minimal prokaryotic genome. The minimal prokaryotic genome. The minimal prokaryotic genome. The minimal prokaryotic genome
Dr. Dirk Gevers 1,2 1 Laboratorium voor Microbiologie 2 Bioinformatics & Evolutionary Genomics The bacterial species in the genomic era CTACCATGAAAGACTTGTGAATCCAGGAAGAGAGACTGACTGGGCAACATGTTATTCAG GTACAAAAAGATTTGGACTGTAACTTAAAAATGATCAAATTATGTTTCCCATGCATCAGG
More informationMicrobiology / Active Lecture Questions Chapter 10 Classification of Microorganisms 1 Chapter 10 Classification of Microorganisms
1 2 Bergey s Manual of Systematic Bacteriology differs from Bergey s Manual of Determinative Bacteriology in that the former a. groups bacteria into species. b. groups bacteria according to phylogenetic
More informationFuyong Li 1,2, Thomas C. A. Hitch 3, Yanhong Chen 1, Christopher J. Creevey 3 and Le Luo Guan 1*
Li et al. Microbiome (2019) 7:6 https://doi.org/10.1186/s40168-019-0618-5 RESEARCH Open Access Comparative metagenomic and metatranscriptomic analyses reveal the breed effect on the rumen microbiome and
More informationMicrobiology Helmut Pospiech
Microbiology http://researchmagazine.uga.edu/summer2002/bacteria.htm 05.04.2018 Helmut Pospiech The Species Concept in Microbiology No universally accepted concept of species for prokaryotes Current definition
More informationChapter 26 Phylogeny and the Tree of Life
Chapter 26 Phylogeny and the Tree of Life Biologists estimate that there are about 5 to 100 million species of organisms living on Earth today. Evidence from morphological, biochemical, and gene sequence
More informationand just what is science? how about this biology stuff?
Welcome to Life on Earth! Rob Lewis 512.775.6940 rlewis3@austincc.edu 1 The Science of Biology Themes and just what is science? how about this biology stuff? 2 1 The Process Of Science No absolute truths
More informationOutline. I. Methods. II. Preliminary Results. A. Phylogeny Methods B. Whole Genome Methods C. Horizontal Gene Transfer
Comparative Genomics Preliminary Results April 4, 2016 Juan Castro, Aroon Chande, Cheng Chen, Evan Clayton, Hector Espitia, Alli Gombolay, Walker Gussler, Ken Lee, Tyrone Lee, Hari Prasanna, Carlos Ruiz,
More informationSUPPLEMENTARY INFORMATION
Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)
More informationDr. Amira A. AL-Hosary
Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological
More informationSPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together
SPECIATION Origin of new species=speciation -Process by which one species splits into two or more species, accounts for both the unity and diversity of life SPECIES BIOLOGICAL CONCEPT Population or groups
More informationChapter 19: Taxonomy, Systematics, and Phylogeny
Chapter 19: Taxonomy, Systematics, and Phylogeny AP Curriculum Alignment Chapter 19 expands on the topics of phylogenies and cladograms, which are important to Big Idea 1. In order for students to understand
More informationSupplementary Information
Supplementary Information For the article"comparable system-level organization of Archaea and ukaryotes" by J. Podani, Z. N. Oltvai, H. Jeong, B. Tombor, A.-L. Barabási, and. Szathmáry (reference numbers
More informationPhylogeny. November 7, 2017
Phylogeny November 7, 2017 Phylogenetics Phylon = tribe/race, genetikos = relative to birth Phylogenetics: study of evolutionary relationships among organisms, sequences, or anything in between Related
More informationChapter 17. Table of Contents. Objectives. Taxonomy. Classifying Organisms. Section 1 Biodiversity. Section 2 Systematics
Classification Table of Contents Objectives Relatebiodiversity to biological classification. Explainwhy naturalists replaced Aristotle s classification system. Identifythe main criterion that Linnaeus
More information