Microbial analysis with STAMP

Similar documents
Microbiome: 16S rrna Sequencing 3/30/2018

Assigning Taxonomy to Marker Genes. Susan Huse Brown University August 7, 2014

Supervised Learning to Predict Geographic Origin of Human Metagenomic Samples

Supplemental Online Results:

SUPPLEMENTARY INFORMATION

Amplicon Sequencing. Dr. Orla O Sullivan SIRG Research Fellow Teagasc

Handling Fungal data in MoBeDAC

Supplementary Information

FIG S1: Rarefaction analysis of observed richness within Drosophila. All calculations were

Phylogenomics, Multiple Sequence Alignment, and Metagenomics. Tandy Warnow University of Illinois at Urbana-Champaign

Title ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses

Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes. - Supplementary Information -

Microbiota: Its Evolution and Essence. Hsin-Jung Joyce Wu "Microbiota and man: the story about us

MiGA: The Microbial Genome Atlas

Exploring Microbes in the Sea. Alma Parada Postdoctoral Scholar Stanford University

Microbes and you ON THE LATEST HUMAN MICROBIOME DISCOVERIES, COMPUTATIONAL QUESTIONS AND SOME SOLUTIONS. Elizabeth Tseng

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Using Ensembles of Hidden Markov Models for Grand Challenges in Bioinformatics

The minimal prokaryotic genome. The minimal prokaryotic genome. The minimal prokaryotic genome. The minimal prokaryotic genome

Other resources. Greengenes (bacterial) Silva (bacteria, archaeal and eukarya)

Istituto di Microbiologia. Università Cattolica del Sacro Cuore, Roma. Gut Microbiota assessment and the Meta-HIT program.

Taxonomy. Content. How to determine & classify a species. Phylogeny and evolution

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Detailed overview of the primer-free full-length SSU rrna library preparation.

Taxonomical Classification using:

Bacterial Communities in Women with Bacterial Vaginosis: High Resolution Phylogenetic Analyses Reveal Relationships of Microbiota to Clinical Criteria

Taxonomy and Clustering of SSU rrna Tags. Susan Huse Josephine Bay Paul Center August 5, 2013

Programme Specification (Undergraduate) For 2017/18 entry Date amended: 25/06/18

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)

Supplementary Information

CREATING PHYLOGENETIC TREES FROM DNA SEQUENCES

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

Undergraduate Curriculum in Biology

BIOL 101 Introduction to Biological Research Techniques I

Introduction to Biology with Lab

Introduction to Evolutionary Concepts

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource

Comparative genomics: Overview & Tools + MUMmer algorithm

Grundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson

Potential of metatranscriptomics as indicator for ecosystem level diversity and function. diversitv Prof. Dr. Jens Boenigk. Department.

Chapter 19 Organizing Information About Species: Taxonomy and Cladistics

Exploring environmental genetic diversity with similarity networks

Outline Classes of diversity measures. Species Divergence and the Measurement of Microbial Diversity. How do we describe and compare diversity?

Chapters AP Biology Objectives. Objectives: You should know...

The Prokaryotic World

SUPPLEMENTARY INFORMATION

BIOL 1010 Introduction to Biology: The Evolution and Diversity of Life. Spring 2011 Sections A & B

Compositional data methods for microbiome studies

BIOINFORMATICS LAB AP BIOLOGY

Robert Edgar. Independent scientist

Outline. I. Methods. II. Preliminary Results. A. Phylogeny Methods B. Whole Genome Methods C. Horizontal Gene Transfer

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

The problem Lineage model Examples. The lineage model

Lowndes County Biology II Pacing Guide Approximate

8/23/2014. Phylogeny and the Tree of Life

Chapter 17. Table of Contents. Objectives. Taxonomy. Classifying Organisms. Section 1 Biodiversity. Section 2 Systematics

arxiv: v1 [q-bio.pe] 7 Jul 2014

Supporting Information

Bioinformatics Exercises

Comparative Bioinformatics Midterm II Fall 2004

A Bayesian taxonomic classification method for 16S rrna gene sequences with improved species-level accuracy

Introduction to de novo RNA-seq assembly

Genômica comparativa. João Carlos Setubal IQ-USP outubro /5/2012 J. C. Setubal

Computational methods for predicting protein-protein interactions

Structure, function and host control of rhizosphere microbiome

Single-cell genomics applied to the picobiliphytes using next-generation sequencing

Microbial Taxonomy and the Evolution of Diversity

AP Environmental Science I. Unit 1-2: Biodiversity & Evolution

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together

Open projects for BSc & MSc

Probing diversity in a hidden world: applications of NGS in microbial ecology

The practice of naming and classifying organisms is called taxonomy.

Supplementary Information

Molecular Genetics for Aquatic and Marine Biodiversity Conservation

Using Topological Data Analysis to find discrimination between microbial states in human microbiome data

Curriculum Links. AQA GCE Biology. AS level

Bio 119 Bacterial Genomics 6/26/10

Phylogenetics and the Human Microbiome

Microbial DNA qpcr Multi-Assay Kit Clostridium perfringens Pathogenicity

Homology Modeling. Roberto Lins EPFL - summer semester 2005

Introduction to Biology Web Course Informational and Test Schedule

Microbial Diversity. Yuzhen Ye I609 Bioinformatics Seminar I (Spring 2010) School of Informatics and Computing Indiana University

1. HyperLogLog algorithm

Approved Courses for General Science students with Major/Minors in Biological Sciences

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Undergraduate Curriculum in Biology

Postgraduate teaching for the next generation of taxonomists

Evolutionary Genetics: Part 0.2 Introduction to Population genetics

Graduate Funding Information Center

Outline. Classification of Living Things

Molecular Biology Of The Cell 6th Edition Alberts

Studying Life. Lesson Overview. Lesson Overview. 1.3 Studying Life

EASTERN ARIZONA COLLEGE Biology Concepts

CLASSIFICATION UNIT GUIDE DUE WEDNESDAY 3/1

Inferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT

Chapter 19. Microbial Taxonomy

Dr. Amira A. AL-Hosary

SUPPLEMENTARY INFORMATION

Cluster Analysis of Gene Expression Microarray Data. BIOL 495S/ CS 490B/ MATH 490B/ STAT 490B Introduction to Bioinformatics April 8, 2002

Transcription:

Microbial analysis with STAMP Conor Meehan cmeehan@itg.be

A quick aside on who I am Tangents already!

Who I am A postdoc at the Institute of Tropical Medicine in Antwerp, Belgium Mycobacteria evolution and epidemiology Previously postdoc at Dalhousie University, Halifax, Canada Human microbiome (esp. gut and airways) Previously PhD student at NUI Galway, Ireland HIV evolution and drug resistance Biologist trapped in the body of a computer scientist Scripting for speed, not for development

What I do/can maybe help with Microbial genetics Pathogen evolution (esp. HIV and Mycobacteria) Phylogenetic reconstruction Lateral gene transfer Microbiome analysis (esp. gut and airway) Protein structure prediction Species concepts in light of the microbiome Knowing which beers to drink at the Kidd Comparing European and North American lifestyles (as I have done both)

What data do you have?

OTU list Qiime Mothur MG-RAST Pplacer Etc. Function list MG-RAST HUManN KEGG SEED COG/eggNOG PICRUSt Etc. Microbiome datasets

PICRUSt?

PICRUSt Phylogenetic Investigation of Communities by Reconstruction of Unobserved States http://picrust.github.com http://huttenhower.sph.harvard.edu/galaxy/ Langille MG, Zaneveld J et al. (2013) Predictive functional profiling of microbial communities using 16S rrna marker gene sequences. Nature Biotechnology 31, 814-821

16S rrna gene QIIME/ MOTHUR Sample 1 Sample 2 Sample 3 OTU 1 4 0 2 OTU 2 1 0 0 OTU 3 2 4 2 Shotgun Metagenomics MG- RAST/ HUMAnN Sample 1 Sample 2 Sample 3 K00001 20 15 18 K00002 1 2 0 K00003 4 5 4 PICRUST CurFs will talk about this in detail on Friday

I get it, I have data. Now what? Well, what do you want to know?

Potential Questions Differences in abundances between conditions Environmental conditions ph, salinity, etc. Host measurements BMI, age, etc. STAMP Geographical influences Gradients across environmental conditions Composition differences between sites Alpha/Beta diversities GenGIS

STAMP (no S = software)

STAMP Software that allows for statistical comparison of samples to distinguish ecological influences Parks, DH and Beiko RG (2010). Identifying biologically relevant differences between metagenomic communities. Bioinformatics, 26, 715-721 Utilises various statistical tests and corrects for multiple sampling Allows for comparisons of individual samples or groups of samples Outputs graphical and tabular lists of OTUs/functions that differ between groups. Primarily used for comparisons between metagenomes, can also compare between genomes (e.g. COG category counts)

A quick tutorial (i.e. I do it, you watch, we ll call it interactive learning)

Lachnospiraceae sporulation Tutorial dataset Genome analysis suggested that gut-residing Lachnospiraceae undergo sporulation while those in other environments do not. Question was: are there more Lachnospiraceae-related sporulation genes in gut microbiomes than in others? Mapped reads from 3 environments (multiple samples) to sporulationrelated genes in lachnospiraceae genomes Compared environments to see if there is an overabundance in the gut microbiome Part of Meehan CJ & Beiko RG (2014) A phylogenomic view of ecological specialization in the Lachnospiraceae, a family of digestive tract-associated bacteria, Genome Biol Evol. 6(13)

STAMP it

A quick research example (i.e. I show you what I did with STAMP)

An example application Meehan CJ and Beiko RG (2012) Lateral gene transfer of an ABC transporter complex between major constituents of the human gut microbiome, BMC Microbiology 12:248 Dataset: MetaHIT 124 patients Metadata included the BMI of the patient Classed these into low (18-22; 34 samples) and obese (33+; 33 samples) Are there functional differences between the gut microbiomes of these two groups?

Functional assignment and abundance comparisons Assembled contigs from metagenomic reads input to Orphelia Predicts ORFs Any <150nt discarded Homology search against IMG genomes using USEARCH Assigns KOs (good example of why you need to learn to script) Dataset input to STAMP to look for differences between low and high BMI groups

Nickel/peptides transporter Found to be greatest in difference between the low and high BMI groups Contains 5 proteins, 4 of which differed significantly between groups What species are contributing these functions to the microbiome?

Species assignments A phylogenetic tree was built for each of the 5 KOs Full length genes extracted from all IMG genomes Aligned with ClustalOmega, trimmed with BMGE, built with FastTree Metagenomic reads assigned to each of the 5 KOs in previous steps were placed on relevant reference tree Pplacer classifies reads in a rank flexible manner Allows for probability cut-off for selecting assignment Integrates the NCBI taxonomy using Taxtastic Faecalibacterium prausnitzii found to be highly associated with all 5 KOs Examine trees for sister taxa Reveals LGT from other residence of gut microbiome Strain differences in operon presence and gene orders

Species Operon 1 Operon 2 Operon 3 Operon 4 Operon 5 Operon 6 F. prausnitzii M21/2 F. prausnitzii A2-165 F. cf. prausnitzii KLE1255 F. prausnitzii SL3/3 650558314 F. prausnitzii L2-6

Microbial analysis Can take OTU lists and get estimated KO/SEED categories with PICRUSt Once you have OTU and/or functional tables there is a whole host of analyses that can be done Phylogenetic placements and counts (pplacer) Comparisons between samples (STAMP etc.) Comparisons between environmental factors (STAMP etc.) Geographical influence on compositions (gengis) Lets talk gengis after this short break. Download and install gengis from here: http://kiwi.cs.dal.ca/gengis/ Download the GOS dataset from here: http://kiwi.cs.dal.ca/gengis/images/4/48/gos_atlantic.zip Go to the tutorial page here: https://stamps.mbl.edu/index.php/gengis_tutorial