Table S1. Summary of datasets used in this study

Similar documents
Nature Biotechnology: doi: /nbt Supplementary Figure 1. Detailed overview of the primer-free full-length SSU rrna library preparation.

Multivariate Statistics 101. Ordination (PCA, NMDS, CA) Cluster Analysis (UPGMA, Ward s) Canonical Correspondence Analysis

Other resources. Greengenes (bacterial) Silva (bacteria, archaeal and eukarya)

Supplementary Information

Amplicon Sequencing. Dr. Orla O Sullivan SIRG Research Fellow Teagasc

Changes in dissolved organic matter composition and disinfection byproduct precursors in advanced drinking water treatment processes

DNA metabarcoding reveals diverse but selective diet of three-spined stickleback in a coastal ecosystem

Title ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses

Characterizing and predicting cyanobacterial blooms in an 8-year

Supplementary Materials for

Mathematical models for prediction of trihalomethanes in drinking water

Lecture 2: Diversity, Distances, adonis. Lecture 2: Diversity, Distances, adonis. Alpha- Diversity. Alpha diversity definition(s)

Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis

FIG S1: Rarefaction analysis of observed richness within Drosophila. All calculations were

Supplementary Material

Taxonomy and Clustering of SSU rrna Tags. Susan Huse Josephine Bay Paul Center August 5, 2013

Partnering with LANDFIRE, NatureServe, and Heritage Programs. Utilizing Legacy Data for Ecological Site Concept Development and Descriptions

Exploring Microbes in the Sea. Alma Parada Postdoctoral Scholar Stanford University

Microbiome: 16S rrna Sequencing 3/30/2018

Tailoring activated carbon for enhanced adsorption of disinfection byproduct precursors

Nonmetals are located on the right side of the periodic table and are generally dull, brittle, and poor conductors.

Supplementary Information

concentration ( mol l -1 )

Ion Torrent. The chip is the machine

STATISTICS Relationships between variables: Correlation

Flowchart. (b) (c) (d)

2/19/2018. Dataset: 85,122 islands 19,392 > 1km 2 17,883 with data

INTRODUCTION TO MULTIVARIATE ANALYSIS OF ECOLOGICAL DATA

Taxonomical Classification using:

Computational engineering of cellulase Cel9A-68 functional motions through mutations in its linker region. WT 1TF4 (crystal) -90 ERRAT PROVE VERIFY3D

BINF6201/8201. Molecular phylogenetic methods

Lecture: Mixture Models for Microbiome data

ChemScan PROCESS ANALYZERS

Lecture 3: Mixture Models for Microbiome data. Lecture 3: Mixture Models for Microbiome data

A (short) introduction to phylogenetics

Please do not adjust margins. Fig. S1 Schematic representation of fabrication of polymer network entwined GO thin-film composite membrane.

CHLORAMINATION AND CHLORAMINE ANALYSIS SW AWWA 2014

ST-DBSCAN: An Algorithm for Clustering Spatial-Temporal Data

Supporting Information

Isolation of atomically precise mixed ligand shell PdAu 24 clusters. Supporting Information

for XPS surface analysis

WASTEWATER CHLORINATION / DECHLORINATION MONITORING AND CONTROL PILOT DEMONSTRATION AT CITY OF GUELPH ONTARIO WWTF

Centrifuge: rapid and sensitive classification of metagenomic sequences

MEMBRANE SURFACE PLANT MONTHLY OPERATIONAL REPORT TO ALABAMA DEPARTMENT OF ENVIRONMENTAL MANAGEMENT MONTGOMERY, ALABAMA.

Using Topological Data Analysis to find discrimination between microbial states in human microbiome data

Mitsuhiko Kawakami and Zhenjiang Shen Department of Civil Engineering Faculty of Engineering Kanazawa University Japan ABSTRACT

Catalysis at the Sub-Nanoscale: Complex CO Oxidation Chemistry on a Few Au Atoms

Please do not adjust margins. A high-rate aqueous rechargeable zinc ion battery based on VS nanocomposite

MiSeq System. Denature and Dilute Libraries Guide

Supporting Information. Detection and Occurrence of Chlorinated By-products of Bisphenol A, Nonylphenol and

Science of Chloramination. Maine Water Utilities Association Michael Koza, Portland Water District June 2010

Supplementary Material: Characterizing Changes in Drought Risk for the United States from Climate Change [1] October 12, 2010

Genomic insights into the taxonomic status of the Bacillus cereus group. Laboratory of Marine Genetic Resources, Third Institute of Oceanography,

Real Estate Price Prediction with Regression and Classification CS 229 Autumn 2016 Project Final Report

Supplementary Information

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

Identifying Disinfection Byproducts in Treated Water

A comparison of fully Bayesian and two-stage imputation strategies for missing covariate data

Inferring long-term trends in prairie reservoirs (Saskatchewan, Canada) through analysis of DNA in sediment cores

Introduction to Evolutionary Concepts

Figure S2: Representative TEM micrographs of the 1wt% Pt/ZSM-5 catalysts used in this work: (a) 2.A; (b) 2.B.

Keywords Haloacetic acids, Water analysis, 2-D ion chromatography, ICS-3000

A gas-phase amplified quartz crystal microbalance immunosensor based on catalase modified immunoparticles

Online Supplementary Material. MetaLP: A Nonparametric Distributed Learning Framework for Small and Big Data

Supplemental Online Results:

ECON Introductory Econometrics. Lecture 13: Internal and external validity

SUPPORTING INFORMATION. Control of the hierarchical assembly of π-conjugated. optoelectronic peptides by ph and flow

Gravity Analysis of Regional Economic Interdependence: In case of Japan

Species Level Deconvolution of Metagenome Assemblies with Hi C Based Contact Probability Maps

Supplementary Information

C HA R AC T E RIZ AT IO N O F INK J E T P RINT E R C A RT RIDG E INK S USING A CHEMOMETRIC APPROACH

LDM Package. 1 Overview. Yi-Juan Hu and Glen A. Satten March 19, 2018

( ( ( ( ( ( ( (! ! 1!

Using co-clustering to analyze spatio-temporal patterns: a case study based on spring phenology

A phylogenomic toolbox for assembling the tree of life

STATISTICAL ANALYSIS OF PYROSEQUENCE DATA KAREN KEATING. B.S., Arkansas State University, 1977 M.A., University of Maryland, 1981

CEE 371 Water and Wastewater Systems

Distance Measures. Objectives: Discuss Distance Measures Illustrate Distance Measures

Understanding Chlorine Measurement. Rebecca Luedee Environmental Sales

Genotyping By Sequencing (GBS) Method Overview

Electron capture dissociation of extremely supercharged protein. ions formed by electrospray ionisation

A Bayesian Criterion for Clustering Stability

No effect of Bt Cry1Ie toxin on bacterial diversity in the midgut of the Chinese honey bees, Apis cerana cerana (Hymenoptera, Apidae)

Censusing the Sea in the 21 st Century

Uranium deposit types: a systems perspective

Locating an Astronomy and Astrophysics Publication Set in a Map of the Full Scopus Database

A Prior Distribution of Bayesian Nonparametrics Incorporating Multiple Distances

Supplementary Information

Regression Analysis. Regression: Methodology for studying the relationship among two or more variables

BIO 682 Multivariate Statistics Spring 2008

UNIVERSITY OF OTAGO EXAMINATIONS 2008

PROJECT 3A. WATER DISINFECTION

Soil fungal diversity follows plant diversity across the alpine grassland ecosystems of the Tibetan Plateau

SUPPLEMENTARY INFORMATION

1 Electronic Supporting Information. 2 The transformation and fate of silver nanoparticles in a paddy soil:

Supporting Information

Rarefaction Example. Consider this dataset: Original matrix:

Key Words Q Exactive, Accela, MetQuest, Mass Frontier, Drug Discovery

Molecular Characterization and Proposed Taxonomic Placement of the Biosimulant 'BG'

Electronic Supplementary Information

Transcription:

Electronic Supplementary Material (ESI) for Environmental Science: Water Research & Technology. This journal is The Royal Society of Chemistry 206 Table S. Summary of datasets used in this study No DOI number for paper Publication year Disinfectant residual type Country of sampling Sampling points Sequencing instrument Hypervariable region amplified Reference number Number of treatment plants Average % of matches in SILVA database 0.06/j.watres.205.2.00 205 Chlorinated UK POU Illumina Miseq V4 5 89.02% ± 3.79% 2 doi: 0.06/j.watres.204.0.049 204 Chlorinated UK DWDS Roche 454 FLX V-V3 28 3 0.06/j.watres.203..027 204 Chlorinated/Chloraminated USA POU Roche 454 FLX Titanium V-V2 32 5 7.5% ± 6.22% 79.69% ± 2.42% 4 0.06/j.ecoenv.204.07.029 204 Chlorinated China DWTP outlet, POU Roche 454 FLX Titanium V3-V4 37 * 6.6% ± 2.48% 5 0.28/AEM.0892-2 202 Chlorinated/Chloraminated USA DWDS Roche 454 FLX Titanium V4-V5 33 89.56% ± 3.88% 6 0.37/journal.pone.04087 205 Chlorinated/Chloraminated USA DWTP outlet, POU Illimina Miseq V4 34 7 0.02/acs.est.5b0352 205 Chlorinated China DWTP outlet, POU Roche 454 FLX Titanium V-V3 38 5 * 67.2% ± 8.08% 85.28% ± 8.29% 8 0.06/j.watres.203.0.07 203 Chlorinated China DWTP outlet Roche 454 FLX V-V3 0 96.92% 9 0.02/es5009467 204 No disinfectant residual Netherlands POU Roche 454 FLX Titanium V4-V6 26 ** 63.6% ± 4.24% 0 0.28/mBio.035-4 204 Chloraminated USA DWTP outlet, POU Roche 454 GS-FLX V4-V5 7 0./462-2920.2739 205 No disinfectant residual Netherlands DWDS Illumina MiSeq/Roche 454 GS-FLX 2 0.02/es502646d 204 Chlorinated/Chloraminated USA DWDS V4 and V5- V6 Illumina MiSeq V4 35 6 32** 69.98% ± 2.3% 45.22% ± 6.65% 68.93% ± 2.65% 3 0.007/s274-03-32-5 203 Chloraminated China DWTP outlet, POU Roche 454 GS-FLX V3-V5 39 29.9% ± 0.64% 4 0.28/AEM.0297-5 205 Chloraminated Australia DWTP outlet, DWDS Ion Torrent V3 43 2 90.24% ± 6.82% *Both studies sampled at the same treatment plant **Plant from study No.9 may have been sampled in study No.

Figure S: The number of samples retained (primary Y-axes, blue squares) with increasing subsampling depths and their corresponding Good s coverage (secondary Y-axes, red squares) is shown. A subsampling depth of 500 sequences per sample was chosen as this allowed for an average Good s coverage 0.8 while retaining maximum number of samples (42 out of 45) in the analyses. Number'of'samples' Average'Good's'coverage' 45.00 0.90 40 0.80 Number of samples 35 30 0.70 0.60 0.50 0.40 0.30 Average Good's coverage 25 0.20 0.0 20 0.00 0 250 500 750 000 250 500 750 2000 Subsampling depth!

Figure S2: Workflow illustrating data collection, data processing and data analysis steps. Data$collec)on$ Data$processing$ Data$analysis$ 6S,rRNA,gene:based, studies.,, Samples,from,full,scale, drinking,water,systems,, Sampling,points:,DWTP outlet,, DWDS,,Point,of,use., Microbial,regrowth, control,strategies:, Chlorinated, Chloraminated, Disinfectant, residual,free, Single,end, Fastq,files, sickle,v..33,,,(min,quality, score=28;,min, length=50bp),, Paired,end, Fastq,files, pear,v.0.8.46, (min,quality, score=28;,min, length=50bp),,, Fastq,to,FASTA,format,using, the,fastq_to_fasta,command, from,fastx:toolkit,v.0.0.3.2,, In,mothur,,unique.seqs., blastn,against,the,silva,9,ssuref_nr, database,(iden\ty,,97%,,e: value<0.000005.),, Extrac\on,of,full,length,sequences,from, SILVA,database., Data,colla\on, In,mothur:, Best:match,sequences,were,aligned,against,the, SILVA,seed,alignment.,, screen.seqs,command,(ver\cal,=t,and,trump,=.),, OTU,clustering,(cutoff,97%),, Taxonomic,classifica\on,(command,classify.seq), using,silva,taxonomy., Subsampled, OTU,table, (a), Full,dataset, (b), Taxonomic, classifica\on, Fasta,file, (OTUs), (3), Alpha,diversity,es\mates,(observed, OTUs,,Inverse,Simpson,index,, Shannon,index,,Pielou s,evenness), Beta,diversity,(Bray,Cur\s,distances), PERMANOVA,(func\on,Adonis),, Mean,rela\ve,abundance,of,taxonomic, groups,,otus,,poten\al,opportunis\c, pathogens,,target,groups,(nitrifiers,and, predatory,bacteria), Phylogene\c,tree,construc\on,in, RaxML, Es\ma\on,of,poten\al, contamina\on, Tax4Fun,,metabolic,profile, predic\on, Explanatory, variables, (4), (a), (4), (b), (3), (b), (a),

Figure S3: Heatmap of OTU abundance (OTUs with relative abundance >0.0% in the subsampled OTU table). Abundances were scaled with a Z-score transformation to improve visualization. The sampling location dendrogram was generated with Bray Curtis distances and UPGMA clustering method. Grouping information is indicated by the color legend (Chl: chlorinated, Chm: chloraminated, Drf: disinfectant residual-free). Sampling)loca,ons)

Figure S4: Dendrogram of sampling locations generated with Bray Curtis distances and UPGMA clustering method. Color legends indicate type of source water (SW: surface water; GW: groundwater; Mix: mixed source water including surface water, groundwater and desalinated seawater), and disinfection group (Chl: chlorinated, Chm: chloraminated, Drf: disinfectant residual-free). Reference numbers of each dataset are at the bottom of the plot.