Genotyping By Sequencing (GBS) Method Overview

Size: px
Start display at page:

Download "Genotyping By Sequencing (GBS) Method Overview"

Transcription

1 enotyping By Sequencing (BS) Method Overview RJ Elshire, JC laubitz, Q Sun, JV Harriman ES Buckler, and SE Mitchell

2 Topics Presented Background/oals BS lab protocol Illumina sequencing review BS adapter system How BS differs from RAD Modifying BS for different species Workflow in our lab

3 Background enotyping by sequencing (BS) in any large genome species requires reduction of genome complexity I Target enrichment Long range PCR of specific genes or genomic subsets Molecular inversion probes II Restriction Enzymes (REs) *Technically less challenging* Methylation sensitive REs filter out repetitive genomic fraction Sequence capture approaches hybridization based (microarrays)

4 QTL are often located in non coding regions Vgt1, Tb, B regulatory regions kb from gene QTL Exon Exon Exon Exon Map large numbers of genome wide markers Exon capture

5 Our goal is to create a public genotyping/informatics platform based on next generation sequencing Create inexpensive, robust multiplex sequencing protocol Low coverage Sequencing Illumina A Informatics Pipelin Combine genotypic & phenotypic data for S and WAS Impute missing data Anchor markers across the genome

6 Open Source Method available for anyone to use / modify Analysis pipeline details and code are public Promote dataset compatibility Method published in PLoS ONE to promote accessibility enotype calls publically available

7 Overview of enotyping by Sequencing (BS) Restriction site ( ) sequence tag Sample1 Loss of cut site < 450 bp Sample2 Focuses Nexten sequencing power to ends of restriction fragments Scores both SNPs and presence/absence markers

8 BS is a simple, highly multiplexed system for constructing libraries for next gen sequencing Reduced sample handling Few PCR & purification steps No DNA size fractionation Efficient barcoding system Simultaneous marker discovery & genotyping Scales very well

9 BS 96 plex Protocol ( overview) Now at 384 plex 1 Plate DNA & adapter pair 2 Digest DNA with RE 3 Ligate adapters

10 BS Adapters and Enzymes Barcode Adapter P1 Sticky Ends Common Adapter P2 Illumina Sequencing Primer 2 Barcode (4 8 bp) Restriction Enzymes Illumina Sequencing Primer 1 ApeKI CWC PstI CTCA EcoT22I ATCA T 5 3

11 BS 96 plex Protocol ( overview) 1 Plate DNA & adapter pair 2 Digest DNA with RE 3 Ligate adapters 4 Pool 96 samples Clean up Primers 5 PCR

12 PRC primers: Pooled Digestion/Ligation Reactions (n=94) PCR BS Library Insert Insert P1 P2 O1 P1 P2 O2

13 BS 96 plex Protocol ( overview) Clean up Clean up 1 Plate DNA & adapter pair 2 Digest DNA with RE 3 Ligate adapters 4 Pool 96 DNA aliquots Primers 5 PCR 6 Evaluate fragment sizes

14 Perform Titration to Minimize Adapter Dimers Before Sequencing NOTE: Done once with a small number of samples Adapter dimers constitute only 005% of raw sequence reads Size Standards Optimal adapter amount Fluorescense intensity 15 bp Adapter Dimer Library 1500 bp Time Time

15 Small Fragments are Enriched in BS Libraries 40% 35% 30% 25% REFENOME B73 Refen v1 IBM (B73 X Mo17) RILs 20% 15% 10% 5% 0% Total Fragments >10000 ApeKI fragment size (bp)

16 384 plex BS Results for Maize Reads Mean read count per line = 528,000 cv =

17 Illumina Sequencing by Synthesis Review Flowcell 8 channels Solid Phase Oligos

18 Cluster Formation Amplifies Sequencing Signal Denatured Library Bridge Amplification Flow cell with bound oligos Linearization

19 A C T C A C T C T T P1 primer C A C T C T A A C T C T C T T TC TCA Sequencing by Synthesis Flowcell HiSeq 2000

20

21 A C T C T A C T C T C A C T C T T N A C T C T A C T C T A C T C T In Phase Out of Phase

22 BS captures barcode and insert DNA sequence in single read Sequencing Primer 1 Read 1 Sequencing Primer 1 Read 1 Barcode RE cut site Insert DNA Insert DNA Sequencing Primer 2 Barcode Read 2 Illumina BS

23 Variable Length BS Barcodes Solves Sequence PhasingIssues First 12 nt used to calculate phasing Algorithm assumes random nt distribution Incorrect phasing causes incorrect base calls ood design and modulating the RE cut site position with variable length barcodes produces even nt distribution Read 1 Insert DNA Barcode Read 2 Barcode RE cut site Insert DNA Illumina BS

24 Invariant BS barcodes cause loss of signal intensity

25 Successful BS sequencing run

26 Most significant BS technical issue? DNA quality

27 BS does not use standard Y adapters Standard BS Ligation PCR Ligation 100% Different Ends 50% Same Ends 50%

28 Same ended Fragments Do Not Form Clusters Denatured library Bridge Amplification Cleaved from surface Cluster Formation & Linearization No P1 binding site Flow Cell with Bound Oligos

29 BS vs RAD Restriction site ( ) sequence tag Sample1 Loss of cut site < 450 bp Sample2 Focuses Nexten sequencing power to ends of restriction fragments Scores both SNPs and presence/absence markers

30 Digest Ligate adapters Pool Random shear Size select Ligate Y adapters PCR RAD BS Reference Davey et al 2011

31 Modifying BS Considerations for using BS with new species and / or different enzymes

32 Why Modify the BS Protocol? More markers Fewer markers (deeper sequence coverage per locus) Increase multiplexing More genome appropriate (avoid more repetitive DNA classes) Other novel applications (ie, bisulfite sequencing)

33 enome Sampling Strategies Vary by Species Dependent on Factors that Affect Diversity: Mating System (Outcrosser, inbreeder, clonal?) Ploidy (Haploid, diploid, auto or allopolyploid?) eographical Distribution (Island population, cosmopolitan?)

34 Other Factors enome size The size of the genome has some bearing on the size of the fragment pool Amount of repetitive DNA directly correlated with genome size enome composition The composition of the genome can affect the frequency and distribution of the cut sites

35 Sampling large genomes with methylationsensitive restriction enzymes BS Library Sequenced Fragments Methylated DNA 5 base cutter Unmethylated DNA 6 base cutter

36 Optimizing BS in New Species rape Maize Cacao Sorghum Rice Shrub willow oose Barley Raspberry Deer Mouse Tunicate Vole iant squid Cassava Cassava Solitary Bee Yeast Scrub jay

37 Choosing Appropriate Restriction Enzymes Life is like a bowl of chocolates

38 ApeKI iant Squid PstI oose

39 ApeKI Deer Mouse PstI

40 ApeKI Shrub Willow PstI EcoT22I

41 BS Adapter Design

42 Barcode Design Considerations Barcode sets are enzyme specific Must not recreate the enzyme recognition site Must have complementary overhangs Sets must be of variable length Bases must be well balanced at each position Must different enough from each other to avoid confusion if there is a sequencing error At least 3 bp differences among barcodes Must not nest within other barcodes No mononucleotide runs of 3 or more bases adapters

43 BS SNP calls in Sorghum bicolor c10hmp alleles chrom SC283 BR007B RIL_100 RIL_101 RIL_102 RIL_103 RIL_104 RIL_105 RIL_106 RIL_107 RIL_108 RIL_109 RIL_10 RIL_110 RIL_111 RIL_112 RIL_113 RIL_114 RIL_115 S10_29954 C/A 10 A N A C A N N C C N N N N N N N N C N S10_29963 A/C 10 C N C A C N N A A N N N N N N N N A N S10_29965 T/C 10 C N C T C N N T T N N N N N N N N T N S10_84178 C/ 10 N N C N N N N N N N N N N C N N N N S10_84512 T/C 10 T T N N T N T N N N N C N N N N N N N S10_84513 C/T 10 C C N N C N C N N N N T N N N N N N N S10_85556 /A 10 A A N N N R N A N R S10_85556 /A 10 N N A A N N N N A N S10_ T/C 10 T C T C T C T N N T T N N N C N N T T S10_ T/C 10 T C N N N N N T T N N N N N N N N N N S10_ /A 10 N N A N N N N N N N A A N N S10_ /A 10 A N A A N N N N N N N N N S10_ /A 10 A N A N N N N N N N N N N N S10_ T/ 10 T N T N N N N T T T N N S10_ C/T 10 T C N C T N T N N T T N T C C C N T N S10_ /A 10 A N A N N A A N S10_ C/ 10 N C S C C C C N C C N C N C N S10_ /A 10 N A A A N N N A A N S10_ C/T 10 C N N T N N N N C N C N N N N N N C C S10_ T/ 10 T N T N T T N T T T T T N N T N T S10_ T/ 10 T T T N T T T T T T N T T T S10_ /A 10 N A N A N N N N N N A N N N N S10_ /T 10 T T N N N N N N N T S10_ C/T 10 C T C T C N N C C C N N N N N T C C C S10_ T/C 10 T C T C T C N N T T N N T N N N T N N S10_ C/A 10 N N N N A N A N N N N N N N N N N N N S10_ /A 10 N N N N A N A N N N N N N N N N N N N S10_ C/T 10 N N C N N N N C T C N N C T N N C N C S10_ T/C 10 T N T N N N N N N N N N N N N N N T T S10_ /T 10 N N N N N N N N N N N N N N S10_ A/T 10 A T N N A N N A A A N A A N T T A A A S10_ A/T 10 A T A T N N N A N A N A N N T T A N A S10_ C/T 10 N N C N C N C C C N N N C T N N C N C S10_ C/T 10 C T C T C N N C N C C N C T T N C C N S10_ A/C 10 A C A N A N A A A A A A A C C N A A A S10_ T/C 10 T C T C T C N T T T T N T N C N T T T

44 Missing Data Strategies Impute Technical Options Reduce the multiplexing level Sequence the same library multiple times Molecular Options Choose less frequently cutting enzymes

45 BS Workflow DNA sample info entered to webform Project approved? Samples shipped HTS database Lab BS libraries made Analysis Pipelines DNA sequence data Reference genome Non reference genome Data File SNPs/Sample Data File States/Sample

46 BS Team Rob Elshire Ed Buckler Sharon Mitchell Jeff laubitz James Harriman Qi Sun

Genotyping By Sequencing (GBS) Method Overview

Genotyping By Sequencing (GBS) Method Overview enotyping By Sequencing (BS) Method Overview Sharon E Mitchell Institute for enomic Diversity Cornell University http://wwwmaizegeneticsnet/ Topics Presented Background/oals BS lab protocol Illumina sequencing

More information

GBS Bioinformatics Pipeline(s) Overview

GBS Bioinformatics Pipeline(s) Overview GBS Bioinformatics Pipeline(s) Overview Getting from sequence files to genotypes. Pipeline Coding: Ed Buckler Jeff Glaubitz James Harriman Presentation: Terry Casstevens With supporting information from

More information

GBS Bioinformatics Pipeline(s) Overview

GBS Bioinformatics Pipeline(s) Overview GBS Bioinformatics Pipeline(s) Overview Getting from sequence files to genotypes. Pipeline Coding: Ed Buckler Jeff Glaubitz James Harriman Presentation: Rob Elshire With supporting information from the

More information

Fei Lu. Post doctoral Associate Cornell University

Fei Lu. Post doctoral Associate Cornell University Fei Lu Post doctoral Associate Cornell University http://www.maizegenetics.net Genotyping by sequencing (GBS) is simple and cost effective 1. Digest DNA 2. Ligate adapters with barcodes 3. Pool DNAs 4.

More information

Accounting for read depth in the analysis of genotyping-by-sequencing data

Accounting for read depth in the analysis of genotyping-by-sequencing data Accounting for read depth in the analysis of genotyping-by-sequencing data Ken Dodds, John McEwan, Timothy Bilton, Rudi Brauning, Rayna Anderson, Tracey Van Stijn, Theodor Kristjánsson, Shannon Clarke

More information

New imputation strategies optimized for crop plants: FILLIN (Fast, Inbred Line Library ImputatioN) FSFHap (Full Sib Family Haplotype)

New imputation strategies optimized for crop plants: FILLIN (Fast, Inbred Line Library ImputatioN) FSFHap (Full Sib Family Haplotype) New imputation strategies optimized for crop plants: FILLIN (Fast, Inbred Line Library ImputatioN) FSFHap (Full Sib Family Haplotype) Kelly Swarts PAG Allele Mining 1/11/2014 Imputation is the projection

More information

SNPs versus sequences for phylogeography an explora:on using simula:ons and massively parallel sequencing in a non- model bird

SNPs versus sequences for phylogeography an explora:on using simula:ons and massively parallel sequencing in a non- model bird SNPs versus sequences for phylogeography an explora:on using simula:ons and massively parallel sequencing in a non- model bird Michael G. Harvey, Brian T. Smith, Brant C. Faircloth, Travis C. Glenn, and

More information

Eppendorf twin.tec PCR Plates 96 LoBind Increase Yield of Transcript Species and Number of Reads of NGS Libraries

Eppendorf twin.tec PCR Plates 96 LoBind Increase Yield of Transcript Species and Number of Reads of NGS Libraries APPLICATION NOTE No. 375 I December 2016 Eppendorf twin.tec PCR Plates 96 LoBind Increase Yield of Transcript Species and Number of Reads of NGS Libraries Hanae A. Henke¹, Björn Rotter² ¹Eppendorf AG,

More information

Analysis of Y-STR Profiles in Mixed DNA using Next Generation Sequencing

Analysis of Y-STR Profiles in Mixed DNA using Next Generation Sequencing Analysis of Y-STR Profiles in Mixed DNA using Next Generation Sequencing So Yeun Kwon, Hwan Young Lee, and Kyoung-Jin Shin Department of Forensic Medicine, Yonsei University College of Medicine, Seoul,

More information

Quantile based Permutation Thresholds for QTL Hotspots. Brian S Yandell and Elias Chaibub Neto 17 March 2012

Quantile based Permutation Thresholds for QTL Hotspots. Brian S Yandell and Elias Chaibub Neto 17 March 2012 Quantile based Permutation Thresholds for QTL Hotspots Brian S Yandell and Elias Chaibub Neto 17 March 2012 2012 Yandell 1 Fisher on inference We may at once admit that any inference from the particular

More information

Comparative Genomics of Fagaceae

Comparative Genomics of Fagaceae Fagaceae Images.google.com Linkage Map www.quia.com TM www.clipartlord.com Selection of mapping parents SM2 SM1 Predominant pollinator? Progeny Exclusion for Full Sib Linkage Mapping Year Acorns genotyped

More information

2. Yeast two-hybrid system

2. Yeast two-hybrid system 2. Yeast two-hybrid system I. Process workflow a. Mating of haploid two-hybrid strains on YPD plates b. Replica-plating of diploids on selective plates c. Two-hydrid experiment plating on selective plates

More information

Principles of QTL Mapping. M.Imtiaz

Principles of QTL Mapping. M.Imtiaz Principles of QTL Mapping M.Imtiaz Introduction Definitions of terminology Reasons for QTL mapping Principles of QTL mapping Requirements For QTL Mapping Demonstration with experimental data Merit of QTL

More information

Easy Illumina Nextera DNA FLEX Library Preparation using the epmotion 5075t automated liquid handler

Easy Illumina Nextera DNA FLEX Library Preparation using the epmotion 5075t automated liquid handler WHITE PAPER No. 13 Easy Illumina Nextera DNA FLEX Library Preparation using the epmotion 5075t automated liquid handler Executive Summary Library preparation steps, including DNA extraction, quantification,

More information

Supplementary Figure 1. Phenotype of the HI strain.

Supplementary Figure 1. Phenotype of the HI strain. Supplementary Figure 1. Phenotype of the HI strain. (A) Phenotype of the HI and wild type plant after flowering (~1month). Wild type plant is tall with well elongated inflorescence. All four HI plants

More information

Developing and implementing molecular markers in perennial ryegrass breeding

Developing and implementing molecular markers in perennial ryegrass breeding Developing and implementing molecular markers in perennial ryegrass breeding K.F. Smith 1,3, J.W. Forster 2,3, T.A. Ciavarella 1,3, J.L. Dumsday 2, M.P. Dupal 2,3, E.S. Jones 2,3, B.D. Kirkwood 1,3, A.

More information

Is KIT locus polymorphism rs related to white belt phenotype in Krškopolje pig?

Is KIT locus polymorphism rs related to white belt phenotype in Krškopolje pig? Is KIT locus polymorphism rs328592739 related to white belt phenotype in Krškopolje pig? Jernej Ogorevc, Minja Zorc, Martin Škrlep, Riccardo Bozzi, Matthias Petig, Luca Fontanesi, Marjeta Čandek-Potokar,

More information

Automated Illumina TruSight HLA v2 Sequencing Panel Library Preparation with the epmotion 5075t

Automated Illumina TruSight HLA v2 Sequencing Panel Library Preparation with the epmotion 5075t SHORT PROTOCOL No. 41 Automated Illumina TruSight HLA v2 Sequencing Panel Library Preparation with the epmotion 5075t Introduction This protocol describes the workstation configuration and pre-programmed

More information

Exhaustive search. CS 466 Saurabh Sinha

Exhaustive search. CS 466 Saurabh Sinha Exhaustive search CS 466 Saurabh Sinha Agenda Two different problems Restriction mapping Motif finding Common theme: exhaustive search of solution space Reading: Chapter 4. Restriction Mapping Restriction

More information

Bias in RNA sequencing and what to do about it

Bias in RNA sequencing and what to do about it Bias in RNA sequencing and what to do about it Walter L. (Larry) Ruzzo Computer Science and Engineering Genome Sciences University of Washington Fred Hutchinson Cancer Research Center Seattle, WA, USA

More information

Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites

Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites Paper by: James P. Balhoff and Gregory A. Wray Presentation by: Stephanie Lucas Reviewed

More information

Dr. Jennifer Weller WORKFLOW DURING THE B3 CAMP MAKING SOLUTIONS FROM STOCKS. B3 Summer Science Camp at Olympic High School

Dr. Jennifer Weller WORKFLOW DURING THE B3 CAMP MAKING SOLUTIONS FROM STOCKS. B3 Summer Science Camp at Olympic High School Dr. Jennifer Weller WORKFLOW DURING THE B3 CAMP MAKING SOLUTIONS FROM STOCKS B3 Summer Science Camp at Olympic High School LAB WORKFLOW OVERVIEW Collect Samples (June 12 th ) Extract DNA from the samples

More information

allosteric cis-acting DNA element coding strand dominant constitutive mutation coordinate regulation of genes denatured

allosteric cis-acting DNA element coding strand dominant constitutive mutation coordinate regulation of genes denatured A B C D E F G H I J K L M N O P Q R S T U V W X Y Z AA BB CC DD EE FF GG HH II JJ KK LL MM NN OO PP QQ RR SS TT UU VV allosteric cis-acting DNA element coding strand codominant constitutive mutation coordinate

More information

GBS Bioinformatics Pipeline

GBS Bioinformatics Pipeline GBS Bioinformatics Pipeline...or, Where Your Data Go After Sequencing James Harriman Ed Buckler Jeff Glaubitz Reference Genome Pipeline QseqToTagCount Qseq Key files QseqToTBT TagCounts per lane TagsByTaxa

More information

Quiz Section 4 Molecular analysis of inheritance: An amphibian puzzle

Quiz Section 4 Molecular analysis of inheritance: An amphibian puzzle Genome 371, Autumn 2018 Quiz Section 4 Molecular analysis of inheritance: An amphibian puzzle Goals: To illustrate how molecular tools can be used to track inheritance. In this particular example, we will

More information

Multivariate analysis of genetic data an introduction

Multivariate analysis of genetic data an introduction Multivariate analysis of genetic data an introduction Thibaut Jombart MRC Centre for Outbreak Analysis and Modelling Imperial College London Population genomics in Lausanne 23 Aug 2016 1/25 Outline Multivariate

More information

Supplementary Figure 1. Schematic of split-merger microfluidic device used to add transposase to template drops for fragmentation.

Supplementary Figure 1. Schematic of split-merger microfluidic device used to add transposase to template drops for fragmentation. Supplementary Figure 1. Schematic of split-merger microfluidic device used to add transposase to template drops for fragmentation. Inlets are labelled in blue, outlets are labelled in red, and static channels

More information

Whole genome sequencing (WGS) - there s a new tool in town. Henrik Hasman DTU - Food

Whole genome sequencing (WGS) - there s a new tool in town. Henrik Hasman DTU - Food Whole genome sequencing (WGS) - there s a new tool in town Henrik Hasman DTU - Food Welcome to the NGS world TODAY Welcome Introduction to Next Generation Sequencing DNA purification (Hands-on) Lunch (Sandwishes

More information

EST1 Homology Domain. 100 aa. hest1a / SMG6 PIN TPR TPR. Est1-like DBD? hest1b / SMG5. TPR-like TPR. a helical. hest1c / SMG7.

EST1 Homology Domain. 100 aa. hest1a / SMG6 PIN TPR TPR. Est1-like DBD? hest1b / SMG5. TPR-like TPR. a helical. hest1c / SMG7. hest1a / SMG6 EST1 Homology Domain 100 aa 853 695 761 780 1206 hest1 / SMG5 -like? -like 109 145 214 237 497 165 239 1016 114 207 212 381 583 hest1c / SMG7 a helical 1091 Sc 57 185 267 284 699 Figure S1:

More information

Design and Synthesis of the Comprehensive Fragment Library

Design and Synthesis of the Comprehensive Fragment Library YOUR INNOVATIVE CHEMISTRY PARTNER IN DRUG DISCOVERY Design and Synthesis of the Comprehensive Fragment Library A 3D Enabled Library for Medicinal Chemistry Discovery Warren S Wade 1, Kuei-Lin Chang 1,

More information

Automated Illumina TruSeq Stranded mrna library construction with the epmotion 5075t/TMX

Automated Illumina TruSeq Stranded mrna library construction with the epmotion 5075t/TMX SHORT PROTOCOL No. 02 I November 2014 Automated Illumina TruSeq Stranded mrna library construction with the epmotion 5075t/TMX Introduction For the MiSeq and HiSeq next generation sequencing (NGS) systems,

More information

Comparative Network Analysis

Comparative Network Analysis Comparative Network Analysis BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2016 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by

More information

Seminar: MPS applica/ons in the forensic DNA IDen/fica/on Solu/ons Vienna, May

Seminar: MPS applica/ons in the forensic DNA IDen/fica/on Solu/ons Vienna, May Seminar: MPS applica/ons in the forensic DNA workflow@human IDen/fica/on Solu/ons 2017 - Vienna, May 16 2017 Systematic Evaluation of the Early Access Applied Biosystems Precision ID Globalfiler Mixture

More information

Amy Driskell. Laboratories of Analytical Biology National Museum of Natural History Smithsonian Institution, Wash. DC

Amy Driskell. Laboratories of Analytical Biology National Museum of Natural History Smithsonian Institution, Wash. DC DNA Barcoding Amy Driskell Laboratories of Analytical Biology National Museum of Natural History Smithsonian Institution, Wash. DC 1 Outline 1. Barcoding in general 2. Uses & Examples 3. Barcoding Bocas

More information

Illegitimate translation causes unexpected gene expression from on-target out-of-frame alleles

Illegitimate translation causes unexpected gene expression from on-target out-of-frame alleles Illegitimate translation causes unexpected gene expression from on-target out-of-frame alleles created by CRISPR-Cas9 Shigeru Makino, Ryutaro Fukumura, Yoichi Gondo* Mutagenesis and Genomics Team, RIKEN

More information

NCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157)

NCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157) NCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157) Evidence Statement Q1 Expected coverage Merit Excellence

More information

Using ADE Mutants to Study Respiration and Fermentation in Yeast Extensions to Basic Lab

Using ADE Mutants to Study Respiration and Fermentation in Yeast Extensions to Basic Lab Using ADE Mutants to Study Respiration and Fermentation in Yeast Extensions to Basic Lab Introduction: David Form, Nashoba Regional High School And Jessica Forton, Melrose High School Bioinformatics Component

More information

Introducing New Technology for Liquid Handling Quality Assurance

Introducing New Technology for Liquid Handling Quality Assurance Introducing New Technology for Liquid Handling Quality Assurance Bill Gigante and John Bradshaw Ph.D. Tutorial Topics Product overview Real world example Compound Management Technology Operation Maintenance

More information

Microsatellite data analysis. Tomáš Fér & Filip Kolář

Microsatellite data analysis. Tomáš Fér & Filip Kolář Microsatellite data analysis Tomáš Fér & Filip Kolář Multilocus data dominant heterozygotes and homozygotes cannot be distinguished binary biallelic data (fragments) presence (dominant allele/heterozygote)

More information

Matrix and Steiner-triple-system smart pooling assays for high-performance transcription regulatory network mapping

Matrix and Steiner-triple-system smart pooling assays for high-performance transcription regulatory network mapping Matrix and Steiner-triple-system smart pooling assays for high-performance transcription regulatory network mapping Vanessa Vermeirssen, Bart Deplancke, M Inmaculada Barrasa, John S Reece-Hoyes, H Efsun

More information

BTRY 7210: Topics in Quantitative Genomics and Genetics

BTRY 7210: Topics in Quantitative Genomics and Genetics BTRY 7210: Topics in Quantitative Genomics and Genetics Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine jgm45@cornell.edu February 12, 2015 Lecture 3:

More information

Ion Torrent. The chip is the machine

Ion Torrent. The chip is the machine Ion Torrent Introduction The Ion Personal Genome Machine [PGM] is simple, more costeffective, and more scalable than any other sequencing technology. Founded in 2007 by Jonathan Rothberg. Part of Life

More information

Introduc)on to RNA- Seq Data Analysis. Dr. Benilton S Carvalho Department of Medical Gene)cs Faculty of Medical Sciences State University of Campinas

Introduc)on to RNA- Seq Data Analysis. Dr. Benilton S Carvalho Department of Medical Gene)cs Faculty of Medical Sciences State University of Campinas Introduc)on to RNA- Seq Data Analysis Dr. Benilton S Carvalho Department of Medical Gene)cs Faculty of Medical Sciences State University of Campinas Material: hep://)ny.cc/rnaseq Slides: hep://)ny.cc/slidesrnaseq

More information

Introduction to Bioinformatics

Introduction to Bioinformatics CSCI8980: Applied Machine Learning in Computational Biology Introduction to Bioinformatics Rui Kuang Department of Computer Science and Engineering University of Minnesota kuang@cs.umn.edu History of Bioinformatics

More information

Optimization of Covaris Settings for Shearing Bacterial Genomic DNA by Focused Ultrasonication and Analysis Using Agilent 2200 TapeStation

Optimization of Covaris Settings for Shearing Bacterial Genomic DNA by Focused Ultrasonication and Analysis Using Agilent 2200 TapeStation Optimization of Covaris Settings for Shearing Bacterial Genomic DNA by Focused Ultrasonication and Analysis Using Agilent 22 TapeStation Application Note Authors Richard Jeannotte, Eric Lee, Narine Arabyan,

More information

Genotype Imputation. Biostatistics 666

Genotype Imputation. Biostatistics 666 Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives

More information

Variant visualisation and quality control

Variant visualisation and quality control Variant visualisation and quality control You really should be making plots! 25/06/14 Paul Theodor Pyl 1 Classical Sequencing Example DNA.BAM.VCF Aligner Variant Caller A single sample sequencing run 25/06/14

More information

Creative Commons Attribution 2.0 (

Creative Commons Attribution 2.0 ( v1.5 Target Enrichment of Illumina Libraries B. Faircloth August 21, 2013 Ecol. and Evol. Biology Univ. of California Los Angeles The goal here is to adapt the SureSelect/Mycroarray enrichment kits for

More information

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it? Proteomics What is it? Reveal protein interactions Protein profiling in a sample Yeast two hybrid screening High throughput 2D PAGE Automatic analysis of 2D Page Yeast two hybrid Use two mating strains

More information

Genomics and bioinformatics summary. Finding genes -- computer searches

Genomics and bioinformatics summary. Finding genes -- computer searches Genomics and bioinformatics summary 1. Gene finding: computer searches, cdnas, ESTs, 2. Microarrays 3. Use BLAST to find homologous sequences 4. Multiple sequence alignments (MSAs) 5. Trees quantify sequence

More information

Introduction to analyzing NanoString ncounter data using the NanoStringNormCNV package

Introduction to analyzing NanoString ncounter data using the NanoStringNormCNV package Introduction to analyzing NanoString ncounter data using the NanoStringNormCNV package Dorota Sendorek May 25, 2017 Contents 1 Getting started 2 2 Setting Up Data 2 3 Quality Control Metrics 3 3.1 Positive

More information

MRC-Holland MLPA. Description version 14; 21 January 2015

MRC-Holland MLPA. Description version 14; 21 January 2015 SALSA MLPA probemix P229-B2 OPA1 Lot B2-0412. As compared to version B1-0809, two reference probes and the 88 and 96 nt control fragments have been replaced (QDX2). The OPA1 gene product is a nuclear-encoded

More information

Update Report. Hauser, Lorenz

Update Report. Hauser, Lorenz Update Report Hauser, Lorenz Period: 2/1/2012-1/31/2013 Project: R/LME-6 - Local adaptation in Puget Sound Pacific cod (Gadus macrocephalus): phenotypic and genomic differentiation and the conservation

More information

Designer Genes C Test

Designer Genes C Test Northern Regional: January 19 th, 2019 Designer Genes C Test Name(s): Team Name: School Name: Team Number: Rank: Score: Directions: You will have 50 minutes to complete the test. You may not write on the

More information

Frequently Asked Questions (FAQs)

Frequently Asked Questions (FAQs) Frequently Asked Questions (FAQs) Q1. What is meant by Satellite and Repetitive DNA? Ans: Satellite and repetitive DNA generally refers to DNA whose base sequence is repeated many times throughout the

More information

Learning gene regulatory networks Statistical methods for haplotype inference Part I

Learning gene regulatory networks Statistical methods for haplotype inference Part I Learning gene regulatory networks Statistical methods for haplotype inference Part I Input: Measurement of mrn levels of all genes from microarray or rna sequencing Samples (e.g. 200 patients with lung

More information

Curriculum Links. AQA GCE Biology. AS level

Curriculum Links. AQA GCE Biology. AS level Curriculum Links AQA GCE Biology Unit 2 BIOL2 The variety of living organisms 3.2.1 Living organisms vary and this variation is influenced by genetic and environmental factors Causes of variation 3.2.2

More information

Heterozygous BMN lines

Heterozygous BMN lines Optical density at 80 hours 0.8 0.6 0.4 0.2 0.8 0.6 0.4 0.2 0.8 0.6 0.4 0.2 0.8 0.6 0.4 0.2 a YPD b YPD + 1µM nystatin c YPD + 2µM nystatin d YPD + 4µM nystatin 1 3 5 6 9 13 16 20 21 22 23 25 28 29 30

More information

Network Biology-part II

Network Biology-part II Network Biology-part II Jun Zhu, Ph. D. Professor of Genomics and Genetic Sciences Icahn Institute of Genomics and Multi-scale Biology The Tisch Cancer Institute Icahn Medical School at Mount Sinai New

More information

Single Cell Sequencing

Single Cell Sequencing Single Cell Sequencing Fundamental unit of life Autonomous and unique Interactive Dynamic - change over time Evolution occurs on the cellular level Robert Hooke s drawing of cork cells, 1665 Type Prokaryotes

More information

TruSight Cancer Workflow on the MiniSeq System

TruSight Cancer Workflow on the MiniSeq System TruSight Cancer Workflow on the MiniSeq System Prepare Library Sequence Analyze Data TruSight Cancer 1.5 days ~ 24 hours < 2 hours TruSight Cancer Library Prep MiniSeq System Local Run Manager Enrichment

More information

Microbiome: 16S rrna Sequencing 3/30/2018

Microbiome: 16S rrna Sequencing 3/30/2018 Microbiome: 16S rrna Sequencing 3/30/2018 Skills from Previous Lectures Central Dogma of Biology Lecture 3: Genetics and Genomics Lecture 4: Microarrays Lecture 12: ChIP-Seq Phylogenetics Lecture 13: Phylogenetics

More information

Lecture WS Evolutionary Genetics Part I 1

Lecture WS Evolutionary Genetics Part I 1 Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in

More information

Whole-genome amplification in doubledigest RADseq results in adequate libraries but fewer sequenced loci

Whole-genome amplification in doubledigest RADseq results in adequate libraries but fewer sequenced loci Whole-genome amplification in doubledigest RADseq results in adequate libraries but fewer sequenced loci Bruno A. S. de Medeiros and Brian D. Farrell Department of Organismic and Evolutionary Biology and

More information

Genetics 275 Notes Week 7

Genetics 275 Notes Week 7 Cytoplasmic Inheritance Genetics 275 Notes Week 7 Criteriafor recognition of cytoplasmic inheritance: 1. Reciprocal crosses give different results -mainly due to the fact that the female parent contributes

More information

High-throughput sequencing: Alignment and related topic

High-throughput sequencing: Alignment and related topic High-throughput sequencing: Alignment and related topic Simon Anders EMBL Heidelberg HTS Platforms E s ta b lis h e d p la tfo rm s Illu m in a H is e q, A B I S O L id, R o c h e 4 5 4 N e w c o m e rs

More information

Cycle «Analyse de données de séquençage à haut-débit»

Cycle «Analyse de données de séquençage à haut-débit» Cycle «Analyse de données de séquençage à haut-débit» Module 1/5 Analyse ADN Chadi Saad CRIStAL - Équipe BONSAI - Univ Lille, CNRS, INRIA (chadi.saad@univ-lille.fr) Présentation de Sophie Gallina (source:

More information

Eukaryotic Gene Expression

Eukaryotic Gene Expression Eukaryotic Gene Expression Lectures 22-23 Several Features Distinguish Eukaryotic Processes From Mechanisms in Bacteria 123 Eukaryotic Gene Expression Several Features Distinguish Eukaryotic Processes

More information

Supplementary Figure 1. Markedly decreased numbers of marginal zone B cells in DOCK8 mutant mice Supplementary Figure 2.

Supplementary Figure 1. Markedly decreased numbers of marginal zone B cells in DOCK8 mutant mice Supplementary Figure 2. Supplementary Figure 1. Markedly decreased numbers of marginal zone B cells in DOCK8 mutant mice. Percentage of marginal zone B cells in the spleen of wild-type mice (+/+), mice homozygous for cpm or pri

More information

Coding sequence array Office hours Wednesday 3-4pm 304A Stanley Hall

Coding sequence array Office hours Wednesday 3-4pm 304A Stanley Hall Coding sequence array Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100 Fig. 1.13 RNA-seq Expression effects of cancer AAAAA AAAAA AAAAA Solexa sequencing counts

More information

DEVELOP YOUR LAB, YOUR WAY

DEVELOP YOUR LAB, YOUR WAY Technical brochure DEVELOP YOUR LAB, YOUR WAY The FLOW Solution: Expand your potential today Redesign your lab with the FLOW Solution The FLOW Solution is a highly flexible modular, semi-automated data

More information

Eiji Yamamoto 1,2, Hiroyoshi Iwata 3, Takanari Tanabata 4, Ritsuko Mizobuchi 1, Jun-ichi Yonemaru 1,ToshioYamamoto 1* and Masahiro Yano 5,6

Eiji Yamamoto 1,2, Hiroyoshi Iwata 3, Takanari Tanabata 4, Ritsuko Mizobuchi 1, Jun-ichi Yonemaru 1,ToshioYamamoto 1* and Masahiro Yano 5,6 Yamamoto et al. BMC Genetics 2014, 15:50 METHODOLOGY ARTICLE Open Access Effect of advanced intercrossing on genome structure and on the power to detect linked quantitative trait loci in a multi-parent

More information

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic Cross Discipline Analysis made possible with Data Pipelining J.R. Tozer SciTegic System Genesis Pipelining tool created to automate data processing in cheminformatics Modular system built with generic

More information

Markov Models & DNA Sequence Evolution

Markov Models & DNA Sequence Evolution 7.91 / 7.36 / BE.490 Lecture #5 Mar. 9, 2004 Markov Models & DNA Sequence Evolution Chris Burge Review of Markov & HMM Models for DNA Markov Models for splice sites Hidden Markov Models - looking under

More information

Elecsys with ECL technology Still light years ahead

Elecsys with ECL technology Still light years ahead Elecsys with ECL technology Still light years ahead The unique technology behind Elecsys immunoassays ECL ElectroChemiLuminescence ECL technology is Roche s highly innovative detection technology for heterogeneous

More information

Automated Illumina TruSeq DNA PCR-Free library construction with the epmotion 5075t/TMX

Automated Illumina TruSeq DNA PCR-Free library construction with the epmotion 5075t/TMX SHORT PROTOCOL No. 15 I August 2016 Automated Illumina TruSeq DNA PCR-Free library construction with the epmotion 5075t/TMX Introduction This protocol describes the configuration and preprogrammed methods

More information

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013 Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 013 1 Estimation of Var(A) and Breeding Values in General Pedigrees The classic

More information

In-Solution Digestion: Multi-Plate v1.0 Quick Start Guide

In-Solution Digestion: Multi-Plate v1.0 Quick Start Guide In-Solution Digestion: Multi-Plate v1.0 Quick Start Guide This guide is for users who have been trained in the proper use of the AssayMAP Bravo Platform and understand the safety guidelines in the Bravo

More information

Fine Mapping and Candidate Gene Characterization of the Pepper Bacterial Spot Resistance Gene bs6

Fine Mapping and Candidate Gene Characterization of the Pepper Bacterial Spot Resistance Gene bs6 Fine Mapping and Candidate Gene Characterization of the Pepper Bacterial Spot Resistance Gene bs6 Rebecca L. Wente, Jian Li, Samuel F. Hutton, Upinder Gill, Jeffrey B. Jones, Gerald V. Minsavage, Robert

More information

Bioinformatics Chapter 1. Introduction

Bioinformatics Chapter 1. Introduction Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!

More information

Levels of genetic variation for a single gene, multiple genes or an entire genome

Levels of genetic variation for a single gene, multiple genes or an entire genome From previous lectures: binomial and multinomial probabilities Hardy-Weinberg equilibrium and testing HW proportions (statistical tests) estimation of genotype & allele frequencies within population maximum

More information

*: Division of Biological Sciences; University of Missouri; Columbia, MO, 65211

*: Division of Biological Sciences; University of Missouri; Columbia, MO, 65211 Genetics: Early Online, published on July 20, 2016 as 10.1534/genetics.116.191726 Fast-Flowering Mini-Maize: Seed to Seed in 60 Days Morgan E. McCaw*, Jason G. Wallace,1, Patrice S. Albert*, Edward S.

More information

Potato Genome Analysis

Potato Genome Analysis Potato Genome Analysis Xin Liu Deputy director BGI research 2016.1.21 WCRTC 2016 @ Nanning Reference genome construction???????????????????????????????????????? Sequencing HELL RIEND WELCOME BGI ZHEN LLOFRI

More information

Nature Genetics: doi: /ng Supplementary Figure 1. The phenotypes of PI , BR121, and Harosoy under short-day conditions.

Nature Genetics: doi: /ng Supplementary Figure 1. The phenotypes of PI , BR121, and Harosoy under short-day conditions. Supplementary Figure 1 The phenotypes of PI 159925, BR121, and Harosoy under short-day conditions. (a) Plant height. (b) Number of branches. (c) Average internode length. (d) Number of nodes. (e) Pods

More information

Overview - MS Proteomics in One Slide. MS masses of peptides. MS/MS fragments of a peptide. Results! Match to sequence database

Overview - MS Proteomics in One Slide. MS masses of peptides. MS/MS fragments of a peptide. Results! Match to sequence database Overview - MS Proteomics in One Slide Obtain protein Digest into peptides Acquire spectra in mass spectrometer MS masses of peptides MS/MS fragments of a peptide Results! Match to sequence database 2 But

More information

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics GENOME Bioinformatics 2 Proteomics protein-gene PROTEOME protein-protein METABOLISM Slide from http://www.nd.edu/~networks/ Citrate Cycle Bio-chemical reactions What is it? Proteomics Reveal protein Protein

More information

Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism

Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism Ling Yin 1,3,, Yunhe An 1,2,, Junjie Qu 3,, Xinlong Li 1, Yali Zhang 1, Ian Dry 5, Huijun Wu 2*, Jiang Lu 1,4** 1 College

More information

Biology 644: Bioinformatics

Biology 644: Bioinformatics A stochastic (probabilistic) model that assumes the Markov property Markov property is satisfied when the conditional probability distribution of future states of the process (conditional on both past

More information

Encoding and Decoding 3D Genome Organization

Encoding and Decoding 3D Genome Organization Encoding and Decoding 3D Genome Organization Noam Kaplan Dekker Lab Program in Systems Biology University of Massachusetts Medical School Chromatin organization 10000nm Human genome: 2x3000 Mbp Human chromosome:

More information

Going Beyond SNPs with Next Genera5on Sequencing Technology Personalized Medicine: Understanding Your Own Genome Fall 2014

Going Beyond SNPs with Next Genera5on Sequencing Technology Personalized Medicine: Understanding Your Own Genome Fall 2014 Going Beyond SNPs with Next Genera5on Sequencing Technology 02-223 Personalized Medicine: Understanding Your Own Genome Fall 2014 Next Genera5on Sequencing Technology (NGS) NGS technology Discover more

More information

Introduction and Objectives:

Introduction and Objectives: Introduction and Objectives: My research program asks why bacteria are naturally competent (able to take up DNA fragments from their surroundings). Answering this question will determine whether competence

More information

Biology I Level - 2nd Semester Final Review

Biology I Level - 2nd Semester Final Review Biology I Level - 2nd Semester Final Review The 2 nd Semester Final encompasses all material that was discussed during second semester. It s important that you review ALL notes and worksheets from the

More information

Title ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses

Title ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 Title ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses

More information

Transposable Elements Contribute to Activation of Maize Genes in Response to Abiotic Stress

Transposable Elements Contribute to Activation of Maize Genes in Response to Abiotic Stress Transposable Elements Contribute to Activation of Maize Genes in Response to Abiotic Stress Irina Makarevitch 1,2, Amanda J. Waters 2, Patrick T. West 2, Michelle Stitzer 3, Candice N. Hirsch 4, Jeffrey

More information

networks in molecular biology Wolfgang Huber

networks in molecular biology Wolfgang Huber networks in molecular biology Wolfgang Huber networks in molecular biology Regulatory networks: components = gene products interactions = regulation of transcription, translation, phosphorylation... Metabolic

More information

Annotation of Plant Genomes using RNA-seq. Matteo Pellegrini (UCLA) In collaboration with Sabeeha Merchant (UCLA)

Annotation of Plant Genomes using RNA-seq. Matteo Pellegrini (UCLA) In collaboration with Sabeeha Merchant (UCLA) Annotation of Plant Genomes using RNA-seq Matteo Pellegrini (UCLA) In collaboration with Sabeeha Merchant (UCLA) inuscu1-35bp 5 _ 0 _ 5 _ What is Annotation inuscu2-75bp luscu1-75bp 0 _ 5 _ Reconstruction

More information

Predicting Protein Functions and Domain Interactions from Protein Interactions

Predicting Protein Functions and Domain Interactions from Protein Interactions Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput

More information

TASK 6.3 Modelling and data analysis support

TASK 6.3 Modelling and data analysis support Wheat and barley Legacy for Breeding Improvement TASK 6.3 Modelling and data analysis support FP7 European Project Task 6.3: How can statistical models contribute to pre-breeding? Daniela Bustos-Korts

More information

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything.

More information

Maize Genetics Cooperation Newsletter Vol Derkach 1

Maize Genetics Cooperation Newsletter Vol Derkach 1 Maize Genetics Cooperation Newsletter Vol 91 2017 Derkach 1 RELATIONSHIP BETWEEN MAIZE LANCASTER INBRED LINES ACCORDING TO SNP-ANALYSIS Derkach K. V., Satarova T. M., Dzubetsky B. V., Borysova V. V., Cherchel

More information