23/01/2018. PiRATE: a Pipeline to Retrieve and Annotate TEs of non-model organisms. Transposable elements (TEs) Impact of TEs on genomes

Size: px
Start display at page:

Download "23/01/2018. PiRATE: a Pipeline to Retrieve and Annotate TEs of non-model organisms. Transposable elements (TEs) Impact of TEs on genomes"

Transcription

1 Transposable elements () PiRATE: a Pipeline to Retrieve and Annotate of non-model organisms are DNAsequences able to move (= transposition) into the host genome of eucaryotic and procaryotic organisms Two classes: Classe I named retrotransposons move by a copy-paste mechanism dsdna TE TE Classe II named DNA transposon move by a cut-paste mechanism Jérémy BERTHELIER, Ph.D. student N. Casse, N. Daccord, V. Jamilloux, B. Saint-Jean, G. Carrier dsdna TE 2 A huge diversity Impact of on genomes Wicker et al., 2007 Non-autonomous impact genome size Zea Mays Homo sapiens Classe I TRIM LARD >4000 bp <4000 bp can generate phenotype changes Proportion of in genomes (%) Chénais et al., 2012 (Non-autonomes) Vitis vinifera Citrus sinensis Kobayashi et al., 2004 Butelli et al., 2012 Classe II MITE <500 bp TE mutations can promote species adaptation 3 Darboux et al., How to conduct a de novo TE annotation Limitations with non-model organisms 1. Detection of Seq1 Seq2 1. Detection of Seq1 Seq2 Seq3 Seq3 Genome assembly is usually fragmented missassembly of repeated elements 2. Classification of detected No Class 2. Classification of detected No Class Comparaison (BLAST) Few knowledge in TE databanks difficulty to classify/recognize Comparaison (BLAST) TE1 TE1 3. TE annotation 3. TE annotation Genome assembly 5 Genome assembly 6 1

2 Limitations with non-model organisms 1. Detection of Seq1 Seq2 Seq3 PiRATE is implemented into a stand-alone Galaxy: Genome assembly is usually fragmented missassembly of repeated elements Improve the detection step 2. Classification of detected No Class Few knowledge in TE databanks difficulty to classify/recognize Comparaison (BLAST) Improve the classification step 3. TE annotation TE1 Genome assembly Download the PiRATE Virtual Machine (Virtual Linux) 2. Import the PiRATE Virtual Machine into a Virtual Machine Monitor (VirtualBox) Launch the PiRATE-Galaxy It s ready to use!!!

3 Tools RepeatMasker Censor Windowmasker Transposon-PSI TeClass RepClass TESeeker + Tools TRANSPO LTR_STRUC MAK LTR_MINER LTR_par/LTR_seq Find_LTR LTR_FINDER LTRharvest LTRdigest MUST Tools REPuter RepeatFinder RECON GROUPER PILER RepeatScout Repseek Tallymer TEdenovo Tool ReAS RepeatExplorer RepARK dnapipete Transposome REPdenovo HelSearch MITE-Hunter Sine-Finder RSPB MITE-Digger TIRfinder Helitron_scanner detectmite MGEScan-LTR MGEScan-nonLTR RepeatModeler Group every existing TE detection approach to: - promote the detection of every TE families - promote the detection of full-length To facilitate the classification step A priori de novo Build repeated sequences Detect repeated sequences Detect specific structure of Detect similare sequences

4 output Cluster identical sequences Remove redundancy Only keep the larger putative Concatenate the detected sequences Cluster identical sequences Remove redundancy Only keep the larger putative Automated classification Automated classification Using databanks Automated classification Using databanks 23 Improved by: - combining public TE databanks - adding non-invertoried sequences - Creating new profiles 24 4

5 Familly 1 Familly 2 Group TE sequences into families PiRATE-Galaxy: Annotation PiRATE-Galaxy: Annotation 3: Annotation 3: Annotation 3 annotations: Putative 27 3 annotations: Putative 28 PiRATE-Galaxy: Annotation Control with Arabidopsis thaliana Genome size: 157 Mb Arabidopsis thaliana 259 TE families - PiRATE detects 81% of the TE families of A. thaliana (difficulty to detect non-autonomous ) 3: Annotation - 75% of these detectes were correctly classified (difficulty to classify non-autonomous ) 3 annotations: Putative All All repeated elements 29 Berthelier et al. BMC Genomics Under Review 30 5

6 Use of PiRATE with T. lutea (Haptophyta) Use of PiRATE with T. lutea (Haptophyta) Example with the microalga Tisochrysis lutea Phylum: Haptophyte Genome size: 82 Mb 193 contigs Example with the microalga Tisochrysis lutea Phylum: Haptophyte Genome size: 82 Mb 193 contigs 17 TE families of Haptophytes 17 TE families of Haptophytes Putative non-autonomous Putative autonomous putative autonomous total LTR / Copia LTR / Gypsy LTR / LARD LTR / TRIM LINE / Tx1 SINE Total repeated elements Proportion of the genome (%) Berthelier et al. BMC Genomics Under Review 31 TIR / Tc1-Mariner TIR / hat TIR / PiggyBac TIR / PIF-Harbinger MITE Proportion of the genome (%) 32 Berthelier et al. BMC Genomics Under Review Thanks for your attention! Acknowledgements: How improve TE classification? PASTEClassifier Public TE databanks To improve: Combine public databanks and adding non-inventoried Non-inventoried - Nicolas Daccord - Véronique Jamilloux Public profil Grégory Carrier Funding: Bruno Saint-Jean Nathalie Casse New profiles done done for microalgal genomes Control Detection Control Classification Arabidopsis thaliana Arabidopsis thaliana % of classified TE families % of detected TE families 96% 94% 100% 79% 80% 62% 60% 60% 40% 27% 20% 0% LTR (173) LINE (19) SINE (22) TIR (69) Helitron n-a (5) Helitron (32) 81% 81% n-a TIR Total TE (58) families (359) 100% 80% 60% 40% 20% 0% 98% 91% 87% 50% 27% LTR (148) LINE (15) SINE (6) TIR (43) n-a TIR (48) Classification : Correct 100% Helitron (3) 7% n-a Helitron (29) 359 TE families

Classification of repetitive elements based on the analysis of protein domains. Pavel Neumann May 2018

Classification of repetitive elements based on the analysis of protein domains. Pavel Neumann May 2018 Classification of repetitive elements based on the analysis of protein domains Pavel Neumann May 2018 A unified classification system for eukaryotic transposable elements (Wicker et al. 2007) Repbase classification

More information

HIGH PERFORMANCE CLUSTER AND GRID COMPUTING SOLUTIONS FOR SCIENCE UMESHKUMAR KESWANI. Presented to the Faculty of the Graduate School of

HIGH PERFORMANCE CLUSTER AND GRID COMPUTING SOLUTIONS FOR SCIENCE UMESHKUMAR KESWANI. Presented to the Faculty of the Graduate School of HIGH PERFORMANCE CLUSTER AND GRID COMPUTING SOLUTIONS FOR SCIENCE By UMESHKUMAR KESWANI Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial Fulfillment of

More information

BIOINFORMATICS. PILER: identification and classification of genomic repeats. Robert C. Edgar 1* and Eugene W. Myers 2 1 INTRODUCTION

BIOINFORMATICS. PILER: identification and classification of genomic repeats. Robert C. Edgar 1* and Eugene W. Myers 2 1 INTRODUCTION BIOINFORMATICS Vol. 1 no. 1 2003 Pages 1 1 PILER: identification and classification of genomic repeats Robert C. Edgar 1* and Eugene W. Myers 2 1 195 Roque Moraes Drive, Mill Valley, CA, U.S.A., bob@drive5.com.

More information

Identification of DNA transposable elements in the Tasmanian devil (Sarcophilus harrisii) genome

Identification of DNA transposable elements in the Tasmanian devil (Sarcophilus harrisii) genome Identification of DNA transposable elements in the Tasmanian devil (Sarcophilus harrisii) genome Abstract The Tasmanian devil (Sarcophilus harrissi) is a carnivorous marsupial residing in Tasmania. Its

More information

The Evolution and Diversity of DNA Transposons in the Genome of the Lizard Anolis carolinensis

The Evolution and Diversity of DNA Transposons in the Genome of the Lizard Anolis carolinensis The Evolution and Diversity of DNA Transposons in the Genome of the Lizard Anolis carolinensis Peter A. Novick 1,2, Jeremy D. Smith 3, Mark Floumanhaft 1, David A. Ray 3, and Stéphane Boissinot*,1,2 1

More information

Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula

Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula Supplementary Information for: The genome of the extremophile crucifer Thellungiella parvula Maheshi Dassanayake 1,9, Dong-Ha Oh 1,9, Jeffrey S. Haas 1,2, Alvaro Hernandez 3, Hyewon Hong 1,4, Shahjahan

More information

Repetitive sequences analysis

Repetitive sequences analysis Repetitive sequences analysis Érica Ramos erica.ramos@gmail.com Repetitive elements characterization Martins et al., 2010.!' Repetitive elements characterization Martins et al., 2010. Identical or similar

More information

Analyse bioinformatique des événements de transferts horizontaux entre espèces de drosophiles et lien avec la régulation des éléments transposables

Analyse bioinformatique des événements de transferts horizontaux entre espèces de drosophiles et lien avec la régulation des éléments transposables Analyse bioinformatique des événements de transferts horizontaux entre espèces de drosophiles et lien avec la régulation des éléments transposables Laurent Modolo To cite this version: Laurent Modolo.

More information

Losing identity: structural diversity of transposable elements belonging to different classes in the genome of Anopheles gambiae

Losing identity: structural diversity of transposable elements belonging to different classes in the genome of Anopheles gambiae Fernández-Medina et al. BMC Genomics 2012, 13:272 RESEARCH ARTICLE Losing identity: structural diversity of transposable elements belonging to different classes in the genome of Anopheles gambiae Rita

More information

TE content correlates positively with genome size

TE content correlates positively with genome size TE content correlates positively with genome size Mb 3000 Genomic DNA 2500 2000 1500 1000 TE DNA Protein-coding DNA 500 0 Feschotte & Pritham 2006 Transposable elements. Variation in gene numbers cannot

More information

PLNT2530 (2018) Unit 5 Genomes: Organization and Comparisons

PLNT2530 (2018) Unit 5 Genomes: Organization and Comparisons PLNT2530 (2018) Unit 5 Genomes: Organization and Comparisons Unless otherwise cited or referenced, all content of this presenataion is licensed under the Creative Commons License Attribution Share-Alike

More information

Special Topics on Genetics

Special Topics on Genetics ARISTOTLE UNIVERSITY OF THESSALONIKI OPEN COURSES Section 9: Transposable elements Drosopoulou E License The offered educational material is subject to Creative Commons licensing. For educational material,

More information

Computational analysis and paleogenomics of interspersed repeats in eukaryotes

Computational analysis and paleogenomics of interspersed repeats in eukaryotes Computational analysis and paleogenomics of interspersed repeats in eukaryotes Cédric Feschotte* and Ellen J. Pritham Department of Biology, University of Texas at Arlington, Arlington TX 76016, USA *

More information

Potato Genome Analysis

Potato Genome Analysis Potato Genome Analysis Xin Liu Deputy director BGI research 2016.1.21 WCRTC 2016 @ Nanning Reference genome construction???????????????????????????????????????? Sequencing HELL RIEND WELCOME BGI ZHEN LLOFRI

More information

Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism

Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism Ling Yin 1,3,, Yunhe An 1,2,, Junjie Qu 3,, Xinlong Li 1, Yali Zhang 1, Ian Dry 5, Huijun Wu 2*, Jiang Lu 1,4** 1 College

More information

A unified classification system for eukaryotic transposable elements

A unified classification system for eukaryotic transposable elements Nature Reviews Genetics AOP, published online 6 November 2007; doi:10.1038/nrg2165 Perspectives g u i d e l i n e s A unified classification system for eukaryotic transposable elements Thomas Wicker, François

More information

Microbiome: 16S rrna Sequencing 3/30/2018

Microbiome: 16S rrna Sequencing 3/30/2018 Microbiome: 16S rrna Sequencing 3/30/2018 Skills from Previous Lectures Central Dogma of Biology Lecture 3: Genetics and Genomics Lecture 4: Microarrays Lecture 12: ChIP-Seq Phylogenetics Lecture 13: Phylogenetics

More information

Mathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007

Mathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007 -2 Transcript Alignment Assembly and Automated Gene Structure Improvements Using PASA-2 Mathangi Thiagarajan mathangi@jcvi.org Rice Genome Annotation Workshop May 23rd, 2007 About PASA PASA is an open

More information

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide Name: Period: Date: AP Bio Module 6: Bacterial Genetics and Operons, Student Learning Guide Getting started. Work in pairs (share a computer). Make sure that you log in for the first quiz so that you get

More information

Paired-End Read Length Lower Bounds for Genome Re-sequencing

Paired-End Read Length Lower Bounds for Genome Re-sequencing 1/11 Paired-End Read Length Lower Bounds for Genome Re-sequencing Rayan Chikhi ENS Cachan Brittany PhD student in the Symbiose team, Irisa, France 2/11 NEXT-GENERATION SEQUENCING Next-gen vs. traditional

More information

Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome

Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome Biology 2014, 3, 295-319; doi:10.3390/biology3020295 Article OPEN ACCESS biology ISSN 2079-7737 www.mdpi.com/journal/biology Sequence-Based Analysis of Structural Organization and Composition of the Cultivated

More information

Genome Annotation. Qi Sun Bioinformatics Facility Cornell University

Genome Annotation. Qi Sun Bioinformatics Facility Cornell University Genome Annotation Qi Sun Bioinformatics Facility Cornell University Some basic bioinformatics tools BLAST PSI-BLAST - Position-Specific Scoring Matrix HMM - Hidden Markov Model NCBI BLAST How does BLAST

More information

Frequently Asked Questions (FAQs)

Frequently Asked Questions (FAQs) Frequently Asked Questions (FAQs) Q1. What is meant by Satellite and Repetitive DNA? Ans: Satellite and repetitive DNA generally refers to DNA whose base sequence is repeated many times throughout the

More information

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16 Genome Evolution Outline 1. What: Patterns of Genome Evolution Carol Eunmi Lee Evolution 410 University of Wisconsin 2. Why? Evolution of Genome Complexity and the interaction between Natural Selection

More information

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18 Genome Evolution Outline 1. What: Patterns of Genome Evolution Carol Eunmi Lee Evolution 410 University of Wisconsin 2. Why? Evolution of Genome Complexity and the interaction between Natural Selection

More information

Highly expressed captured genes and cross-kingdom domains present in Helitrons create novel diversity in Pleurotus ostreatus and other fungi

Highly expressed captured genes and cross-kingdom domains present in Helitrons create novel diversity in Pleurotus ostreatus and other fungi Castanera et al. BMC Genomics 2014, 15:1071 RESEARCH ARTICLE Open Access Highly expressed captured genes and cross-kingdom domains present in Helitrons create novel diversity in Pleurotus ostreatus and

More information

Supplementary Information

Supplementary Information Supplementary Information LINE-1-like retrotransposons contribute to RNA-based gene duplication in dicots Zhenglin Zhu 1, Shengjun Tan 2, Yaqiong Zhang 2, Yong E. Zhang 2,3 1. School of Life Sciences,

More information

Domain organization within repeated DNA sequences: application to the study of a family of transposable elements

Domain organization within repeated DNA sequences: application to the study of a family of transposable elements BIOINFORMATICS ORIGINAL PAPER Vol. 22 no. 16 2006, pages 1948 1954 doi:10.1093/bioinformatics/btl337 Sequence analysis Domain organization within repeated DNA sequences: application to the study of a family

More information

Caminalcules Discovery Classifying Imaginary Animals by Analysis of Shared Characteristics

Caminalcules Discovery Classifying Imaginary Animals by Analysis of Shared Characteristics Caminalcules Discovery Classifying Imaginary Animals by Analysis of Shared Characteristics PURPOSE In this activity you will reinforce the concept of classification by grouping imaginary organisms with

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)

More information

Evaluation of Genome Sequencing Quality in Selected Plant Species Using Expressed Sequence Tags

Evaluation of Genome Sequencing Quality in Selected Plant Species Using Expressed Sequence Tags Evaluation of Genome Sequencing Quality in Selected Plant Species Using Expressed Sequence Tags Lingfei Shangguan 1, Jian Han 1, Emrul Kayesh 1, Xin Sun 1, Changqing Zhang 2, Tariq Pervaiz 1, Xicheng Wen

More information

Unfixed endogenous retroviral insertions in the human population. Emanuele Marchi, Alex Kanapin, Gkikas Magiorkinis and Robert Belshaw

Unfixed endogenous retroviral insertions in the human population. Emanuele Marchi, Alex Kanapin, Gkikas Magiorkinis and Robert Belshaw Unfixed endogenous retroviral insertions in the human population Emanuele Marchi, Alex Kanapin, Gkikas Magiorkinis and Robert Belshaw Supplementary Methods Common sources of 'false positives' in mining

More information

Transposable Elements Contribute to Activation of Maize Genes in Response to Abiotic Stress

Transposable Elements Contribute to Activation of Maize Genes in Response to Abiotic Stress Transposable Elements Contribute to Activation of Maize Genes in Response to Abiotic Stress Irina Makarevitch 1,2, Amanda J. Waters 2, Patrick T. West 2, Michelle Stitzer 3, Candice N. Hirsch 4, Jeffrey

More information

Mole_Oce Lecture # 24: Introduction to genomics

Mole_Oce Lecture # 24: Introduction to genomics Mole_Oce Lecture # 24: Introduction to genomics DEFINITION: Genomics: the study of genomes or he study of genes and their function. Genomics (1980s):The systematic generation of information about genes

More information

Recent and dynamic transposable elements contribute to genomic divergence under asexuality

Recent and dynamic transposable elements contribute to genomic divergence under asexuality Ferreira de Carvalho et al. BMC Genomics (2016) 17:884 DOI 10.1186/s12864-016-3234-9 RESEARCH ARTICLE Open Access Recent and dynamic transposable elements contribute to genomic divergence under asexuality

More information

An overview of the Phalaenopsis orchid genome by BAC end sequence analysis

An overview of the Phalaenopsis orchid genome by BAC end sequence analysis An overview of the Phalaenopsis orchid genome by BAC end sequence analysis Chia-Chi Hsu 1, Yu-Lin Chung 1, Yu-Ling Lee 1, Tien-Chih Chen 1, Yi-Tzu Kuo 1, Wen-Chieh Tsai 2,3, Yu-Yun Hsiao 1, Yun-Wen Chen

More information

GEP Annotation Report

GEP Annotation Report GEP Annotation Report Note: For each gene described in this annotation report, you should also prepare the corresponding GFF, transcript and peptide sequence files as part of your submission. Student name:

More information

Transposable element dynamics among asymbiotic and ectomycorrhizal Amanita fungi

Transposable element dynamics among asymbiotic and ectomycorrhizal Amanita fungi Genome Biology and Evolution Advance Access published June 12, 2014 doi:10.1093/gbe/evu121 Transposable element dynamics among asymbiotic and ectomycorrhizal Amanita fungi Jaqueline Hess 1, Inger Skrede

More information

Miniature Transposable Elements (mtes): Impacts and Uses in the Brassica Genome

Miniature Transposable Elements (mtes): Impacts and Uses in the Brassica Genome Miniature Transposable Elements (mtes): Impacts and Uses in the Brassica Genome 6 Perumal Sampath, Jonghoon Lee, Feng Cheng, Xiaowu Wang and Tae-Jin Yang Abstract Transposable elements occupy large portions

More information

Procedure to Create NCBI KOGS

Procedure to Create NCBI KOGS Procedure to Create NCBI KOGS full details in: Tatusov et al (2003) BMC Bioinformatics 4:41. 1. Detect and mask typical repetitive domains Reason: masking prevents spurious lumping of non-orthologs based

More information

Discovery of Repetitive Patterns in DNA with Accurate Boundaries

Discovery of Repetitive Patterns in DNA with Accurate Boundaries Discovery of Repetitive Patterns in DNA with Accurate Boundaries Jie Zheng Stefano Lonardi Department of Computer Science and Engineering University of California, Riverside, CA 92521 fzjie, stelog@cs.ucr.edu

More information

A Browser for Pig Genome Data

A Browser for Pig Genome Data A Browser for Pig Genome Data Thomas Mailund January 2, 2004 This report briefly describe the blast and alignment data available at http://www.daimi.au.dk/ mailund/pig-genome/ hits.html. The report describes

More information

Genome-wide analysis of WOX genes in upland cotton and their expression pattern under different stresses

Genome-wide analysis of WOX genes in upland cotton and their expression pattern under different stresses Yang et al. BMC Plant Biology (2017) 17:113 DOI 10.1186/s12870-017-1065-8 RESEARCH ARTICLE Genome-wide analysis of WOX genes in upland cotton and their expression pattern under different stresses Zhaoen

More information

Academy of Agricultural Sciences, Anyang , Henan, China. 2 BGI-Shenzhen,

Academy of Agricultural Sciences, Anyang , Henan, China. 2 BGI-Shenzhen, The draft genome of a diploid cotton Gossypium raimondii Kunbo Wang 1,6, Zhiwen Wang 2,6, Fuguang Li 1,6, Wuwei Ye 1,6, Junyi Wang 2,6, Guoli Song 1,6, Zhen Yue 2, Lin Cong 2, Haihong Shang 1, Shilin Zhu

More information

CGS 5991 (2 Credits) Bioinformatics Tools

CGS 5991 (2 Credits) Bioinformatics Tools CAP 5991 (3 Credits) Introduction to Bioinformatics CGS 5991 (2 Credits) Bioinformatics Tools Giri Narasimhan 8/26/03 CAP/CGS 5991: Lecture 1 1 Course Schedules CAP 5991 (3 credit) will meet every Tue

More information

Transposon fingerprinting using low coverage whole genome shotgun sequencing in Cacao (Theobroma cacao L.) and related species

Transposon fingerprinting using low coverage whole genome shotgun sequencing in Cacao (Theobroma cacao L.) and related species Sveinsson et al. BMC Genomics 2013, 14:502 RESEARCH ARTICLE Open Access Transposon fingerprinting using low coverage whole genome shotgun sequencing in Cacao (Theobroma cacao L.) and related species Saemundur

More information

CS612 - Algorithms in Bioinformatics

CS612 - Algorithms in Bioinformatics Fall 2017 Databases and Protein Structure Representation October 2, 2017 Molecular Biology as Information Science > 12, 000 genomes sequenced, mostly bacterial (2013) > 5x10 6 unique sequences available

More information

Evolution of genes and genomes on the Drosophila phylogeny

Evolution of genes and genomes on the Drosophila phylogeny Vol 450 8 November 2007 doi:10.1038/nature06341 ARTICLES Evolution of genes and genomes on the Drosophila phylogeny Drosophila 12 Genomes Consortium* Comparative analysis of multiple genomes in a phylogenetic

More information

Unexpected Diversity and Differential Success of DNA Transposons in Four Species of Entamoeba Protozoans

Unexpected Diversity and Differential Success of DNA Transposons in Four Species of Entamoeba Protozoans RESEARCH ARTICLES Unexpected Diversity and Differential Success of DNA Transposons in Four Species of Entamoeba Protozoans Ellen J. Pritham, 1 Cédric Feschotte, 1 and Susan R. Wessler Department of Plant

More information

Genome-Wide Analysis of Repetitive Elements in Papaya

Genome-Wide Analysis of Repetitive Elements in Papaya DOI 10.1007/s12042-008-9015-0 Genome-Wide Analysis of Repetitive Elements in Papaya Niranjan Nagarajan & Rafael Navajas-Pérez & Mihai Pop & Maqsudul Alam & Ray Ming & Andrew H. Paterson & Steven L. Salzberg

More information

Supplemental Tables for Genomic Legacy of the African Cheetah, Acinonyx jubatus

Supplemental Tables for Genomic Legacy of the African Cheetah, Acinonyx jubatus Supplemental Tables for Genomic Legacy of the African Cheetah, Acinonyx jubatus 1 List of Tables Table S1: Sequenced cheetah reads for de novo genome assembly 3 Table S2: Re-sequenced cheetah reads for

More information

TRANSPOSABLE elements (TEs) are ubiquitous and one of

TRANSPOSABLE elements (TEs) are ubiquitous and one of INVESTIGATION Population Genetics and Molecular Evolution of DNA Sequences in Transposable Elements. I. A Simulation Framework T. E. Kijima and Hideki Innan 1 Graduate University for Advanced Studies,

More information

Genomes Comparision via de Bruijn graphs

Genomes Comparision via de Bruijn graphs Genomes Comparision via de Bruijn graphs Student: Ilya Minkin Advisor: Son Pham St. Petersburg Academic University June 4, 2012 1 / 19 Synteny Blocks: Algorithmic challenge Suppose that we are given two

More information

Overview of Research at Bioinformatics Lab

Overview of Research at Bioinformatics Lab Overview of Research at Bioinformatics Lab Li Liao Develop new algorithms and (statistical) learning methods that help solve biological problems > Capable of incorporating domain knowledge > Effective,

More information

Bioinformatics Chapter 1. Introduction

Bioinformatics Chapter 1. Introduction Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!

More information

Population Genetics and Molecular Evolution of DNA Sequences in Transposable Elements. I. A Simulation Framework

Population Genetics and Molecular Evolution of DNA Sequences in Transposable Elements. I. A Simulation Framework Genetics: Early Online, published on September 3, 213 as 1.1534/genetics.113.15292 Population Genetics and Molecular Evolution of DNA Sequences in Transposable Elements. I. A Simulation Framework T. E.

More information

Week 7.2 Ch 4 Microevolutionary Proceses

Week 7.2 Ch 4 Microevolutionary Proceses Week 7.2 Ch 4 Microevolutionary Proceses 1 Mendelian Traits vs Polygenic Traits Mendelian -discrete -single gene determines effect -rarely influenced by environment Polygenic: -continuous -multiple genes

More information

Organizing Diversity Taxonomy is the discipline of biology that identifies, names, and classifies organisms according to certain rules.

Organizing Diversity Taxonomy is the discipline of biology that identifies, names, and classifies organisms according to certain rules. 1 2 3 4 5 6 7 8 9 10 Outline 1.1 Introduction to AP Biology 1.2 Big Idea 1: Evolution 1.3 Big Idea 2: Energy and Molecular Building Blocks 1.4 Big Idea 3: Information Storage, Transmission, and Response

More information

Transposon diversity is higher in amphioxus than in vertebrates: functional and evolutionary inferences Cristian Can estro and Ricard Albalat

Transposon diversity is higher in amphioxus than in vertebrates: functional and evolutionary inferences Cristian Can estro and Ricard Albalat BRIEFINGS IN FUNCTIONAL GENOMICS. VOL 11. NO 2. 131^141 doi:10.1093/bfgp/els010 Transposon diversity is higher in amphioxus than in vertebrates: functional and evolutionary inferences Cristian Can estro

More information

G4120: Introduction to Computational Biology

G4120: Introduction to Computational Biology ICB Fall 2009 G4120: Introduction to Computational Biology Oliver Jovanovic, Ph.D. Columbia University Department of Microbiology & Immunology Copyright 2008 Oliver Jovanovic, All Rights Reserved. Genome

More information

Introduction to de novo RNA-seq assembly

Introduction to de novo RNA-seq assembly Introduction to de novo RNA-seq assembly Introduction Ideal day for a molecular biologist Ideal Sequencer Any type of biological material Genetic material with high quality and yield Cutting-Edge Technologies

More information

Annotation of Plant Genomes using RNA-seq. Matteo Pellegrini (UCLA) In collaboration with Sabeeha Merchant (UCLA)

Annotation of Plant Genomes using RNA-seq. Matteo Pellegrini (UCLA) In collaboration with Sabeeha Merchant (UCLA) Annotation of Plant Genomes using RNA-seq Matteo Pellegrini (UCLA) In collaboration with Sabeeha Merchant (UCLA) inuscu1-35bp 5 _ 0 _ 5 _ What is Annotation inuscu2-75bp luscu1-75bp 0 _ 5 _ Reconstruction

More information

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic Cross Discipline Analysis made possible with Data Pipelining J.R. Tozer SciTegic System Genesis Pipelining tool created to automate data processing in cheminformatics Modular system built with generic

More information

Graduate Funding Information Center

Graduate Funding Information Center Graduate Funding Information Center UNC-Chapel Hill, The Graduate School Graduate Student Proposal Sponsor: Program Title: NESCent Graduate Fellowship Department: Biology Funding Type: Fellowship Year:

More information

Transcription Regulation and Gene Expression in Eukaryotes FS08 Pharmacenter/Biocenter Auditorium 1 Wednesdays 16h15-18h00.

Transcription Regulation and Gene Expression in Eukaryotes FS08 Pharmacenter/Biocenter Auditorium 1 Wednesdays 16h15-18h00. Transcription Regulation and Gene Expression in Eukaryotes FS08 Pharmacenter/Biocenter Auditorium 1 Wednesdays 16h15-18h00. Promoters and Enhancers Systematic discovery of transcriptional regulatory motifs

More information

Chapters AP Biology Objectives. Objectives: You should know...

Chapters AP Biology Objectives. Objectives: You should know... Objectives: You should know... Notes 1. Scientific evidence supports the idea that evolution has occurred in all species. 2. Scientific evidence supports the idea that evolution continues to occur. 3.

More information

M. oryzae Yashiromochi (Pita) Pi No.4 (Pita2) Fukunishiki (Piz) M. oryzae Ina168

M. oryzae Yashiromochi (Pita) Pi No.4 (Pita2) Fukunishiki (Piz) M. oryzae Ina168 Supplemental Data Yoshida et al., (2009) Association genetics reveals three novel avirulence genes from the rice blast fungal pathogen Magnaporthe oryzae. M. oryzae 70-15????????????? Sasanishiki (Pia)

More information

GENOME DUPLICATION AND GENE ANNOTATION: AN EXAMPLE FOR A REFERENCE PLANT SPECIES.

GENOME DUPLICATION AND GENE ANNOTATION: AN EXAMPLE FOR A REFERENCE PLANT SPECIES. GENOME DUPLICATION AND GENE ANNOTATION: AN EXAMPLE FOR A REFERENCE PLANT SPECIES. Alessandra Vigilante, Mara Sangiovanni, Chiara Colantuono, Luigi Frusciante and Maria Luisa Chiusano Dept. of Soil, Plant,

More information

The sunflower (Helianthus annuus L.) genome reflects a recent history of biased accumulation of transposable elements

The sunflower (Helianthus annuus L.) genome reflects a recent history of biased accumulation of transposable elements The Plant Journal (2012) 72, 142 153 doi: 10.1111/j.1365-313X.2012.05072.x The sunflower (Helianthus annuus L.) genome reflects a recent history of biased accumulation of transposable elements S. Evan

More information

PROTEIN SYNTHESIS INTRO

PROTEIN SYNTHESIS INTRO MR. POMERANTZ Page 1 of 6 Protein synthesis Intro. Use the text book to help properly answer the following questions 1. RNA differs from DNA in that RNA a. is single-stranded. c. contains the nitrogen

More information

RNA- seq read mapping

RNA- seq read mapping RNA- seq read mapping Pär Engström SciLifeLab RNA- seq workshop October 216 IniDal steps in RNA- seq data processing 1. Quality checks on reads 2. Trim 3' adapters (opdonal (for species with a reference

More information

15.12 Applications of Suffix Trees

15.12 Applications of Suffix Trees 248 Algorithms in Bioinformatis II, SoSe 07, ZBIT, D. Huson, May 14, 2007 15.12 Appliations of Suffix Trees 1. Searhing for exat patterns 2. Minimal unique substrings 3. Maximum unique mathes 4. Maximum

More information

Comparative Virology as Context. Loy Volkman Professor Emerita Plant & Microbial Biology University of California, Berkeley

Comparative Virology as Context. Loy Volkman Professor Emerita Plant & Microbial Biology University of California, Berkeley Comparative Virology as Context Loy Volkman Professor Emerita Plant & Microbial Biology University of California, Berkeley LEFT FIELD Oxygen Accumulation and Evolutionary History of Life on Earth PHOTOSYNTHESIS

More information

MOBILE ELEMENTS AND EVOLUTION OF MOLECULAR REGULATORY SYSTEMS. Evelina Daskalova*

MOBILE ELEMENTS AND EVOLUTION OF MOLECULAR REGULATORY SYSTEMS. Evelina Daskalova* PROCEEDINGS OF THE BALKAN SCIENTIFIC CONFERENCE OF BIOLOGY IN PLOVDIV (BULGARIA) FROM 19 TH TILL 21 ST OF MAY 2005 (EDS B. GRUEV, M. NIKOLOVA AND A. DONEV), 2005 (P. 79 89) MOBILE ELEMENTS AND EVOLUTION

More information

M.B. Zhou, X.M. Liu, and D.Q. Tang. Corresponding author: D.Q. Tang

M.B. Zhou, X.M. Liu, and D.Q. Tang. Corresponding author: D.Q. Tang Transposable elements in Phyllostachys pubescens (Poaceae) genome survey sequences and the full-length cdna sequences, and their association with simple-sequence repeats M.B. Zhou, X.M. Liu, and D.Q. Tang

More information

November 13, 2009 Bioe 109 Fall 2009 Lecture 20 Evolutionary Genomics

November 13, 2009 Bioe 109 Fall 2009 Lecture 20 Evolutionary Genomics November 13, 2009 Bioe 109 Fall 2009 Lecture 20 Evolutionary Genomics - we have now entered the genomics age - the number of complete genomes continues to rise rapidly each year, now numbering about 200.

More information

Living apart together: crosstalk between the core and supernumerary genomes in a fungal plant pathogen

Living apart together: crosstalk between the core and supernumerary genomes in a fungal plant pathogen Vanheule et al. BMC Genomics (2016) 17:670 DOI 10.1186/s12864-016-2941-6 RESEARCH ARTICLE Open Access Living apart together: crosstalk between the core and supernumerary genomes in a fungal plant pathogen

More information

Evolution of the Rdr1 TNL-cluster in roses and other Rosaceous species

Evolution of the Rdr1 TNL-cluster in roses and other Rosaceous species Terefe-Ayana et al. BMC Genomics 2012, 13:409 RESEARCH ARTICLE Open Access Evolution of the Rdr1 TNL-cluster in roses and other Rosaceous species Diro Terefe-Ayana, Helgard Kaufmann, Marcus Linde and Thomas

More information

Tc1/mariner is a diverse and widespread superfamily Tc1/mariner elements were recently found to be wideof

Tc1/mariner is a diverse and widespread superfamily Tc1/mariner elements were recently found to be wideof Copyright 2003 by the Genetics Society of America Genome-Wide Analysis of mariner-like Transposable Elements in Rice Reveals Complex Relationships With Stowaway Miniature Inverted Repeat Transposable Elements

More information

Genomics Education Partnership Fosmid 16 Harry Quedenfeld. Data and text from this paper is allowed to be included in publications

Genomics Education Partnership Fosmid 16 Harry Quedenfeld. Data and text from this paper is allowed to be included in publications TheCharacterizationofOrthologousCG14561, CG7139 PA,CG7139 PB,CG7133,CG7130andthe IneffectualExclusionofDrosophilamelanogaster ExonicSequenceinRpLP0inDrosophilaerecta GenomicsEducationPartnership Fosmid16

More information

HORIZONTAL TRANSFER IN EUKARYOTES KIMBERLEY MC GRAIL FERNÁNDEZ GENOMICS

HORIZONTAL TRANSFER IN EUKARYOTES KIMBERLEY MC GRAIL FERNÁNDEZ GENOMICS HORIZONTAL TRANSFER IN EUKARYOTES KIMBERLEY MC GRAIL FERNÁNDEZ GENOMICS OVERVIEW INTRODUCTION MECHANISMS OF HGT IDENTIFICATION TECHNIQUES EXAMPLES - Wolbachia pipientis - Fungus - Plants - Drosophila ananassae

More information

Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences

Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences Jianlin Cheng, PhD Department of Computer Science University of Missouri 2008 Free for Academic

More information

Annotation of Drosophila grimashawi Contig12

Annotation of Drosophila grimashawi Contig12 Annotation of Drosophila grimashawi Contig12 Marshall Strother April 27, 2009 Contents 1 Overview 3 2 Genes 3 2.1 Genscan Feature 12.4............................................. 3 2.1.1 Genome Browser:

More information

Supplementary Figure 3

Supplementary Figure 3 Supplementary Figure 3 7.0 Col Kas-1 Line FTH1A 8.4 F3PII3 8.9 F26H11 ATQ1 T9I22 PLS8 F26B6-B 9.6 F27L4 9.81 F27D4 9.92 9.96 10.12 10.14 10.2 11.1 0.5 Mb T1D16 Col % RGR 83.3 101 227 93.5 75.9 132 90 375

More information

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Evaluation. Course Homepage.

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Evaluation. Course Homepage. CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools Giri Narasimhan ECS 389; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs06.html 1/12/06 CAP5510/CGS5166 1 Evaluation

More information

Going Beyond SNPs with Next Genera5on Sequencing Technology Personalized Medicine: Understanding Your Own Genome Fall 2014

Going Beyond SNPs with Next Genera5on Sequencing Technology Personalized Medicine: Understanding Your Own Genome Fall 2014 Going Beyond SNPs with Next Genera5on Sequencing Technology 02-223 Personalized Medicine: Understanding Your Own Genome Fall 2014 Next Genera5on Sequencing Technology (NGS) NGS technology Discover more

More information

IMMERSIVE GRAPH-BASED VISUALIZATION AND EXPLORATION OF BIOLOGICAL DATA RELATIONSHIPS

IMMERSIVE GRAPH-BASED VISUALIZATION AND EXPLORATION OF BIOLOGICAL DATA RELATIONSHIPS Data Science Journal, Volume 4, 31 December 2005 189 IMMERSIVE GRAPH-BASED VISUALIZATION AND EXPLORATION OF BIOLOGICAL DATA RELATIONSHIPS N Férey, PE Gros, J Hérisson, R Gherbi* *Bioinformatics team, Human-Computer

More information

Supplemental Figure 1. Comparison of Tiller Bud Formation between the Wild Type and d27. (A) and (B) Longitudinal sections of shoot apex in wild-type

Supplemental Figure 1. Comparison of Tiller Bud Formation between the Wild Type and d27. (A) and (B) Longitudinal sections of shoot apex in wild-type A B 2 3 3 2 1 1 Supplemental Figure 1. Comparison of Tiller Bud Formation between the Wild Type and d27. (A) and (B) Longitudinal sections of shoot apex in wild-type (A) and d27 (B) seedlings at the four

More information

Paleo-evolutionary plasticity of plant disease resistance genes

Paleo-evolutionary plasticity of plant disease resistance genes Zhang et al. BMC Genomics 2014, 15:187 RESEARCH ARTICLE Paleo-evolutionary plasticity of plant disease resistance genes Rongzhi Zhang 1,2, Florent Murat 1, Caroline Pont 1, Thierry Langin 1 and Jerome

More information

Using Ensembles of Hidden Markov Models for Grand Challenges in Bioinformatics

Using Ensembles of Hidden Markov Models for Grand Challenges in Bioinformatics Using Ensembles of Hidden Markov Models for Grand Challenges in Bioinformatics Tandy Warnow Founder Professor of Engineering The University of Illinois at Urbana-Champaign http://tandy.cs.illinois.edu

More information

Bioinformatics Practical for Biochemists

Bioinformatics Practical for Biochemists Bioinformatics Practical for Biochemists Andrei Lupas, Birte Höcker, Steffen Schmidt WS 2012/2013 01. DNA & Genomics 1 Description Lectures about general topics in Bioinformatics & History Tutorials will

More information

RIBOSOME: THE ENGINE OF THE LIVING VON NEUMANN S CONSTRUCTOR

RIBOSOME: THE ENGINE OF THE LIVING VON NEUMANN S CONSTRUCTOR RIBOSOME: THE ENGINE OF THE LIVING VON NEUMANN S CONSTRUCTOR IAS 2012 Von Neumann s universal constructor Self-reproducing machine: constructor + tape (1948/9). Program on tape: (i) retrieve parts from

More information

Bioinformatics Practical for Biochemists

Bioinformatics Practical for Biochemists Bioinformatics Practical for Biochemists Andrei Lupas, Birte Höcker, Steffen Schmidt WS 2013/2014 01. DNA & Genomics!! 1 Description Lectures about general topics in Bioinformatics & History Tutorials

More information

Diversity and evolution of transposable elements in Arabidopsis

Diversity and evolution of transposable elements in Arabidopsis Chromosome Res (2014) 22:203 216 DOI 10.1007/s10577-014-9418-8 REVIEW Diversity and evolution of transposable elements in Arabidopsis Zoé Joly-Lopez & Thomas E. Bureau Published online: 7 May 2014 # Springer

More information

Discrete Probability Distributions: Poisson Applications to Genome Assembly

Discrete Probability Distributions: Poisson Applications to Genome Assembly Discrete Probability Distributions: Poisson Applications to Genome Assembly Binomial Distribution Binomially distributed random variable X equals sum of n independent Bernoulli trials The probability mass

More information

Assembly improvement: based on Ragout approach. student: Anna Lioznova scientific advisor: Son Pham

Assembly improvement: based on Ragout approach. student: Anna Lioznova scientific advisor: Son Pham Assembly improvement: based on Ragout approach student: Anna Lioznova scientific advisor: Son Pham Plan Ragout overview Datasets Assembly improvements Quality overlap graph paired-end reads Coverage Plan

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary Figures 76.0 DOC (mean) 68.1 76.2 500 bp upstream Repeat region 500 bp downstream Supplementary Figure 1 Comparative assessment of depth of coverage (DOC) of reads by mapping to the repetitive

More information

CHAPTER : Prokaryotic Genetics

CHAPTER : Prokaryotic Genetics CHAPTER 13.3 13.5: Prokaryotic Genetics 1. Most bacteria are not pathogenic. Identify several important roles they play in the ecosystem and human culture. 2. How do variations arise in bacteria considering

More information

Prediction of protein function from sequence analysis

Prediction of protein function from sequence analysis Prediction of protein function from sequence analysis Rita Casadio BIOCOMPUTING GROUP University of Bologna, Italy The omic era Genome Sequencing Projects: Archaea: 74 species In Progress:52 Bacteria:

More information

Jay Moore,, Graham King, James Lynn. Data integration for Brassica comparative genomics

Jay Moore,, Graham King, James Lynn. Data integration for Brassica comparative genomics Jay Moore,, Graham King, James Lynn Data integration for Brassica comparative genomics How best to bring together diverse data about Brassica genome organisation? How best to make data accessible and useful

More information