Comparative Genomics II
|
|
- Aleesha Johnson
- 5 years ago
- Views:
Transcription
1 Comparative Genomics II Advances in Bioinformatics and Genomics GEN 240B Jason Stajich May 19 Comparative Genomics II Slide 1/31
2 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods Gene tree reconciliation Databases of Orthologs References Comparative Genomics II Introduction Slide 2/31
3 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods Gene tree reconciliation Databases of Orthologs References Comparative Genomics II Introduction Gene Families Slide 3/31
4 Orthology inference and Gene family identification How to cluster genes by similarity? Want to uncover paralogy and orthology relationships. Approaches: Single-linkage Markov-Clustering Phylogenetic approaches Comparative Genomics II Introduction Gene Families Slide 4/31
5 Orthology prediction methods Comparative Genomics II Introduction Gene Families Slide 5/31
6 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods Gene tree reconciliation Databases of Orthologs References Comparative Genomics II Introduction Pairwise Methods Slide 6/31
7 Pairwise methods Best Bidirectional Hits (A B) Single linkage COGs InParanoid & OrthoMCL Comparative Genomics II Introduction Pairwise Methods Slide 7/31
8 Best Bidirectional Hits (BBH) All pairs of proteins with reciprocal best hits are considered orthologs. Note that this method is unable to predict the othology with the yellow protein. Pro Intuitive and fast Con Has problem of promiscuous domains leading to over-connecting Con Requires a single cutoff for establishing linkages Comparative Genomics II Introduction Pairwise Methods Slide 8/31
9 Clusters of Orthologous Genes (COG) Proteins in the nodes of triangular networks of BBHs are considered as orthologs (green, red and yellow protein 1). New proteins are added to the orthologous group if they are present in BBH triangles that share an edge with a given cluster. The COG-like approach can add additional proteins from the same genome if they are more similar to each other than to proteins in other genomes, or if they form BBH triangles with members of the cluster. This is not the case for yellow protein 2, which is, again, misclassified. Comparative Genomics II Introduction Pairwise Methods Slide 9/31
10 InParanoid approach - correct for paralogy This is similar to BBH but other proteins within a proteome (yellow protein 2 in this example) are included as in-paralogs if they are more similar to each other than to their corresponding hits in the other species. Comparative Genomics II Introduction Pairwise Methods Slide 10/31
11 OrthOMCL approach - Markov Cluster This is similar to BBH but other proteins within a proteome (yellow protein 2 in this example) are included as in-paralogs if they are more similar to each other than to their corresponding hits in the other species. Comparative Genomics II Introduction Pairwise Methods Slide 11/31
12 OrthoMCL workflow Comparative Genomics II Introduction Pairwise Methods Slide 12/31
13 OrthoMCL distance correction for paralog method Comparative Genomics II Introduction Pairwise Methods Slide 13/31
14 OrthoMCL able to connect families unlinked by Single-Linkage or COG Comparative Genomics II Introduction Pairwise Methods Slide 14/31
15 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods Gene tree reconciliation Databases of Orthologs References Comparative Genomics II Introduction Phylogenetic Methods Slide 15/31
16 Tree Reconciliation Duplication nodes (marked with a D) are defined by comparing the gene tree (small tree at the top) with the species tree (small tree at the bottom) to derive a reconciled tree (big tree on the right) in which the minimal number of duplication and gene loss (dashed lines) events necessary to explain the gene tree are included. In this case, both the yellow proteins are included in the orthologous group but the red and gray proteins are excluded. Comparative Genomics II Introduction Phylogenetic Methods Slide 16/31
17 Species overlap phylogenetic approach All proteins that derive from a common ancestor by speciation are considered members of the same orthologous group. Duplication nodes are detected when they define partitions with at least one shared species. A one-to-many orthology relationship emerges because of a recent duplication in the lineage leading to the yellow proteome. Comparative Genomics II Introduction Phylogenetic Methods Slide 17/31
18 SYNERGY [Wapinski] Clusters of similar genes are found and trees inferred at once Phylogenetic approach that builds up a tree and breaks groups when a ancestral duplication is found that is older than the species group. Can take into account scoring scheme that uses synteny SYNERGY InParanoid Comparative Genomics II Introduction Phylogenetic Methods Slide 18/31
19 SYNERGY Comparative Genomics II Introduction Phylogenetic Methods Slide 19/31
20 SYNERGY starts (top) with a collection of genes (A1, B1, C1 and so on), their chromosomal order (grey lines) and sequence distances (blue arrows; arrows of the same thickness have similar sequence distances). It then builds orthogroups as it climbs the species tree. First, it collects the genes in species A and B that share a common ancestor in species X (second panel, orange ovals). Then, it merges orthogroups formed in the previous stage with the genes in C, resulting in new orthogroups representing ancestral genes in species Y (third panel, yellow ovals). The orthogroups assembled at each stage are associated with gene trees reflecting divergence, duplication and loss events (bottom). b, Gene tree reconstruction and refining orthogroup assignments. An unrooted phylogeny is reconstructed for the genes and sub-orthogroups in each putative orthogroup (dashed oval). Some rootings (purple arrow) indicate that all the genes descended from a common ancestor (for example, X3, bottom left). Others (green arrow) show that a duplication occurred at the root of the gene tree (for example, X2 and X3, bottom right). In the latter case, the orthogroup is partitioned before proceeding. Comparative Genomics II Introduction Phylogenetic Methods Slide 20/31
21 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods Gene tree reconciliation Databases of Orthologs References Comparative Genomics II Introduction Gene tree reconciliation Slide 21/31
22 Gene tree reconciliation Resolve Duplication and Speciation events on a gene tree Uses the known phylogeny of species and walk up the gene tree and assign nodes Some methods impute missing data (gene losses that are unobserved) Comparative Genomics II Introduction Gene tree reconciliation Slide 22/31
23 Speciation-Duplication Inference [Zmaseck and Eddy 2002] Very simple recursion to reconcile gene tree and species tree. Each node is labeled. Doesn t try and infer that there is missing data. Improved upon with Resampling Inference of Orthology (RIO) by same authors. Comparative Genomics II Introduction Gene tree reconciliation Slide 23/31
24 Notung Very simple recursion to reconcile gene tree and species tree. Each node is labeled. Doesn t try and infer that there is missing data. Comparative Genomics II Introduction Gene tree reconciliation Slide 24/31
25 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods Gene tree reconciliation Databases of Orthologs References Comparative Genomics II Introduction Databases of Orthologs Slide 25/31
26 COGs and KOGs Don t use this as a way to classify your orthologs. Many other more accurate methods exist. Comparative Genomics II Introduction Databases of Orthologs Slide 26/31
27 OrthoMCL database OrthoMCL is an MCL based clustering gene family assignment Comparative Genomics II Introduction Databases of Orthologs Slide 27/31
28 PhylomeDB PhylomeDB strategy Comparative Genomics II Introduction Databases of Orthologs Slide 28/31
29 TreeFam Curated gene trees and gene families starting with automated clusters. Comparative Genomics II Introduction Databases of Orthologs Slide 29/31
30 Other tools Bayesian gene tree with species tree knowledge Prime-GSR OrthoStrapper for orthology TreeBEST Likelihood gene tree inference which is species tree aware. Comparative Genomics II Introduction Databases of Orthologs Slide 30/31
31 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods Gene tree reconciliation Databases of Orthologs References Comparative Genomics II References Slide 31/31
32 References Frech C and Chen N. (2010) Genome-Wide Comparative Gene Family Classification PLoS One 5(10):e URL Gabaldon T. (2008) Large-scale assignment of orthology: back to phylogenetics? Genome Biol 9:235. URL Zmaseck C and Eddy SR. (2001) A simple algorithm to infer gene duplication and speciation events on a gene tree. Bioinformatics 17(9): URL Comparative Genomics II References Slide 31/31
Example of Function Prediction
Find similar genes Example of Function Prediction Suggesting functions of newly identified genes It was known that mutations of NF1 are associated with inherited disease neurofibromatosis 1; but little
More informationBioinformatics and Genomics Program, Center for Genomic Regulation, Doctor Aiguader, 88, Barcelona, Spain.
Review Large-scale assignment of orthology: back to phylogenetics? Toni Gabaldón Bioinformatics and Genomics Program, Center for Genomic Regulation, Doctor Aiguader, 88, 08003 Barcelona, Spain. Email:
More informationPhylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)
Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction Lesser Tenrec (Echinops telfairi) Goals: 1. Use phylogenetic experimental design theory to select optimal taxa to
More informationOrthology Part I: concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona
Orthology Part I: concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona (tgabaldon@crg.es) http://gabaldonlab.crg.es Homology the same organ in different animals under
More informationPhylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.
Supplementary Note S2 Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Phylogenetic trees reconstructed by a variety of methods from either single-copy orthologous loci (Class
More informationPhylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline
Phylogenetics Todd Vision iology 522 March 26, 2007 pplications of phylogenetics Studying organismal or biogeographic history Systematics ating events in the fossil record onservation biology Studying
More informationOrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy
Emms and Kelly Genome Biology (2015) 16:157 DOI 10.1186/s13059-015-0721-2 SOFTWARE OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy
More informationSession 5: Phylogenomics
Session 5: Phylogenomics B.- Phylogeny based orthology assignment REMINDER: Gene tree reconstruction is divided in three steps: homology search, multiple sequence alignment and model selection plus tree
More informationComputational approaches for functional genomics
Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding
More informationBioinformatics tools for phylogeny and visualization. Yanbin Yin
Bioinformatics tools for phylogeny and visualization Yanbin Yin 1 Homework assignment 5 1. Take the MAFFT alignment http://cys.bios.niu.edu/yyin/teach/pbb/purdue.cellwall.list.lignin.f a.aln as input and
More informationComputational methods for predicting protein-protein interactions
Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational
More informationEvolutionary Tree Analysis. Overview
CSI/BINF 5330 Evolutionary Tree Analysis Young-Rae Cho Associate Professor Department of Computer Science Baylor University Overview Backgrounds Distance-Based Evolutionary Tree Reconstruction Character-Based
More informationMultiple Sequence Alignment. Sequences
Multiple Sequence Alignment Sequences > YOR020c mstllksaksivplmdrvlvqrikaqaktasglylpe knveklnqaevvavgpgftdangnkvvpqvkvgdqvl ipqfggstiklgnddevilfrdaeilakiakd > crassa mattvrsvksliplldrvlvqrvkaeaktasgiflpe
More informationOrthology Part I concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona
Orthology Part I concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona Toni Gabaldón Contact: tgabaldon@crg.es Group website: http://gabaldonlab.crg.es Science blog: http://treevolution.blogspot.com
More informationChapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships
Chapter 26: Phylogeny and the Tree of Life You Must Know The taxonomic categories and how they indicate relatedness. How systematics is used to develop phylogenetic trees. How to construct a phylogenetic
More informationGene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family
Review: Gene Families Gene Families part 2 03 327/727 Lecture 8 What is a Case study: ian globin genes Gene trees and how they differ from species trees Homology, orthology, and paralogy Last tuesday 1
More informationComparative Bioinformatics Midterm II Fall 2004
Comparative Bioinformatics Midterm II Fall 2004 Objective Answer, part I: For each of the following, select the single best answer or completion of the phrase. (3 points each) 1. Deinococcus radiodurans
More informationand both play a significant role in the rise of variable size gene families originating
Reconstruction and Analysis of Gene Family Evolution in Mammals Jin Jun University of Connecticut, 2010 Gene duplication and loss is a dynamic and ongoing process during evolution and both play a significant
More informationComparing Genomes! Homologies and Families! Sequence Alignments!
Comparing Genomes! Homologies and Families! Sequence Alignments! Allows us to achieve a greater understanding of vertebrate evolution! Tells us what is common and what is unique between different species
More informationreconciling trees Stefanie Hartmann postdoc, Todd Vision s lab University of North Carolina the data
reconciling trees Stefanie Hartmann postdoc, Todd Vision s lab University of North Carolina 1 the data alignments and phylogenies for ~27,000 gene families from 140 plant species www.phytome.org publicly
More informationC3020 Molecular Evolution. Exercises #3: Phylogenetics
C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from
More informationAlgorithms in Bioinformatics
Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Distance Methods Character Methods
More informationBMI/CS 776 Lecture #20 Alignment of whole genomes. Colin Dewey (with slides adapted from those by Mark Craven)
BMI/CS 776 Lecture #20 Alignment of whole genomes Colin Dewey (with slides adapted from those by Mark Craven) 2007.03.29 1 Multiple whole genome alignment Input set of whole genome sequences genomes diverged
More informationMultiple Whole Genome Alignment
Multiple Whole Genome Alignment BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 206 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by
More informationPhylogenetic trees 07/10/13
Phylogenetic trees 07/10/13 A tree is the only figure to occur in On the Origin of Species by Charles Darwin. It is a graphical representation of the evolutionary relationships among entities that share
More informationPhylogenetic Tree Reconstruction
I519 Introduction to Bioinformatics, 2011 Phylogenetic Tree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Evolution theory Speciation Evolution of new organisms is driven
More information17 Non-collinear alignment Motivation A B C A B C A B C A B C D A C. This exposition is based on:
17 Non-collinear alignment This exposition is based on: 1. Darling, A.E., Mau, B., Perna, N.T. (2010) progressivemauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One 5(6):e11147.
More information9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)
I9 Introduction to Bioinformatics, 0 Phylogenetic ree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & omputing, IUB Evolution theory Speciation Evolution of new organisms is driven by
More informationGenômica comparativa. João Carlos Setubal IQ-USP outubro /5/2012 J. C. Setubal
Genômica comparativa João Carlos Setubal IQ-USP outubro 2012 11/5/2012 J. C. Setubal 1 Comparative genomics There are currently (out/2012) 2,230 completed sequenced microbial genomes publicly available
More informationInferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT
Inferring phylogeny Constructing phylogenetic trees Tõnu Margus Contents What is phylogeny? How/why it is possible to infer it? Representing evolutionary relationships on trees What type questions questions
More informationTree of Life iological Sequence nalysis Chapter http://tolweb.org/tree/ Phylogenetic Prediction ll organisms on Earth have a common ancestor. ll species are related. The relationship is called a phylogeny
More informationBLAST. Varieties of BLAST
BLAST Basic Local Alignment Search Tool (1990) Altschul, Gish, Miller, Myers, & Lipman Uses short-cuts or heuristics to improve search speed Like speed-reading, does not examine every nucleotide of database
More informationEnsembl focuses on metazoan (animal) genomes. The genomes currently available at the Ensembl site are:
Comparative genomics and proteomics Species available Ensembl focuses on metazoan (animal) genomes. The genomes currently available at the Ensembl site are: Vertebrates: human, chimpanzee, mouse, rat,
More information8/23/2014. Phylogeny and the Tree of Life
Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major
More informationA PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS
A PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS CRYSTAL L. KAHN and BENJAMIN J. RAPHAEL Box 1910, Brown University Department of Computer Science & Center for Computational Molecular Biology
More informationBINF6201/8201. Molecular phylogenetic methods
BINF60/80 Molecular phylogenetic methods 0-7-06 Phylogenetics Ø According to the evolutionary theory, all life forms on this planet are related to one another by descent. Ø Traditionally, phylogenetics
More informationPGA: A Program for Genome Annotation by Comparative Analysis of. Maximum Likelihood Phylogenies of Genes and Species
PGA: A Program for Genome Annotation by Comparative Analysis of Maximum Likelihood Phylogenies of Genes and Species Paulo Bandiera-Paiva 1 and Marcelo R.S. Briones 2 1 Departmento de Informática em Saúde
More informationEvolution by duplication
6.095/6.895 - Computational Biology: Genomes, Networks, Evolution Lecture 18 Nov 10, 2005 Evolution by duplication Somewhere, something went wrong Challenges in Computational Biology 4 Genome Assembly
More informationA (short) introduction to phylogenetics
A (short) introduction to phylogenetics Thibaut Jombart, Marie-Pauline Beugin MRC Centre for Outbreak Analysis and Modelling Imperial College London Genetic data analysis with PR Statistics, Millport Field
More informationAnalysis of Gene Order Evolution beyond Single-Copy Genes
Analysis of Gene Order Evolution beyond Single-Copy Genes Nadia El-Mabrouk Département d Informatique et de Recherche Opérationnelle Université de Montréal mabrouk@iro.umontreal.ca David Sankoff Department
More informationNature Genetics: doi: /ng Supplementary Figure 1. Icm/Dot secretion system region I in 41 Legionella species.
Supplementary Figure 1 Icm/Dot secretion system region I in 41 Legionella species. Homologs of the effector-coding gene lega15 (orange) were found within Icm/Dot region I in 13 Legionella species. In four
More informationPhylogenomics of closely related species and individuals
Phylogenomics of closely related species and individuals Matthew Rasmussen Siepel lab, Cornell University In collaboration with Manolis Kellis, MIT CSAIL February, 2013 Short time scales 1kyr-1myrs Long
More informationGene function annotation
Gene function annotation Paul D. Thomas, Ph.D. University of Southern California What is function annotation? The formal answer to the question: what does this gene do? The association between: a description
More informationEvolutionary Rate Covariation of Domain Families
Evolutionary Rate Covariation of Domain Families Author: Brandon Jernigan A Thesis Submitted to the Department of Chemistry and Biochemistry in Partial Fulfillment of the Bachelors of Science Degree in
More informationBiological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor
Biological Networks:,, and via Relative Description Length By: Tamir Tuller & Benny Chor Presented by: Noga Grebla Content of the presentation Presenting the goals of the research Reviewing basic terms
More informationEVOLUTIONARY DISTANCES
EVOLUTIONARY DISTANCES FROM STRINGS TO TREES Luca Bortolussi 1 1 Dipartimento di Matematica ed Informatica Università degli studi di Trieste luca@dmi.units.it Trieste, 14 th November 2007 OUTLINE 1 STRINGS:
More informationBioinformatics: Network Analysis
Bioinformatics: Network Analysis Comparative Network Analysis COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1 Biomolecular Network Components 2 Accumulation of Network Components
More information10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison
10-810: Advanced Algorithms and Models for Computational Biology microrna and Whole Genome Comparison Central Dogma: 90s Transcription factors DNA transcription mrna translation Proteins Central Dogma:
More informationSUPPLEMENTARY INFORMATION
Supplementary information S3 (box) Methods Methods Genome weighting The currently available collection of archaeal and bacterial genomes has a highly biased distribution of isolates across taxa. For example,
More informationPhylogenetic and Functional Assessment of Orthologs Inference Projects and Methods
Phylogenetic and Functional Assessment of Orthologs Inference Projects and Methods Adrian M. Altenhoff*, Christophe Dessimoz Institute of Computational Science, ETH Zurich, and Swiss Institute of Bioinformatics,
More informationWhat is Phylogenetics
What is Phylogenetics Phylogenetics is the area of research concerned with finding the genetic connections and relationships between species. The basic idea is to compare specific characters (features)
More informationNon-binary Tree Reconciliation. Louxin Zhang Department of Mathematics National University of Singapore
Non-binary Tree Reconciliation Louxin Zhang Department of Mathematics National University of Singapore matzlx@nus.edu.sg Introduction: Gene Duplication Inference Consider a duplication gene family G Species
More informationChapter 9. Inferring Orthology and Paralogy. Adrian M. Altenhoff and Christophe Dessimoz. Abstract. 1. Introduction
Chapter 9 Inferring Orthology and Paralogy Adrian M. Altenhoff and Christophe Dessimoz Abstract The distinction between orthologs and paralogs, genes that started diverging by speciation versus duplication,
More informationMitochondrial Genome Annotation
Protein Genes 1,2 1 Institute of Bioinformatics University of Leipzig 2 Department of Bioinformatics Lebanese University TBI Bled 2015 Outline Introduction Mitochondrial DNA Problem Tools Training Annotation
More informationUsing phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)
Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures
More informationPhylogeny: building the tree of life
Phylogeny: building the tree of life Dr. Fayyaz ul Amir Afsar Minhas Department of Computer and Information Sciences Pakistan Institute of Engineering & Applied Sciences PO Nilore, Islamabad, Pakistan
More informationPhylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz
Phylogenetic Trees What They Are Why We Do It & How To Do It Presented by Amy Harris Dr Brad Morantz Overview What is a phylogenetic tree Why do we do it How do we do it Methods and programs Parallels
More informationmolecular evolution and phylogenetics
molecular evolution and phylogenetics Charlotte Darby Computational Genomics: Applied Comparative Genomics 2.13.18 https://www.thinglink.com/scene/762084640000311296 Internal node Root TIME Branch Leaves
More informationMolecular phylogeny How to infer phylogenetic trees using molecular sequences
Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 200 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues
More informationElements of Bioinformatics 14F01 TP5 -Phylogenetic analysis
Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis 10 December 2012 - Corrections - Exercise 1 Non-vertebrate chordates generally possess 2 homologs, vertebrates 3 or more gene copies; a Drosophila
More informationPhylogenetics: Building Phylogenetic Trees
1 Phylogenetics: Building Phylogenetic Trees COMP 571 Luay Nakhleh, Rice University 2 Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary model should
More informationIntroduction to protein alignments
Introduction to protein alignments Comparative Analysis of Proteins Experimental evidence from one or more proteins can be used to infer function of related protein(s). Gene A Gene X Protein A compare
More informationComparative genomics of gene families in relation with metabolic pathways for gene candidates highlighting
Comparative genomics of gene families in relation with metabolic pathways for gene candidates highlighting Delphine Larivière & David Couvin Under the supervision of Dominique This, Jean-François Dufayard
More informationChapter 27: Evolutionary Genetics
Chapter 27: Evolutionary Genetics Student Learning Objectives Upon completion of this chapter you should be able to: 1. Understand what the term species means to biology. 2. Recognize the various patterns
More informationMolecular phylogeny How to infer phylogenetic trees using molecular sequences
Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 2009 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues
More informationZhongyi Xiao. Correlation. In probability theory and statistics, correlation indicates the
Character Correlation Zhongyi Xiao Correlation In probability theory and statistics, correlation indicates the strength and direction of a linear relationship between two random variables. In general statistical
More informationUoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)
- Phylogeny? - Systematics? The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogenetic systematics? Connection between phylogeny and classification. - Phylogenetic systematics informs the
More informationPhylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University
Phylogenetics: Building Phylogenetic Trees COMP 571 - Fall 2010 Luay Nakhleh, Rice University Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary
More informationGenome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.
Genome Annotation Bioinformatics and Computational Biology Genome Annotation Frank Oliver Glöckner 1 Genome Analysis Roadmap Genome sequencing Assembly Gene prediction Protein targeting trna prediction
More informationReconciliation with Non-binary Gene Trees Revisited
Reconciliation with Non-binary Gene Trees Revisited Yu Zheng and Louxin Zhang National University of Singapore matzlx@nus.edu.sg Abstract By reconciling the phylogenetic tree of a gene family with the
More informationMany of the slides that I ll use have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!
Many of the slides that I ll use have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis
More informationHomology and Information Gathering and Domain Annotation for Proteins
Homology and Information Gathering and Domain Annotation for Proteins Outline Homology Information Gathering for Proteins Domain Annotation for Proteins Examples and exercises The concept of homology The
More informationPage 1. Evolutionary Trees. Why build evolutionary tree? Outline
Page Evolutionary Trees Russ. ltman MI S 7 Outline. Why build evolutionary trees?. istance-based vs. character-based methods. istance-based: Ultrametric Trees dditive Trees. haracter-based: Perfect phylogeny
More informationCHAPTERS 24-25: Evidence for Evolution and Phylogeny
CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology
More informationPhylogenetic molecular function annotation
Phylogenetic molecular function annotation Barbara E Engelhardt 1,1, Michael I Jordan 1,2, Susanna T Repo 3 and Steven E Brenner 3,4,2 1 EECS Department, University of California, Berkeley, CA, USA. 2
More informationFrom BBCC Conference 2017 Naples, Italy December 2017
Ambrosino et al. BMC Bioinformatics 2018, 19(Suppl 15):435 https://doi.org/10.1186/s12859-018-2420-y RESEARCH Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities
More informationPhylogenetic Reconstruction of Orthology, Paralogy, and Conserved Synteny for Dog and Human
Phylogenetic Reconstruction of Orthology, Paralogy, and Conserved Synteny for Dog and Human Leo Goodstadt *, Chris P. Ponting Medical Research Council Functional Genetics Unit, University of Oxford, Department
More informationPairwise & Multiple sequence alignments
Pairwise & Multiple sequence alignments Urmila Kulkarni-Kale Bioinformatics Centre 411 007 urmila@bioinfo.ernet.in Basis for Sequence comparison Theory of evolution: gene sequences have evolved/derived
More informationUsing Phylogenomics to Predict Novel Fungal Pathogenicity Genes
Using Phylogenomics to Predict Novel Fungal Pathogenicity Genes David DeCaprio, Ying Li, Hung Nguyen (sequenced Ascomycetes genomes courtesy of the Broad Institute) Phylogenomics Combining whole genome
More informationBiol478/ August
Biol478/595 29 August # Day Inst. Topic Hwk Reading August 1 M 25 MG Introduction 2 W 27 MG Sequences and Evolution Handouts 3 F 29 MG Sequences and Evolution September M 1 Labor Day 4 W 3 MG Database
More informationSupplementary Information
Supplementary Information Supplementary Figure 1. Schematic pipeline for single-cell genome assembly, cleaning and annotation. a. The assembly process was optimized to account for multiple cells putatively
More informationProtein Families. João C. Setubal University of São Paulo Agosto /23/2012 J. C. Setubal
Protein Families João C. Setubal University of São Paulo Agosto 2012 8/23/2012 J. C. Setubal 1 Motivation Phytophthora Science paper [Tyler et al., 2006] Comparison of the [P. sojae and P. ramorum] genomes
More informationBig Questions. Is polyploidy an evolutionary dead-end? If so, why are all plants the products of multiple polyploidization events?
Plant of the Day Cyperus esculentus - Cyperaceae Chufa (tigernut) 8,000 kg/ha, 720 kcal/sq m per month Top Crop for kcal productivity! One of the world s worst weeds Big Questions Is polyploidy an evolutionary
More informationWhole Genome Alignments and Synteny Maps
Whole Genome Alignments and Synteny Maps IINTRODUCTION It was not until closely related organism genomes have been sequenced that people start to think about aligning genomes and chromosomes instead of
More informationSUPPLEMENTARY INFORMATION
Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)
More informationPhylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?
Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species
More informationReconstructing the history of lineages
Reconstructing the history of lineages Class outline Systematics Phylogenetic systematics Phylogenetic trees and maps Class outline Definitions Systematics Phylogenetic systematics/cladistics Systematics
More informationPackage WGDgc. June 3, 2014
Package WGDgc June 3, 2014 Type Package Title Whole genome duplication detection using gene counts Version 1.1 Date 2014-06-03 Author Tram Ta, Charles-Elie Rabier, Cecile Ane Maintainer Tram Ta
More informationAnatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses
Anatomy of a tree outgroup: an early branching relative of the interest groups sister taxa: taxa derived from the same recent ancestor polytomy: >2 taxa emerge from a node Anatomy of a tree clade is group
More informationResearch Proposal. Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family.
Research Proposal Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family. Name: Minjal Pancholi Howard University Washington, DC. June 19, 2009 Research
More informationRGP finder: prediction of Genomic Islands
Training courses on MicroScope platform RGP finder: prediction of Genomic Islands Dynamics of bacterial genomes Gene gain Horizontal gene transfer Gene loss Deletion of one or several genes Duplication
More informationPhylogeny and Evolution. Gina Cannarozzi ETH Zurich Institute of Computational Science
Phylogeny and Evolution Gina Cannarozzi ETH Zurich Institute of Computational Science History Aristotle (384-322 BC) classified animals. He found that dolphins do not belong to the fish but to the mammals.
More informationEstimating Evolutionary Trees. Phylogenetic Methods
Estimating Evolutionary Trees v if the data are consistent with infinite sites then all methods should yield the same tree v it gets more complicated when there is homoplasy, i.e., parallel or convergent
More informationPhylogenetic inference
Phylogenetic inference Bas E. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, March 7 th 016 After this lecture, you can discuss (dis-) advantages of different information types
More informationPhylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center
Phylogenetic Analysis Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center Outline Basic Concepts Tree Construction Methods Distance-based methods
More informationPhylogeny and the Tree of Life
Chapter 26 Phylogeny and the Tree of Life PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from
More informationUnified modeling of gene duplication, loss and coalescence using a locus tree
Unified modeling of gene duplication, loss and coalescence using a locus tree Matthew D. Rasmussen 1,2,, Manolis Kellis 1,2, 1. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute
More informationCONCEPT OF SEQUENCE COMPARISON. Natapol Pornputtapong 18 January 2018
CONCEPT OF SEQUENCE COMPARISON Natapol Pornputtapong 18 January 2018 SEQUENCE ANALYSIS - A ROSETTA STONE OF LIFE Sequence analysis is the process of subjecting a DNA, RNA or peptide sequence to any of
More informationLearning in Bayesian Networks
Learning in Bayesian Networks Florian Markowetz Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin Berlin: 20.06.2002 1 Overview 1. Bayesian Networks Stochastic Networks
More informationLecture 8 Multiple Alignment and Phylogeny
Introduction to Bioinformatics for Medical Research Gideon Greenspan gdg@cs.technion.ac.il Lecture 8 Multiple Alignment and Phylogeny Multiple Alignment & Phylogeny Multiple Alignment Scoring Complexity
More information