Genome Rearrangements In Man and Mouse. Abhinav Tiwari Department of Bioengineering

Similar documents
Analysis of Gene Order Evolution beyond Single-Copy Genes

Algorithms for Bioinformatics

I519 Introduction to Bioinformatics, Genome Comparison. Yuzhen Ye School of Informatics & Computing, IUB

Chromosomal rearrangements in mammalian genomes : characterising the breakpoints. Claire Lemaitre

Greedy Algorithms. CS 498 SS Saurabh Sinha

Chromosomal Breakpoint Reuse in Genome Sequence Rearrangement ABSTRACT

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

Computational Genetics Winter 2013 Lecture 10. Eleazar Eskin University of California, Los Angeles

Genomes Comparision via de Bruijn graphs

The Fragile Breakage versus Random Breakage Models of Chromosome Evolution

17 Non-collinear alignment Motivation A B C A B C A B C A B C D A C. This exposition is based on:

Perfect Sorting by Reversals and Deletions/Insertions

The combinatorics and algorithmics of genomic rearrangements have been the subject of much

Multiple Whole Genome Alignment

I519 Introduction to Bioinformatics, Genome Comparison. Yuzhen Ye School of Informatics & Computing, IUB

Comparing whole genomes

AN EXACT SOLVER FOR THE DCJ MEDIAN PROBLEM

Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes

Breakpoint Graphs and Ancestral Genome Reconstructions

Handling Rearrangements in DNA Sequence Alignment

7 Multiple Genome Alignment

BMI/CS 776 Lecture #20 Alignment of whole genomes. Colin Dewey (with slides adapted from those by Mark Craven)

Scaffold Filling Under the Breakpoint Distance

A PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS

Whole Genome Alignments and Synteny Maps

On the complexity of unsigned translocation distance

Genome Rearrangement Problems with Single and Multiple Gene Copies: A Review

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

Techniques for Multi-Genome Synteny Analysis to Overcome Assembly Limitations

Genome Rearrangement Problems with Single and Multiple Gene Copies: A Review

BIO GENETICS CHROMOSOME MUTATIONS

2 Genome evolution: gene fusion versus gene fission

Some Algorithmic Challenges in Genome-Wide Ortholog Assignment

Intro Gene regulation Synteny The End. Today. Gene regulation Synteny Good bye!

Graphs, permutations and sets in genome rearrangement

Inferring positional homologs with common intervals of sequences

Reconstructing contiguous regions of an ancestral genome

A Methodological Framework for the Reconstruction of Contiguous Regions of Ancestral Genomes and Its Application to Mammalian Genomes

On the Nadeau-Taylor Theory of Conserved Chromosome Segments

Multi-Break Rearrangements and Breakpoint Re- Uses: From Circular to Linear Genomes

Comparative genomics: Overview & Tools + MUMmer algorithm

Comparing Genomes! Homologies and Families! Sequence Alignments!

Practical considerations of working with sequencing data

Molecular evolution - Part 1. Pawan Dhar BII

Isolating - A New Resampling Method for Gene Order Data

Genome Rearrangement. 1 Inversions. Rick Durrett 1

Chapter 11 Chromosome Mutations. Changes in chromosome number Chromosomal rearrangements Evolution of genomes

Assembly improvement: based on Ragout approach. student: Anna Lioznova scientific advisor: Son Pham

CONTENTS. P A R T I Genomes 1. P A R T II Gene Transcription and Regulation 109

Sex Linked Inheritance, Chromosome Mapping & Mutations

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation

Motivating the need for optimal sequence alignments...

NIH Public Access Author Manuscript Pac Symp Biocomput. Author manuscript; available in PMC 2009 October 6.

Orthology Part I: concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona

Presentation by Julie Hudson MAT5313

Computational approaches for functional genomics

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.

Mathematics of Evolution and Phylogeny. Edited by Olivier Gascuel

A Framework for Orthology Assignment from Gene Rearrangement Data

An Improved Algorithm for Ancestral Gene Order Reconstruction

Regions with Duplications ABSTRACT

MACFP: Maximal Approximate Consecutive Frequent Pattern Mining under Edit Distance

Martin Bader June 25, On Reversal and Transposition Medians

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Module: Sequence Alignment Theory and Applications Session: Introduction to Searching and Sequence Alignment

A New Genomic Evolutionary Model for Rearrangements, Duplications, and Losses that Applies across Eukaryotes and Prokaryotes

Tests for gene clustering

Chromosomal Rearrangement Inferred From Comparisons of 12 Drosophila Genomes

Effects of Gap Open and Gap Extension Penalties

Evolution s cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes

Biol478/ August

Jay Moore,, Graham King, James Lynn. Data integration for Brassica comparative genomics

Alignment Algorithms. Alignment Algorithms

Eric Tannier* - Chunfang Zheng - David Sankoff - *Corresponding author

Outline of lectures 23-26

Evolution of Tandemly Arrayed Genes in Multiple Species

Multiple genome rearrangement

WHERE DOES THE VARIATION COME FROM IN THE FIRST PLACE?

Example of Function Prediction

A Practical Algorithm for Ancestral Rearrangement Reconstruction

Whole-Genome Alignments and Polytopes for Comparative Genomics

Genomes and Their Evolution

Sequence analysis and comparison

Annotation and Nomenclature: A Zebrafish Example. Ingo Braasch, Julian Catchen and John Postlethwait

Genômica comparativa. João Carlos Setubal IQ-USP outubro /5/2012 J. C. Setubal

An Efficient Probabilistic Population-Based Descent for the Median Genome Problem

Inferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT

Multiple Alignment of Genomic Sequences

Algorithms for phylogeny construction

Comparing Genomes with Duplications: a Computational Complexity Point of View

Comparative Genomics Background and Strategies. Nitya Sharma, Emily Rogers, Kanika Arora, Zhiming Zhao, Yun Gyeong Lee

Discovery of Genomic Structural Variations with Next-Generation Sequencing Data

Sequence Alignment (chapter 6)

Advances in Phylogeny Reconstruction from Gene Order and Content Data

Name Period. 3. How many rounds of DNA replication and cell division occur during meiosis?

Comparative Genomics. Chapter for Human Genetics - Principles and Approaches - 4 th Edition

EVOLUTIONARY DISTANCES

Phylogenies Scores for Exhaustive Maximum Likelihood and Parsimony Scores Searches

6.207/14.15: Networks Lecture 12: Generalized Random Graphs

Transcription:

Genome Rearrangements In Man and Mouse Abhinav Tiwari Department of Bioengineering

Genome Rearrangement Scrambling of the order of the genome during evolution Operations on chromosomes Reversal Translocation Fusion Fission The genomic distance between multichromosomalgenomes is defined as the number of such rearrangements in the most parsimonious scenario

Examples Reversal 1 2 3 4 5 6 1 2-5 -4-3 6 Chromosome 1: Chromosome 2: 1 2 3 4 5 6 Translocation 1 2 6 4 5 3 Fusion Chromosome 1: Chromosome 2: 1 2 3 4 5 6 1 2 3 4 5 6 Fission

Why study genome rearrangements? Useful in studying evolution Less ambiguity in interpreting the mutations A larger scale of data which is more appropriate for studying evolution

Adjacencies and Breakpoints π = π 1 π 2 π 3 π n-1 π n A pair of elements π i and π i+ 1 are adjacent if π i+1 = π i +1 An adjacency - a pair of adjacent elements that are consecutive A breakpoint - a pair of adjacent elements that are not consecutive For example: π= 1 9 3 4 7 8 2 6 5

Shortcomings of earlier works Do not distinguish between micro- and macro-rearrangements Unreliable assignment of orthologs Conserved gene order can be disrupted by recent duplications and insertions Problem To obtain a meaningful estimate of the number of rearrangement events on the evolutionary path from mouse to human

Human and mouse synteny blocks Syntenyblocks are segments that can be converted into conserved segments by microrearrangements Human and mouse genomes share 281 synteny blocks

GRIMM-Synteny Algorithm Form an anchor graph whose vertex set is the set of anchors (bidirectional best local similarities called anchors). Connect vertices in the anchor graph by an edge if the distance between them is smaller than the gap size G. Determine the connected components of the anchor graph. Each connected component is called a cluster. Delete small clusters (shorter than the minimum cluster size C in length). Determine the cluster order and signs for each genome. Output the strips in the resulting cluster order as synteny blocks

An example: X-chromosome Dot plot of anchors

An example: X-chromosome Cluster of anchors

An example: X-chromosome Rectified anchors

An example: X-chromosome Synteny blocks

An example: X-chromosome Syntenyblocks as units of same size

A new way to construct Breakpoint Graph

Parsimonious rearrangement scenario Hannenhalli-Pevzneralgorithm uses breakpoint graph to construct the most parsimonious evolutionary scenario

Multichromosomalbreakpoint graph of the whole human and mouse genomes

Reversal distance Rd is at most ½ the number of breakpoints in the genome Inaccurate as breakpoints might be reused in the evolution Hannenhalliand Pevznertheorem estimates Rd = n+1-c+h A similar theorem holds for multichromosomal genomes Fast implementation of the HannenhalliPezneralgorithm available via GRIMM web server 245 rearrangements ( 149 inversions, 93 translocations, 3 fissions) 41 out of 281 syntenyblocks do not show any rearrangements, 10 are extremely rearranged

Summary New algorithm for constructing syntenyblocks Study arrangement of snyteny blocks in human and mouse Derive a most parsimonious human-mouse rearrangement scenario Provide evidence that intrachromosomalrearrangements are more frequent than interchromosomal rearrangements