Computational Genetics Winter 2013 Lecture 10. Eleazar Eskin University of California, Los Angeles

Similar documents
Genome Rearrangements In Man and Mouse. Abhinav Tiwari Department of Bioengineering

I519 Introduction to Bioinformatics, Genome Comparison. Yuzhen Ye School of Informatics & Computing, IUB

Algorithms for Bioinformatics

A PARSIMONY APPROACH TO ANALYSIS OF HUMAN SEGMENTAL DUPLICATIONS

I519 Introduction to Bioinformatics, Genome Comparison. Yuzhen Ye School of Informatics & Computing, IUB

Genomes and Their Evolution

Molecular evolution - Part 1. Pawan Dhar BII

Multiple Whole Genome Alignment

Genomes Comparision via de Bruijn graphs

Greedy Algorithms. CS 498 SS Saurabh Sinha

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

Cell Division and Reproduction

Chromosomal rearrangements in mammalian genomes : characterising the breakpoints. Claire Lemaitre

A Methodological Framework for the Reconstruction of Contiguous Regions of Ancestral Genomes and Its Application to Mammalian Genomes

AN EXACT SOLVER FOR THE DCJ MEDIAN PROBLEM

Linear models for the joint analysis of multiple. array-cgh profiles

EVOLUTIONARY DISTANCES

Genome Rearrangement Problems with Single and Multiple Gene Copies: A Review

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide

Breakpoint Graphs and Ancestral Genome Reconstructions

The Fragile Breakage versus Random Breakage Models of Chromosome Evolution

Genome Rearrangement Problems with Single and Multiple Gene Copies: A Review

Computational Biology: Basics & Interesting Problems

growth, the replacement of damaged cells, and development from an embryo into an adult. These cell division occur by means of a process of mitosis

Stochastic processes and

Sparse canonical correlation analysis, with applications to genomic data

The combinatorics and algorithmics of genomic rearrangements have been the subject of much

Analysis of Gene Order Evolution beyond Single-Copy Genes

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation

Supplementary Information

Some Statistical Models and Algorithms for Change-Point Problems in Genomics

5.1 Cell Division and the Cell Cycle

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Principles of Genetics

Natural Selection. Differential survival and reproduction has been demonstrated in the wild and in the laboratory many times.

Perfect Sorting by Reversals and Deletions/Insertions

GS Analysis of Microarray Data

17 Non-collinear alignment Motivation A B C A B C A B C A B C D A C. This exposition is based on:

Texas Biology Standards Review. Houghton Mifflin Harcourt Publishing Company 26 A T

Name: Date: Period: Must-Know: Unit 6 (Cell Division) AP Biology, Mrs. Krouse. Topic #1: The Cell Cycle and Mitosis

Homeotic Genes and Body Patterns

Martin Bader June 25, On Reversal and Transposition Medians

Comparative genomics: Overview & Tools + MUMmer algorithm

BMI/CS 776 Lecture #20 Alignment of whole genomes. Colin Dewey (with slides adapted from those by Mark Craven)

Post-selection Inference for Changepoint Detection

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Intro Gene regulation Synteny The End. Today. Gene regulation Synteny Good bye!

Whole Genome Alignments and Synteny Maps

Sex accelerates adaptation

Natural Selection. a population there must be phenotypic variation. differential reproductive success of different phenotypes

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

Chromosomal Breakpoint Reuse in Genome Sequence Rearrangement ABSTRACT

ECOL/MCB 320 and 320H Genetics

TE content correlates positively with genome size

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Learning Objectives LO 3.7 The student can make predictions about natural phenomena occurring during the cell cycle. [See SP 6.4]

Genetic Engineering and Creative Design

On the complexity of unsigned translocation distance

On Reversal and Transposition Medians

CELL DIVISION MITOSIS & MEIOSIS

Charalampos (Babis) E. Tsourakakis WABI 2013, France WABI '13 1

GS Analysis of Microarray Data

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Special Topics on Genetics

Outline of lectures 23-26

1 Introduction. Abstract

Grade 11 Biology SBI3U 12

A Phylogenetic Network Construction due to Constrained Recombination

THE CELL CYCLE & MITOSIS. Asexual Reproduction: Production of genetically identical offspring from a single parent.

Reconstructing contiguous regions of an ancestral genome

Q2 (4.6) Put the following in order from biggest to smallest: Gene DNA Cell Chromosome Nucleus. Q8 (Biology) (4.6)

ADMM Fused Lasso for Copy Number Variation Detection in Human 3 March Genomes / 1

Lecture 12 - Meiosis

Whole-genome duplications and paleopolyploidy

Applications of genome alignment

Bioinformatics 2 - Lecture 4

JUNK DNA: EVIDENCE FOR EVOLUTION OR DESIGN?

Meiosis vs Mitosis. How many times did it go through prophase-metaphase-anaphase-telophase?

High-throughput sequence alignment. November 9, 2017

Research Statement on Statistics Jun Zhang

Group activities: Making animal model of human behaviors e.g. Wine preference model in mice

Chromosomes and Inheritance

Meiosis and Sexual Reproduction. Chapter 9

Massachusetts Institute of Technology Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution

Cellular Reproduction

Genomes and Their Evolution

Measuring TF-DNA interactions

Bioinformatics Exercises

Introduction to Bioinformatics

Discovery of Genomic Structural Variations with Next-Generation Sequencing Data

Meiosis and Sexual Life Cycles

Meiosis and Sexual Life Cycles

Meiosis & Sexual Reproduction

Biology. Chapter 12. Meiosis and Sexual Reproduction. Concepts and Applications 9e Starr Evers Starr. Cengage Learning 2015

Genome Rearrangement. 1 Inversions. Rick Durrett 1

Learning Objectives Chapter 8

Processes of Evolution

Transcription:

Computational Genetics Winter 2013 Lecture 10 Eleazar Eskin University of California, Los ngeles

Pair End Sequencing Lecture 10. February 20th, 2013 (Slides from Ben Raphael)

Chromosome Painting: Normal Cells

Chromosome Painting: Tumor Cells

Rearrangements in Tumors Change gene structure and regulatory wiring of the genome. Create bad novel fusion genes and break good old genes. Example: translocation in leukemia. Chromosome 9 promoter Chromosome 22 BL gene promoter BCR gene promoter BCR-BL oncogene n Gleevec TM (Novartis 2001) targets BCR-BL oncogene.

Complex Tumor Genomes 1) What are detailed architectures of tumor genomes? 2) What rearrangements/duplications produce these architectures and what is the order of these events? 3) What are the novel fusion genes and old broken genes?

Genome rearrangements Mouse (X chrom.) Unknown ancestor ~ 80 million years ago (X chrom.) n What are the the architectural blocks forming the existing genomes and how to find them? n What is the architecture of the ancestral genome? n What is the evolutionary scenario for transforming one genome into the other?

History of Chromosome X Rat Consortium, Nature, 2004

Inversions 1 2 3 9 10 8 4 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 7 6 5 n Blocks represent conserved genes. n In the course of evolution or in a clinical context, blocks 1,2,,10 could be misread as: 1, 2, 3, -8, -7, -6, -5, -4, 9, 10.

Inversions 1 2 3 9 10 8 4 1, 2, 3, -8, -7, -6, -5, -4, 9, 10 7 6 5 n n n n Blocks represent conserved genes. In the course of evolution or in a clinical context, blocks 1,,10 could be misread as 1, 2, 3, -8, -7, -6, -5, -4, 9, 10. Evolution: occurred one-two times every million years. Cancer: may occur every month.

Inversions 1 2 3 9 10 8 4 1, 2, 3, -8, -7, -6, -5, -4, 9, 10 7 6 5 The inversion introduced two breakpoints (disruptions in order).

Measuring Structural Changes in Tumors: Cytogenetics Directly visualize (fluorescently) labeled chromosomes. Chromosome banding, mfish, SKY Weakness: Physical location of chromosomal junctions not revealed. Low resolution. No/little information about copy number changes.

Paired End Sequencing (PE) C. Collins et al. (UCSF Cancer Center) Tumor DN 1) Pieces of tumor genome: clones (100-250kb). DN x y 2) Sequence ends of clones (500bp). 3) Map end sequences to human genome. Each clone corresponds to pair of end sequences (PE pair) (x,y). Typical Next Generation Sequencing read lengths are shorter.

PE Pairs Order PE pair such that x < y. PE pair (x,y) is valid if x,y on same chromosome. and l y x L, min (max) size of clone. x, y have opposite, convergent orientations invalid, otherwise. Results from rearrangement or experimental noise. L x y x 1 x x 3 x 4 y 1 y x 5 y y y 2 2 5 4 3

Tumor Genome Reconstruction Puzzle Reconstruct tumor genome x 1 x 2 B C D E genome (known) Unknown sequence of rearrangements Tumor genome -C -D B E (unknown) Map PE pairs to human genome. x 3 x 4 y 1 y x 5 y 2 5 y 4 y 3 Location of PE pairs in human genome. (known)

Tumor Genome Reconstruction E B Tumor -D -C B C D E

Tumor Genome Reconstruction E B Tumor -D -C B C D E

Tumor Genome Reconstruction E B (x 3,y 3 ) Tumor (x 2,y -D 2 ) -C (x 4,y 4 ) (x 1,y 1 ) B C D E x 1 x 2 x 3 x 4 y 1 y 2 y 4 y 3

PE Plot E (x 3,y 3 ) (x 4,y 4 ) D C B (x 1,y 1 ) (x 2,y 2 ) 2D Representation of PE Data Each point is PE pair. Can we reconstruct the tumor genome from the positions of the PE pairs? B C D E

E D C B 2D Representation of PE Data Each point is PE pair. Can we reconstruct the tumor genome from the positions of the PE pairs? B C D E

E D C B 2D Representation of PE Data Each point is PE pair. Can we reconstruct the tumor genome from the positions of the PE pairs? B C D E

E D C B 2D Representation of PE Data Each point is PE pair. Can we reconstruct the tumor genome from the positions of the PE pairs? B C D E

E D C B 2D Representation of PE Data Each point is PE pair. Can we reconstruct the tumor genome from the positions of the PE pairs? B C D E

E D C B 2D Representation of PE Data Each point is PE pair. Can we reconstruct the tumor genome from the positions of the PE pairs? B C D E

E D C B 2D Representation of PE Data Each point is PE pair. Can we reconstruct the tumor genome from the positions of the PE pairs? B C D E

E D C B 2D Representation of PE Data Each point is PE pair. Can we reconstruct the tumor genome from the positions of the PE pairs? B C D E

E D C B 2D Representation of PE Data Each point is PE pair. Can we reconstruct the tumor genome from the positions of the PE pairs? B C D E

E E D -D C -C B B Reconstructed Tumor Genome B C D E -C -D B E

Real data noisy and incomplete!

Reconstruction of Tumor History n Use knowledge of known rearrangement mechanisms e.g. inversions, translocations, etc. n Find simplest explanation for data, given these mechanisms. n Motivation: Sorting by Reversals

n G = [0,M], unichromosomal genome. n Reversal ρ s,t (x)= x, if x < s or x > t, PE Sorting Problem t (x s), otherwise. B C G x 1 s y 1 t x 2 y 2 ρ -B x 1 y G = ρg 1 x 2 y 2 Given: PE pairs (x 1, y 1 ),, (x n, y n ) Find: Minimum number of reversals ρ s1,t1,, ρ sn, tn such that if ρ = ρ s1,t1 ρ sn, tn then (ρ x 1, ρ y 1 ),, (ρ x n, ρ y n ) are valid PE pairs.

B C x 1 s x x y 1 t 2 3 y 3 y 2 -C -B x 1 y 1 y 3 x 3 x 2 y 2 ρ t s Sequence of reversals. s t ll ES pairs valid.

Breast Cancer MCF7 Cell Line chromosomes MCF7 chromosomes 5 inversions 15 translocations Raphael et al. 2003.

Complications with MCF7: Chromosomes 1,3,17, 20 33/70 clusters Total length: 31Mb

s t C -B s t C B inversion u B u w D B C D v E w C D v E duplication/ transposition u B w C v E???? Rearrangement Signatures Tumor s t -B s t -C B D C D translocation

Complex Tumor Genomes

Structure of Duplications in Tumors? Duplicated segments may co-localize (Guan et al. Nat.Gen.1994) genome Tumor genome n Mechanisms not well understood.

Tumor mplisomes