Gel Electrophoresis. 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 1

Similar documents
Theory of Evolution Charles Darwin

Theory of Evolution. Charles Darwin

Constructing Evolutionary/Phylogenetic Trees

Constructing Evolutionary/Phylogenetic Trees

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Evolutionary Tree Analysis. Overview

Phylogenetic trees 07/10/13

Algorithms in Bioinformatics

EVOLUTIONARY DISTANCES

Page 1. Evolutionary Trees. Why build evolutionary tree? Outline

Phylogenetic inference

CS5238 Combinatorial methods in bioinformatics 2003/2004 Semester 1. Lecture 8: Phylogenetic Tree Reconstruction: Distance Based - October 10, 2003

Phylogeny and Evolution. Gina Cannarozzi ETH Zurich Institute of Computational Science

Phylogenetic analyses. Kirsi Kostamo

Phylogeny Tree Algorithms

Phylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

Phylogenetic inference: from sequences to trees

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

Phylogeny: building the tree of life

Michael Yaffe Lecture #5 (((A,B)C)D) Database Searching & Molecular Phylogenetics A B C D B C D

Bio nformatics. Lecture 16. Saad Mneimneh

BINF6201/8201. Molecular phylogenetic methods

CSCI1950 Z Computa4onal Methods for Biology Lecture 5

Phylogenetic Tree Reconstruction

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Molecular Phylogenetics (part 1 of 2) Computational Biology Course João André Carriço

Properties of Life. Levels of Organization. Levels of Organization. Levels of Organization. Levels of Organization. The Science of Biology.

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

Protein Structure Analysis and Verification. Course S Basics for Biosystems of the Cell exercise work. Maija Nevala, BIO, 67485U 16.1.

Phylogeny: traditional and Bayesian approaches

PHYLOGENY AND SYSTEMATICS

The Science of Biology. Chapter 1

Inferring phylogeny. Today s topics. Milestones of molecular evolution studies Contributions to molecular evolution

DNA Phylogeny. Signals and Systems in Biology Kushal EE, IIT Delhi

LECTURE-13. Peptide Mass Fingerprinting HANDOUT. Mass spectrometry is an indispensable tool for qualitative and quantitative analysis of

Processes of Evolution

How to read and make phylogenetic trees Zuzana Starostová

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

The Science of Biology. Chapter 1


Phylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz

How should we organize the diversity of animal life?

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

A (short) introduction to phylogenetics

C.DARWIN ( )

Lecture 6 Phylogenetic Inference

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Computational Methods for Mass Spectrometry Proteomics

Multiple Sequence Alignment. Sequences

Phylogeny. November 7, 2017

Phylogenetics: Building Phylogenetic Trees

Additive distances. w(e), where P ij is the path in T from i to j. Then the matrix [D ij ] is said to be additive.

BIOL 1010 Introduction to Biology: The Evolution and Diversity of Life. Spring 2011 Sections A & B

Plan: Evolutionary trees, characters. Perfect phylogeny Methods: NJ, parsimony, max likelihood, Quartet method

What is Phylogenetics

Consistency Index (CI)

Phylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University

The Generalized Neighbor Joining method

A Phylogenetic Network Construction due to Constrained Recombination

Inferring Phylogenetic Trees. Distance Approaches. Representing distances. in rooted and unrooted trees. The distance approach to phylogenies

Phylogenetics. BIOL 7711 Computational Bioscience

Evolutionary trees. Describe the relationship between objects, e.g. species or genes

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

Inferring Molecular Phylogeny

Dr. Amira A. AL-Hosary

Phylogenetics: Distance Methods. COMP Spring 2015 Luay Nakhleh, Rice University

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline

Estimating Evolutionary Trees. Phylogenetic Methods

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

I. Short Answer Questions DO ALL QUESTIONS

Chapter 16: Evolutionary Theory

Evolution. Species Changing over time

Supplementary Information

Estimating Phylogenies (Evolutionary Trees) II. Biol4230 Thurs, March 2, 2017 Bill Pearson Jordan 6-057

8/23/2014. Phylogeny and the Tree of Life

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

Biology 211 (2) Week 1 KEY!

Evolution. Species Changing over time

Bioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 4. Bioinformatics 1: Biology, Sequences, Phylogenetics

Mass spectrometry has been used a lot in biology since the late 1950 s. However it really came into play in the late 1980 s once methods were

Cryodetectors in Life Science Mass Spectrometry. Damian Twerenbold

EQ: How are genetic variations caused and how do they lead to natural selection?

Phylogeny and Molecular Evolution. Introduction

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

Evolutionary trees. Describe the relationship between objects, e.g. species or genes

Evolution. Just a few points

Analytical Technologies in Biotechnology Prof. Dr. Ashwani K. Sharma Department of Biotechnology Indian Institute of Technology, Roorkee

Lab 2A--Life on Earth

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

Reading for Lecture 13 Release v10

Seuqence Analysis '17--lecture 10. Trees types of trees Newick notation UPGMA Fitch Margoliash Distance vs Parsimony

CREATING PHYLOGENETIC TREES FROM DNA SEQUENCES

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Lecture 8 Multiple Alignment and Phylogeny

Chapter 27: Evolutionary Genetics

Big Idea #1: The process of evolution drives the diversity and unity of life

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Transcription:

Gel Electrophoresis Used to measure the lengths of DNA fragments. When voltage is applied to DNA, different size fragments migrate to different distances (smaller ones travel farther). 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 1

Gel Pictures 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 2

Gel Electrophoresis: Measure sizes of fragments The phosphate backbone makes DNA a highly negatively charged molecule. Thus DNA can be fractionated according to its size. Gel: allow hot 1 % solution of purifed agarose to cool and solidify/polymerize (like Jello). DNA sample added to wells at the top of a gel and voltage is applied. Larger fragments migrate through the pores slower. Proteins can be separated in much the same way, only acrylamide is used as the crosslinking agent. Varying concentration of agarose makes different pore sizes & results. 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 3

Gel Electrophoresis 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 4

Gel Electrophoresis 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 5

10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 6

Sequencing a Fragment Using Gels Isolate the desired DNA fragment. Using the starving method obtain all fragments that end in A, C, G, T Run gel with 4 lanes and read the sequence 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 7

Application of Gels: Sequencing GCCAGGTGAGCCTTTGCA A C G T 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 8

2D-Gels 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 9

10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 10

2D-Gels First Dimension Methodology of a 2D Gel: Denatured cell extract layered on a glass tube filled with polyacrylamide saturated with solution of ampholytes, a mixture of polyanionic[(-) charged] and polycationic [(+) charged] molecules. When placed in an electric field, the ampholytes separate and form continuous gradient based on net charge. Highly polyanionic ampholytes will collect at one end of tube, highly polycationic ampholytes will collect at other end. Gradient of ampholytes establishes ph gradient. Charged proteins migrate through gradient until they reach their pi, or isoelectric point, the ph at which the net charge of the protein is zero. This resolves proteins that differ by only one charge. Entering the Second Dimension: Proteins that were separated on IEF gel are next separated in the second dimension based on their molecular weights. The IEF gel is extruded from tube and placed lengthwise in alignment with second polyacrylamide gel slab saturated with SDS. When an electric field is imposed, the proteins migrate from IEF gel into SDS slab gel and then separate according to mass. Sequential resolution of proteins by their charge and mass can give excellent separation of cellular proteins. As many as 1000 proteins can be resolved simultaneously. *Some information was taken from Lodish et al. Molecular Cell Biology. 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 11

2D-gels Comparing Proteomes For Differences in Protein Expression Comparing Different Sample Types For Changes in Protein Levels 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 12

Mass Spectrometry 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 13

Mass Spectrometry Mass measurements By Time-of-Flight Pulses of light from laser ionizes protein that is absorbed on metal target. Electric field accelerates molecules in sample towards detector. The time to the detector is inversely proportional to the mass of the molecule. Simple conversion to mass gives the molecular weights of proteins and peptides. Using Peptide Masses to Identify Proteins: One powerful use of mass spectrometers is to identify a protein from its peptide mass fingerprint. A peptide mass fingerprint is a compilation of the molecular weights of peptides generated by a specific protease. The molecular weights of the parent protein prior to protease treatment and the subsequent proteolytic fragments are used to search genome databases for any similarly sized protein with identical or similar peptide mass maps. The increasing availability of genome sequences combined with this approach has almost eliminated the need to chemically sequence a protein to determine its amino acid sequence. 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 14

Sequencing 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 15

Shotgun Sequencing From http://www.tulane.edu/~biochem/lecture/723/humgen.html 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 16

Shotgun Sequencing From http://www.tulane.edu/~biochem/lecture/723/humgen.html 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 17

Human Genome Project Play the Sequencing Video: Download Windows file from http://www.cs.fiu.edu/~giri/teach/6936/papers/sequence.exe Then run it on your PC. 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 18

Assembly: Simple Example ACCGT, CGTGC, TTAC, TACCGT Total length = ~10» --ACCGT--» ----CGTGC» TTAC-----» -TACCGT» TTACCGTGC 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 19

Assembly: Complications Errors in input sequence fragments (~3%) Indels or substitutions Contamination by host DNA Chimeric fragments (joining of non-contiguous fragments) Unknown orientation Repeats (long repeats) Fragment contained in a repeat Repeat copies not exact copies Inherently ambiguous assemblies possible Inverted repeats Inadequate Coverage 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 20

Assembly: Complications 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 21

Assembly: Complications 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 22

Miscellaneous Contig: A continuously covered region in the assembly. Other sequencing methods: Sequencing by Hybridization (SBH) Dual end sequencing Chromosome Walking (see page 5-6 of Pevzner s text). 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 23

SBH Suppose that the only length 4 fragments that hybridize to S are: TAGG, GGCA, CAAA, GCAA, ATAG, AGGC. Then what is S, if it is of length ~9? TAGG GGCA CAAA GCAA ATAG AGGC Hamiltonian Path Problem 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 24

SBH TAG GGC CAA GCA ATA AGG AAA Eulerian Path Problem 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 25

Assembly Software Parallel EST alignment engine (http://corba.ebi.ac.uk/est") with a CORBA interface to alignment database. Can perform ad hoc assemblies. Can act as foundation for CORBA-based EST assembly and editing package. [Parsons, EBI] Software using multiple alternative sequence assembly "engines" writing to a common format file [Staden, Cambridge] (http://www.mrc-lmb.cam.ac.uk/pubseq/index.html). Phrap,(http://bozeman.genome.washington.edu/phrap.docs/phrap.html) Assembler (TIGR) for EST and Microbial whole-genome assembly (http://www.tigr.org/softlab/) FAK2 and FAKtory (http://www.cs.arizona.edu/people/gene/) [Myers] GCG (http://www.gcg.com) Falcon [Gryan, Harvard] fast (rascal.med.harvard.edu/gryan/falcon/) SPACE, SPASS [Lawrence Berkeley Labs] (http://wwwhgc.lbl.gov/inf/space.html) CAP 2 [Huang] (http://www.tigem.it/assembly/capdoc.html) 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 26

Theory of Evolution Charles Darwin 1858-59: Origin of Species 5 year voyage of H.M.S. Beagle (1831-36) Populations have variations. Natural Selection & Survival of the fittest: nature selects best adapted varieties to survive and to reproduce. Speciation arises by splitting of one population into subpopulations. Gregor Mendel and his work (1856-63) on inheritance. 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 27

10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 28

Dominant View of Evolution All existing organisms are derived from a common ancestor and that new species arise by splitting of a population into subpopulations that do not cross-breed. Organization: Directed Rooted Tree; Existing species: Leaves; Common ancestor species (divergence event): Internal node; Length of an edge: Time. 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 29

Phylogeny 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 30

Constructing Evolutionary/Phylogenetic Trees 2 broad categories: Distance-based methods Ultrametric Additive: UPGMA Transformed Distance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood Bayesian Methods 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 31

Ultrametric An ultrametric tree: decreasing internal node labels distance between two nodes is label of least common ancestor. An ultrametric distance matrix: Symmetric matrix such that for every i, j, k, there is tie for maximum of D(i,j), D(j,k), D(i,k) D ij, D ik D jk i j k 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 32

Ultrametric: Assumptions Molecular Clock Hypothesis, Zuckerkandl & Pauling, 1962: Accepted point mutations in amino acid sequence of a protein occurs at a constant rate. Varies from protein to protein Varies from one part of a protein to another 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 33

Ultrametric Data Sources Lab-based methods: hybridization Take denatured DNA of the 2 taxa and let them hybridize. Then measure energy to separate. Sequence-based methods: distance 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 34

Ultrametric: Example A B C D E F G H A 0 4 3 4 5 4 3 4 B C D E F G H E B,D,F,H C,G 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 35 A 3 5 4

Ultrametric: Example A B C D E F G H A 0 4 3 4 5 4 3 4 B 0 4 2 5 1 4 4 C D E F G H 5 4 A C,G B F E H D 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 36 3 2 1

Ultrametric: Distances Computed A B C D E F G H A 0 4 3 4 5 4 3 4 B 0 4 2 5 1 4 4 C 2 D E F G H 5 4 A C,G B F E H D 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 37 3 2 1

Additive-Distance Trees Additive distance trees are edge-weighted trees, with distance between leaf nodes are exactly equal to length of path between nodes. A B C D A 0 3 7 9 B 0 6 8 A 2 3 4 D C 0 6 D 0 B 1 2 C 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 38

Unrooted Trees on 4 Taxa A C B D A C A B D B C D 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 39

Four-Point Condition If the true tree is as shown below, then 1. d AB + d CD < d AC + d BD, and 2. d AB + d CD < d AD + d BC A C B D 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 40

Unweighted pair-group method with arithmetic means (UPGMA) A B C AB C B d AB C d (AB)C C d AC d BC D d (AB)D d CD D d AD d BD d CD d (AB)C = (d AC + d BC ) /2 d AB /2 A B 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 41

Transformed Distance Method UPGMA makes errors when rate constancy among lineages does not hold. Remedy: introduce an outgroup & make n corrections DkO D ' = Now apply UPGMA ij D ij D 2 io D jo + k = 1 n 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 42

Saitou & Nei: Neighbor-Joining Method Start with a star topology. Find the pair to separate such that the total length of the tree is minimized. The pair is then replaced by its arithmetic mean, and the process is repeated. S 12 = D 2 12 + 1 2( n 2) n ( D + D + 1 1k 2k k = 3 ( n ) 2) D 3 i j n ij 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 43

Neighbor-Joining 1 2 n n 3 3 2 1 = + + + = n j i ij n k k k D n D D n D S 3 3 2 1 12 12 2) ( 1 ) ( 2) 2( 1 2 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 44

Constructing Evolutionary/Phylogenetic Trees 2 broad categories: Distance-based methods Ultrametric Additive: UPGMA Transformed Distance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood Bayesian Methods 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 45

Character-based Methods Input: characters, morphological features, sequences, etc. Output: phylogenetic tree that provides the history of what features changed. [Perfect Phylogeny Problem] one leaf/object, 1 edge per character, path changed traits 1 2 3 4 5 A 1 1 0 0 0 3 2 B 0 0 1 0 0 4 1 C 1 1 0 0 1 D B E 5 D 0 0 1 1 0 E 0 1 0 0 0 A C 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 46

Example Perfect phylogeny does not always exist. 1 2 3 4 5 A 1 1 0 0 0 B 0 0 1 0 1 C 1 1 0 0 1 D 0 0 1 1 0 E 0 1 0 0 1 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 47

Maximum Parsimony Minimize the total number of mutations implied by the evolutionary history 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 48