Lecture 15: Programming Example: TASEP

Similar documents
Videos. Bozeman, transcription and translation: Crashcourse: Transcription and Translation -

1. In most cases, genes code for and it is that

Practical Bioinformatics

BME 5742 Biosystems Modeling and Control

NSCI Basic Properties of Life and The Biochemistry of Life on Earth

Lesson Overview. Ribosomes and Protein Synthesis 13.2

Supplementary Information for

GCD3033:Cell Biology. Transcription

Lecture 7: Simple genetic circuits I

Molecular Biology - Translation of RNA to make Protein *

Computational Biology: Basics & Interesting Problems

SUPPLEMENTARY DATA - 1 -

A Simple Protein Synthesis Model

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

Section 7. Junaid Malek, M.D.

PROTEIN SYNTHESIS INTRO

Chapter

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

RNA & PROTEIN SYNTHESIS. Making Proteins Using Directions From DNA

More Protein Synthesis and a Model for Protein Transcription Error Rates

Gene Switches Teacher Information

Name Period The Control of Gene Expression in Prokaryotes Notes

CCHS 2015_2016 Biology Fall Semester Exam Review

Name: SBI 4U. Gene Expression Quiz. Overall Expectation:

Translation Part 2 of Protein Synthesis

The Making of the Fittest: Evolving Switches, Evolving Bodies

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus:

Crick s early Hypothesis Revisited

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

From Gene to Protein

UNIT 5. Protein Synthesis 11/22/16

Chapter 17. From Gene to Protein. Biology Kevin Dees

4. Why not make all enzymes all the time (even if not needed)? Enzyme synthesis uses a lot of energy.

Advanced topics in bioinformatics

Gene Regulation and Expression

Nucleus. The nucleus is a membrane bound organelle that store, protect and express most of the genetic information(dna) found in the cell.

Lecture 5. How DNA governs protein synthesis. Primary goal: How does sequence of A,G,T, and C specify the sequence of amino acids in a protein?

Laith AL-Mustafa. Protein synthesis. Nabil Bashir 10\28\ First

Translation. Genetic code

SUPPORTING INFORMATION FOR. SEquence-Enabled Reassembly of β-lactamase (SEER-LAC): a Sensitive Method for the Detection of Double-Stranded DNA

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

Controlling Gene Expression

From DNA to protein, i.e. the central dogma

ومن أحياها Translation 2. Translation 2. DONE BY :Nisreen Obeidat

Organic Chemistry Option II: Chemical Biology

CCHS 2016_2017 Biology Fall Semester Exam Review

Topic 4 - #14 The Lactose Operon

Computational Cell Biology Lecture 4

Types of RNA. 1. Messenger RNA(mRNA): 1. Represents only 5% of the total RNA in the cell.

Clay Carter. Department of Biology. QuickTime and a TIFF (Uncompressed) decompressor are needed to see this picture.

GENETICS - CLUTCH CH.12 GENE REGULATION IN PROKARYOTES.

TRANSCRIPTION VS TRANSLATION FILE

GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications

Lecture 25: Protein Synthesis Key learning goals: Be able to explain the main stuctural features of ribosomes, and know (roughly) how many DNA and

RNA Processing: Eukaryotic mrnas

REVIEW SESSION. Wednesday, September 15 5:30 PM SHANTZ 242 E

Old FINAL EXAM BIO409/509 NAME. Please number your answers and write them on the attached, lined paper.

13.4 Gene Regulation and Expression

Biology 112 Practice Midterm Questions

Lecture 5: Processes and Timescales: Rates for the fundamental processes 5.1

Unit Test on Cell Biology please read carefully and double-check your work! LO: Describe and explain the Central Dogma. SLE: Meet NGSS.

What Kind Of Molecules Carry Protein Assembly Instructions From The Nucleus To The Cytoplasm

Supporting Information for. Initial Biochemical and Functional Evaluation of Murine Calprotectin Reveals Ca(II)-

Supplemental Table 1. Primers used for cloning and PCR amplification in this study

Chapter 6.2. p

Biology 2018 Final Review. Miller and Levine

Genetics 304 Lecture 6

ومن أحياها Translation 1. Translation 1. DONE BY :Maen Faoury

Regulation of Gene Expression *

15.2 Prokaryotic Transcription *

CHAPTER 3. Cell Structure and Genetic Control. Chapter 3 Outline

Control of Gene Expression in Prokaryotes

Turns sunlight, water & carbon dioxide (CO 2 ) into sugar & oxygen through photosynthesis

7.32/7.81J/8.591J: Systems Biology. Fall Exam #1

Reading Assignments. A. Genes and the Synthesis of Polypeptides. Lecture Series 7 From DNA to Protein: Genotype to Phenotype

Complete all warm up questions Focus on operon functioning we will be creating operon models on Monday

Multiple Choice Review- Eukaryotic Gene Expression

Name: Monster Synthesis Activity

Chapters 12&13 Notes: DNA, RNA & Protein Synthesis

2011 The Simple Homeschool Simple Days Unit Studies Cells

Supplemental data. Pommerrenig et al. (2011). Plant Cell /tpc

Encoding of Amino Acids and Proteins from a Communications and Information Theoretic Perspective

O 3 O 4 O 5. q 3. q 4. Transition

Chapter 15 Active Reading Guide Regulation of Gene Expression

Evolutionary Analysis of Viral Genomes

Honors Biology Reading Guide Chapter 11

CS-E5880 Modeling biological networks Gene regulatory networks

From gene to protein. Premedical biology

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

Practice Test on Cell Biology (the REAL test is on Friday the 17th) KEY LO: Describe and explain the Central Dogma. SLE: Meet NGSS.

Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p

Degeneracy. Two types of degeneracy:

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON

CCHS 2014_2015 Biology Fall Semester Exam Review

Characterization of Pathogenic Genes through Condensed Matrix Method, Case Study through Bacterial Zeta Toxin

Lecture 18 June 2 nd, Gene Expression Regulation Mutations

GENE REGULATION AND PROBLEMS OF DEVELOPMENT

Transcription:

Carl Kingsford, 0-0, Fall 0 Lecture : Programming Example: TASEP The goal for this lecture is to implement a reasonably large program from scratch. The task we will program is to simulate ribosomes moving along a mrna transcript during the process of translation. The TASEP model The TASEP model is a model used to simulate systems such as traffic, queues, and ribosomes. The model L uses a -dimensional grid of a fixed, finite length. Particles (such as cars) enter at the left end of the grid. They move, at rates that depend on their location, to the right until the fall off the grid. Only one particle can occupy any grid position at any time. A particle cannot move forward until the space it would move into is empty. For example: Here, the solid circles represent positions occupied by items. The item at position can't move until the item at position moves. TASEP stands for the features of this model: Totally Asymmetric: particles can only move forward Simple Exclusion: only one particle can be at each grid point Process: it describes dynamics Translation DNA encodes for genes, which are subregions of the DNA molecule. These subregions are transcribed into messenger RNA (mrna) molecules during a process called transcription which is the first step toward making a protein corresponding to the gene.

Following transcription, molecular machiens called ribosomes walk down the mrna molecule translating it into a protein. The genetic code is a mapping between three-letter-long "words" of RNA called codons to amino acids. A ribosome binds to the mrna at the start of the transcript, moves down the transcript one codon at a time (and never backs up). The speed at which the ribosome moves is likely somewhat dependent on the type of codon it is translating. At the end of the transcript ribosomes fall off. A ribosome binding to the start of the transcript is called initiation. Many ribosomes can be translating a mrna molecule at the same time, but no two ribosomes can occupy the same place on the mrna. The picture is this: where the red blobs are the ribosomes translating the mrna (purple dashed line). The chains of circles attached to the ribosomes are the proteins being created. Translation looks a lot like the idealized TASEP model. We will simulate ribosome motion using this model. The simulator We will simulate for a given number of time steps. t Input data: The codon-specific rates will be given in a file formatted as below:

7 init 00 TTT TTC 0 TTA 70 TTG TCT 0 TCC The first column gives the codon, the second column gives the number of time steps that a ribosome must wait at a given codon. The special "codon" init is the time between initiation events. Denote by rate["ttt"] the delay for codon type TTT. A ribosome must wait rate[x] timesteps before moving if it is sitting at a codon of type x. The sequence of the mrna will be provided in a standard format called FASTA, which looks like this: 7 8 >YAL00C TFC SGDID:S00000000, Chr I from 00-79,- 097, Genome Release - ATGGTACTGACGATTTATCCTGACGAACTCGTACAAATAGTGTCTGATAAAATTGCTTCA AATAAGGGAAAAATCACTTTGAATCAGCTGTGGGATATATCTGGTAAATATTTTGATTTG TCTGATAAAAAAGTTAAACAGTTCGTGCTTTCATGCGTGATATTGAAAAAGGACATTGAG GTGTATTGTGATGGTGCTATAACAACTAAAAATGTGACTGATATTATAGGCGACGCTAAT CATTCATACTCGGTTGGGATTACTGAGGACAGCCTATGGACATTATTAACGGGATACACA AAAAAGGAGTCAACTATTGGAAATTCTGCATTTGAACTACTTCTCGAAGTTGCCAAATCA GGAGAAAAAGGGATCAATACTATGGATTTGGCGCAGGTAACTGGGCAAGATCCTAGAAGT Output: We would like to know a profile of where ribosomes were sitting over the course of the simulation. To that end, we will produce output that looks like that:

7 8 9 0 0 ATG 9980 ( 0) GTA 9978 ( ) CTG 9970 ( ) ACG 9977 ( ) ATT 9989 ( ) TAT 998000 ( 0) CCT 99808 ( 0) 7 GAC 9990 ( 0) 8 GAA 998970 ( 0) 9 CTC 99890 ( 0) 0 GTA 998790 ( ) CAA 99888 ( 90) What happens at a timestep of the simulation: At each time step, all the ribosomes that can move should move. A ribosome at codon-type x can move if it's waited rate[x] timesteps and if there is no ribosome at the next codon. If there is a chain of ribosomes that are ready to move at a timestep, then they all should move: In the above example, the ribosomes at positions,, and should all move at time 0 one space to the right. Strategy for the Code We will following the following steps to write the code:. Write code to read in the sequence and the rates --- starting with reading your data is a good way to make early progress.. We will then decide on a data model. Something like:

type Transcript []bool // t[i] == true if posn i has ribosome type Ribosome struct { posn int timecanmove int }. We will then switch to a top-down design approach where we write the outer loop of the simulation. Something like: for curtime := ; curtime < maxtime; curtime++ { // find a ribosomes to move // move them // initiate a ribosome if we can // update our profile of where the ribosomes are } The code we came up with is available separately. Ribo-seq The experimental protocol called ribo-seq measures the positions of ribosomes that are translating mrna. It works by:. Attaching the ribosomes where they are onto the mrna. Degrading all the mrna that is not protected by a ribosome. Removing the ribosomes from the mrna they are stuck to. Sequencing the mrna that was protected by the ribosomes. This gives you sequences of the the ``ribosome footprints''. Mapping these sequences to the know sequences of the genes The above approach produces profiles of ribosome locations:

This profile is similar to that produced by our simulation (except that the above figure shows that sometimes ribosomes are captured in between codons --- the large peaks are the codon peaks and the smaller peaks are ribosomes that look like they were in between codons when they were frozen). Exercises Extend our simulator to: draw a graph of ribosome positions to not waste time "simulating" for timesteps in which nothing moves automatically find rates that match an input profile (hard) Summary Translation can be simulated using the TASEP model This is a moderately sophistcated program (that needn't use anything we haven't already seen in depth)

How fast ribosomes move and their dynamics is an active research area Glossary transcription: the process of producing an mrna template from the genome. translation: the process of constructing a protein from an mrna template. ribosome: a molecular machine that reads an mrna template and produces a protein from it initiation: the process by which a ribosome bind to an mrna and starts translating ribo-seq: experimental technique to measure ribosome locations codon: a -letter code for an amino acid genetic code: the complete set of mapping from -letter codes to amino acids