Prof. Christian MICHEL

Size: px
Start display at page:

Download "Prof. Christian MICHEL"

Transcription

1 CIRCULAR CODES IN GENES AND GENOMES Prof. Christian MICHEL Theoretical Bioinformatics ICube University of Strasbourg, CNRS France c.michel@unistra.fr Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 1 / 46

2 Biological recall: DNA Alphabet: A 4 ={A,C,G,T} Double helix Complementary pairing A T and C G Antiparallel Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 2 / 46

3 Biological recall: complementary trinucleotide Complementary map: C Complementary nucleotide C(A) = T and C(T) = A C(C) = G and C(G) = C Complementary trinucleotide w 0 = l 0 l 1 l 2 with l 0 l 1 l 2 A 4, is 3 C(w 0 ) = C(l 2 )C(l 1 )C(l 0 ) e.g. C(ACG)=CGT Extension to a complementary trinucleotide set Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 3 / 46

4 Biological recall: permuted trinucleotide Permutation map: P Permuted trinucleotide w 0 = l 0 l 1 l 2 with l 0 l 1 l 2 A 4, is 3 P(w 0 ) = w 1 = l 1 l 2 l 0 and P(P(w 0 )) = P(w 1 ) = w 2 = l 2 l 0 l 1 e.g. P(ACG)=CGA and P(P(ACG))=P(CGA)=GAC Extension to a permuted trinucleotide set Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 4 / 46

5 Biological recall: 3 frames in genes Frame 0: Reading frame established by a start codon {ATG,GTG,TTG} Frame 1: Frame 0 shifted by 1 nucleotide in 5-3 Frame 2: Frame 0 shifted by 2 nucleotides in 5-3 Frame 2 Frame 1 Frame 0 A T G A C G G T A C G A T T G... Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 5 / 46

6 Result 1: The distribution of the 64 trinucleotides in the 3 frames of genes (prokaryotes, eukaryotes) are not uniform: 3 sets of trinucleotides are identified Trinucleotide frequencies per frame (Arquès, Michel, 1996) Correlation functions per frame (Arquès, Michel, 1997) Frame permuted trinucleotide frequencies (Frey, Michel, 2003, 2006) Covering function (Gonzalez, Giannerini, Rosa, 2011) Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 6 / 46

7 Trinucleotide frequencies per frame Frame 0 Frame 1 Frame 2 Frame 2 Frame 1 Frame 0 A T G A C G G T A C G A T T G... Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 7 / 46

8 Identification of 3 sets of trinucleotides per frame in prokaryotes and eukaryotes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 8 / 46

9 Result 2 (Arquès, Michel, 1996, 1997): Mathematical properties of X 0, X 1 and X 2 Complementary property C C(X 0 ) = X 0 : X 0 is self-complementary C(X 1 ) = X 2 and C(X 2 ) = X 1 : X 1 and X 2 are complementary to each other Permutation property P P(X 0 ) = X 1 and P(X 1 ) = X 2 : X 1 and X 2 are deduced from X 0 by permutation Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 9 / 46

10 Result 2 (Arquès, Michel, 1996, 1997): Mathematical properties of X 0, X 1 and X 2 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 10 / 46

11 Result 3 (Arquès, Michel, 1996, 1997): X 0, X 1 and X 2 are trinucleotide circular codes X 0 is able to retrieve the reading frame 0 X 1 is able to retrieve the frame 1 X 2 is able to retrieve the frame 2 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 11 / 46

12 Code, comma-free code Y = {A,GC,AGC} is not a code as A GC = AGC 3 A 4 = {AAA,...,TTT} (genetic code) is a code. 3 A 4 is not a comma-free code: A CGA CG = ACG ACG Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 12 / 46

13 Circular code For words of length 3 over a 4-letter alphabet (trinucleotides), the maximal length of circular codes is 20 words Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 13 / 46

14 Circular code: proof Flower automaton (Lassez, 1976; Berstel, Perrin, 1985; Arquès, Michel, 1996, 1997) Necklaces 5LDCN (Letter Diletter Continued Necklace) (Pirillo, 2003) and nldccn (Letter Diletter Continued Closed Necklace) with n {2,3,4,5} (Michel, Pirillo, 2010) Result 4 (Lacan, Michel, 2001): Proof that the probabilistic model based on the nucleotide frequencies (Koch, Lehmann, 1997) is incomplete for constructing circular codes, in particular it cannot generate X 0 (cannot generate the trinucleotides ) Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 14 / 46

15 Circular code The decomposition of any word of a circular code Y written on a circle is unique Is Y = {GCG,CGC} a circular code? C 2 decompositions: w = GCG CGC w = CGC GCG Y is not a circular code G C G C Y = {GGC,CGG} is a circular code G Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 15 / 46

16 Circular code Generation of a word from the circular code X 0 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 16 / 46

17 Circular code Generation of a word from the circular code X 0 GAA,GAG,GTA,GTA,ACC,AAT,GTA,CTC,TAC,TTC,ACC,ATC Then, the trinucleotides in frame 1 mainly belong to X 1 G,AAG,AGG,TAG,TAA,CCA,ATG,TAC,TCT,ACT,TCA,CCA,TC X 2 X 0 Then, the trinucleotides in frame 2 mainly belong to X 2 GA,AGA,GGT,AGT,AAC,CAA,TGT,ACT,CTA,CTT,CAC,CAT,C X 0 X 0 X 1 In frame 1: 75.4 % of X 1, 11.9 % of X 0 and 12.7 % of X 2 In frame 2: 75.4 % of X 2, 11.9 % of X 0 and 12.7 % of X 1 In a comma-free codey 0, the trinucleotides of Y 0 do not occur in the shifted frames 1 and 2 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 17 / 46

18 Result 5 (Arquès, Michel, 1996, 1997): X 0 is a C 3 self-complementary trinucleotide circular code X 0, X 1 = P(X 0 ) and X 2 = P(X 1 ) are maximal (20) trinucleotide circular codes C(X 0 ) = X 0, C(X 1 ) = X 2 and C(X 2 ) = X 1 Remark: if X 0 is a circular code then X 1 = P(X 0 ) and X 2 = P(X 1 ) are not necessarily circular codes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 18 / 46

19 Result 6 (Michel, Pirillo, Pirillo, 2008): Growth function of comma-free codes Result 7 (Michel, Pirillo, Pirillo, 2008): Growth function of C 3 self-complementary comma-free codes Result 8 (Michel, Pirillo, 2010): Growth function of circular codes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 19 / 46

20 Result 9 (Arquès, Michel, 1996, 1997): Classes of maximal (20) circular codes Number of potential circular codes: 3 20 = Number of circular codes: Number of C 3 codes: Number of C 3 self-complementary codes: 216 Occurrence probability of X 0 in genes: 216 / 3 20 = Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 20 / 46

21 Result 10 (Michel, Pirillo, 2011; Michel, Pirillo, Pirillo, 2012): Hierarchy of maximal (20) circular codes Strong: comma-free codes Weak Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 21 / 46

22 Result 11 (Bussoli, Michel, Pirillo, 2011, 2012): Self-complementary maximal (20) circular codes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 22 / 46

23 Result 12 (Benard, Michel, 2013): Transversion II on the three positions of any subset of trinucleotides of the circular code X 0 yields no circular code Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 23 / 46

24 Result 13 (Benard, Michel, 2013): Transversion I on the 2nd position of any subset of trinucleotides of the circular code X 0 yields to circular codes which are always C 3 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 24 / 46

25 Result 14 (Michel, Pirillo, 2013): Dinucleotide circular codes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 25 / 46

26 Result 15 (Michel, Pirillo, 2013): List of the 24 dinucleotide circular codes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 26 / 46

27 Result 16 (Arquès, Michel, 1996, 1997): The circular code X 0 codes 12 amino acids Result 17 (Michel, Pirillo, 2013): Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 27 / 46

28 Result 18 (Arquès, Michel, 1996, 1997): The comma-free code RNY = {RRY,RYY} deduced from X 0 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 28 / 46

29 Result 19 (Arquès, Lacan, Michel, 2002): Identification of genes in genomes with statistical functions based on the circular code X 0 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 29 / 46

30 Result 19 (Arquès, Lacan, Michel, 2002): Identification of genes in genomes with statistical functions based on the circular code X 0 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 30 / 46

31 Frameshift genes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 31 / 46

32 Result 20 (Ahmed, Frey, Michel, 2007): Loss of the signal of the circular code X 0 at the frameshift site of frameshift genes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 32 / 46

33 Result 21 (Ahmed, Michel, 2011): Shift of the signal of the circular code X 0 at the frameshift site of frameshift genes +1 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 33 / 46

34 Result 21 (Ahmed, Michel, 2011): Shift of the signal of the circular code X 0 at the frameshift site of frameshift genes -1 Statistical tests based on the circular code X 0 can be used to describe frameshift genes (Seligmann, 2012) Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 34 / 46

35 Result 22 (Ahmed, Michel, 2008): Signal of the circular code X 0 in the plant mirnas Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 35 / 46

36 Result 23 (Michel, 2013): X 0 circular code motifs in transfer RNAs Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 36 / 46

37 Result 23 (Michel, 2013): X 0 circular code motifs in trnas of prokaryotes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 37 / 46

38 Result 23 (Michel, 2013): X 0 circular code motifs in trnas of eukaryotes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 38 / 46

39 Result 24 (Arquès, Fallot, Marsan, Michel, 1999; Bahi, Michel, 2008): Asymmetry between the circular codes X 1 and X 2 in genes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 39 / 46

40 Result 25 (Michel, 2013): Asymmetry between the circular codes X 1 and X 2 in the 3 regions of trnas of prokaryotes and eukaryotes Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 40 / 46

41 Result 26 (Michel, 2012): A possible translation code based on the circular code X 0 trna-phe Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 41 / 46

42 Result 26 (Michel, 2012): A possible translation code based on the circular code X 0 16S rrna Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 42 / 46

43 Result 26 (Michel, 2012): A possible translation code based on the circular code X 0 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 43 / 46

44 Result 26 (Michel, 2012): A possible translation code based on the circular code X 0 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 44 / 46

45 Result 26 (Michel, 2012): A possible translation code based on the circular code X 0 Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 45 / 46

46 References Prof. Christian MICHEL, Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, France 46 / 46

Computational Biology and Chemistry

Computational Biology and Chemistry Computational Biology and Chemistry 33 (2009) 245 252 Contents lists available at ScienceDirect Computational Biology and Chemistry journal homepage: www.elsevier.com/locate/compbiolchem Research Article

More information

Codon Distribution in Error-Detecting Circular Codes

Codon Distribution in Error-Detecting Circular Codes life Article Codon Distribution in Error-Detecting Circular Codes Elena Fimmel, * and Lutz Strüngmann Institute for Mathematical Biology, Faculty of Computer Science, Mannheim University of Applied Sciences,

More information

Predicting the Evolution of two Genes in the Yeast Saccharomyces Cerevisiae

Predicting the Evolution of two Genes in the Yeast Saccharomyces Cerevisiae Available online at wwwsciencedirectcom Procedia Computer Science 11 (01 ) 4 16 Proceedings of the 3rd International Conference on Computational Systems-Biology and Bioinformatics (CSBio 01) Predicting

More information

An Analytical Model of Gene Evolution with 9 Mutation Parameters: An Application to the Amino Acids Coded by the Common Circular Code

An Analytical Model of Gene Evolution with 9 Mutation Parameters: An Application to the Amino Acids Coded by the Common Circular Code Bulletin of Mathematical Biology (2007) 69: 677 698 DOI 10.1007/s11538-006-9147-z ORIGINAL ARTICLE An Analytical Model of Gene Evolution with 9 Mutation Parameters: An Application to the Amino Acids Coded

More information

Videos. Bozeman, transcription and translation: https://youtu.be/h3b9arupxzg Crashcourse: Transcription and Translation - https://youtu.

Videos. Bozeman, transcription and translation: https://youtu.be/h3b9arupxzg Crashcourse: Transcription and Translation - https://youtu. Translation Translation Videos Bozeman, transcription and translation: https://youtu.be/h3b9arupxzg Crashcourse: Transcription and Translation - https://youtu.be/itsb2sqr-r0 Translation Translation The

More information

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid. 1. A change that makes a polypeptide defective has been discovered in its amino acid sequence. The normal and defective amino acid sequences are shown below. Researchers are attempting to reproduce the

More information

GCD3033:Cell Biology. Transcription

GCD3033:Cell Biology. Transcription Transcription Transcription: DNA to RNA A) production of complementary strand of DNA B) RNA types C) transcription start/stop signals D) Initiation of eukaryotic gene expression E) transcription factors

More information

Crick s early Hypothesis Revisited

Crick s early Hypothesis Revisited Crick s early Hypothesis Revisited Or The Existence of a Universal Coding Frame Ryan Rossi, Jean-Louis Lassez and Axel Bernal UPenn Center for Bioinformatics BIOINFORMATICS The application of computer

More information

BME 5742 Biosystems Modeling and Control

BME 5742 Biosystems Modeling and Control BME 5742 Biosystems Modeling and Control Lecture 24 Unregulated Gene Expression Model Dr. Zvi Roth (FAU) 1 The genetic material inside a cell, encoded in its DNA, governs the response of a cell to various

More information

UNIT 5. Protein Synthesis 11/22/16

UNIT 5. Protein Synthesis 11/22/16 UNIT 5 Protein Synthesis IV. Transcription (8.4) A. RNA carries DNA s instruction 1. Francis Crick defined the central dogma of molecular biology a. Replication copies DNA b. Transcription converts DNA

More information

From Gene to Protein

From Gene to Protein From Gene to Protein Gene Expression Process by which DNA directs the synthesis of a protein 2 stages transcription translation All organisms One gene one protein 1. Transcription of DNA Gene Composed

More information

Lesson Overview. Ribosomes and Protein Synthesis 13.2

Lesson Overview. Ribosomes and Protein Synthesis 13.2 13.2 The Genetic Code The first step in decoding genetic messages is to transcribe a nucleotide base sequence from DNA to mrna. This transcribed information contains a code for making proteins. The Genetic

More information

1. In most cases, genes code for and it is that

1. In most cases, genes code for and it is that Name Chapter 10 Reading Guide From DNA to Protein: Gene Expression Concept 10.1 Genetics Shows That Genes Code for Proteins 1. In most cases, genes code for and it is that determine. 2. Describe what Garrod

More information

Mining Infrequent Patterns of Two Frequent Substrings from a Single Set of Biological Sequences

Mining Infrequent Patterns of Two Frequent Substrings from a Single Set of Biological Sequences Mining Infrequent Patterns of Two Frequent Substrings from a Single Set of Biological Sequences Daisuke Ikeda Department of Informatics, Kyushu University 744 Moto-oka, Fukuoka 819-0395, Japan. daisuke@inf.kyushu-u.ac.jp

More information

From DNA to protein, i.e. the central dogma

From DNA to protein, i.e. the central dogma From DNA to protein, i.e. the central dogma DNA RNA Protein Biochemistry, chapters1 5 and Chapters 29 31. Chapters 2 5 and 29 31 will be covered more in detail in other lectures. ph, chapter 1, will be

More information

Modelling and Analysis in Bioinformatics. Lecture 1: Genomic k-mer Statistics

Modelling and Analysis in Bioinformatics. Lecture 1: Genomic k-mer Statistics 582746 Modelling and Analysis in Bioinformatics Lecture 1: Genomic k-mer Statistics Juha Kärkkäinen 06.09.2016 Outline Course introduction Genomic k-mers 1-Mers 2-Mers 3-Mers k-mers for Larger k Outline

More information

Chapters 12&13 Notes: DNA, RNA & Protein Synthesis

Chapters 12&13 Notes: DNA, RNA & Protein Synthesis Chapters 12&13 Notes: DNA, RNA & Protein Synthesis Name Period Words to Know: nucleotides, DNA, complementary base pairing, replication, genes, proteins, mrna, rrna, trna, transcription, translation, codon,

More information

9/11/18. Molecular and Cellular Biology. 3. The Cell From Genes to Proteins. key processes

9/11/18. Molecular and Cellular Biology. 3. The Cell From Genes to Proteins. key processes Molecular and Cellular Biology Animal Cell ((eukaryotic cell) -----> compare with prokaryotic cell) ENDOPLASMIC RETICULUM (ER) Rough ER Smooth ER Flagellum Nuclear envelope Nucleolus NUCLEUS Chromatin

More information

Research Article Strong Trinucleotide Circular Codes

Research Article Strong Trinucleotide Circular Codes Hindawi Publishing Corporation International Journal of Combinatorics Volume 2011, Article ID 659567, 14 pages doi:10.1155/2011/659567 Research Article Strong Trinucleotide Circular Codes Christian J.

More information

Diego L. Gonzalez. Dipartimento di Statistica Università di Bologna. CNR-IMM Istituto per la Microelettronica e i Microsistemi

Diego L. Gonzalez. Dipartimento di Statistica Università di Bologna. CNR-IMM Istituto per la Microelettronica e i Microsistemi Diego L. Gonzalez CNR-IMM Istituto per la Microelettronica e i Microsistemi Dipartimento di Statistica Università di Bologna 1 Biological Codes Biological codes are the great invariants of life, the sole

More information

Molecular Biology - Translation of RNA to make Protein *

Molecular Biology - Translation of RNA to make Protein * OpenStax-CNX module: m49485 1 Molecular Biology - Translation of RNA to make Protein * Jerey Mahr Based on Translation by OpenStax This work is produced by OpenStax-CNX and licensed under the Creative

More information

Multiple Choice Review- Eukaryotic Gene Expression

Multiple Choice Review- Eukaryotic Gene Expression Multiple Choice Review- Eukaryotic Gene Expression 1. Which of the following is the Central Dogma of cell biology? a. DNA Nucleic Acid Protein Amino Acid b. Prokaryote Bacteria - Eukaryote c. Atom Molecule

More information

RNA & PROTEIN SYNTHESIS. Making Proteins Using Directions From DNA

RNA & PROTEIN SYNTHESIS. Making Proteins Using Directions From DNA RNA & PROTEIN SYNTHESIS Making Proteins Using Directions From DNA RNA & Protein Synthesis v Nitrogenous bases in DNA contain information that directs protein synthesis v DNA remains in nucleus v in order

More information

Chapter

Chapter Chapter 17 17.4-17.6 Molecular Components of Translation A cell interprets a genetic message and builds a polypeptide The message is a series of codons on mrna The interpreter is called transfer (trna)

More information

Chapter 17. From Gene to Protein. Biology Kevin Dees

Chapter 17. From Gene to Protein. Biology Kevin Dees Chapter 17 From Gene to Protein DNA The information molecule Sequences of bases is a code DNA organized in to chromosomes Chromosomes are organized into genes What do the genes actually say??? Reflecting

More information

Unit 3 - Molecular Biology & Genetics - Review Packet

Unit 3 - Molecular Biology & Genetics - Review Packet Name Date Hour Unit 3 - Molecular Biology & Genetics - Review Packet True / False Questions - Indicate True or False for the following statements. 1. Eye color, hair color and the shape of your ears can

More information

Bio 119 Bacterial Genomics 6/26/10

Bio 119 Bacterial Genomics 6/26/10 BACTERIAL GENOMICS Reading in BOM-12: Sec. 11.1 Genetic Map of the E. coli Chromosome p. 279 Sec. 13.2 Prokaryotic Genomes: Sizes and ORF Contents p. 344 Sec. 13.3 Prokaryotic Genomes: Bioinformatic Analysis

More information

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS 1 Prokaryotes and Eukaryotes 2 DNA and RNA 3 4 Double helix structure Codons Codons are triplets of bases from the RNA sequence. Each triplet defines an amino-acid.

More information

9/2/17. Molecular and Cellular Biology. 3. The Cell From Genes to Proteins. key processes

9/2/17. Molecular and Cellular Biology. 3. The Cell From Genes to Proteins. key processes Molecular and Cellular Biology Animal Cell ((eukaryotic cell) -----> compare with prokaryotic cell) ENDOPLASMIC RETICULUM (ER) Rough ER Smooth ER Flagellum Nuclear envelope Nucleolus NUCLEUS Chromatin

More information

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation. Protein Synthesis Unit 6 Goal: Students will be able to describe the processes of transcription and translation. Protein Synthesis: Protein synthesis uses the information in genes to make proteins. 2 Steps

More information

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation. Protein Synthesis Unit 6 Goal: Students will be able to describe the processes of transcription and translation. Types of RNA Messenger RNA (mrna) makes a copy of DNA, carries instructions for making proteins,

More information

Translation. Genetic code

Translation. Genetic code Translation Genetic code If genes are segments of DNA and if DNA is just a string of nucleotide pairs, then how does the sequence of nucleotide pairs dictate the sequence of amino acids in proteins? Simple

More information

Motifs and Logos. Six Introduction to Bioinformatics. Importance and Abundance of Motifs. Getting the CDS. From DNA to Protein 6.1.

Motifs and Logos. Six Introduction to Bioinformatics. Importance and Abundance of Motifs. Getting the CDS. From DNA to Protein 6.1. Motifs and Logos Six Discovering Genomics, Proteomics, and Bioinformatics by A. Malcolm Campbell and Laurie J. Heyer Chapter 2 Genome Sequence Acquisition and Analysis Sami Khuri Department of Computer

More information

Laith AL-Mustafa. Protein synthesis. Nabil Bashir 10\28\ First

Laith AL-Mustafa. Protein synthesis. Nabil Bashir 10\28\ First Laith AL-Mustafa Protein synthesis Nabil Bashir 10\28\2015 http://1drv.ms/1gigdnv 01 First 0 Protein synthesis In previous lectures we started talking about DNA Replication (DNA synthesis) and we covered

More information

Chapter 2 Class Notes Words and Probability

Chapter 2 Class Notes Words and Probability Chapter 2 Class Notes Words and Probability Medical/Genetics Illustration reference Bojesen et al (2003), Integrin 3 Leu33Pro Homozygosity and Risk of Cancer, J. NCI. Women only 2 x 2 table: Stratification

More information

Computational Cell Biology Lecture 4

Computational Cell Biology Lecture 4 Computational Cell Biology Lecture 4 Case Study: Basic Modeling in Gene Expression Yang Cao Department of Computer Science DNA Structure and Base Pair Gene Expression Gene is just a small part of DNA.

More information

The genome encodes biology as patterns or motifs. We search the genome for biologically important patterns.

The genome encodes biology as patterns or motifs. We search the genome for biologically important patterns. Curriculum, fourth lecture: Niels Richard Hansen November 30, 2011 NRH: Handout pages 1-8 (NRH: Sections 2.1-2.5) Keywords: binomial distribution, dice games, discrete probability distributions, geometric

More information

CCHS 2015_2016 Biology Fall Semester Exam Review

CCHS 2015_2016 Biology Fall Semester Exam Review Biomolecule General Knowledge Macromolecule Monomer (building block) Function Energy Storage Structure 1. What type of biomolecule is hair, skin, and nails? 2. What is the polymer of a nucleotide? 3. Which

More information

GENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS.

GENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS. !! www.clutchprep.com CONCEPT: HISTORY OF GENETICS The earliest use of genetics was through of plants and animals (8000-1000 B.C.) Selective breeding (artificial selection) is the process of breeding organisms

More information

Computational Biology: Basics & Interesting Problems

Computational Biology: Basics & Interesting Problems Computational Biology: Basics & Interesting Problems Summary Sources of information Biological concepts: structure & terminology Sequencing Gene finding Protein structure prediction Sources of information

More information

Motivating the need for optimal sequence alignments...

Motivating the need for optimal sequence alignments... 1 Motivating the need for optimal sequence alignments... 2 3 Note that this actually combines two objectives of optimal sequence alignments: (i) use the score of the alignment o infer homology; (ii) use

More information

Objective 3.01 (DNA, RNA and Protein Synthesis)

Objective 3.01 (DNA, RNA and Protein Synthesis) Objective 3.01 (DNA, RNA and Protein Synthesis) DNA Structure o Discovered by Watson and Crick o Double-stranded o Shape is a double helix (twisted ladder) o Made of chains of nucleotides: o Has four types

More information

Markov Models & DNA Sequence Evolution

Markov Models & DNA Sequence Evolution 7.91 / 7.36 / BE.490 Lecture #5 Mar. 9, 2004 Markov Models & DNA Sequence Evolution Chris Burge Review of Markov & HMM Models for DNA Markov Models for splice sites Hidden Markov Models - looking under

More information

Stochastic processes and Markov chains (part II)

Stochastic processes and Markov chains (part II) Stochastic processes and Markov chains (part II) Wessel van Wieringen w.n.van.wieringen@vu.nl Department of Epidemiology and Biostatistics, VUmc & Department of Mathematics, VU University Amsterdam, The

More information

Bioinformatics Chapter 1. Introduction

Bioinformatics Chapter 1. Introduction Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!

More information

Hypothesis. Levels of organization. Theory. Controlled experiment. Homeostasis. ph scale. Characteristics of living things

Hypothesis. Levels of organization. Theory. Controlled experiment. Homeostasis. ph scale. Characteristics of living things Hypothesis Quantitative & Qualitative observations Theory Levels of organization Controlled experiment Homeostasis Characteristics of living things ph scale Quantitative- involves numbers, counting, measuring

More information

Know how to read a balance, graduated cylinder, ruler. Know the SI unit of each measurement.

Know how to read a balance, graduated cylinder, ruler. Know the SI unit of each measurement. Biology I Fall Semester Exam Review 2012-2013 Due the day of your final for a maximum of 5 extra credit points. You will be able to use this review on your exam for 15 minutes! Safety and Lab Measurement:

More information

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus:

Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus: m Eukaryotic mrna processing Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus: Cap structure a modified guanine base is added to the 5 end. Poly-A tail

More information

Introduction to Bioinformatics. Shifra Ben-Dor Irit Orr

Introduction to Bioinformatics. Shifra Ben-Dor Irit Orr Introduction to Bioinformatics Shifra Ben-Dor Irit Orr Lecture Outline: Technical Course Items Introduction to Bioinformatics Introduction to Databases This week and next week What is bioinformatics? A

More information

CCHS 2016_2017 Biology Fall Semester Exam Review

CCHS 2016_2017 Biology Fall Semester Exam Review CCHS 2016_2017 Biology Fall Semester Exam Review Biomolecule General Knowledge Macromolecule Monomer (building block) Function Structure 1. What type of biomolecule is hair, skin, and nails? Energy Storage

More information

What is the central dogma of biology?

What is the central dogma of biology? Bellringer What is the central dogma of biology? A. RNA DNA Protein B. DNA Protein Gene C. DNA Gene RNA D. DNA RNA Protein Review of DNA processes Replication (7.1) Transcription(7.2) Translation(7.3)

More information

Types of RNA. 1. Messenger RNA(mRNA): 1. Represents only 5% of the total RNA in the cell.

Types of RNA. 1. Messenger RNA(mRNA): 1. Represents only 5% of the total RNA in the cell. RNAs L.Os. Know the different types of RNA & their relative concentration Know the structure of each RNA Understand their functions Know their locations in the cell Understand the differences between prokaryotic

More information

Biology I Level - 2nd Semester Final Review

Biology I Level - 2nd Semester Final Review Biology I Level - 2nd Semester Final Review The 2 nd Semester Final encompasses all material that was discussed during second semester. It s important that you review ALL notes and worksheets from the

More information

Reading Assignments. A. Genes and the Synthesis of Polypeptides. Lecture Series 7 From DNA to Protein: Genotype to Phenotype

Reading Assignments. A. Genes and the Synthesis of Polypeptides. Lecture Series 7 From DNA to Protein: Genotype to Phenotype Lecture Series 7 From DNA to Protein: Genotype to Phenotype Reading Assignments Read Chapter 7 From DNA to Protein A. Genes and the Synthesis of Polypeptides Genes are made up of DNA and are expressed

More information

Stochastic processes and

Stochastic processes and Stochastic processes and Markov chains (part II) Wessel van Wieringen w.n.van.wieringen@vu.nl wieringen@vu nl Department of Epidemiology and Biostatistics, VUmc & Department of Mathematics, VU University

More information

Organic Chemistry Option II: Chemical Biology

Organic Chemistry Option II: Chemical Biology Organic Chemistry Option II: Chemical Biology Recommended books: Dr Stuart Conway Department of Chemistry, Chemistry Research Laboratory, University of Oxford email: stuart.conway@chem.ox.ac.uk Teaching

More information

Translation and Operons

Translation and Operons Translation and Operons You Should Be Able To 1. Describe the three stages translation. including the movement of trna molecules through the ribosome. 2. Compare and contrast the roles of three different

More information

2015 FALL FINAL REVIEW

2015 FALL FINAL REVIEW 2015 FALL FINAL REVIEW Biomolecules & Enzymes Illustrate table and fill in parts missing 9A I can compare and contrast the structure and function of biomolecules. 9C I know the role of enzymes and how

More information

Introduction to molecular biology. Mitesh Shrestha

Introduction to molecular biology. Mitesh Shrestha Introduction to molecular biology Mitesh Shrestha Molecular biology: definition Molecular biology is the study of molecular underpinnings of the process of replication, transcription and translation of

More information

O 3 O 4 O 5. q 3. q 4. Transition

O 3 O 4 O 5. q 3. q 4. Transition Hidden Markov Models Hidden Markov models (HMM) were developed in the early part of the 1970 s and at that time mostly applied in the area of computerized speech recognition. They are first described in

More information

The Complete Set Of Genetic Instructions In An Organism's Chromosomes Is Called The

The Complete Set Of Genetic Instructions In An Organism's Chromosomes Is Called The The Complete Set Of Genetic Instructions In An Organism's Chromosomes Is Called The What is a genome? A genome is an organism's complete set of genetic instructions. Single strands of DNA are coiled up

More information

Interpolated Markov Models for Gene Finding. BMI/CS 776 Spring 2015 Colin Dewey

Interpolated Markov Models for Gene Finding. BMI/CS 776  Spring 2015 Colin Dewey Interpolated Markov Models for Gene Finding BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2015 Colin Dewey cdewey@biostat.wisc.edu Goals for Lecture the key concepts to understand are the following the

More information

GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications

GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications 1 GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications 2 DNA Promoter Gene A Gene B Termination Signal Transcription

More information

Mitochondrial Genome Annotation

Mitochondrial Genome Annotation Protein Genes 1,2 1 Institute of Bioinformatics University of Leipzig 2 Department of Bioinformatics Lebanese University TBI Bled 2015 Outline Introduction Mitochondrial DNA Problem Tools Training Annotation

More information

On the optimality of the standard genetic code: the role of stop codons

On the optimality of the standard genetic code: the role of stop codons On the optimality of the standard genetic code: the role of stop codons Sergey Naumenko 1*, Andrew Podlazov 1, Mikhail Burtsev 1,2, George Malinetsky 1 1 Department of Non-linear Dynamics, Keldysh Institute

More information

Midterm Review Guide. Unit 1 : Biochemistry: 1. Give the ph values for an acid and a base. 2. What do buffers do? 3. Define monomer and polymer.

Midterm Review Guide. Unit 1 : Biochemistry: 1. Give the ph values for an acid and a base. 2. What do buffers do? 3. Define monomer and polymer. Midterm Review Guide Name: Unit 1 : Biochemistry: 1. Give the ph values for an acid and a base. 2. What do buffers do? 3. Define monomer and polymer. 4. Fill in the Organic Compounds chart : Elements Monomer

More information

something about srna in archaea

something about srna in archaea something about srna in archaea or: Processed Small RNAs in Archaea and BHB Elements Sarah Berkemer Bioinformatics Vienzig Archaea? Sarah Berkemer (Bioinformatics Vienzig) BHB elements in Archaea 2 / 23

More information

Is inversion symmetry of chromosomes a law of nature?

Is inversion symmetry of chromosomes a law of nature? Is inversion symmetry of chromosomes a law of nature? David Horn TAU Safra bioinformatics retreat, 28/6/2018 Lecture based on Inversion symmetry of DNA k-mer counts: validity and deviations. Shporer S,

More information

Practical Bioinformatics

Practical Bioinformatics 5/2/2017 Dictionaries d i c t i o n a r y = { A : T, T : A, G : C, C : G } d i c t i o n a r y [ G ] d i c t i o n a r y [ N ] = N d i c t i o n a r y. h a s k e y ( C ) Dictionaries g e n e t i c C o

More information

Advanced Topics in RNA and DNA. DNA Microarrays Aptamers

Advanced Topics in RNA and DNA. DNA Microarrays Aptamers Quiz 1 Advanced Topics in RNA and DNA DNA Microarrays Aptamers 2 Quantifying mrna levels to asses protein expression 3 The DNA Microarray Experiment 4 Application of DNA Microarrays 5 Some applications

More information

Name: Test date: Per:

Name: Test date: Per: Name: Test date: Per: Cell Cycle/Protein Synthesis Unit 1 Study Guide Always remember to use your notes/lectures first, then book, then other sources to help you find the right answers. Be as thorough

More information

Interphase & Cell Division

Interphase & Cell Division 1 Interphase & Cell Division 2 G1 = cell grows and carries out its normal job. S phase = DNA is copied (replicated/duplicated) G2 = Cell prepares for division 3 During mitosis, the nuclear membrane breaks

More information

Introduction to Polymer Physics

Introduction to Polymer Physics Introduction to Polymer Physics Enrico Carlon, KU Leuven, Belgium February-May, 2016 Enrico Carlon, KU Leuven, Belgium Introduction to Polymer Physics February-May, 2016 1 / 28 Polymers in Chemistry and

More information

From gene to protein. Premedical biology

From gene to protein. Premedical biology From gene to protein Premedical biology Central dogma of Biology, Molecular Biology, Genetics transcription replication reverse transcription translation DNA RNA Protein RNA chemically similar to DNA,

More information

Translation. A ribosome, mrna, and trna.

Translation. A ribosome, mrna, and trna. Translation The basic processes of translation are conserved among prokaryotes and eukaryotes. Prokaryotic Translation A ribosome, mrna, and trna. In the initiation of translation in prokaryotes, the Shine-Dalgarno

More information

Lecture 18 June 2 nd, Gene Expression Regulation Mutations

Lecture 18 June 2 nd, Gene Expression Regulation Mutations Lecture 18 June 2 nd, 2016 Gene Expression Regulation Mutations From Gene to Protein Central Dogma Replication DNA RNA PROTEIN Transcription Translation RNA Viruses: genome is RNA Reverse Transcriptase

More information

Sequence analysis and comparison

Sequence analysis and comparison The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species

More information

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot

More information

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Bio 1B Lecture Outline (please print and bring along) Fall, 2007 Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution

More information

Regulatory Sequence Analysis. Sequence models (Bernoulli and Markov models)

Regulatory Sequence Analysis. Sequence models (Bernoulli and Markov models) Regulatory Sequence Analysis Sequence models (Bernoulli and Markov models) 1 Why do we need random models? Any pattern discovery relies on an underlying model to estimate the random expectation. This model

More information

Biology Final Review

Biology Final Review Biology Final Review Complete this review on your own paper and staple your answers to this paper. Each section is worth certain number of points. You can earn up to 10 points total on the semester exam.

More information

Biology I Fall Semester Exam Review 2014

Biology I Fall Semester Exam Review 2014 Biology I Fall Semester Exam Review 2014 Biomolecules and Enzymes (Chapter 2) 8 questions Macromolecules, Biomolecules, Organic Compunds Elements *From the Periodic Table of Elements Subunits Monomers,

More information

Number of questions TEK (Learning Target) Biomolecules & Enzymes

Number of questions TEK (Learning Target) Biomolecules & Enzymes Unit Biomolecules & Enzymes Number of questions TEK (Learning Target) on Exam 8 questions 9A I can compare and contrast the structure and function of biomolecules. 9C I know the role of enzymes and how

More information

1/22/13. Example: CpG Island. Question 2: Finding CpG Islands

1/22/13. Example: CpG Island. Question 2: Finding CpG Islands I529: Machine Learning in Bioinformatics (Spring 203 Hidden Markov Models Yuzhen Ye School of Informatics and Computing Indiana Univerty, Bloomington Spring 203 Outline Review of Markov chain & CpG island

More information

LIFE SCIENCE CHAPTER 5 & 6 FLASHCARDS

LIFE SCIENCE CHAPTER 5 & 6 FLASHCARDS LIFE SCIENCE CHAPTER 5 & 6 FLASHCARDS Why were ratios important in Mendel s work? A. They showed that heredity does not follow a set pattern. B. They showed that some traits are never passed on. C. They

More information

Biology 2018 Final Review. Miller and Levine

Biology 2018 Final Review. Miller and Levine Biology 2018 Final Review Miller and Levine bones blood cells elements All living things are made up of. cells If a cell of an organism contains a nucleus, the organism is a(n). eukaryote prokaryote plant

More information

Introduction to Hidden Markov Models for Gene Prediction ECE-S690

Introduction to Hidden Markov Models for Gene Prediction ECE-S690 Introduction to Hidden Markov Models for Gene Prediction ECE-S690 Outline Markov Models The Hidden Part How can we use this for gene prediction? Learning Models Want to recognize patterns (e.g. sequence

More information

13.4 Gene Regulation and Expression

13.4 Gene Regulation and Expression 13.4 Gene Regulation and Expression Lesson Objectives Describe gene regulation in prokaryotes. Explain how most eukaryotic genes are regulated. Relate gene regulation to development in multicellular organisms.

More information

Chapter 9 DNA recognition by eukaryotic transcription factors

Chapter 9 DNA recognition by eukaryotic transcription factors Chapter 9 DNA recognition by eukaryotic transcription factors TRANSCRIPTION 101 Eukaryotic RNA polymerases RNA polymerase RNA polymerase I RNA polymerase II RNA polymerase III RNA polymerase IV Function

More information

Translation Part 2 of Protein Synthesis

Translation Part 2 of Protein Synthesis Translation Part 2 of Protein Synthesis IN: How is transcription like making a jello mold? (be specific) What process does this diagram represent? A. Mutation B. Replication C.Transcription D.Translation

More information

Biology Semester 1 Study Guide

Biology Semester 1 Study Guide Name Per Date Biology Semester 1 Study Guide The following Gizmos meet the standards assessed by the Biology EOC and should be reviewed during the first semester: 1. Rabbit Population by Season Gizmo 2.

More information

Degeneracy. Two types of degeneracy:

Degeneracy. Two types of degeneracy: Degeneracy The occurrence of more than one codon for an amino acid (AA). Most differ in only the 3 rd (3 ) base, with the 1 st and 2 nd being most important for distinguishing the AA. Two types of degeneracy:

More information

Probabilistic modeling and molecular phylogeny

Probabilistic modeling and molecular phylogeny Probabilistic modeling and molecular phylogeny Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical University of Denmark (DTU) What is a model? Mathematical

More information

PROTEIN SYNTHESIS INTRO

PROTEIN SYNTHESIS INTRO MR. POMERANTZ Page 1 of 6 Protein synthesis Intro. Use the text book to help properly answer the following questions 1. RNA differs from DNA in that RNA a. is single-stranded. c. contains the nitrogen

More information

Algorithmics and Bioinformatics

Algorithmics and Bioinformatics Algorithmics and Bioinformatics Gregory Kucherov and Philippe Gambette LIGM/CNRS Université Paris-Est Marne-la-Vallée, France Schedule Course webpage: https://wikimpri.dptinfo.ens-cachan.fr/doku.php?id=cours:c-1-32

More information

ROBI POLIKAR. ECE 402/504 Lecture Hidden Markov Models IGNAL PROCESSING & PATTERN RECOGNITION ROWAN UNIVERSITY

ROBI POLIKAR. ECE 402/504 Lecture Hidden Markov Models IGNAL PROCESSING & PATTERN RECOGNITION ROWAN UNIVERSITY BIOINFORMATICS Lecture 11-12 Hidden Markov Models ROBI POLIKAR 2011, All Rights Reserved, Robi Polikar. IGNAL PROCESSING & PATTERN RECOGNITION LABORATORY @ ROWAN UNIVERSITY These lecture notes are prepared

More information

2. What was the Avery-MacLeod-McCarty experiment and why was it significant? 3. What was the Hershey-Chase experiment and why was it significant?

2. What was the Avery-MacLeod-McCarty experiment and why was it significant? 3. What was the Hershey-Chase experiment and why was it significant? Name Date Period AP Exam Review Part 6: Molecular Genetics I. DNA and RNA Basics A. History of finding out what DNA really is 1. What was Griffith s experiment and why was it significant? 1 2. What was

More information

In Genomes, Two Types of Genes

In Genomes, Two Types of Genes In Genomes, Two Types of Genes Protein-coding: [Start codon] [codon 1] [codon 2] [ ] [Stop codon] + DNA codons translated to amino acids to form a protein Non-coding RNAs (NcRNAs) No consistent patterns

More information

DNA Structure and Function

DNA Structure and Function DNA Structure and Function Nucleotide Structure 1. 5-C sugar RNA ribose DNA deoxyribose 2. Nitrogenous Base N attaches to 1 C of sugar Double or single ring Four Bases Adenine, Guanine, Thymine, Cytosine

More information

Genetic Code, Attributive Mappings and Stochastic Matrices

Genetic Code, Attributive Mappings and Stochastic Matrices Genetic Code, Attributive Mappings and Stochastic Matrices Matthew He Division of Math, Science and Technology Nova Southeastern University Ft. Lauderdale, FL 33314, USA Email: hem@nova.edu Abstract: In

More information