RNA Secondary Structure Prediction
|
|
- Bertram Higgins
- 5 years ago
- Views:
Transcription
1 RN Secondary Structure Prediction Perry Hooker S 531: dvanced lgorithms Prof. Mike Rosulek University of Montana December 10, 2010 Introduction Ribonucleic acid (RN) is a macromolecule that is essential to living biological systems. In addition to its role in the central dogma of molecular biology as an agent of information transfer (Fig. 1), certain specific RN molecules are not translated into proteins and instead operate functionally in a number of different capacities. In fact, recent evidence suggests that most of the genomes of mammals and other complex organisms is transcribed into noncoding RN (ncrn) [8]. Examples of ncrn molecules include ribosomal RN, a core component of the cellular protein manufacturing mechanism, small nuclear RN, which is involved in RN splicing and regulation of transcription factors, and micro RN, which is responsible for translational gene repression. The function of a given ncrn depends on the molecule s structure in Transcription Translation DN RN Protein Reverse Transcription Figure 1: The central dogma of molecular biology, showing information flow. 1
2 3-dimensional space, or tertiary structure. 1 In RN, tertiary structure is derived from secondary structure, which is defined as the set of all base-pair interactions among an RN s component nucleotides (Figs. 2, 3). 2 Primary structure refers to the sequential ordering of an RN s nucleotides. urrently, the most well-established experimental method for determining RN secondary structure involves working backward from a tertiary structure obtained via X-ray crystallography. However, experimental methods for determining RN structure are generally expensive and time-consuming. Since all of the information in a strand of RN is captured by the molecule s primary structure, the ability to determine the function of an RN molecule given only the sequence of its nucleotides is theoretically possible. Free Energy Minimization Modern techniques for RN secondary structure prediction are based on the concept of free-energy minimization. These algorithms depend on two principles: First, that the structure adopted by an RN molecule in solution is that which minimizes the total Gibbs free energy G of the solution [16], and second, that the free energy for a given secondary structure can be estimated for any sequence [7, 15]. Gibbs Free Energy Gibbs free energy is a measure of the process-initiating work obtainable from a thermodynamic system at constant temperature and pressure, and is typically expressed in terms of enthalpy H, entropy S, and absolute temperature (Eq. 1). G = H T S (1) Reactions which increase the total entropy of the universe occur spontaneously and are characterized by negative free energy values. The tendency of systems (e.g., strands of RN in solution) to adopt states which minimize G is a consequence of the Second Law of Thermodynamics. 1 The tertiary structures of DN and RN are typically quite different: while DN molecules usually exist as fully base-paired double helices, most RN is single-stranded and hence adopts highly complex 3D forms. The genesis of these complex structures is further aided by the increased ability of RN to form hydrogen bonds due to the extra hydroxyl group in the RN s ribose sugar backbone. 2 These interactions include the canonical Watson-rick base pairs & G U, as well as alternate bonding patterns like wobble bases or Hoogsteen base pairs. 2
3 Estimating G for RN free energy estimate for a given secondary structure depends on the number and type of nucleotides involved in base-pairing interactions, as well as the resultant arrangements of those nucleotides which are not base-paired. These arrangements or motifs fall into two broad categories: nested loops (Fig. 2) and pseudoknots (Fig. 3). Formally, given an RN sequence S and a set of base pairs S which define a secondary structure, nested loops satisfy i j, i j S, either i < i < j < j or i < j < i < j. By contrast, i j and i j form a pseudoknot if i < i < j < j. Base pairs reduce the total free energy of a secondary structure, and regions of contiguous base pairs (or stacked bases) further reduce the structure s total free energy. Loop regions, on the other hand, increase the total free energy of a secondary structure; different loop motifs make different contributions to the final free energy estimate (Table 1) for a given secondary structure. G estimates for various motifs are based on optical melting experiments [7]. lgorithmic Techniques Nussinov [9] and Zuker [19] produced the first polynomial-time dynamic programming algorithms for RN secondary structure prediction in the late 1970s. Today, most RN structure prediction algorithms still use some form of dynamic programming, though other approaches employing genetic algorithms have been implemented with some success [5]. Minimum free-energy secondary structures which do not include pseudoknots can be found using dynamic programming in O(N 3 ) time, where N is the number of nucleotides in the sequence [9, 19, 10, 18]. By contrast, the task of determining RN secondary structures which include arbitrary pseudoknots has been shown to be NP-complete [6, 1], though there have been successful attempts to predict pseudoknotted structures in polynomial time using a limited set of all possible pseudoknot types [14]. Dynamic Programming Without Pseudoknots The first dynamic programming algorithm for predicting RN secondary structure sought only to maximize the number of nested base-pair interactions in a sequence S composed of nucleotides S 1... S n. This maximization problem can be approached by examining subsequences S ij of length p, and 3
4 Hairpin loop G U G 30 G Bulge loop G 20 G G U G U G Interior loops U 40 G Interior loop G } U G Stacked bases 10 G U 3 5 U U G G G U Multiloop G G G G G G Exterior loop G G G U U Hairpin loop U 60 Figure 2: Secondary structure of trn-phe from Haloarcula marismortui, a single-celled archaeon found in the Dead Sea, showing the different types of nested loops and stacked base pairs. Table 1 shows the G values for the various motifs. Figure generated with mfold [20]. 4
5 Structural element δg Information External loop ss bases & 1 closing helices. Stack External closing pair is G 1-73 Stack External closing pair is 2 -G 72 Stack External closing pair is 3 -G 71 Stack External closing pair is G 4-70 Stack External closing pair is 5 -G 69 Helix base pairs. Multi-loop 1.90 External closing pair is 6 -G 68 4 ss bases & 3 closing helices. Stack External closing pair is 50 -G 66 Stack External closing pair is 51 -G 65 Stack External closing pair is 52 -G 64 Stack External closing pair is G Helix base pairs. Hairpin loop 4.40 losing pair is G Stack External closing pair is U 8 -G 47 Stack External closing pair is 9 -U 46 Stack External closing pair is G Stack External closing pair is 11 -G 44 Stack External closing pair is U Helix base pairs. Bulge loop 1.40 External closing pair is 13 -G 42 Stack External closing pair is G Helix base pairs. Bulge loop 1.60 External closing pair is 16 -U 40 Stack External closing pair is 17 -G 38 Helix base pairs. Bulge loop 3.80 External closing pair is U Stack External closing pair is G 19 -U 34 Helix base pairs. Interior loop External closing pair is G Stack External closing pair is G Helix base pairs. Hairpin loop 4.40 losing pair is 24 -U 29 Table 1: Free energy values for the structure in Figure 2. The total G for the structure is From mfold [20]. 5
6 U U U G G G U Figure 3: simple pseudoknot. incrementally increasing p at each step. t each iteration, the maximization algorithm tests the ability of each nucleotide S k : i k < j to pair with S j, and then recursively calculates the best score in the subsequences to either side of S k. The key observation is that at each successive step of the algorithm, the values of the subsequences S i(k 1) and S (k+1)j have already been computed in an earlier iteration. The general recursive formula for computing the maximum number of nested, paired bases in a sequence from i to j is given in Equation 2 [9]. M(i, j) = max i k<j L min { M(i, k 1) + M(k + 1, j 1) + 1 M(i, j 1) Here, L min is the minimum number of bases that must separate two nucleotides involved in a base-pairing interaction. 3 Refinements to this process use more detailed free-energy approximations [19]. During execution, these refined algorithms maintain two matrices of free-energy scores for a subsequence S ij - the matrix V (i, j) (Eq. 3) represents the best possible score with i and j paired to one another, and W (i, j) (Eq. 4) is the best score regardless of whether i and j are paired. The explicit calculation of energy scores on substructures in which i and j are paired enables these algorithms to incorporate energy scores for whole motifs as well as individual base-pairs. sample of the thermodynamic stacking and destabilizing energies used to calculate a free energy estimate for a given secondary structure can be found in Table 1. The terms, eh, es, and el found in Equation 3 are the energy functions for hairpin loops, stacked base pairs, and internal loops & bulges, respectively. 3 Steric constraints limit the minimum loop size to three nucleotides, and most loops include 4-8 bases. (2) 6
7 Figure 4: graphical representation of the terms used in V (i, j). Dashed lines represent a region of arbitrary secondary structure. Dotted lines separate distinct components of each term, and roughly correspond to the + operator in each expression. () The structure represented by the first term, which handles hairpin loops. (B) The relationship between stacked base pairs. () Represents internal loops; the condition i i + j j > 2 ensures that there is at least one unpaired base between either i and i or j and j. (D) generalized multiloop structure. eh(i, j) es(i, j, i + 1, j 1) + V (i + 1, j 1) V (i, j) = min min {el(i, j, i, j ) + V (i, j )} i<i <j <j i i+j j >2 min {W (i + 1, i+1<i <j i 1) + W (i, j 1)} V (i, j), W (i + 1, j), W (i, j) = min W (i, j 1), min {W (i, i i <j i ) + W (i + 1, j)} (3) (4) Figure 4 is a graphical representation of the terms used to compute V (i, j). Figures 5 & 6 show graphical representations of the recursions used to form V (i, j) and W (i, j). The algorithms described here use only a single RN sequence as input. State-of-the-art systems for RN secondary structure prediction incorporate information derived from multiple sequences. These methods use the assumption that similar RN sequences generally have similar functional roles [4], and will therefore adopt similar secondary structures even if they differ slightly in primary structure and/or are found in different organisms. Such functionally similar sequences are deemed homologous. Homologous 7
8 sequences can be identified using covariance models or other expert techniques; structure predictions which make use of this additional information are generally termed consensus models. Figure 5: Graphical representation of the general recursion for V (i, j). wavy line between bases i and j indicates i is base-paired to j [14]. Figure 6: Graphical representation of the recursion for W (i, j). dotted line from i to j means the relation between bases is unknown [14]. Dynamic Programming With Pseudoknots The dynamic programming algorithms for RN secondary structure prediction reviewed thus far depend on the notion that the free energy of an RN sequence can be expressed as an optimal combination of the free energies of motifs formed from non-overlapping subsequences of the full sequence (Fig. 7). These algorithms are efficient because at each iteration they re-use previously calculated energies of smaller motifs formed from shorter contiguous subsequences. These standard dynamic-programming algorithms thus fail to predict pseudoknots because pseudoknotted motifs are structures formed from overlapping sequences of RN (Fig. 8). During the execution of such a standard algorithm, at any given stage a minimum free energy pseudoknotted structure may exist which incorporates base-pairing interactions with nucleotides that have already been implicitly examined by earlier iterations of the algorithm. This suggests that the general class of pseudoknotted structures does not exhibit the optimal substructure property required for a successful application of dynamic programming. However, RN pseudoknots occur relatively rarely in vivo [17]. Furthermore, the set of pseudoknot configurations that have actually been observed in nature is limited. Rivas and Eddy [14] developed a dynamic programming algorithm capable of predicting all known types of RN pseudoknots in O(N 6 ) time and O(N 4 ) space; their algorithm recursively calculates pseudoknotted configurations by explicitly considering certain classes of pseudoknots along with the general class of nested loops. Theoretical bounds 8
9 Figure 7: linear representation of the trn secondary structure shown in Figure 2. rcs represent base-pair interactions. Note that every arc connecting base i to base j defines a contiguous sequence of bases Sij such that k : i < k < j, if k is paired to l then i < l < j. Generated with VRN [3]. Figure 8: linear representation of the pseudoknot in Figure 3, showing the non-nested base-pair interactions characteristic of pseudoknots. 9
10 have since been improved to O(N 4 ) time and O(N 2 ) space [12]. Heuristic approaches offer further improvement on the accuracy and efficiency of pseudoknot prediction [13], as do updates to the free energy models used for determining structure energetics [2]. Pseudoknots & omputational omplexity The problem of RN secondary structure prediction with generalized pesudoknots has been shown to be NP-complete under simplified energy models via reductions from longest common subsequence [1] and a version of 3ST where each literal occurs at most twice (2LIT3ST) [11] [6]. Both reductions use large, carefully-constructed RN sequences and energy functions. In Lyngsø and Pedersen [6], an instance φ of 2LIT3ST with c clauses and v variables is encoded as a strand s φ of infinite-alphabet 4 RN, and they show that φ has a satisfying assignment if and only if the energy E of s φ s optimal secondary structure under a specified model is strictly greater than a certain value. Their proof is outlined below: Translate each clause i = (l 1 l 2 l 3 ) into a clause substring: i = 1,1 (l 1 ) 1 1,1 1,2 (l 2 ) 1 1,1 1,2 (l 3 ) 1 1,2 Translate each variable x i which occurs twice positively and twice negatively in φ into a variable substring: V i = V 1 (x 1 ) 2 (x 1 ) 1 V 1 ( x 1 ) 2 ( x 1 ) 1 V 1 reate the sequence s φ = c V 1 V 2... V v ompute the energy E of the optimal secondary structure under the energy model E(S) = E(i j,i + 1, j 1) i j S where 1 if X and Y are complementary and do not form a pesudoknot E(X i Y j, V i+1, W j 1 ) =. with a neighboring base 0 else 4 The standard RN alphabed has four bases: Guanine, denine, ytosine, & Uracil. 10
11 If E > (3c + v) then φ is not satisfiable. The proof then demonstrates the equivalence of an infinite RN alphabet and the standard four-letter RN alphabet. Though the energy functions on which these reductions are based do not perfectly emulate real-world conditions, complexity results should remain valid as long as the natural (complex) model can be reduced to the simple model by fixing parameters [6]. References [1] Tatsuya kutsu. Dynamic programming algorithms for RN secondary structure prediction with pseudoknots. Discrete pplied Mathematics, 104(1-3):45 62, [2] Mirela S. ndronescu, ristina Pop, and nne E. ondon. Improved free energy parameters for rna pseudoknotted secondary structure prediction. RN, 16(1):26 42, [3] Kevin Darty, lain Denise, and Yann Ponty. Varna: Interactive drawing and editing of the RN secondary structure., [4] S R Eddy and R Durbin. RN sequence analysis using covariance models. Nucleic cids Research, 22(11): , Jun [5] P Gultyaev, F H van Batenburg, and W Pleij. The computer simulation of RN folding pathways using a genetic algorithm. J Mol Biol, 250(1):37 51, Jun [6] Rune B. Lyngsø and hristian N. S. Pedersen. RN pseudoknot prediction in energy based models. Journal of omputational Biology, 7:2000, [7] David H. Mathews and Michael Zuker. RN secondary structure prediction. John Wiley & Sons, Ltd, [8] John S. Mattick and Igor V. Makunin. Non-coding rna. Human Molecular Genetics, 15(suppl 1):R17 R29, 15 pril [9] R Nussinov and B Jacobson. Fast algorithm for predicting the secondary structure of single-stranded RN. Proceedings of the National cademy of Sciences of the US, 77(11): , Nov
12 [10] Ruth Nussinov, George Pieczenik, Jerrold R. Griggs, and Daniel J. Kleitman. lgorithms for loop matchings. SIM Journal on pplied Mathematics, 35(1):68 82, [11] hristos H. Papadimitriou. omputational complexity. [12] Jens Reeder and Robert Giegerich. Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics, [13] Jihong Ren, Baharak Rastegari, nne ondon, and Holger H Hoos. Hotknots: heuristic prediction of RN secondary structures including pseudoknots. RN, 11(10): , Oct [14] E Rivas and S R Eddy. dynamic programming algorithm for RN structure prediction including pseudoknots. Journal of Molecular Biology, 285(5): , Feb [15] I Jr Tinoco, P N Borer, B Dengler, M D Levin, O Uhlenbeck, D M rothers, and J Bralla. Improved estimation of secondary structure in ribonucleic acids. Nature: New Biology, 246(150):40 41, Nov [16] I Jr Tinoco, O Uhlenbeck, and M D Levine. Estimation of secondary structure in ribonucleic acids. Nature, 230(5293): , pr [17]. Xayaphoummine, T. Bucher, F. Thalmann, and H. Isambert. Prediction and statistics of pseudoknots in rna structures using exactly clustered stochastic simulations. Proceedings of the National cademy of Sciences of the United States of merica, 100(26): , Dec. 23, [18] M. Zuker, D.H. Mathews, and D.H. Turner. lgorithms and Thermodynamics for RN Secondary Structure Prediction: Practical Guide. NTO SI Series. Kluwer cademic Publishers, [19] M Zuker and P Stiegler. Optimal computer folding of large RN sequences using thermodynamics and auxiliary information. Nucleic cids Research, 9(1): , Jan [20] Michael Zuker. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic cids Research, 31(13): ,
Algorithms in Bioinformatics
Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri RNA Structure Prediction Secondary
More informationproteins are the basic building blocks and active players in the cell, and
12 RN Secondary Structure Sources for this lecture: R. Durbin, S. Eddy,. Krogh und. Mitchison, Biological sequence analysis, ambridge, 1998 J. Setubal & J. Meidanis, Introduction to computational molecular
More information98 Algorithms in Bioinformatics I, WS 06, ZBIT, D. Huson, December 6, 2006
98 Algorithms in Bioinformatics I, WS 06, ZBIT, D. Huson, December 6, 2006 8.3.1 Simple energy minimization Maximizing the number of base pairs as described above does not lead to good structure predictions.
More informationCombinatorial approaches to RNA folding Part I: Basics
Combinatorial approaches to RNA folding Part I: Basics Matthew Macauley Department of Mathematical Sciences Clemson University http://www.math.clemson.edu/~macaule/ Math 4500, Spring 2015 M. Macauley (Clemson)
More informationComputational Approaches for determination of Most Probable RNA Secondary Structure Using Different Thermodynamics Parameters
Computational Approaches for determination of Most Probable RNA Secondary Structure Using Different Thermodynamics Parameters 1 Binod Kumar, Assistant Professor, Computer Sc. Dept, ISTAR, Vallabh Vidyanagar,
More informationImpact Of The Energy Model On The Complexity Of RNA Folding With Pseudoknots
Impact Of The Energy Model On The omplexity Of RN Folding With Pseudoknots Saad Sheikh, Rolf Backofen Yann Ponty, niversity of Florida, ainesville, S lbert Ludwigs niversity, Freiburg, ermany LIX, NRS/Ecole
More informationCombinatorial approaches to RNA folding Part II: Energy minimization via dynamic programming
ombinatorial approaches to RNA folding Part II: Energy minimization via dynamic programming Matthew Macauley Department of Mathematical Sciences lemson niversity http://www.math.clemson.edu/~macaule/ Math
More informationRNA secondary structure prediction. Farhat Habib
RNA secondary structure prediction Farhat Habib RNA RNA is similar to DNA chemically. It is usually only a single strand. T(hyamine) is replaced by U(racil) Some forms of RNA can form secondary structures
More informationDNA/RNA Structure Prediction
C E N T R E F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U Master Course DNA/Protein Structurefunction Analysis and Prediction Lecture 12 DNA/RNA Structure Prediction Epigenectics Epigenomics:
More informationA Novel Statistical Model for the Secondary Structure of RNA
ISBN 978-1-8466-93-3 Proceedings of the 5th International ongress on Mathematical Biology (IMB11) Vol. 3 Nanjing, P. R. hina, June 3-5, 11 Novel Statistical Model for the Secondary Structure of RN Liu
More informationRNA Basics. RNA bases A,C,G,U Canonical Base Pairs A-U G-C G-U. Bases can only pair with one other base. wobble pairing. 23 Hydrogen Bonds more stable
RNA STRUCTURE RNA Basics RNA bases A,C,G,U Canonical Base Pairs A-U G-C G-U wobble pairing Bases can only pair with one other base. 23 Hydrogen Bonds more stable RNA Basics transfer RNA (trna) messenger
More informationRNA-Strukturvorhersage Strukturelle Bioinformatik WS16/17
RNA-Strukturvorhersage Strukturelle Bioinformatik WS16/17 Dr. Stefan Simm, 01.11.2016 simm@bio.uni-frankfurt.de RNA secondary structures a. hairpin loop b. stem c. bulge loop d. interior loop e. multi
More informationBachelor Thesis. RNA Secondary Structure Prediction
Bachelor Thesis RNA Secondary Structure Prediction Sophie Schneiderbauer soschnei@uos.de Cognitive Science, University Of Osnabrück First supervisor: Prof. Dr. Volker Sperschneider Second supervisor: Prof.
More informationRapid Dynamic Programming Algorithms for RNA Secondary Structure
ADVANCES IN APPLIED MATHEMATICS 7,455-464 I f Rapid Dynamic Programming Algorithms for RNA Secondary Structure MICHAEL S. WATERMAN* Depurtments of Muthemutics und of Biologicul Sciences, Universitk of
More informationDYNAMIC PROGRAMMING ALGORITHMS FOR RNA STRUCTURE PREDICTION WITH BINDING SITES
DYNAMIC PROGRAMMING ALGORITHMS FOR RNA STRUCTURE PREDICTION WITH BINDING SITES UNYANEE POOLSAP, YUKI KATO, TATSUYA AKUTSU Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho,
More informationDANNY BARASH ABSTRACT
JOURNAL OF COMPUTATIONAL BIOLOGY Volume 11, Number 6, 2004 Mary Ann Liebert, Inc. Pp. 1169 1174 Spectral Decomposition for the Search and Analysis of RNA Secondary Structure DANNY BARASH ABSTRACT Scales
More informationBIOINFORMATICS. Prediction of RNA secondary structure based on helical regions distribution
BIOINFORMATICS Prediction of RNA secondary structure based on helical regions distribution Abstract Motivation: RNAs play an important role in many biological processes and knowing their structure is important
More informationPredicting RNA Secondary Structure
7.91 / 7.36 / BE.490 Lecture #6 Mar. 11, 2004 Predicting RNA Secondary Structure Chris Burge Review of Markov Models & DNA Evolution CpG Island HMM The Viterbi Algorithm Real World HMMs Markov Models for
More informationRNA Folding and Interaction Prediction: A Survey
RNA Folding and Interaction Prediction: A Survey Syed Ali Ahmed Graduate Center, City University of New York New York, NY November 19, 2015 Abstract The problem of computationally predicting the structure
More informationThe Double Helix. CSE 417: Algorithms and Computational Complexity! The Central Dogma of Molecular Biology! DNA! RNA! Protein! Protein!
The Double Helix SE 417: lgorithms and omputational omplexity! Winter 29! W. L. Ruzzo! Dynamic Programming, II" RN Folding! http://www.rcsb.org/pdb/explore.do?structureid=1t! Los lamos Science The entral
More informationComputational approaches for RNA energy parameter estimation
omputational approaches for RNA energy parameter estimation by Mirela Ştefania Andronescu M.Sc., The University of British olumbia, 2003 B.Sc., Bucharest Academy of Economic Studies, 1999 A THESIS SUBMITTED
More informationCONTRAfold: RNA Secondary Structure Prediction without Physics-Based Models
Supplementary Material for CONTRAfold: RNA Secondary Structure Prediction without Physics-Based Models Chuong B Do, Daniel A Woods, and Serafim Batzoglou Stanford University, Stanford, CA 94305, USA, {chuongdo,danwoods,serafim}@csstanfordedu,
More informationRNA and Protein Structure Prediction
RNA and Protein Structure Prediction Bioinformatics: Issues and Algorithms CSE 308-408 Spring 2007 Lecture 18-1- Outline Multi-Dimensional Nature of Life RNA Secondary Structure Prediction Protein Structure
More informationComputational Biology: Basics & Interesting Problems
Computational Biology: Basics & Interesting Problems Summary Sources of information Biological concepts: structure & terminology Sequencing Gene finding Protein structure prediction Sources of information
More informationRNA Abstract Shape Analysis
ourse: iegerich RN bstract nalysis omplete shape iegerich enter of Biotechnology Bielefeld niversity robert@techfak.ni-bielefeld.de ourse on omputational RN Biology, Tübingen, March 2006 iegerich ourse:
More informationRecitaLon CB Lecture #10 RNA Secondary Structure
RecitaLon 3-19 CB Lecture #10 RNA Secondary Structure 1 Announcements 2 Exam 1 grades and answer key will be posted Friday a=ernoon We will try to make exams available for pickup Friday a=ernoon (probably
More informationBIOINF 4120 Bioinforma2cs 2 - Structures and Systems -
BIOINF 4120 Bioinforma2cs 2 - Structures and Systems - Oliver Kohlbacher Summer 2014 3. RNA Structure Part II Overview RNA Folding Free energy as a criterion Folding free energy of RNA Zuker- SCegler algorithm
More informationGrand Plan. RNA very basic structure 3D structure Secondary structure / predictions The RNA world
Grand Plan RNA very basic structure 3D structure Secondary structure / predictions The RNA world very quick Andrew Torda, April 2017 Andrew Torda 10/04/2017 [ 1 ] Roles of molecules RNA DNA proteins genetic
More informationComputational RNA Secondary Structure Design:
omputational RN Secondary Structure Design: Empirical omplexity and Improved Methods by Rosalía guirre-hernández B.Sc., niversidad Nacional utónoma de México, 996 M. Eng., niversidad Nacional utónoma de
More informationSemi-Supervised CONTRAfold for RNA Secondary Structure Prediction: A Maximum Entropy Approach
Wright State University CORE Scholar Browse all Theses and Dissertations Theses and Dissertations 2011 Semi-Supervised CONTRAfold for RNA Secondary Structure Prediction: A Maximum Entropy Approach Jianping
More informationA Method for Aligning RNA Secondary Structures
Method for ligning RN Secondary Structures Jason T. L. Wang New Jersey Institute of Technology J Liu, JTL Wang, J Hu and B Tian, BM Bioinformatics, 2005 1 Outline Introduction Structural alignment of RN
More informationUsing SetPSO to determine RNA secondary structure
Using SetPSO to determine RNA secondary structure by Charles Marais Neethling Submitted in partial fulfilment of the requirements for the degree of Master of Science (Computer Science) in the Faculty of
More informationLecture 12. DNA/RNA Structure Prediction. Epigenectics Epigenomics: Gene Expression
C N F O N G A V B O N F O M A C S V U Master Course DNA/Protein Structurefunction Analysis and Prediction Lecture 12 DNA/NA Structure Prediction pigenectics pigenomics: Gene xpression ranscription factors
More informationMoments of the Boltzmann distribution for RNA secondary structures
Bulletin of Mathematical Biology 67 (2005) 1031 1047 www.elsevier.com/locate/ybulm Moments of the Boltzmann distribution for RNA secondary structures István Miklós a, Irmtraud M. Meyer b,,borbála Nagy
More informationGCD3033:Cell Biology. Transcription
Transcription Transcription: DNA to RNA A) production of complementary strand of DNA B) RNA types C) transcription start/stop signals D) Initiation of eukaryotic gene expression E) transcription factors
More informationCopyright (c) 2007 IEEE. Personal use of this material is permitted. However, permission to use this material for any other purposes must be obtained
Copyright (c) 2007 IEEE. Personal use of this material is permitted. However, permission to use this material for any other purposes must be obtained from the IEEE by sending an email to pubs-permissions@ieee.org.
More informationRNA Folding Algorithms. Michal Ziv-Ukelson Ben Gurion University of the Negev
RNA Folding Algorithms Michal Ziv-Ukelson Ben Gurion University of the Negev The RNA Folding Problem: Given an RNA sequence, predict its energetically most stable structure (minimal free energy). AUCCCCGUAUCGAUC
More informationRNA Folding Algorithms. Michal Ziv-Ukelson Ben Gurion University of the Negev
RNA Folding Algorithms Michal Ziv-Ukelson Ben Gurion University of the Negev The RNA Folding Problem: Given an RNA sequence, predict its energetically most stable structure (minimal free energy). AUCCCCGUAUCGAUC
More informationIntroduction to Molecular and Cell Biology
Introduction to Molecular and Cell Biology Molecular biology seeks to understand the physical and chemical basis of life. and helps us answer the following? What is the molecular basis of disease? What
More information2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology
2012 Univ. 1301 Aguilera Lecture Introduction to Molecular and Cell Biology Molecular biology seeks to understand the physical and chemical basis of life. and helps us answer the following? What is the
More informationarxiv: v1 [q-bio.bm] 21 Oct 2010
TT2NE: A novel algorithm to predict RNA secondary structures with pseudoknots Michaël Bon and Henri Orland Institut de Physique Théorique, CEA Saclay, CNRS URA 2306, arxiv:1010.4490v1 [q-bio.bm] 21 Oct
More informationExtending the hypergraph analogy for RNA dynamic programming
Extending the hypergraph analogy for RN dynamic programming Yann Ponty Balaji Raman édric Saule Polytechnique/NRS/INRI MIB France Yann Ponty, Balaji Raman, édric Saule RN Folding RN = Biopolymer composed
More informationBCB 444/544 Fall 07 Dobbs 1
BCB 444/544 Required Reading (before lecture) Lecture 25 Mon Oct 15 - Lecture 23 Protein Tertiary Structure Prediction Chp 15 - pp 214-230 More RNA Structure Wed Oct 17 & Thurs Oct 18 - Lecture 24 & Lab
More informationRNA Secondary Structure. CSE 417 W.L. Ruzzo
RN Secondary Structure SE 417 W.L. Ruzzo The Double Helix Los lamos Science The entral Dogma of Molecular Biology DN RN Protein gene Protein DN (chromosome) cell RN (messenger) Non-coding RN Messenger
More informationGenome 559 Wi RNA Function, Search, Discovery
Genome 559 Wi 2009 RN Function, Search, Discovery The Message Cells make lots of RN noncoding RN Functionally important, functionally diverse Structurally complex New tools required alignment, discovery,
More informationSparse RNA Folding Revisited: Space-Efficient Minimum Free Energy Prediction
Sparse RNA Folding Revisited: Space-Efficient Minimum Free Energy Prediction Sebastian Will 1 and Hosna Jabbari 2 1 Bioinformatics/IZBI, University Leipzig, swill@csail.mit.edu 2 Ingenuity Lab, National
More informationFrom gene to protein. Premedical biology
From gene to protein Premedical biology Central dogma of Biology, Molecular Biology, Genetics transcription replication reverse transcription translation DNA RNA Protein RNA chemically similar to DNA,
More informationBiomolecules. Energetics in biology. Biomolecules inside the cell
Biomolecules Energetics in biology Biomolecules inside the cell Energetics in biology The production of energy, its storage, and its use are central to the economy of the cell. Energy may be defined as
More informationarxiv: v3 [math.co] 17 Jan 2018
An infinite class of unsaturated rooted trees corresponding to designable RNA secondary structures arxiv:1709.08088v3 [math.co] 17 Jan 2018 Jonathan Jedwab Tara Petrie Samuel Simon 23 September 2017 (revised
More informationRNA$2 nd $structure$predic0on
RNA$2 nd $structure$predic0on Recall Nucleic Acids - RNA and DNA The carrier of genetic information - The blueprints of proteins Nucleotides Bases Adenine (A) Guanine (G) Cytosine(C) Thymine(T) Uracil
More informationChapter 1. Topic: Overview of basic principles
Chapter 1 Topic: Overview of basic principles Four major themes of biochemistry I. What are living organism made from? II. How do organism acquire and use energy? III. How does an organism maintain its
More informationStructure-Based Comparison of Biomolecules
Structure-Based Comparison of Biomolecules Benedikt Christoph Wolters Seminar Bioinformatics Algorithms RWTH AACHEN 07/17/2015 Outline 1 Introduction and Motivation Protein Structure Hierarchy Protein
More informationRNA Search and! Motif Discovery" Genome 541! Intro to Computational! Molecular Biology"
RNA Search and! Motif Discovery" Genome 541! Intro to Computational! Molecular Biology" Day 1" Many biologically interesting roles for RNA" RNA secondary structure prediction" 3 4 Approaches to Structure
More informationLab III: Computational Biology and RNA Structure Prediction. Biochemistry 208 David Mathews Department of Biochemistry & Biophysics
Lab III: Computational Biology and RNA Structure Prediction Biochemistry 208 David Mathews Department of Biochemistry & Biophysics Contact Info: David_Mathews@urmc.rochester.edu Phone: x51734 Office: 3-8816
More informationConserved RNA Structures. Ivo L. Hofacker. Institut for Theoretical Chemistry, University Vienna.
onserved RN Structures Ivo L. Hofacker Institut for Theoretical hemistry, University Vienna http://www.tbi.univie.ac.at/~ivo/ Bled, January 2002 Energy Directed Folding Predict structures from sequence
More informationThe Multistrand Simulator: Stochastic Simulation of the Kinetics of Multiple Interacting DNA Strands
The Multistrand Simulator: Stochastic Simulation of the Kinetics of Multiple Interacting DNA Strands Joseph Schaeffer, Caltech (slides by John Reif) Entropy: Is a measure of disorder or randomness of a
More informationDNA THE CODE OF LIFE 05 JULY 2014
LIFE SIENES N THE OE OF LIFE 05 JULY 2014 Lesson escription In this lesson we nswer questions on: o N, RN and Protein synthesis o The processes of mitosis and meiosis o omparison of the processes of meiosis
More informationCS681: Advanced Topics in Computational Biology
CS681: Advanced Topics in Computational Biology Can Alkan EA224 calkan@cs.bilkent.edu.tr Week 10 Lecture 1 http://www.cs.bilkent.edu.tr/~calkan/teaching/cs681/ RNA folding Prediction of secondary structure
More informationBIOINFORMATICS. Fast evaluation of internal loops in RNA secondary structure prediction. Abstract. Introduction
BIOINFORMATICS Fast evaluation of internal loops in RNA secondary structure prediction Abstract Motivation: Though not as abundant in known biological processes as proteins, RNA molecules serve as more
More informationD Dobbs ISU - BCB 444/544X 1
11/7/05 Protein Structure: Classification, Databases, Visualization Announcements BCB 544 Projects - Important Dates: Nov 2 Wed noon - Project proposals due to David/Drena Nov 4 Fri PM - Approvals/responses
More informationPrediction of RNA secondary structure including kissing hairpin motifs
Prediction of RNA secondary structure including kissing hairpin motifs Corinna Theis, Stefan Janssen, and Robert Giegerich Faculty of Technology, Bielefeld University 33501 Bielefeld, Germany robert@techfak.uni-bielefeld.de
More informationHuman Biology. The Chemistry of Living Things. Concepts and Current Issues. All Matter Consists of Elements Made of Atoms
2 The Chemistry of Living Things PowerPoint Lecture Slide Presentation Robert J. Sullivan, Marist College Michael D. Johnson Human Biology Concepts and Current Issues THIRD EDITION Copyright 2006 Pearson
More informationBIOINF 4120 Bioinforma2cs 2 - Structures and Systems -
BIIF 4120 Bioinforma2cs 2 - Structures and Systems - liver Kohlbacher SS 2011 2. RA Structure Part I verview RA Types of RA and their biological func@on Two- dimensional structure Three- dimensional structure
More informationStructural biomathematics: an overview of molecular simulations and protein structure prediction
: an overview of molecular simulations and protein structure prediction Figure: Parc de Recerca Biomèdica de Barcelona (PRBB). Contents 1 A Glance at Structural Biology 2 3 1 A Glance at Structural Biology
More informationA tutorial on RNA folding methods and resources
A tutorial on RNA folding methods and resources Alain Denise, LRI/IGM, Université Paris-Sud with invaluable help from Yann Ponty, CNRS/Ecole Polytechnique 1 Master BIBS 2014-2015 Goals To help your work
More informationDNA Structure. Voet & Voet: Chapter 29 Pages Slide 1
DNA Structure Voet & Voet: Chapter 29 Pages 1107-1122 Slide 1 Review The four DNA bases and their atom names The four common -D-ribose conformations All B-DNA ribose adopt the C2' endo conformation All
More informationThe wonderful world of RNA informatics
December 9, 2012 Course Goals Familiarize you with the challenges involved in RNA informatics. Introduce commonly used tools, and provide an intuition for how they work. Give you the background and confidence
More informationBerg Tymoczko Stryer Biochemistry Sixth Edition Chapter 1:
Berg Tymoczko Stryer Biochemistry Sixth Edition Chapter 1: Biochemistry: An Evolving Science Tips on note taking... Remember copies of my lectures are available on my webpage If you forget to print them
More informationPosition-specific scoring matrices (PSSM)
Regulatory Sequence nalysis Position-specific scoring matrices (PSSM) Jacques van Helden Jacques.van-Helden@univ-amu.fr Université d ix-marseille, France Technological dvances for Genomics and Clinics
More informationIn Genomes, Two Types of Genes
In Genomes, Two Types of Genes Protein-coding: [Start codon] [codon 1] [codon 2] [ ] [Stop codon] + DNA codons translated to amino acids to form a protein Non-coding RNAs (NcRNAs) No consistent patterns
More informationChapter 1. A Method to Predict the 3D Structure of an RNA Scaffold. Xiaojun Xu and Shi-Jie Chen. Abstract. 1 Introduction
Chapter 1 Abstract The ever increasing discoveries of noncoding RNA functions draw a strong demand for RNA structure determination from the sequence. In recently years, computational studies for RNA structures,
More informationThe Ensemble of RNA Structures Example: some good structures of the RNA sequence
The Ensemble of RNA Structures Example: some good structures of the RNA sequence GGGGGUAUAGCUCAGGGGUAGAGCAUUUGACUGCAGAUCAAGAGGUCCCUGGUUCAAAUCCAGGUGCCCCCU free energy in kcal/mol (((((((..((((...))))...((((...))))(((((...)))))))))))).
More informationRNA Protein Interaction
Introduction RN binding in detail structural analysis Examples RN Protein Interaction xel Wintsche February 16, 2009 xel Wintsche RN Protein Interaction Introduction RN binding in detail structural analysis
More informationAlgorithms in Computational Biology (236522) spring 2008 Lecture #1
Algorithms in Computational Biology (236522) spring 2008 Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: 15:30-16:30/by appointment TA: Ilan Gronau, Taub 700, tel 4894 Office hours:??
More informationA Combinatorial Framework for Multiple RNA Interaction Prediction
City University of New York (CUNY) CUNY Academic Works Dissertations, Theses, and Capstone Projects Graduate Center 9-2017 A Combinatorial Framework for Multiple RNA Interaction Prediction Syed Ali Ahmed
More informationQuantitative modeling of RNA single-molecule experiments. Ralf Bundschuh Department of Physics, Ohio State University
Quantitative modeling of RN single-molecule experiments Ralf Bundschuh Department of Physics, Ohio State niversity ollaborators: lrich erland, LM München Terence Hwa, San Diego Outline: Single-molecule
More informationClassified Dynamic Programming
Bled, Feb. 2009 Motivation Our topic: Programming methodology A trade-off in dynamic programming between search space design and evaluation of candidates A trade-off between modifying your code and adding
More informationarxiv: v1 [q-bio.bm] 25 Jul 2012
CoFold: thermodynamic RNA structure prediction with a kinetic twist arxiv:1207.6013v1 [q-bio.bm] 25 Jul 2012 Jeff R. Proctor and Irmtraud M. Meyer Centre for High-Throughput Biology & Department of Computer
More informationJunction-Explorer Help File
Junction-Explorer Help File Dongrong Wen, Christian Laing, Jason T. L. Wang and Tamar Schlick Overview RNA junctions are important structural elements of three or more helices in the organization of the
More informationComputing the partition function and sampling for saturated secondary structures of RNA, with respect to the Turner energy model
Computing the partition function and sampling for saturated secondary structures of RNA, with respect to the Turner energy model J. Waldispühl 1,3 P. Clote 1,2, 1 Department of Biology, Higgins 355, Boston
More informationMath 8803/4803, Spring 2008: Discrete Mathematical Biology
Math 8803/4803, Spring 2008: Discrete Mathematical Biology Prof. hristine Heitsch School of Mathematics eorgia Institute of Technology Lecture 12 February 4, 2008 Levels of RN structure Selective base
More informationVideos. Bozeman, transcription and translation: https://youtu.be/h3b9arupxzg Crashcourse: Transcription and Translation - https://youtu.
Translation Translation Videos Bozeman, transcription and translation: https://youtu.be/h3b9arupxzg Crashcourse: Transcription and Translation - https://youtu.be/itsb2sqr-r0 Translation Translation The
More informationChapter 9 DNA recognition by eukaryotic transcription factors
Chapter 9 DNA recognition by eukaryotic transcription factors TRANSCRIPTION 101 Eukaryotic RNA polymerases RNA polymerase RNA polymerase I RNA polymerase II RNA polymerase III RNA polymerase IV Function
More informationMultiple Choice Review- Eukaryotic Gene Expression
Multiple Choice Review- Eukaryotic Gene Expression 1. Which of the following is the Central Dogma of cell biology? a. DNA Nucleic Acid Protein Amino Acid b. Prokaryote Bacteria - Eukaryote c. Atom Molecule
More informationUsing distance geomtry to generate structures
Using distance geomtry to generate structures David A. Case Genomic systems and structures, Spring, 2009 Converting distances to structures Metric Matrix Distance Geometry To describe a molecule in terms
More informationCOMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University
COMP 598 Advanced Computational Biology Methods & Research Introduction Jérôme Waldispühl School of Computer Science McGill University General informations (1) Office hours: by appointment Office: TR3018
More informationSupplementary Material
Supplementary Material Sm-I Formal Description of the Sampling Process In the sequel, given an RNA molecule r consisting of n nucleotides, we denote the corresponding sequence fragment from position i
More informationLecture 5: September Time Complexity Analysis of Local Alignment
CSCI1810: Computational Molecular Biology Fall 2017 Lecture 5: September 21 Lecturer: Sorin Istrail Scribe: Cyrus Cousins Note: LaTeX template courtesy of UC Berkeley EECS dept. Disclaimer: These notes
More informationBME 5742 Biosystems Modeling and Control
BME 5742 Biosystems Modeling and Control Lecture 24 Unregulated Gene Expression Model Dr. Zvi Roth (FAU) 1 The genetic material inside a cell, encoded in its DNA, governs the response of a cell to various
More informationRanjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013
Hydration of protein-rna recognition sites Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India 1 st November, 2013 Central Dogma of life DNA
More informationSparse RNA folding revisited: space efficient minimum free energy structure prediction
DOI 10.1186/s13015-016-0071-y Algorithms for Molecular Biology RESEARCH ARTICLE Sparse RNA folding revisited: space efficient minimum free energy structure prediction Sebastian Will 1* and Hosna Jabbari
More informationA graph kernel approach to the identification and characterisation of structured non-coding RNAs using multiple sequence alignment information
graph kernel approach to the identification and characterisation of structured noncoding RNs using multiple sequence alignment information Mariam lshaikh lbert Ludwigs niversity Freiburg, Department of
More informationMolecular Biology - Translation of RNA to make Protein *
OpenStax-CNX module: m49485 1 Molecular Biology - Translation of RNA to make Protein * Jerey Mahr Based on Translation by OpenStax This work is produced by OpenStax-CNX and licensed under the Creative
More informationChapter 1. DNA is made from the building blocks adenine, guanine, cytosine, and. Answer: d
Chapter 1 1. Matching Questions DNA is made from the building blocks adenine, guanine, cytosine, and. Answer: d 2. Matching Questions : Unbranched polymer that, when folded into its three-dimensional shape,
More information13 Comparative RNA analysis
13 Comparative RNA analysis Sources for this lecture: R. Durbin, S. Eddy, A. Krogh und G. Mitchison, Biological sequence analysis, Cambridge, 1998 D.W. Mount. Bioinformatics: Sequences and Genome analysis,
More informationAlgorithmic Aspects of RNA Secondary Structures
Algorithmic Aspects of RNA Secondary Structures Stéphane Vialette CNRS & LIGM, Université Paris-Est Marne-la-Vallée, France 214-115 S. Vialette (CNRS & LIGM) RNA Secondary Structures 214-215 1 / 124 Introduction
More informationSugars, such as glucose or fructose are the basic building blocks of more complex carbohydrates. Which of the following
Name: Score: / Quiz 2 on Lectures 3 &4 Part 1 Sugars, such as glucose or fructose are the basic building blocks of more complex carbohydrates. Which of the following foods is not a significant source of
More informationAnalytical Study of Hexapod mirnas using Phylogenetic Methods
Analytical Study of Hexapod mirnas using Phylogenetic Methods A.K. Mishra and H.Chandrasekharan Unit of Simulation & Informatics, Indian Agricultural Research Institute, New Delhi, India akmishra@iari.res.in,
More informationMATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME
MATHEMATICAL MODELING AND THE HUMAN GENOME Hilary S. Booth Australian National University, Australia Keywords: Human genome, DNA, bioinformatics, sequence analysis, evolution. Contents 1. Introduction:
More informationMotif Prediction in Amino Acid Interaction Networks
Motif Prediction in Amino Acid Interaction Networks Omar GACI and Stefan BALEV Abstract In this paper we represent a protein as a graph where the vertices are amino acids and the edges are interactions
More information