A Novel Statistical Model for the Secondary Structure of RNA

Size: px
Start display at page:

Download "A Novel Statistical Model for the Secondary Structure of RNA"

Transcription

1 ISBN Proceedings of the 5th International ongress on Mathematical Biology (IMB11) Vol. 3 Nanjing, P. R. hina, June 3-5, 11 Novel Statistical Model for the Secondary Structure of RN Liu Hong 1, 1 Zhou Pei-Yuan enter for pplied Mathematics, Tsinghua niversity, Beijing, P.R. hina, 184 bstract. In this paper, a novel statistical model for the secondary structures of RN is constructed, which is not only applicable to homo-rn chains, but also can be easily extended to the hetero-rn sequences by considering Watson-rick base pairings. The main approach used here is the operator transfer matrix, whose calculation rules are simple, and can be automatically finished. Through the model, we found the conformational entropy of homo-rn is highest, and grows linearly with chain length; but the hairpin/coil transition of homo-rn is less cooperative than natural hetero-rn sequences. s a typical example, we also verified the native secondary structures of microrn PSK is thermally stable with respect to model parameters, which we believe is a direct consequence of nature's evolution. Keywords: statistical model, operator matrix, secondary structure of RN. 1. Introduction RN is one of the most important single-stranded biomolecules in cell. It serves as a key linkage between DN and proteins, which transcripts the genetic information of DN, and then translates into different types of functional proteins. RN is constituted of four different types of nucleotides, denine(), racil(), ylosine() and uanine(). In most cases, denine and racil form two hydrogen bonds between them; while ylosine and uanine form three hydrogen bonds. This rule is called the Watson-rick base pairing[1]. Other types of hydrogen bonding pairs are seldom seen. To fulfill its biological functions, RN must adopt some specific three dimensional structure, whose overall topology is usually characterized by a shape of inverted letter L ; while its scaffold is provided by the secondary structure elements --- hairpins, hairpin loops, interior loops and bugles. These secondary structures are thermally stable independent of the tertiary structure, and determine a large fraction of final RN structure[1]. In past several decades, plenty of studies have been dedicated to the prediction of three-dimensional structure of RN, especially the secondary structure. One of the first attempts was made by Nussinov and co-workers[,3]. They built a simple nearest-neighbour energy model, and used the method of dynamic programming to try to maximize the total number of base-pairs. The energy parameters they adopted were derived from empirical calorimetric experiments[4,5], as well as known structures[6]. Later Zuker and Stiegler[7] proposed a slightly refined approach to incorporate stacking effect. In 1999, Rivas and Eddy[8] published new algorithms to handle the pseudoknots, which must be excluded from previous standard recursions. n alternative computational approach was to sample the RN structures from Boltzmann ensemble[9,1]. Besides many computational studies, statistical mechanical models for RN have attracted general interest[11-16], which can offer the probabilities of all chemically allowed conformations, no matter how small their populations are. famous example is the helix/coil transition theory[17], which was specifically designed for the helical structures of biopolymers. However this theory cannot be applied to RN directly, since RN's secondary structure is mainly made up by β-hairpins rather than α-helix. nother systematical study was done by hen and Dill[18,19], in which they constructed a losed raph ount Matrix to describe the complex topological structure of RN. major drawback of hen's model is the neglect of Watson-rick base pairings[1], which constrains the possible applications to natural hetero-rns. orresponding author. Tel.: address: hong-l4@mails.tsinghua.edu.cn. 473

2 In current paper, we have extended the transfer matrix method used in classical helix/coil transition theory, and applied it to the study of RN's secondary structures. The construction of operator transfer matrix and corresponding calculation rules for homo- and hetero-rns was illustrated in the methodology part. In the application part, we compared the conformational entropy and hairpin/coil transition for different RN sequences. We also studied the free energy surface of a typical microrn -- PSK, which was proved thermally stable against all parameters.. Basic Model.1. Polymer graph Fig. 1: (a) Sample secondary structure of RN. (b) orresponding polymer graph and nucleotide coding of RN. In Fig. 1(a), a typical RN structure is illustrated, with nucleotides numbered from 1 to 33 along the chain. s we can see, if we neglect tertiary topology and only focus on secondary structures, all chemically allowed conformations of RN would just depend on the state of each nucleotide. ccording to different types of secondary structures elements, the states of nucleotides can be simply divided into three different classes: hairpin residues (form hydrogen bonds with another nucleotide), loop residues (in the hairpin loops, interior loops or bugles, form no hydrogen bonds) and coil residues (at the two ends of a RN chain, without hydrogen bonds). The hydrogen bonds between hairpin residues are very strong. In fact, they are the dominant effect in maintaining the stability of secondary structures in RN. The coil residues and loop residues are similar, which do not form any hydrogen bonds with other nucleotides. However, the coil residues at two ends of the RN chain are totally free; while the torsion angles of loop residues are geometrically constrained. This leads to a minor energy difference between coil residues and loop residues. The secondary structure of RN can be systematically expressed through a polymer graph (see Fig. 1(b)), which can be constructed according to following steps[18,19]: (1) If two nucleotides form a pair through hydrogen bonding, a link between them will be added. The residue at the left of a link will be denoted as a, and the one at the right as b. () The loop residues(l ) distribute between hairpin residues and form no links. Here we neglect the difference between hairpin loops, interior loops and bugles for simplicity. (3) The coil residues stay at the two ends of a RN chain and without links too. The ones at the beginning of a chain are written as c, while those at the ending as c 1. Based on the polymer graph, we will construct the transfer matrix for the calculation of partition function... Statistical weights and operators The transfer matrix, which has been extensively studied in the helix/coil transition theory[17], is a powerful method to generate the possible conformations of chain molecules. However, its original version is limited to α-helix, and cannot guarantee the correct generation of all secondary structures of RN. s a result, we need to extend the numerical transfer matrix to an operator one, which is constituted by two parts: the numerical statistical weights and the operators. The statistical weight shows the relative probability of each state that a nucleotide can adopt, which is listed in Table. 1. Value 1 is arbitrarily assigned to coil residues, since only the relative ratio is effective; the parameter t is given to loop residues, accounting for geometrical constraints on their possible torsion angles; while the parameter is for hairpin residues, representing the hydrogen bonding between two nucleotides. 474

3 The operators are represented by the capitals in Table. 1, and correspond to the states of nucleotides one to one. To guarantee the correct generation of all chemically allowed conformations of RN, we argue that parameters (,t, ) can communicate with operators; but operators (, B, ) cannot change order with each other. They must be calculated from left to right according to following rules:,, BB BB, B. (1) Rules in Eq. 1 can be easily deduced from the general consideration of hydrogen bonding pairs in RN. Since coil residues do not form any hydrogen bond, we can shorten the operator chain as,, BB BB. dditionally, two hairpin residues in state a and b can form a hydrogen bond between them, thus we design the simplification rule B. In fact, it can be proved that rules in Eq. 1 are both self-consistent and sufficient to guarantee the correct generation of all chemically allowed conformations of RN. Table 1: States, statistical weights and operators of nucleotides in RN state weight operator description c 1 coil residue, at the beginning of a chain a hairpin residue, on the left of a link l t loop residue, between hairpin residues b B hairpin residue, on the right of a link c 1 1 coil residue, at the end of a chai.3. Operator transfer matrix Now we can construct the operator transfer matrix for homo-rn in the same way as in the traditional helix/coil transition theory[17]. In general, the left-most vertical column of the matrix stands for each state of nucleotide i ; while the top horizontal column represents the states of nucleotide i 1. Then if the state of nucleotide i 1 is not allowed to appear after that of nucleotide i in RN, its corresponding value is set as zero; otherwise it will be the product of statistical weight and operator of nucleotide i 1 as shown in Table. 1. In this way, we construct the operator transfer matrix as M t t t Here we introduce an additional parameter to take in the edging effect of a loop. B B. () s a consequence, the partition function for a RN chain with n nucleotides can be written as n B, 1 Z 1 VM, (3) where V (1 ) is a 1 5 vector, and ( 1 1) is a 5 1 vector. dramatic difference from traditional calculation of partition function is that here we need to deal with some specific operator chains generated through the multiplication of operator transfer matrix, which can be simplified according to the rules listed in Eq. 1. For each operator chain, if all operators except are cancelled in the end, its corresponding statistical weights will appear in the final partition function; otherwise its contribution will be zero. This procedure can be automatically done through the symbolic calculation of computers. Once the partition function obtained, a variety of important statistical quantities that characterize the secondary structure of RN can be derived[], i.e. the average number of hairpin residues n b ln Z ln, loop residues ln Z lnt, coil residues nc n nb nl, and the average number of loops capacity, which is given as n l T k l ln Z ln etc. nother notable quantity is the specific heat 475

4 where ln ln ln ln Z ln Z ln Z v k B t, (ln ) (ln ) (ln ) (4) t denotes the statistical ensemble average. In above calculation, we assume model parameters (,t, ) are independent of the temperature, which is valid only when the temperature variation is not very large..4. Extension to hetero-rn bove argument is valid only for homo-rn chains. To account for the natural hetero-rn sequence, we need to correlate the statistical weights and operators with the different types of nucleotides, i.e. denine(), racil(), ylosine() and uanine() respectively. ccordingly, the calculation rules of operators will be extended as,,,,,,,,,,,,,,,,,,,,,,,,,, B B B B, (5) B,, B B B, (6),, B B B B. (7) Note the rules in Eq. 6 account for Watson-rick base pairs; while rules in Eq. 7 are for mismatched nucleotide pairs. 3. pplications 3.1. onformational entropy onformational entropy ( S c ) makes significant contributions to the stability of RN structures. It is an important thermal quantity, and directly related with the total number of possible RN conformation through S c ln ( n), where n stands for the chain length. In Fig., we studied the length dependence of conformational entropy for three different types of RN sequences. s we can see there is clearly a linear relationship between the conformational entropy and the chain length. For homo-rn, least square linear fitting gives S c n ; for hetero-rn sequence, we show two different cases. One is 1 randomly generated hetero-rn sequences, which gives an n / n / average S c n ; the other is the most stable RN sequence ( ), which is chosen to form most Watson-rick base pairs. In this case, linear fitting shows S c n.,,,, Fig. : omparison of conformational entropy for different RN sequences. Red squares stand for numerical data of n / n / homo-rn based on Eq. 3; blue triangles for the data of hetero-rn sequence, which is chosen to form most hydrogen bonds; and purple circles for average values of 1 randomly generated hetero-rn sequences. Therefore when the chain length is equal, homo-rn sequence gives the most possible conformations, as any two nonconsecutive nucleotides in homo-rn may form a hydrogen bonding pair. The possible conformations of hetero-rn sequence are far fewer, especially the randomly generated RN sequences, which produce the fewest conformations due to quite limited possible Watson-rick base pairs. For natural RN sequences, their conformation entropy stays in between of the n / n / sequence and the 476

5 randomly generated ones. Thus their conformational space is neither too large nor too small, a compromise between fast folding and biological evolution. direct consequence of the linear relationship between conformational entropy and chain length is the total number of possible RN conformations will grow exponentially with the chain length. In fact, this phenomenon is not a special case, but another form of Levinthal's problem in RN[1]. s a result, the exact enumeration of all secondary structures of RN is a NP-hard problem. rough estimation shows only the partition function of RN sequence with n 5 is computationally acceptable at present. 3.. Hairpin/coil transition in RN The formation of RN's secondary structure is a highly cooperative process. Here we compare the hairpin/coil transition curves for different RN sequences, which include a -nucleotides homo-rn sequence, a 1 1 sequence and a hetero-rn sequence by removing the first three nucleotides from the microrn PSK (denoted as PSK ). The parameter values in our calculation are taken from the experimental data by Turner[4,5], which 1 1 shows the bonding energies are around. 9 ~ 1.3kcal mol K for base pairs, and ~.4kcal mol K for base pairs at 31 K. Thus according to the Boltzmann relationship, we can infer and For the edging effect of a loop region, since we here did not distinguish the different types of hairpin loops, interior loops and 1 1 bugles, we will take an average value 4kcal mol K, which gives. 15. The value of t can be gotten from the energy changes with respect to the loop length, which vary from. 1 to 1 1.4kcal mol K. Thus t In later computations, we set t (corresponding to.kcal mol K ) for simplicity. Fig. 3: omparison of hairpin/coil transition in different RN sequences with chain length n. The dependence of average number of hairpin residues, loop residues and loops on parameter are shown in subplots (a-c); while the dependence of specific heat capacity on parameters, t and are shown in subplots (d-f) separately. s we can see in Fig. 3(a,b,c), the transitional behaviors of homo-rn and 1 1 are quite similar. With the increase of hydrogen bonding effect, the average numbers of hairpin residues, loop residues and loops for the homo-rn and sequence 1 1 grow quickly from zero to their maximum values ( n b 18, nl, kl 1 ), then stay constantly at the most stable structure. While the curves of PSK are relatively different: below 1, no apparent hairpin structures could be detected; from to 4, most of the native hydrogen bonding pairs are formed until 477

6 n b 8, n 7, k 1, which just corresponds to the native secondary structure of PSK (see Fig. l l 4(a)). Here we did not show results for different parameters t and, however their qualitative behaviours are quite clear. In general, the influence of parameters t and on the average number of hairpin residues, loop residues and loops are quite limited, but the sharpness of transition curves will sensitively depend on t and. The smaller t and are, the sharper the transition curves will be, which also means higher cooperativity of the hairpin/coil transition of RN. The folding cooperativity and thermal stability of RN sequences could be directly learned from the study of specific heat capacity. From Fig. 3(d,e,f), we found the homo-rn and sequence 1 1 have relatively the smallest specific heat capacity. Since these two sequences can form most hydrogen bonding pairs, their thermal variations with parameters, t and are very small, expect for a sharp peak at 1, which corresponds to the conformational transition from coil to hairpin structure. On the contrary, the specific heat capacity of PSK varies more largely with model parameters, which we believe is mainly due to many alternative conformations produced by Watson-rick base pairings in hetero-rn. To be more exactly, there are two peaks and one valley on the v curve for PSK in Fig. 3(d). The first peak corresponds to the hairpin/coil transition; the second one is for the repacking of RN structure; while the valley shows the thermal stability of native secondary structure of PSK. In Fig. 3(e,f), the specific heat capacity of PSK increases almost monotonously with parameters t and. The larger t and are, the easier loops can be formed. s a consequence, structure changes will also become easier, which leads to a larger value of the specific heat capacity Thermal stability of PSK Fig. 4: (a) Native secondary structure of PSK. (b) Population distribution of the native secondary structure of PSK with respect to parameters and. Here we set.15, t t t t. 7. In previous section, we have made some general comparisons between homo-rn and hetero-rn sequences on the hairpin/coil transition. Now we turn to study a specific example -- microrn PSK, whose natural sequence is given as, including 3-nucleotides. Its native secondary structure is shown in Fig. 4(a), with eight coil residues, four consecutive pairs of hairpin residues and a hairpin-loop composed of seven loop residues. In Fig. 4(b), we can find this native secondary structure is the dominant conformation in a reasonable parameter region with and However, the population of native secondary structure in this region is only about 3 4%, which means there are many other different conformations simultaneously existing in the system. Therefore the native RN structure must be understood from a statistical ensemble point. The thermal stability of native secondary structure of PSK (see Fig. 4(a)) can be learned from its specific heat capacity. In Fig.5, we plot the changes of specific heat capacity of PSK with respect to each model parameter. In Fig. 5(a), when parameters change from. 7 to. 88, the local minimum of v moves continuously from 6. 9 to 5. ; in Fig. 5(b), when change from 5. 5 to 7. 1, the local minimum of v moves from. 1 to Both match very well with our 478

7 above showed reasonable parameter region for natural RN, with and In Fig. 5(c), for reasonable values of and, their corresponding specific heat capacities at t. 7 do not deviate largely from the values of nearby local minimum. In Fig. 5(d), when. 88 and 7. 1, the local minimum of v lays at.5, not far from. 15 the value we chosen; when. 7 and 5.5, the value of v at. 15 is also close to local minimum. ll these results strongly support the fact that native secondary structure of PSK is thermally stable with respect to all model parameters. Or in other words, the nucleotide sequence of PSK is optimized in the sense of thermal stability, which we believe is a direct consequence of natural selection during evolution. Fig. 5: (a) Dependence of the specific heat capacity v of PSK on model parameters (a) ; (b) ; (c) t ; and (d). In all calculations, we set the non-variable parameters as,, and.15, t t t t onclusion and Discussion In current study, based on the operator transfer matrix, a novel statistical mechanical model has been constructed for the study of RN structures. Through numerical calculation, we found the hairpin/coil transition in homo-rn seems less cooperative than natural RN sequences. We further explored the thermal stability of microrn PSK with respect to different model parameters, which convinced the fact that the nucleotide sequence of natural RN has been optimized during nature's evolution. major neglect in current model is the tertiary structure, which is believed essential for biological functions of natural RN. lthough the secondary structure is thermally stable independently, a comprehensive description of RN structure still requires the knowledge of tertiary structure. This problem may be solved through a further generalization of the operator transfer matrix. nother notable point is when extending the homo-rn model to hetero-rn sequences, we have introduced some specific matching rules (see Eqs. 5-7) for Watson-rick base pairs. Despite their generality for natural RN, there still exist several exceptions, like and pairs etc[]. The interactions between these non-watson-rick base pairs are weaker than the Watson-rick base pairs. Their natural occurrence is also relatively low. Our current model can be easily extended to include these cases, just by modifying the calculation rules of operators in Eqs This work is supported in part by the Tsinghua niversity Initiative Scientific Research Program ( ). 479

8 5. References [1].L. Lehninger, et al. Principles of Biochemistry. W.H. Freeman, 5. [] R. Nussinov,. Piecznik, J.R. rigg and D.J. Kleitman. lgorithms for loop matchings, SIM Journal on pplied Mathematics [3] R. Nussinov and.b. Jacobson. Fast algorithm for predicting the secondary structure of single-stranded RN, PNS. 198, 77: [4] [5] D.H. Mathews, et al. Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RN secondary structure, PNS. 4, 11: [6].B. Do, D.. Woods and S. Batzoglou. ONTRfold: RN secondary structure prediction without physicsbased models, Bioinformatics. 6, :9-98. [7] M. Zuker and P. Stiegler. Optimal computer folding of large RN sequences using thermodynamics and auxiliary information, Nucleic cids Res. 1981, 9: [8] E. Rivas and S.R. Eddy. dynamic programming algorithm for RN structure prediction including pseudoknots, J. Mol. Biol. 1999, 85: [9] J.S. Mcaskill. The equilibrium partition function and base pair binding probabilities for RN secondary structure, Biopolymers. 199, 9: [1] Y. Ding and.e. Lawrence. statistical sampling algorithm for RN secondary structure prediction, Nucleic cids Res. 3, 31: [11] P.. Bevilacqua and J.M. Blose. Structures, kinetics, thermodynamics, and biological functions of RN hairpins, nnu. Rev. Phys. hem. 8, 59: [1] S.J. hen and K.. Dill. RN folding energy landscapes, PNS., 97: [13] E. Tostesen, S.J. hen and K.. Dill. RN folding transitions and cooperativity, J. Phys. hem. 1, 15: [14] W.B. Zhang and S.J. hen. RN hairpin-folding kinetics, PNS,, 99: [15] W.B. Zhang and S.J. hen. Exploring the complex folding kinetics of RN hairpins: I. eneral folding kinetics analysis, Biophysical Journal, 6, 9: [16] W.B. Zhang and S.J. hen. Exploring the complex folding kinetics of RN hairpins: II. Effect of sequence, length, and misfolded states, Biophysical Journal, 6, 9: [17] D. Poland and H.. Scheraga. Theory of Helix-oil Transitions in Biopolymers. cademic Press, 197. [18] S.J. hen and K.. Dill. Statistical thermodynamics of double-stranded polymer molecules, J. hem. Phys. 1995, 13: [19] S.J. hen and K.. Dill. Theory for the conformational changes of double-stranded chain molecules, J. hem. Phys. 1998, 19: [] Kerson Huang. Statistical Mechanics. Wiley, [1]. Levinthal. re there pathways far protein folding? J. hem. Phys. Phys. Biol. 1968, 65: []. Nagaswamy, N. Voss, Z. Zhang and.e. Fox. Database of non-canonical base pairs found in known RN structures, Nucleic cids Res., 8:

proteins are the basic building blocks and active players in the cell, and

proteins are the basic building blocks and active players in the cell, and 12 RN Secondary Structure Sources for this lecture: R. Durbin, S. Eddy,. Krogh und. Mitchison, Biological sequence analysis, ambridge, 1998 J. Setubal & J. Meidanis, Introduction to computational molecular

More information

RNA Secondary Structure Prediction

RNA Secondary Structure Prediction RN Secondary Structure Prediction Perry Hooker S 531: dvanced lgorithms Prof. Mike Rosulek University of Montana December 10, 2010 Introduction Ribonucleic acid (RN) is a macromolecule that is essential

More information

Biophysical Model Building

Biophysical Model Building Biophysical Model Building Step 1: Come up with a hypothesis about how a system works How many binding sites? Is there cooperativity? Step 2: Translate the qualitative hypotheses into an observable mathematical

More information

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure

More information

DNA/RNA Structure Prediction

DNA/RNA Structure Prediction C E N T R E F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U Master Course DNA/Protein Structurefunction Analysis and Prediction Lecture 12 DNA/RNA Structure Prediction Epigenectics Epigenomics:

More information

98 Algorithms in Bioinformatics I, WS 06, ZBIT, D. Huson, December 6, 2006

98 Algorithms in Bioinformatics I, WS 06, ZBIT, D. Huson, December 6, 2006 98 Algorithms in Bioinformatics I, WS 06, ZBIT, D. Huson, December 6, 2006 8.3.1 Simple energy minimization Maximizing the number of base pairs as described above does not lead to good structure predictions.

More information

2.4 DNA structure. S(l) bl + c log l + d, with c 1.8k B. (2.72)

2.4 DNA structure. S(l) bl + c log l + d, with c 1.8k B. (2.72) 2.4 DNA structure DNA molecules come in a wide range of length scales, from roughly 50,000 monomers in a λ-phage, 6 0 9 for human, to 9 0 0 nucleotides in the lily. The latter would be around thirty meters

More information

Predicting free energy landscapes for complexes of double-stranded chain molecules

Predicting free energy landscapes for complexes of double-stranded chain molecules JOURNAL OF CHEMICAL PHYSICS VOLUME 114, NUMBER 9 1 MARCH 2001 Predicting free energy landscapes for complexes of double-stranded chain molecules Wenbing Zhang and Shi-Jie Chen a) Department of Physics

More information

Quantitative modeling of RNA single-molecule experiments. Ralf Bundschuh Department of Physics, Ohio State University

Quantitative modeling of RNA single-molecule experiments. Ralf Bundschuh Department of Physics, Ohio State University Quantitative modeling of RN single-molecule experiments Ralf Bundschuh Department of Physics, Ohio State niversity ollaborators: lrich erland, LM München Terence Hwa, San Diego Outline: Single-molecule

More information

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding? The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation By Jun Shimada and Eugine Shaknovich Bill Hawse Dr. Bahar Elisa Sandvik and Mehrdad Safavian Outline Background on protein

More information

S(l) bl + c log l + d, with c 1.8k B. (2.71)

S(l) bl + c log l + d, with c 1.8k B. (2.71) 2.4 DNA structure DNA molecules come in a wide range of length scales, from roughly 50,000 monomers in a λ-phage, 6 0 9 for human, to 9 0 0 nucleotides in the lily. The latter would be around thirty meters

More information

Impact Of The Energy Model On The Complexity Of RNA Folding With Pseudoknots

Impact Of The Energy Model On The Complexity Of RNA Folding With Pseudoknots Impact Of The Energy Model On The omplexity Of RN Folding With Pseudoknots Saad Sheikh, Rolf Backofen Yann Ponty, niversity of Florida, ainesville, S lbert Ludwigs niversity, Freiburg, ermany LIX, NRS/Ecole

More information

arxiv: v1 [cond-mat.soft] 22 Oct 2007

arxiv: v1 [cond-mat.soft] 22 Oct 2007 Conformational Transitions of Heteropolymers arxiv:0710.4095v1 [cond-mat.soft] 22 Oct 2007 Michael Bachmann and Wolfhard Janke Institut für Theoretische Physik, Universität Leipzig, Augustusplatz 10/11,

More information

Protein folding. Today s Outline

Protein folding. Today s Outline Protein folding Today s Outline Review of previous sessions Thermodynamics of folding and unfolding Determinants of folding Techniques for measuring folding The folding process The folding problem: Prediction

More information

A Method for Aligning RNA Secondary Structures

A Method for Aligning RNA Secondary Structures Method for ligning RN Secondary Structures Jason T. L. Wang New Jersey Institute of Technology J Liu, JTL Wang, J Hu and B Tian, BM Bioinformatics, 2005 1 Outline Introduction Structural alignment of RN

More information

Combinatorial approaches to RNA folding Part I: Basics

Combinatorial approaches to RNA folding Part I: Basics Combinatorial approaches to RNA folding Part I: Basics Matthew Macauley Department of Mathematical Sciences Clemson University http://www.math.clemson.edu/~macaule/ Math 4500, Spring 2015 M. Macauley (Clemson)

More information

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila 1 Guessing the upper bound free-energy difference between native-like structures Jorge A. Vila IMASL-CONICET, Universidad Nacional de San Luis, Ejército de Los Andes 950, 5700- San Luis, Argentina Use

More information

Introduction to" Protein Structure

Introduction to Protein Structure Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Learning Objectives Outline the basic levels of protein structure.

More information

Computational Approaches for determination of Most Probable RNA Secondary Structure Using Different Thermodynamics Parameters

Computational Approaches for determination of Most Probable RNA Secondary Structure Using Different Thermodynamics Parameters Computational Approaches for determination of Most Probable RNA Secondary Structure Using Different Thermodynamics Parameters 1 Binod Kumar, Assistant Professor, Computer Sc. Dept, ISTAR, Vallabh Vidyanagar,

More information

RNA-Strukturvorhersage Strukturelle Bioinformatik WS16/17

RNA-Strukturvorhersage Strukturelle Bioinformatik WS16/17 RNA-Strukturvorhersage Strukturelle Bioinformatik WS16/17 Dr. Stefan Simm, 01.11.2016 simm@bio.uni-frankfurt.de RNA secondary structures a. hairpin loop b. stem c. bulge loop d. interior loop e. multi

More information

Supersecondary Structures (structural motifs)

Supersecondary Structures (structural motifs) Supersecondary Structures (structural motifs) Various Sources Slide 1 Supersecondary Structures (Motifs) Supersecondary Structures (Motifs): : Combinations of secondary structures in specific geometric

More information

Secondary structure stability, beta-sheet formation & stability

Secondary structure stability, beta-sheet formation & stability Protein Physics 2016 Lecture 6, February 5 Secondary structure stability, beta-sheet formation & stability Magnus Andersson magnus.andersson@scilifelab.se Theoretical & Computational Biophysics Recap of

More information

Scaling Law for the Radius of Gyration of Proteins and Its Dependence on Hydrophobicity

Scaling Law for the Radius of Gyration of Proteins and Its Dependence on Hydrophobicity Scaling Law for the Radius of Gyration of Proteins and Its Dependence on Hydrophobicity LIU HONG, JINZHI LEI Zhou Pei-Yuan Center for Applied Mathematics, Tsinghua University, Beijing, People s Republic

More information

RNA Abstract Shape Analysis

RNA Abstract Shape Analysis ourse: iegerich RN bstract nalysis omplete shape iegerich enter of Biotechnology Bielefeld niversity robert@techfak.ni-bielefeld.de ourse on omputational RN Biology, Tübingen, March 2006 iegerich ourse:

More information

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday.

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday. Short Announcements 1 st Quiz today: 15 minutes Homework 3: Due next Wednesday. Next Lecture, on Visualizing Molecular Dynamics (VMD) by Klaus Schulten Today s Lecture: Protein Folding, Misfolding, Aggregation

More information

Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence

Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence Naoto Morikawa (nmorika@genocript.com) October 7, 2006. Abstract A protein is a sequence

More information

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed. Macromolecular Processes 20. Protein Folding Composed of 50 500 amino acids linked in 1D sequence by the polypeptide backbone The amino acid physical and chemical properties of the 20 amino acids dictate

More information

Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11

Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11 Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11 Taking the advice of Lord Kelvin, the Father of Thermodynamics, I describe the protein molecule and other

More information

The protein folding problem consists of two parts:

The protein folding problem consists of two parts: Energetics and kinetics of protein folding The protein folding problem consists of two parts: 1)Creating a stable, well-defined structure that is significantly more stable than all other possible structures.

More information

Protein folding. α-helix. Lecture 21. An α-helix is a simple helix having on average 10 residues (3 turns of the helix)

Protein folding. α-helix. Lecture 21. An α-helix is a simple helix having on average 10 residues (3 turns of the helix) Computat onal Biology Lecture 21 Protein folding The goal is to determine the three-dimensional structure of a protein based on its amino acid sequence Assumption: amino acid sequence completely and uniquely

More information

Proteins polymer molecules, folded in complex structures. Konstantin Popov Department of Biochemistry and Biophysics

Proteins polymer molecules, folded in complex structures. Konstantin Popov Department of Biochemistry and Biophysics Proteins polymer molecules, folded in complex structures Konstantin Popov Department of Biochemistry and Biophysics Outline General aspects of polymer theory Size and persistent length of ideal linear

More information

Chemistry 565 / 665 BIOPHYSICAL CHEMISTRY. - Spring

Chemistry 565 / 665 BIOPHYSICAL CHEMISTRY. - Spring Chemistry 565 / 665 BIOPHYSICAL CHEMISTRY - Spring 2003 - LECTURE: LECTURER: OFFICE HOURS: 9:55 10:45 a.m. MTRF, B383 Chemistry Prof. Silvia Cavagnero Office: 8108 New Chemistry Building (will be 5341

More information

RNA secondary structure prediction. Farhat Habib

RNA secondary structure prediction. Farhat Habib RNA secondary structure prediction Farhat Habib RNA RNA is similar to DNA chemically. It is usually only a single strand. T(hyamine) is replaced by U(racil) Some forms of RNA can form secondary structures

More information

Lecture 12. DNA/RNA Structure Prediction. Epigenectics Epigenomics: Gene Expression

Lecture 12. DNA/RNA Structure Prediction. Epigenectics Epigenomics: Gene Expression C N F O N G A V B O N F O M A C S V U Master Course DNA/Protein Structurefunction Analysis and Prediction Lecture 12 DNA/NA Structure Prediction pigenectics pigenomics: Gene xpression ranscription factors

More information

Presenter: She Zhang

Presenter: She Zhang Presenter: She Zhang Introduction Dr. David Baker Introduction Why design proteins de novo? It is not clear how non-covalent interactions favor one specific native structure over many other non-native

More information

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University Thermodynamics Entropy and its Applications Lecture 11 NC State University System and surroundings Up to this point we have considered the system, but we have not concerned ourselves with the relationship

More information

Bio nformatics. Lecture 23. Saad Mneimneh

Bio nformatics. Lecture 23. Saad Mneimneh Bio nformatics Lecture 23 Protein folding The goal is to determine the three-dimensional structure of a protein based on its amino acid sequence Assumption: amino acid sequence completely and uniquely

More information

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University COMP 598 Advanced Computational Biology Methods & Research Introduction Jérôme Waldispühl School of Computer Science McGill University General informations (1) Office hours: by appointment Office: TR3018

More information

The Double Helix. CSE 417: Algorithms and Computational Complexity! The Central Dogma of Molecular Biology! DNA! RNA! Protein! Protein!

The Double Helix. CSE 417: Algorithms and Computational Complexity! The Central Dogma of Molecular Biology! DNA! RNA! Protein! Protein! The Double Helix SE 417: lgorithms and omputational omplexity! Winter 29! W. L. Ruzzo! Dynamic Programming, II" RN Folding! http://www.rcsb.org/pdb/explore.do?structureid=1t! Los lamos Science The entral

More information

Motif Prediction in Amino Acid Interaction Networks

Motif Prediction in Amino Acid Interaction Networks Motif Prediction in Amino Acid Interaction Networks Omar GACI and Stefan BALEV Abstract In this paper we represent a protein as a graph where the vertices are amino acids and the edges are interactions

More information

Frontiers in Physics 27-29, Sept Self Avoiding Growth Walks and Protein Folding

Frontiers in Physics 27-29, Sept Self Avoiding Growth Walks and Protein Folding Frontiers in Physics 27-29, Sept. 2012 Self Avoiding Growth Walks and Protein Folding K P N Murthy, K Manasa, and K V K Srinath School of Physics, School of Life Sciences University of Hyderabad September

More information

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions Van der Waals Interactions

More information

Protein Folding Prof. Eugene Shakhnovich

Protein Folding Prof. Eugene Shakhnovich Protein Folding Eugene Shakhnovich Department of Chemistry and Chemical Biology Harvard University 1 Proteins are folded on various scales As of now we know hundreds of thousands of sequences (Swissprot)

More information

Protein Folding. I. Characteristics of proteins. C α

Protein Folding. I. Characteristics of proteins. C α I. Characteristics of proteins Protein Folding 1. Proteins are one of the most important molecules of life. They perform numerous functions, from storing oxygen in tissues or transporting it in a blood

More information

Biomolecules: lecture 10

Biomolecules: lecture 10 Biomolecules: lecture 10 - understanding in detail how protein 3D structures form - realize that protein molecules are not static wire models but instead dynamic, where in principle every atom moves (yet

More information

Lab III: Computational Biology and RNA Structure Prediction. Biochemistry 208 David Mathews Department of Biochemistry & Biophysics

Lab III: Computational Biology and RNA Structure Prediction. Biochemistry 208 David Mathews Department of Biochemistry & Biophysics Lab III: Computational Biology and RNA Structure Prediction Biochemistry 208 David Mathews Department of Biochemistry & Biophysics Contact Info: David_Mathews@urmc.rochester.edu Phone: x51734 Office: 3-8816

More information

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions

More information

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its

More information

Principles of Physical Biochemistry

Principles of Physical Biochemistry Principles of Physical Biochemistry Kensal E. van Hold e W. Curtis Johnso n P. Shing Ho Preface x i PART 1 MACROMOLECULAR STRUCTURE AND DYNAMICS 1 1 Biological Macromolecules 2 1.1 General Principles

More information

Modeling Background; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 8

Modeling Background; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 8 Modeling Background; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 8 Depending on thermodynamic and solvent conditions, the interrelationships between thermodynamic stability of

More information

Computer simulations of protein folding with a small number of distance restraints

Computer simulations of protein folding with a small number of distance restraints Vol. 49 No. 3/2002 683 692 QUARTERLY Computer simulations of protein folding with a small number of distance restraints Andrzej Sikorski 1, Andrzej Kolinski 1,2 and Jeffrey Skolnick 2 1 Department of Chemistry,

More information

Biophysics II. Key points to be covered. Molecule and chemical bonding. Life: based on materials. Molecule and chemical bonding

Biophysics II. Key points to be covered. Molecule and chemical bonding. Life: based on materials. Molecule and chemical bonding Biophysics II Life: based on materials By A/Prof. Xiang Yang Liu Biophysics & Micro/nanostructures Lab Department of Physics, NUS Organelle, formed by a variety of molecules: protein, DNA, RNA, polysaccharides

More information

EVALUATION OF RNA SECONDARY STRUCTURE MOTIFS USING REGRESSION ANALYSIS

EVALUATION OF RNA SECONDARY STRUCTURE MOTIFS USING REGRESSION ANALYSIS EVLTION OF RN SEONDRY STRTRE MOTIFS SIN RERESSION NLYSIS Mohammad nwar School of Information Technology and Engineering, niversity of Ottawa e-mail: manwar@site.uottawa.ca bstract Recent experimental evidences

More information

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig part of Bioinformatik von RNA- und Proteinstrukturen Computational EvoDevo University Leipzig Leipzig, SS 2011 Protein Structure levels or organization Primary structure: sequence of amino acids (from

More information

Master equation approach to finding the rate-limiting steps in biopolymer folding

Master equation approach to finding the rate-limiting steps in biopolymer folding JOURNAL OF CHEMICAL PHYSICS VOLUME 118, NUMBER 7 15 FEBRUARY 2003 Master equation approach to finding the rate-limiting steps in biopolymer folding Wenbing Zhang and Shi-Jie Chen a) Department of Physics

More information

Protein Folding Challenge and Theoretical Computer Science

Protein Folding Challenge and Theoretical Computer Science Protein Folding Challenge and Theoretical Computer Science Somenath Biswas Department of Computer Science and Engineering, Indian Institute of Technology Kanpur. (Extended Abstract) September 2011 Almost

More information

Matter and Substances Section 3-1

Matter and Substances Section 3-1 Matter and Substances Section 3-1 Key Idea: All matter is made up of atoms. An atom has a positively charges core surrounded by a negatively charged region. An atom is the smallest unit of matter that

More information

= (-22) = +2kJ /mol

= (-22) = +2kJ /mol Lecture 8: Thermodynamics & Protein Stability Assigned reading in Campbell: Chapter 4.4-4.6 Key Terms: DG = -RT lnk eq = DH - TDS Transition Curve, Melting Curve, Tm DH calculation DS calculation van der

More information

3D HP Protein Folding Problem using Ant Algorithm

3D HP Protein Folding Problem using Ant Algorithm 3D HP Protein Folding Problem using Ant Algorithm Fidanova S. Institute of Parallel Processing BAS 25A Acad. G. Bonchev Str., 1113 Sofia, Bulgaria Phone: +359 2 979 66 42 E-mail: stefka@parallel.bas.bg

More information

BIOINFORMATICS. Fast evaluation of internal loops in RNA secondary structure prediction. Abstract. Introduction

BIOINFORMATICS. Fast evaluation of internal loops in RNA secondary structure prediction. Abstract. Introduction BIOINFORMATICS Fast evaluation of internal loops in RNA secondary structure prediction Abstract Motivation: Though not as abundant in known biological processes as proteins, RNA molecules serve as more

More information

F. Piazza Center for Molecular Biophysics and University of Orléans, France. Selected topic in Physical Biology. Lecture 1

F. Piazza Center for Molecular Biophysics and University of Orléans, France. Selected topic in Physical Biology. Lecture 1 Zhou Pei-Yuan Centre for Applied Mathematics, Tsinghua University November 2013 F. Piazza Center for Molecular Biophysics and University of Orléans, France Selected topic in Physical Biology Lecture 1

More information

Structure-Based Comparison of Biomolecules

Structure-Based Comparison of Biomolecules Structure-Based Comparison of Biomolecules Benedikt Christoph Wolters Seminar Bioinformatics Algorithms RWTH AACHEN 07/17/2015 Outline 1 Introduction and Motivation Protein Structure Hierarchy Protein

More information

3. Solutions W = N!/(N A!N B!) (3.1) Using Stirling s approximation ln(n!) = NlnN N: ΔS mix = k (N A lnn + N B lnn N A lnn A N B lnn B ) (3.

3. Solutions W = N!/(N A!N B!) (3.1) Using Stirling s approximation ln(n!) = NlnN N: ΔS mix = k (N A lnn + N B lnn N A lnn A N B lnn B ) (3. 3. Solutions Many biological processes occur between molecules in aqueous solution. In addition, many protein and nucleic acid molecules adopt three-dimensional structure ( fold ) in aqueous solution.

More information

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Department of Chemical Engineering Program of Applied and

More information

Physiochemical Properties of Residues

Physiochemical Properties of Residues Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)

More information

Lattice protein models

Lattice protein models Lattice protein models Marc R. Roussel epartment of Chemistry and Biochemistry University of Lethbridge March 5, 2009 1 Model and assumptions The ideas developed in the last few lectures can be applied

More information

Protein Structure Prediction Using Multiple Artificial Neural Network Classifier *

Protein Structure Prediction Using Multiple Artificial Neural Network Classifier * Protein Structure Prediction Using Multiple Artificial Neural Network Classifier * Hemashree Bordoloi and Kandarpa Kumar Sarma Abstract. Protein secondary structure prediction is the method of extracting

More information

A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics

A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics JOURNAL OF CHEMICAL PHYSICS VOLUME 115, NUMBER 3 15 JULY 2001 A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics Dominik Gront Department of

More information

DANNY BARASH ABSTRACT

DANNY BARASH ABSTRACT JOURNAL OF COMPUTATIONAL BIOLOGY Volume 11, Number 6, 2004 Mary Ann Liebert, Inc. Pp. 1169 1174 Spectral Decomposition for the Search and Analysis of RNA Secondary Structure DANNY BARASH ABSTRACT Scales

More information

Molecular Driving Forces

Molecular Driving Forces Molecular Driving Forces Statistical Thermodynamics in Chemistry and Biology SUBGfittingen 7 At 216 513 073 / / Ken A. Dill Sarina Bromberg With the assistance of Dirk Stigter on the Electrostatics chapters

More information

Chemistry 431. Lecture 27 The Ensemble Partition Function Statistical Thermodynamics. NC State University

Chemistry 431. Lecture 27 The Ensemble Partition Function Statistical Thermodynamics. NC State University Chemistry 431 Lecture 27 The Ensemble Partition Function Statistical Thermodynamics NC State University Representation of an Ensemble N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T

More information

The role of form in molecular biology. Conceptual parallelism with developmental and evolutionary biology

The role of form in molecular biology. Conceptual parallelism with developmental and evolutionary biology The role of form in molecular biology Conceptual parallelism with developmental and evolutionary biology Laura Nuño de la Rosa (UCM & IHPST) KLI Brown Bag talks 2008 The irreducibility of biological form

More information

Algorithms in Bioinformatics

Algorithms in Bioinformatics Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri RNA Structure Prediction Secondary

More information

STRUCTURAL BIOINFORMATICS I. Fall 2015

STRUCTURAL BIOINFORMATICS I. Fall 2015 STRUCTURAL BIOINFORMATICS I Fall 2015 Info Course Number - Classification: Biology 5411 Class Schedule: Monday 5:30-7:50 PM, SERC Room 456 (4 th floor) Instructors: Vincenzo Carnevale - SERC, Room 704C;

More information

Journal of Mathematical Analysis and Applications

Journal of Mathematical Analysis and Applications J. Math. Anal. Appl. 383 (011) 00 07 Contents lists available at ScienceDirect Journal of Mathematical Analysis and Applications www.elsevier.com/locate/jmaa Asymptotic enumeration of some RNA secondary

More information

CAP 5510 Lecture 3 Protein Structures

CAP 5510 Lecture 3 Protein Structures CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity

More information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

arxiv:cond-mat/ v1 2 Feb 94

arxiv:cond-mat/ v1 2 Feb 94 cond-mat/9402010 Properties and Origins of Protein Secondary Structure Nicholas D. Socci (1), William S. Bialek (2), and José Nelson Onuchic (1) (1) Department of Physics, University of California at San

More information

Chapter 1. A Method to Predict the 3D Structure of an RNA Scaffold. Xiaojun Xu and Shi-Jie Chen. Abstract. 1 Introduction

Chapter 1. A Method to Predict the 3D Structure of an RNA Scaffold. Xiaojun Xu and Shi-Jie Chen. Abstract. 1 Introduction Chapter 1 Abstract The ever increasing discoveries of noncoding RNA functions draw a strong demand for RNA structure determination from the sequence. In recently years, computational studies for RNA structures,

More information

Quiz 2 Morphology of Complex Materials

Quiz 2 Morphology of Complex Materials 071003 Quiz 2 Morphology of Complex Materials 1) Explain the following terms: (for states comment on biological activity and relative size of the structure) a) Native State b) Unfolded State c) Denatured

More information

Introduction to Polymer Physics

Introduction to Polymer Physics Introduction to Polymer Physics Enrico Carlon, KU Leuven, Belgium February-May, 2016 Enrico Carlon, KU Leuven, Belgium Introduction to Polymer Physics February-May, 2016 1 / 28 Polymers in Chemistry and

More information

Protein Structure Analysis with Sequential Monte Carlo Method. Jinfeng Zhang Computational Biology Lab Department of Statistics Harvard University

Protein Structure Analysis with Sequential Monte Carlo Method. Jinfeng Zhang Computational Biology Lab Department of Statistics Harvard University Protein Structure Analysis with Sequential Monte Carlo Method Jinfeng Zhang Computational Biology Lab Department of Statistics Harvard University Introduction Structure Function & Interaction Protein structure

More information

Rapid Dynamic Programming Algorithms for RNA Secondary Structure

Rapid Dynamic Programming Algorithms for RNA Secondary Structure ADVANCES IN APPLIED MATHEMATICS 7,455-464 I f Rapid Dynamic Programming Algorithms for RNA Secondary Structure MICHAEL S. WATERMAN* Depurtments of Muthemutics und of Biologicul Sciences, Universitk of

More information

Pymol Practial Guide

Pymol Practial Guide Pymol Practial Guide Pymol is a powerful visualizor very convenient to work with protein molecules. Its interface may seem complex at first, but you will see that with a little practice is simple and powerful

More information

CONTRAfold: RNA Secondary Structure Prediction without Physics-Based Models

CONTRAfold: RNA Secondary Structure Prediction without Physics-Based Models Supplementary Material for CONTRAfold: RNA Secondary Structure Prediction without Physics-Based Models Chuong B Do, Daniel A Woods, and Serafim Batzoglou Stanford University, Stanford, CA 94305, USA, {chuongdo,danwoods,serafim}@csstanfordedu,

More information

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013 Hydration of protein-rna recognition sites Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India 1 st November, 2013 Central Dogma of life DNA

More information

BIOINFORMATICS. Prediction of RNA secondary structure based on helical regions distribution

BIOINFORMATICS. Prediction of RNA secondary structure based on helical regions distribution BIOINFORMATICS Prediction of RNA secondary structure based on helical regions distribution Abstract Motivation: RNAs play an important role in many biological processes and knowing their structure is important

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Figure S1. Secondary structure of CAP (in the camp 2 -bound state) 10. α-helices are shown as cylinders and β- strands as arrows. Labeling of secondary structure is indicated. CDB, DBD and the hinge are

More information

Extending the hypergraph analogy for RNA dynamic programming

Extending the hypergraph analogy for RNA dynamic programming Extending the hypergraph analogy for RN dynamic programming Yann Ponty Balaji Raman édric Saule Polytechnique/NRS/INRI MIB France Yann Ponty, Balaji Raman, édric Saule RN Folding RN = Biopolymer composed

More information

Contents. xiii. Preface v

Contents. xiii. Preface v Contents Preface Chapter 1 Biological Macromolecules 1.1 General PrincipIes 1.1.1 Macrornolecules 1.2 1.1.2 Configuration and Conformation Molecular lnteractions in Macromolecular Structures 1.2.1 Weak

More information

Predicting RNA Secondary Structure

Predicting RNA Secondary Structure 7.91 / 7.36 / BE.490 Lecture #6 Mar. 11, 2004 Predicting RNA Secondary Structure Chris Burge Review of Markov Models & DNA Evolution CpG Island HMM The Viterbi Algorithm Real World HMMs Markov Models for

More information

D Dobbs ISU - BCB 444/544X 1

D Dobbs ISU - BCB 444/544X 1 11/7/05 Protein Structure: Classification, Databases, Visualization Announcements BCB 544 Projects - Important Dates: Nov 2 Wed noon - Project proposals due to David/Drena Nov 4 Fri PM - Approvals/responses

More information

Protein Folding experiments and theory

Protein Folding experiments and theory Protein Folding experiments and theory 1, 2,and 3 Protein Structure Fig. 3-16 from Lehninger Biochemistry, 4 th ed. The 3D structure is not encoded at the single aa level Hydrogen Bonding Shared H atom

More information

Statistical Thermodynamics of DNA Denaturation. processes

Statistical Thermodynamics of DNA Denaturation. processes Statistical Thermodynamics of DNA Denaturation processes Course: Statistical Thermodynamics Advisor: Prof. Raissa D Souza Spring quarter 2012 By: Sebastian Frey 1 Nomenclature F Free Energy [J] x Coordinate

More information

Small RNA in rice genome

Small RNA in rice genome Vol. 45 No. 5 SCIENCE IN CHINA (Series C) October 2002 Small RNA in rice genome WANG Kai ( 1, ZHU Xiaopeng ( 2, ZHONG Lan ( 1,3 & CHEN Runsheng ( 1,2 1. Beijing Genomics Institute/Center of Genomics and

More information

Grand Plan. RNA very basic structure 3D structure Secondary structure / predictions The RNA world

Grand Plan. RNA very basic structure 3D structure Secondary structure / predictions The RNA world Grand Plan RNA very basic structure 3D structure Secondary structure / predictions The RNA world very quick Andrew Torda, April 2017 Andrew Torda 10/04/2017 [ 1 ] Roles of molecules RNA DNA proteins genetic

More information

Basics of protein structure

Basics of protein structure Today: 1. Projects a. Requirements: i. Critical review of one paper ii. At least one computational result b. Noon, Dec. 3 rd written report and oral presentation are due; submit via email to bphys101@fas.harvard.edu

More information

BIOCHEMISTRY NOTES - UNIT 2-

BIOCHEMISTRY NOTES - UNIT 2- BIOCHEMISTRY NOTES - UNIT 2- ATOMS - the basic unit of matter. Contains subatomic particles o (+ charge) o (no charge/neutral) o (- charge) Protons and neutrons have about the same mass. Electrons are

More information

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov Biophysical Journal, Volume 98 Supporting Material Molecular dynamics simulations of anti-aggregation effect of ibuprofen Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov Supplemental

More information

Lecture 11: Protein Folding & Stability

Lecture 11: Protein Folding & Stability Structure - Function Protein Folding: What we know Lecture 11: Protein Folding & Stability 1). Amino acid sequence dictates structure. 2). The native structure represents the lowest energy state for a

More information