arxiv:cond-mat/ v1 [cond-mat.stat-mech] 21 Dec 2004

Similar documents
Non-exponential decay of base-pair opening fluctuations in DNA

Chaotic Modeling and Simulation (CMSIM) 2: , 2017

DISORDER AND FLUCTUATIONS IN NONLINEAR EXCITATIONS IN DNA

Introduction to Polymer Physics

Stacking and Hydrogen Bonding. DNA Cooperativity at Melting.

S(l) bl + c log l + d, with c 1.8k B. (2.71)

2.4 DNA structure. S(l) bl + c log l + d, with c 1.8k B. (2.72)

Statistical Thermodynamics of DNA Denaturation. processes

arxiv: v1 [cond-mat.soft] 28 Sep 2015

Modelling DNA at the mesoscale: a challenge for nonlinear science?

DNA Bubble Dynamics. Hans Fogedby Aarhus University and Niels Bohr Institute Denmark

Melting Transition of Directly-Linked Gold Nanoparticle DNA Assembly arxiv:physics/ v1 [physics.bio-ph] 10 Mar 2005

Chapter 1. Topic: Overview of basic principles

Dynamics of Proton and Electron conductivity in DNA Chains

X-ray reflectivity of Fibonacci multilayers. Abstract

arxiv: v1 [cond-mat.soft] 19 Mar 2012

Finite Ring Geometries and Role of Coupling in Molecular Dynamics and Chemistry

arxiv:cond-mat/ v2 [cond-mat.soft] 8 Nov 2004

Lesson Overview The Structure of DNA

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 8 Oct 1996

Lecture 34 Protein Unfolding Thermodynamics

Equation of state of additive hard-disk fluid mixtures: A critical analysis of two recent proposals

Localization and electron-phonon interactions in disordered systems

F. Piazza Center for Molecular Biophysics and University of Orléans, France. Selected topic in Physical Biology. Lecture 1

Cluster Distribution in Mean-Field Percolation: Scaling and. Universality arxiv:cond-mat/ v1 [cond-mat.stat-mech] 6 Jun 1997.

DNA Structure. Voet & Voet: Chapter 29 Pages Slide 1

Multimedia : Fibronectin and Titin unfolding simulation movies.

NUMERICAL SIMULATION OF THE IR SPECTRA OF DNA BASES

Sugars, such as glucose or fructose are the basic building blocks of more complex carbohydrates. Which of the following

3 Biopolymers Uncorrelated chains - Freely jointed chain model

Intermolecular Forces & Condensed Phases

Influence of moving breathers on vacancies migration

Part III Biological Physics course

in Halogen-Bonded Complexes

Monte Carlo simulation of confined water

DNA-Scaffolded Self-Assembling Nano-Circuitry

Thermodynamics of nuclei in thermal contact

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila

Name: Date: Period: Biology Notes: Biochemistry Directions: Fill this out as we cover the following topics in class

Base pairing in DNA.

Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model

arxiv: v1 [cond-mat.stat-mech] 10 Jul 2018

Invaded cluster dynamics for frustrated models

Supporting Information for:

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

A MOLECULAR DYNAMICS SIMULATION OF A BUBBLE NUCLEATION ON SOLID SURFACE

5. THE STATISTICAL MECHANICS OF DNA

Sample Question Solutions for the Chemistry of Life Topic Test

2) Matter composed of a single type of atom is known as a(n) 2) A) element. B) mineral. C) electron. D) compound. E) molecule.

Structures of the Molecular Components in DNA and RNA with Bond Lengths Interpreted as Sums of Atomic Covalent Radii

Pressure Dependent Study of the Solid-Solid Phase Change in 38-Atom Lennard-Jones Cluster

UC Berkeley. Chem 130A. Spring nd Exam. March 10, 2004 Instructor: John Kuriyan

5.111 Lecture Summary #17 Friday, October 17, 2014

Triangular Lattice Foldings-a Transfer Matrix Study.

SUPPLEMENTARY FIGURE 1. Force dependence of the unbinding rate: (a) Force-dependence

NIH Public Access Author Manuscript Phys Rev Lett. Author manuscript; available in PMC 2013 April 16.

Atomic Structures of the Molecular Components in DNA and. RNA based on Bond Lengths as Sums of Atomic Radii

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov

Microcanonical scaling in small systems arxiv:cond-mat/ v1 [cond-mat.stat-mech] 3 Jun 2004

DNA denaturation in the rodlike polyelectrolyte model

Master equation approach to finding the rate-limiting steps in biopolymer folding

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Computational Biology: Basics & Interesting Problems

NOTES - Ch. 16 (part 1): DNA Discovery and Structure

The 6-vertex model of hydrogen-bonded crystals with bond defects

arxiv:cond-mat/ v1 [cond-mat.mes-hall] 8 Jan 1998

Molecular Interactions F14NMI. Lecture 4: worked answers to practice questions

arxiv:cond-mat/ v1 2 Feb 94

arxiv:cond-mat/ v2 [cond-mat.soft] 29 Jul 2003

DNA/RNA structure and packing

Heat capacity of water: a signature of nuclear quantum effects. Abstract

arxiv:cond-mat/ v2 [cond-mat.soft] 29 Nov 2004

Breather trapping and breather transmission in a DNA model with an interface.

Biology I Fall Semester Exam Review 2014

arxiv: v1 [cond-mat.stat-mech] 7 Mar 2019

Theoretical and Computational Treatments of DNA and RNA Molecules

CHEM J-3 June Calculate the osmotic pressure of a 0.25 M aqueous solution of sucrose, C 12 H 22 O 11, at 37 C

Number of questions TEK (Learning Target) Biomolecules & Enzymes

Intrinsic Localized Lattice Modes and Thermal Transport: Potential Application in a Thermal Rectifier

arxiv:cond-mat/ v1 [cond-mat.soft] 25 Jul 2002

arxiv:cond-mat/ v1 [cond-mat.other] 5 Jun 2004

Introduction to Molecular and Cell Biology

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001

Duduială, Ciprian Ionut (2010) Stochastic nonlinear models of DNA breathing at a defect. PhD thesis, University of Nottingham.

arxiv:cond-mat/ v1 [cond-mat.other] 4 Aug 2004

Effect of the Inner-Zone Vibrations on the Dynamics of Collision-Induced Intramolecular Energy Flow in Highly Excited Toluene

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 13 Apr 1999

2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology

Physical Models of Allostery: Allosteric Regulation in Capsid Assembly

Protein folding. Today s Outline

BIOCHEMISTRY GUIDED NOTES - AP BIOLOGY-

Langevin Dynamics of a Single Particle

Cluster Monte Carlo study of multicomponent fluids of the Stillinger-Helfand and Widom- Rowlinson type

Free energy recovery in single molecule experiments

PHYSICAL REVIEW LETTERS

Ensemble equivalence for non-extensive thermostatistics

SOLIDS AND LIQUIDS - Here's a brief review of the atomic picture or gases, liquids, and solids GASES

THE DETAILED BALANCE ENERGY-SCALED DISPLACEMENT MONTE CARLO ALGORITHM

Virgili, Tarragona (Spain) Roma (Italy) Zaragoza, Zaragoza (Spain)

Microscopic Deterministic Dynamics and Persistence Exponent arxiv:cond-mat/ v1 [cond-mat.stat-mech] 22 Sep 1999

Transcription:

Theory of Bubble Nucleation and Cooperativity in DNA Melting arxiv:cond-mat/4259v [cond-mat.stat-mech] 2 Dec 24 Saúl Ares,, 2 N. K. Voulgarakis, 3 K. Ø. Rasmussen, and A. R. Bishop Theoretical Division and Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA 2 Grupo Interdisciplinar de Sistemas Complejos (GISC) and Departamento de Matemáticas, Universidad Carlos III de Madrid, Avenida de la Universidad 3, 289 Leganés, Madrid, Spain 3 Department of Physics, University of Crete and Foundation for Research and Technology-Hellas (FORTH), P.O. Box 228, 73 Heraklion, Crete, Greece (Dated: February 2, 28) The onset of intermediate states (denaturation bubbles) and their role during the melting transition of DNA are studied using the Peyrard-Bishop-Daxuois model by Monte Carlo simulations with no adjustable parameters. Comparison is made with previously published experimental results finding excellent agreement. Melting curves, critical DNA segment length for stability of bubbles and the possibility of a two states transition are studied. PACS numbers: 63.2.Pw,87.5.-v,87.5.He Accessing the genetic code stored in DNA is central to fundamental biological processes such as replication and transcription and this requires that the extraordinary stable double helical structure of the molecule must locally open to physically expose the bases. Although, in the cell, proteins may actively help separating the strands of double stranded DNA, recent evidence [, 2] corroborates that sequence-specific propensity to form strand separations (bubbles) at transcription initiation sites exists and promotes thermal bubble formation. Important thermal effects such as stability of different DNA sequences, and the properties of denaturation bubbles can be studied in vitro and provide important insight to the biological processes. Recent, experimental studies [3, 4, 5] have attempted to interrogate the nature and statistical significance of such bubble states. Intriguingly, these experiments combine traditional UV absorption experiments with a novel bubble quenching technique that traps ensembles of bubbles to capture statistical properties of the bubbles. The actual melting of double-stranded DNA occurs through an entropy driven phase transition. The entropy gained in transitioning from the very rigid doublestranded DNA to the much more flexible single-stranded DNA can, already at moderate temperatures, balance the energy cost of breaking a base-pair. Since, the double-stranded helix is held together by hydrogen bonds between complementary base-pairs: two bonds for the AT pair and three bonds for the stronger GC pair, the sequence heterogeneity interplays with the entropy effects to create an extended premelting temperature window, (including the biologically relevant regime) where large thermal bubbles are readily formed. Theoretical studies of the melting transition have included ones based on Ising-type models [6] describing paired and unpaired bases, thermodynamics models like nearestneighbor models [7], Poland-Scheraga models [8], simple zipper models [9, ], or models that introduces a phenomelogical pairing potential between the bases [, 2, 3]. In particular the Peyrard-Bishop-Dauxois model [2, 3] is emerging as a model that is able to appropriately describe the melting transition but also the sequence dependence of the bubble nucleation dynamics in the pre-melting regime. Here, we compare the powerful recent experimental results in Refs. [4, 5] with Monte Carlo simulations of the model proposed by Peyrard, Bishop, and Dauxois [, 2, 3]. This model has already been successfully compared with denaturation experiments on short homogeneous sequences [4]. The recent demonstration [] of the model s ability to accurately predict the locations at which large bubbles form in several viral sequences, is even more exceptional. The difference between our comparison and previous ones is that we use the same (deceptively) simple model, with no further refinements that introduce new parameters that need to be fitted. Indeed parameters of the model are not changed to fit the experiments: we use the same values for those parameters that were fixed in reference [4] for quite different DNA sequences. The potential energy of the model reads: V = [ D n (e anyn ) 2 + n k 2 ( + ρe β(yn+yn ) )(y n y n ) 2] () The sum is over all the base-pairs of the molecule and y n denotes the relative displacement from equilibrium at the n th base pair. The first term of the potential energy is a Morse potential that represents the hydrogen bonds between the bases. The second term is a nextneighbor coupling that represents the stacking interaction between adjacent base pairs: it comprises a harmonic coupling multiplied by a term that strengthens the coupling when the molecule is closed and makes it

2 weaker when it is melted, in this way taking into account the different stiffness (i.e. entropy effects) of DNA double strands and single strands (this effect can be directly observed, in model calculations, in terms of a softning of the characteristic frequencies of the system with rising temperature [5]). This nonlinear coupling results in long-range cooperative effects in the denaturation, leading to an abrupt entropy-driven transition [2, 3]. A crucial point for obtaining correct results is the accurate description of the heterogeneity of the sequence [9]. In this model it is incorporated by giving different values to the parameters of the Morse potential, depending on the base-pair type of the site considered: adenine-thymine (AT) or guanine-cytosine (GC). The parameter values we have used are those used in Ref. [4]: k =.25eV/A 2, ρ = 2, β =.35A for the inter-site coupling, while for the Morse potential D GC =.75eV, a GC = 6.9A for a GC base pair, and D AT =.5eV, a AT = 4.2A for an AT pair. These parameters were chosen to fit thermodynamic properties of DNA [4]. One should be cautious in relating these parameters directly to microscopic properties, and recall that they arise as a result of several physical phenomena at the microscopic level. Using the standard Metropolis algorithm [6, 7], we have performed Monte Carlo simulations on this model [8]. For each temperature, we performed a number of simulations. In each of these simulations, we compute the mean profile y n, from which we obtain the fraction of open base-pairs. We consider the n th base-pair to be open if y n exceeds a certain threshold. Applying the same threshold, we record at the end of each simulation whether the entire molecule was open (denaturated). Performing a large number of such simulations starting from different initial conditions we obtain the averaged fraction f of open base-pairs and the averaged fraction of denaturated molecules p at a given temperature. In this way, we simulate the experiments, where the measures are made over a large ensemble of molecules. The threshold we have used is.5a, but we have used other values and observed that the faction p of denaturated molecules depends only very slightly on the threshold value. The fraction of open base pairs, f, displays a somewhat stronger dependence on the threshold value. In the same manner as Ref. [4] we obtain the averaged fractional length of the bubbles as l = (f p)/( p). The experimental work [4, 5] concentrated on two sets of sequences one set (bubble-in-the-middle sequences) designed to form bubbles in the middle of the short sequence, and another set (bubble-at-the-end sequences) designed to form bubbles (openings) at one end of the sequences. Specifically, these sequences are: (a) Bubble-in-the-middle sequences : L6B36: CCGCCAGCGGCGTTATTACATTTAATTC TTAAGTATTATAAGTAATATGGCCGCTGCGCC L42B8: CCGCCAGCGGCGTTAATACTTAAGTATT ATGGCCGCTGCGCC L33B9: CCGCCAGCGGCCTTTACTAAAGGCCGCT GCGCC (b) Bubble-at-the-end sequences: L48AS: CATAATACTTTATATTTAATTGGCGGCGC ACGGGACCCGTGCGCCGCC L36AS: CATAATACTTTATATTGCCGCGCACGCGT GCGCGGC L3AS: ATAAAATACTTATTGCCGCACGCGTGC GGC L24AS: ATAATAAAATTGCCCGGTCCGGGC L9AS 2: ATAATAAAGGCGGTCCGCC The bubble-in-the-middle sequences are rich in AT.8.6 4 5 6 7 8 9.8.6 L6B36 L42B8 4 5 6 7 8 9.8.6 L33b9 4 5 6 7 8 9 FIG. : Melting profiles for the bubble-in-the-middle sequences [3, 4, 5]. Filled circles are p, open circles are f and squares are l.

3 σ σ av.5.3. - 4 5 6 7 8 9 2 3 4 5 6 7 L FIG. 2: Upper figure: σ = f p versus T for L6B36 (circles), L42B8 (squares) and L33B9 (diamonds) [3, 4, 5]. Lower figure: σ av versus the length, L, of the molecule..8.6 L48AS 2 3 4 5 6 7 8 9.8.6 L9AS_2 2 3 4 5 6 7 8 FIG. 3: Melting profile for the L48AS and L9AS 2 sequences. Symbols are as in Fig.. base-pairs in the middle, while the bubble-at-the-end are rich in AT base-pairs at one end of the molecule. The AT base pairs are bonded by two hydrogen bonds, as opposed to the stronger triple hydrogen bonding of the GC base-pairs. This fact is obviously reflected in the model parameters (D AT =.5 while D GC =.75) and it also indicates that AT rich regions denaturate at lower temperatures that GC rich regions In Fig. we present our results for the bubble-in-themiddle sequences and we see a very good agreement with the experimental results given in Refs. [4, 5]. As in the experimental results we find for the L6B36 sequence that f l for l <.6. After this point, l displays a plateau, resulting from the occurrence of completely denaturated molecules at T 65C. As noted in the experimental work, the plateau occurs at l.6 because this is the ratio between the AT-rich central region, of 36 base pairs, and the molecule s total of 6 base pairs. The fact that f l before the plateau indicates that the bubble opens continuously as a function of temperature until it reaches its full size, while there are very few completely melted molecules at these temperatures. For the L42B8 sequence we again find a plateau in l at the value 42/8 3, but here f l even at the lower temper- atures (this is even more pronounced for L33B9). This shows that bubble generation and complete denaturation are both possible at lower temperatures. Since the three sequences are similar in structure and merely differ in the length of AT-rich region, this demonstrates that for these structures bubble are only sustainable if the soft region is of size 2 base-pairs or more. To further illustrate this point we show in Fig. 2 σ = f p, which represents the fraction of bases participating in a bubble state at a given temperature. The upper figure of Fig. 2 shows, as discussed, that as the soft bubble region becomes shorter, bubble states become less important, as also seen in Ref. [4]. In the lower figure, we summarize the length dependence of the incidence of bubble states. We plot σ av, the area under the curves in the upper figure divided by their width, versus the molecule length, L. The line is a linear fit showing that these intermediate states disappear (σ av = ) for L 22, in excellent agreement with the experimental conclusion in Refs. [4, 5]. In Fig. 3 we show the melting curves for two of the bubble-in-the-end molecules. Comparison with experimental results in ref. [5] is again good although not as good as in the the bubble-in-the-middle cases. This is due to the limitations of our model at the ends of the DNA molecule. Most experimental features are, however, still

4 σ.3..5-8 -6-4 -2 T-T m ( o C) the premelting regime. Experimental observations regarding nucleation size of the bubbles in the middle of a molecule and the possibility of a two states transition are exactly recovered by the model. This demonstrates that this model not only works for very large DNA strands [], but also for short strands such as the ones studied here. Remarkably, these include both natural and synthetic structures. We are grateful to Prof. G. Zocchi for insightful discussions of the data in Refs. [3, 4, 5]. This work has been supported in part by the Ministerio de Ciencia y Tecnología of Spain through grant BFM23-7749-C5- (SA). SA acknowledges financial support from the Center for Nonlinear Studies, where this work was performed. Work at Los Alamos is performed under the auspices of the US Department of Energy. σ av.3. 2 3 4 5 L FIG. 4: Upper figure: σ = f p versus T T m (T m is the melting temperature) for L48AS (circles), L36AS (squares), L3AS (diamonds), L24AS (triangles up) and L9AS 2 (triangles down) [3, 4, 5]. Lower figure: σ av versus L, the molecule s length. reproduced. For instance, in the L48AS sequence we find the same plateau on l at L.8 that is seen in the experiments. To overcome the problem caused by the boundaries, in Fig. 4 we consider how σ and σ av change with the system size, as the deformation imposed by the excessive end opening will appear in all the molecules and in that way will not contaminate the global picture. In the upper figure we plot σ versus T T m (T m is the melting temperature), finding that the bubble states are smaller for the shorter sequences, as in Ref. [5]. In the lower figure we plot σ av versus L. The extrapolation to σ av = occurs at a value compatible with L, as in Ref. [5], and shows that in our model a two-state transition for this kind of sequences would only be possible in the limit L, just as in the experiments. We have shown that the theoretical model proposed by Peyrard, Bishop, and Dauxois with no further parameters or fitting, accurately reproduces experiments on DNA denaturation, not only for the melting curve, but also for the formation and role of bubble states in [] C.H. Choi et al., Nucleic Acids Res. 32, 584 (24). [2] G. Kalosakas, K. Ø. Rasmussen, A.R. Bishop, C.H. Choi, and A. Usheva, Europhys. Lett. 68, 27 (24). [3] A. Montrichok, G. Gruner and G. Zocchi, Europhys. Lett. 62, 452 (23). [4] Y. Zeng, A. Montrichok and G. Zocchi, Phys. Rev. Lett. 9, 48 (23). [5] Y. Zeng, A. Montrichok and G. Zocchi, J. Mol. Biol. 339, 67 (24). [6] M. Ya Azbel, Phys. Rev. A, 2, 67 (979) and references therein. [7] J. Santa Lucia, Jr., Proc. Natl. Acad. Sci U.S.A. 95, 46 (998). [8] D. Poland and H.A. Scheraga, J. Chem. Phys. 45, 456 (966); 45 464 (966); D. Poland, Biopolymers 73, 26 (24); C. Richard and A. J. Guttmann, J. Stat. Phys. 5 925 (24). [9] C. Kittel, Am. J. Phys. 37, 97 (969). [] V. Ivanov, Y. Zeng, and G. Zocchi, Phys. Rev. E 7, 597 (24). [] M. Peyrard and A. R. Bishop, Phys. Rev. Lett. 62, 2755 (989). [2] T. Dauxois, M. Peyrard and A. R. Bishop, Phys. Rev. E 47 R44 (993). [3] T. Dauxois and M. Peyrard, Phys. Rev. E 5 427 (995). [4] A. Campa and A. Giansanti, Phys. Rev. E 58 3538 (998). [5] N. K. Voulgarakis, G. Kalosakas, K. Ø. Rasmussen, A.R. Bishop, Nano Letters 4, 629, (24). [6] N. Metropolis et al., J. Chem. Phys. bf 2, 87 (953). [7] We use the standard Metropolis algorithm to produce an equilibrium state of the system: A single base pair, n, is picked at random and a new value of the variable y n is proposed according to a thermal (at the appropriate temperature) Gaussian distribution at this base pair. The proposed value is accepted according to the Metropolis probability: if the energy E new of the new configuration is lower than that E old of the old configuration, and exp( [E new E old ]/kt) if E new > E old. The process is continued after thermal equilibrium is reached in the measurement phase.

5 [8] We have used periodic boundary conditions for the bubbles-in-the-middle sequences and open boundary conditions for the bubbles-at-the-end sequences. While it may appear most reasonable to apply open boundary conditions, in both cases, we consistently find that such boundary conditions lead to a much too large propensity for openings at the ends of the DNA molecule. This represents a clear problem of the model that we are working towards improving. [9] S. Ares and A. Sánchez, to be published.