David B. Lukatsky and Ariel Afek Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva Israel
|
|
- Angel James
- 5 years ago
- Views:
Transcription
1 Sequence correlations shape protein promiscuity David B. Lukatsky and Ariel Afek Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva Israel Abstract We predict that diagonal correlations of amino acid positions within protein sequences statistically enhance protein propensity for promiscuous binding. Diagonal correlations represent statistically significant repeats of sequence patterns where amino acids of the same type are clustered together. The predicted effect is qualitatively robust with respect to the form of the microscopic interaction potential and the average amino acid composition. We suggest experimental and bioinformatics approaches to test the predicted effect. Recent experimental evidences that proteins within a cell maintain a high degree of nonspecificity have challenged the understanding of molecular mechanisms providing the specificity of protein-protein binding [1, 2]. Such non-specific binding is termed protein promiscuity. Numerous organismal-scale measurements of protein-protein interactions (PPI) suggest that organismal proteomes possess a higher degree of non-specific binding [3-7]. It appears that protein promiscuity is an evolutionary selectable trait enabling proteins to evolve more efficiently [1]. The key question is what makes a protein promiscuous? Are there generic sequence signatures of promiscuity? In this report we predict one such generic signature. We predict that protein sequences with the enhanced correlations of sequence positions of amino acids of the same type generally represent more promiscuous sequences. We term such correlations diagonal. The latter finding suggests that the symmetry properties and strength of sequence correlations is a key factor that controls the global connectivity properties of PPI networks. We begin by introducing a statistical measure of promiscuity for a given protein, A. Such measure can be defined as the probability distribution of the interaction energies, P(E A ), of this protein with a set of target proteins, where E A is the interaction energy between protein A and a protein from the target set. Now we can compare the promiscuity of two proteins A and B interacting with the same target set, assuming that their average interaction energies are the same, E A = E B. This latter target set is not supposed to be optimized in any way for stronger binding with either protein A or protein B. If the dispersion, σ A, of P(E A ) is greater 1
2 than the dispersion, σ B, of P(E B ), then protein A is statistically more promiscuous than protein B. This is because if σ A > σ B, the distribution of the minimal interaction energies (the extreme value distribution), P min (E A ), will be always shifted towards lower energies as compared with P min (E B ) [8]. The assumption that E A = E B corresponds to the constraint that the sequences of two proteins A and B have the same average amino acid composition (see below). The latter constraint is necessary for a fare comparison of promiscuities, since the differences in the average amino acid composition would produce a trivial shift of the average interaction energies. The predicted effect is induced exclusively by sequence correlations and goes beyond the mean-field. We introduce now a simple model for random and designed (or correlated ) protein-like, linear sequences. Despite its one-dimensional origin, the model is not exactly solvable because of the generally long-range nature of the potentials that we use below. For simplicity we use a minimalistic sequence alphabet with two types of residues only. A random sequence is obtained by distributing N p polar and N h hydrophobic residues at random within the linear sequence of the total length L = N p + N h. Our simplistic approach therefore does not take into account the folding of the sequence. The average liner fraction of polar and hydrophobic residues is thus fixed and given by φ p,0 = N p /L and φ h,0 = N h /L, respectively. After each random sequence is generated, the residues are fixed and not allowed to change their positions. A correlated sequence is obtained using the following stochastic, Monte-Carlo (MC) procedure. First, we generate a random sequence as described above. Second, we allow residues to anneal at a given design temperature,. We note that our notion of the designed sequences stands to describe the existence of positional correlations of amino acids within the linear sequences and not the folding. We thus impose that the residues within the sequence under the design procedure interact through the pair-wise additive design potential, U αβ (x). The intra-sequence interaction energy for any given residue distribution is thus: E intra = 1 2 φ p (x)u pp (x x )φ p dx d x φ h(x)u hh (x x )φ h dx d x Eq. (1) + φ h (x)u hp (x x )φ p dx d x where φ p (x) and φ h (x) are the local, linear fraction densities of polar and hydrophobic residues, respectively. The average composition of polar and hydrophobic residues is fixed by 2
3 the values, φ p,0 and φ h,0, respectively, and we impose that the total fraction of residues at each sequence position, x, is unity, φ p (x) + φ h (x) = 1. Here U pp (x), U hh (x), and U hp (x) is the interaction potential between polar-polar, hydrophobic-hydrophobic, and hydrophobic-polar residues, respectively. We also note that φ p (x) can be represented in the form: φ p (x) = φ p,0 + δφ p (x), where δφ p (x) is the deviation of the local density of polar residues from its average value, and analogously, φ h = φ h,0 + δφ h (x). The only two assumptions about the interaction potentials, U αβ (x), used in the sequence design procedure are that they are pairwise additive and have a finite range of action. Our next step is to analyze the probability distribution P(E) of the interaction energy, E, between the random and correlated sequences. Every pair of interacting sequences thus consists of one random and one correlated sequence superimposed in a parallel configuration, thus the problem is a quasi one-dimensional one. We show below that the enhanced correlations between amino acids of the same type lead to the broadening of the distribution P(E). Such broadening implies that the corresponding extreme value distribution (EVD) will always be shifted to lower energies for stronger correlated sequences [8]. The latter property implies that such correlated sequences will be more promiscuous, i.e. statistically prone to a stronger binding with an arbitrary sequence. We use an ensemble of entirely random sequences as a proxy for an ensemble of arbitrary protein sequences. The interaction energy between the random and correlated sequences is the following: E = ν p (x)v pp (x x )φ p dx d x + ν h (x)v hh (x x )φ h dx d x Eq. (2) + ν h (x)v hp (x x )φ p dx d x + ν p (x)v hp (x x )φ h dx d x where ν p (x) and ν h (x) are the local, linear fraction densities of polar and hydrophobic residues, respectively, within the random sequence, and here again ν p (x) + ν h (x) = 1, and ν p (x) = φ p,0 + δν p (x) with δν p (x) being the deviation of the polar residue density from its average value, and analogously for ν h (x) = φ h,0 + δν h (x). We thus assume that the average amino acid composition is the same for random and correlated sequences. We emphasize that the inter-sequence interaction potentials, V pp (x), V hh (x), and V hp (x) need not be identical to the potentials U pp (x), U hh (x), and U hp (x) used in the sequence design procedure. We will describe below in details the influence of the potentials V αβ (x) and U αβ (x) on the properties of P(E). 3
4 The probability distribution for the interaction energies between the random and correlated sequences, P(E), is characterized by its mean, E, and by the variance. The mean, E, is independent on the design potential, U αβ (x), and therefore all the different distributions P(E) obtained at different values of the design temperature,, will have exactly the same mean. The variance of P(E) is σ 2 = ( δe 2 ) 2, where the only relevant term for the averaging is quadratic in the sequence density fluctuations: where V (x) = V pp (x) + V hh (x) 2V ph (x), and V ˆ (k) = δe 2 = δν p (x)v (x x ) δφ p ( x ) dx d x, Eq. (3) V (x)e ikx dx. The averaging in σ is performed using the Boltzmann probability distribution function for the sequence density fluctuations of the correlated (i.e. designed) sequences that has the following form: P d [δφ p (x)] = C 1 exp δφ 2 p(x) dx exp( E 2φ p,0 φ intra /k B ), Eq. (4) h,0 where is the design temperature, and k B is the Boltzmann constant. The first exponential term in Eq. (4) is the entropic contribution [8] due to the sequence density fluctuations of the designed sequences, and the second exponential term represents the strength of the correlations within the designed sequences. The corresponding probability distribution for the density fluctuations of the random sequences contains only the entropic contribution: P r [δν p (x)] = C 2 exp δν 2 p (x) dx. Eq. (5) 2φ p,0 φ h,0 The constants C 1 and C 2 in Eqs. (4) and (5), are found from the normalization constrains applied on the probability distributions. The averaging leads to the following result: where Û(k) = dk σ 2 = 4Lφ p,0 φ h,0 ˆV(k) 2 1 2π 1 / φ p,0 φ h,0 + Û(k) / k T, Eq. (6) B d U(x)eikx dx, and U(x) = U pp (x) + U hh (x) 2U ph (x). The larger σ (and thus the broader the distribution P(E) of the interaction energies between the correlated and random sequences), the more promiscuous are the correlated sequences. We note that our model is only solvable analytically in the Gaussian approximation, and not exactly solvable, unlike the one-dimensional Ising model, due to the generally long-range nature of the intra-sequence ( design ) potential, U(x), and the inter-sequence potential, V(x). We also note the existence 4
5 of the singularity in Eq. (6) at sufficiently large and negative values of the design potential, U(x), when the Gaussian fluctuation model breaks down. The analysis of Eq. (6) leads to the two key conclusions. First, the more negative is the design potential, U(x), the larger is σ. Taking into account the definition of U = U pp + U hh 2U ph, one concludes that in order to increase σ one needs to design the sequences with the enhanced correlations in the positions between the residues of similar types. This means that correlated sequences where amino acids of the same type are clustered together will be the more promiscuous ones. Second, such correlated sequences will interact statistically stronger (than non-correlated sequences would do) with any arbitrary sequences independently on the sign of the inter-residue interaction potential, V = V pp + V hh 2V ph. Third, if the design potential is overall positive, U > 0, designed sequences will be even less promiscuous than random sequences. We emphasize that the predicted effects are generic and qualitatively independent on the specific form and even sign of the microscopic interaction potentials, V αβ, and on the average amino acid composition of the sequences. We note that the predicted effect gets even stronger when both interacting sequences are designed (i.e. correlated). In the latter case the variance, σ d,d, of the corresponding P(E) is a straightforward generalization of Eq. (6): 2 σ d,d = 4L dk 2π ˆV(k) 2 1 (1 / φ p,0 φ h,0 + Û1 (k) / k B 1 )(1 / φ p,0φ h,0 + Û2 (k) / k B 2 ), Eq. (7) where U 1 (x) and U 2 (x) are defined analogously to U(x) for each of the interacting sequences; and 1 and 2 are the design temperatures for the first and second sequence, respectively. If both design potentials, U 1 (x) and U 2 (x), are overall negative, then σ d,d > σ, and thus in the latter case the sequences will be statistically more promiscuous than in the case when only one of the interacting sequences is designed (Eq. (6)). We stress that the interacting sequences are designed independently and not optimized in any way towards a stronger binding. Therefore the observed effect of statistically enhanced binding corresponds to the non-specific (promiscuous) binding. In order to verify our theoretical predictions, we first perform the standard MC annealing procedure [9] to design the correlated sequences. We begin with generating a random sequence with a given amino acid composition. In the computations described below we used a uniform sequence composition with 50% polar and 50% hydrophobic residues. Qualitatively, the results are valid for an arbitrary amino acid composition. We next perform 5
6 the MC stochastic design procedure, where the residues within the sequence are allowed to exchange their positions, and each sequence configuration has the Boltzmann weight, ~ exp( E intra /k B ), where E intra is the internal energy of the sequence in a given configuration given by Eq. (1). The MC design procedure is stopped after a certain number of MC moves, and the resulting annealed configuration is accepted as the final, designed configuration for a given sequence. The lower is, the stronger are the correlations within the sequences. Intuitively, stronger correlations correspond to repetitive sequence patterns with a longer correlation length. The properties of the correlated patterns depend critically on the sign of the interaction potentials U αβ (x) used in the design procedure. If the effective design potential U = U pp + U hh 2U hp is overall negative (this corresponds to the attraction between the amino acids of similar types), the correlated patterns will have the form of repetitive residues of the same type, for example: HHHHPPPPHHHPPP If however, the potential U = U pp + U hh 2U hp is overall positive, the correlated patterns will have the form of the alternating hydrophobic and polar residues, for example: HPHPHPHPHPHPHP To characterize the correlation properties of the sequences quantitatively, we introduce the normalized correlation function: r η αβ (x) = g αβ (x) / g αβ (x), Eq. (8) r where g αβ (x) is proportional to the probability to find a residue of the type α separated by the r distance x from a residue of the type β, and g αβ (x) is the corresponding probability for the r randomized sequence, and g αβ (x) r corresponds to the averaging with respect to different realizations of randomized sequences. The computed correlation functions are represented in Fig. 1 at the value of k B = 2, and the insert shows the analogous calculations for a lower value of the design temperature, k B = 1 (in the units of k B T ). For the entirely uncorrelated (random) sequences, all the matrix elements of η αβ (x) are equal to unity, Fig. 1. The clustering of the residues of a similar type corresponds to η αα (x) > 1, Fig. 1. The next step is to compute numerically the properties of the probability distribution, P(E), of the interaction energies, E, between random and designed sequences (i.e. each interacting pair consists of a random and designed sequences). The results of these calculations are shown in Fig. 2. We computed P(E) at different values of and we represented the results as a ratio between the dispersion of P(E), σ = σ d, r and the dispersion of the corresponding probability distribution where both sequences are entirely random, σ r, r 6
7 (the latter corresponds to the case of a vanishing design potential, U αβ = 0). We used here the inter-residue interaction potential, V pp = V hh = 1, and V hp = 1, and we assumed that the nearest neighbor and the next-nearest neighbor amino acids can interact between the two sequences. The analytical result computed from Eq. (6) is also plotted in Fig. 2. As expected, the Gaussian fluctuation model becomes accurate at small values of the ratio, U(a) / k B 1, where a is the potential range. The insert of Fig. 2 shows the computed P(E) in the case of designed-random and random-random sequence pairs, respectively. The key conclusion here is that in accordance with the analytical predictions, the dispersion of P(E) is larger for the sequences designed with the overall negative U, as compared to the dispersion of P(E) in the case where both interacting sequences are entirely random, σ d,r > σ r,r. We stress that the latter result is qualitatively insensitive to the sign of the inter-residue interaction potential, V(x). We emphasize also that the interaction energy mean-value, E, is identical in the two cases. To summarize our results qualitatively, correlated sequences of the type HHHHHPPPPPPHHHHHPPPP, where amino acids of the same type are clustered together will bind statistically stronger to an arbitrary target sequence set, compared to either random sequences, or correlated sequences of the type HPHPHPHPHPHPHPHPHP. The sequences possessing the latter symmetry of correlations will be the least promiscuous ones. This effect is qualitatively robust with respect to the specific form and even sign of the microscopic intersequence interaction potential. Despite the one-dimensional nature of our model, its results are directly applicable to protein-protein interaction networks since the most recent, wholeorganism experimental and bioinformatics data suggest that 15-40% of all protein-protein interactions are mediated by linear sequence motifs, and not by large protein surfaces [10]. Still our key objective for the future theoretical analysis is to take into account the effect of protein folding. There are several possible strategies to test our predictions. The direct experimental test would utilize the protein chip [11] or microfluidic protein chip [12] technology. The target protein data set would be attached to the chip surface. The test proteins or peptides would be synthesized with a varying strength and symmetry of sequence correlations but keeping the average amino acid composition fixed. Titration experiments would allow measuring directly the binding affinity [11, 12] as a function of sequence correlation properties. Another possibility is to use the existing high-throughput protein-protein binding data [3-6], and to compare sequence correlation properties of multi-specific and mono-specific proteins [13]. 7
8 Yet another possibility is to use the recent whole-genome protein over-expression analysis [14]. Since the over-expression of highly promiscuous proteins should presumably be toxic to a cell, the correlation analysis of such toxic proteins (hundreds of them are known [14]) will show whether the predicted effect plays a significant role in a living cell [15]. Acknowledgements We thank Amir Aharoni, Gilad Haran, Nikolaus Rajewsky, Irit Sagi, Eugene Shakhnovich, and Dan Tawfik for helpful discussions. This work is supported by the Israel Science Foundation (ISF). References [1] O. Khersonsky, and D. S. Tawfik, Annu Rev Biochem 79, 471 (2010). [2] I. Nobeli, A. D. Favia, and J. M. Thornton, Nat Biotechnol 27, 157 (2009). [3] N. N. Batada et al., PLoS Biol 4, e317 (2006). [4] G. Butland et al., Nature 433, 531 (2005). [5] P. Hu et al., PLoS Biol 7, e96 (2009). [6] J. F. Rual et al., Nature 437, 1173 (2005). [7] U. Stelzl et al., Cell 122, 957 (2005). [8] D. B. Lukatsky, K. B. Zeldovich, and E. I. Shakhnovich, Phys Rev Lett 97, (2006). [9] D. Frenkel, and B. Smit, Understanding molecular simulation : from algorithms to applications (Academic Press, San Diego, 2002), pp. xxii. [10] J. R. Perkins et al., Structure 18, 1233 (2010). [11] A. Wolf Yadlin, M. Sevecka, and G. MacBeath, Curr Opin Chem Biol 13, 398 (2009). [12] D. Gerber, S. J. Maerkl, and S. R. Quake, Nat Methods 6, 71 (2009). [13] A. Afek, and D. B. Lukatsky, (in preparation). [14] T. Vavouri et al., Cell 138, 198 (2009). [15] D. S. Tawfik, (private communication). 8
9 η pp (x)=η hp (x), random sequence η hp (x), =2 η pp (x)=η hh (x), =2 1.2 η (x) = x Figure 1: Computed sequence correlation functions for the designed sequences, η pp (x) = η hh (x) (red squares), η ph (x) = η hp (x) (blue diamonds); and for the random sequences, (black circles). All the matrix elements of η αβ (x) are the same for the random sequences. The design potential was chosen to be U pp = U hh = 1, and U hp = 1, and we assumed that only the nearest-neighbor residues can interact. The design temperature is = 2. The sequence length was chosen to be 200 amino acids, and we generated 5000 different sequences in each calculation. The plotted η αβ (x) represent the average over the entire set of the designed sequences. The uniform amino acid composition was adopted: 50% polar (p) and 50% hydrophobic (h) residues in each sequence. The error bars are smaller than the symbol size. Insert: an analogous computation as in the main figure but at a lower design temperature, = 1 (the design temperature is in the units of k B T ). 9
10 random-random 0.2 σ d, r / σ r, r P(E) B designed-random T d= E, k T T, k B Figure 2: Computed ratio between the dispersions of the P(E) for the interaction energies of the designed-random, σ = σ d, r, and random-random, σ = σ r, r, sequence pairs at different values of the design temperature, (circles). The error bars are smaller than the symbol size. The uniform amino acid composition was adopted: 50% polar (p) and 50% hydrophobic (h) residues in each sequence. Thin curve represents the corresponding analytical result, Eq. (6). Insert: Computed probability distribution function, P(E), for the interaction energies between the pairs of two random sequences (black), and pairs consisting each of a random and a designed sequences, where the designed sequences were generated at = 1 (red). The energy E is normalized per one residue. 10
A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics
JOURNAL OF CHEMICAL PHYSICS VOLUME 115, NUMBER 3 15 JULY 2001 A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics Dominik Gront Department of
More informationMONTE CARLO METHOD. Reference1: Smit Frenkel, Understanding molecular simulation, second edition, Academic press, 2002.
MONTE CARLO METHOD Reference1: Smit Frenkel, Understanding molecular simulation, second edition, Academic press, 2002. Reference 2: David P. Landau., Kurt Binder., A Guide to Monte Carlo Simulations in
More informationStatistical Physics of The Symmetric Group. Mobolaji Williams Harvard Physics Oral Qualifying Exam Dec. 12, 2016
Statistical Physics of The Symmetric Group Mobolaji Williams Harvard Physics Oral Qualifying Exam Dec. 12, 2016 1 Theoretical Physics of Living Systems Physics Particle Physics Condensed Matter Astrophysics
More informationTitle Super- and subcritical hydration of Thermodynamics of hydration Author(s) Matubayasi, N; Nakahara, M Citation JOURNAL OF CHEMICAL PHYSICS (2000), 8109 Issue Date 2000-05-08 URL http://hdl.handle.net/2433/50350
More informationBrownian motion and the Central Limit Theorem
Brownian motion and the Central Limit Theorem Amir Bar January 4, 3 Based on Shang-Keng Ma, Statistical Mechanics, sections.,.7 and the course s notes section 6. Introduction In this tutorial we shall
More informationEntropy production fluctuation theorem and the nonequilibrium work relation for free energy differences
PHYSICAL REVIEW E VOLUME 60, NUMBER 3 SEPTEMBER 1999 Entropy production fluctuation theorem and the nonequilibrium work relation for free energy differences Gavin E. Crooks* Department of Chemistry, University
More informationSuperparameterization and Dynamic Stochastic Superresolution (DSS) for Filtering Sparse Geophysical Flows
SP and DSS for Filtering Sparse Geophysical Flows Superparameterization and Dynamic Stochastic Superresolution (DSS) for Filtering Sparse Geophysical Flows Presented by Nan Chen, Michal Branicki and Chenyue
More informationTopology of Protein Interaction Network Shapes Protein Abundances and Strengths of Their Functional and Nonspecific Interactions
Topology of Protein Interaction Network Shapes Protein Abundances and Strengths of Their Functional and Nonspecific Interactions The Harvard community has made this article openly available. Please share
More informationAn Importance Sampling Algorithm for Models with Weak Couplings
An Importance ampling Algorithm for Models with Weak Couplings Mehdi Molkaraie EH Zurich mehdi.molkaraie@alumni.ethz.ch arxiv:1607.00866v1 [cs.i] 4 Jul 2016 Abstract We propose an importance sampling algorithm
More informationSupplementary Information. Overlap between folding and functional energy landscapes for. adenylate kinase conformational change
Supplementary Information Overlap between folding and functional energy landscapes for adenylate kinase conformational change by Ulrika Olsson & Magnus Wolf-Watz Contents: 1. Supplementary Note 2. Supplementary
More informationMany proteins spontaneously refold into native form in vitro with high fidelity and high speed.
Macromolecular Processes 20. Protein Folding Composed of 50 500 amino acids linked in 1D sequence by the polypeptide backbone The amino acid physical and chemical properties of the 20 amino acids dictate
More informationPhysics 116C The Distribution of the Sum of Random Variables
Physics 116C The Distribution of the Sum of Random Variables Peter Young (Dated: December 2, 2013) Consider a random variable with a probability distribution P(x). The distribution is normalized, i.e.
More informationPhase transitions and critical phenomena
Phase transitions and critical phenomena Classification of phase transitions. Discontinous (st order) transitions Summary week -5 st derivatives of thermodynamic potentials jump discontinously, e.g. (
More informationTitle Theory of solutions in the energy r of the molecular flexibility Author(s) Matubayasi, N; Nakahara, M Citation JOURNAL OF CHEMICAL PHYSICS (2003), 9702 Issue Date 2003-11-08 URL http://hdl.handle.net/2433/50354
More informationarxiv:cond-mat/ v1 [cond-mat.other] 4 Aug 2004
Conservation laws for the voter model in complex networks arxiv:cond-mat/0408101v1 [cond-mat.other] 4 Aug 2004 Krzysztof Suchecki, 1,2 Víctor M. Eguíluz, 1 and Maxi San Miguel 1 1 Instituto Mediterráneo
More informationProtein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche
Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its
More informationMultiple time step Monte Carlo
JOURNAL OF CHEMICAL PHYSICS VOLUME 117, NUMBER 18 8 NOVEMBER 2002 Multiple time step Monte Carlo Balázs Hetényi a) Department of Chemistry, Princeton University, Princeton, NJ 08544 and Department of Chemistry
More information1.5 Sequence alignment
1.5 Sequence alignment The dramatic increase in the number of sequenced genomes and proteomes has lead to development of various bioinformatic methods and algorithms for extracting information (data mining)
More informationUnderstanding temperature and chemical potential using computer simulations
University of Massachusetts Amherst ScholarWorks@UMass Amherst Physics Department Faculty Publication Series Physics 2005 Understanding temperature and chemical potential using computer simulations J Tobochnik
More informationFragmentation under the scaling symmetry and turbulent cascade with intermittency
Center for Turbulence Research Annual Research Briefs 23 197 Fragmentation under the scaling symmetry and turbulent cascade with intermittency By M. Gorokhovski 1. Motivation and objectives Fragmentation
More informationMagnets, 1D quantum system, and quantum Phase transitions
134 Phys620.nb 10 Magnets, 1D quantum system, and quantum Phase transitions In 1D, fermions can be mapped into bosons, and vice versa. 10.1. magnetization and frustrated magnets (in any dimensions) Consider
More informationStudy of the Magnetic Properties of a Lieb Core-Shell Nano-Structure: Monte Carlo Simulations
Study of the Magnetic Properties of a Lieb Core-Shell Nano-Structure: Monte Carlo Simulations S. Aouini, S. Ziti, H. Labrim,* and L. Bahmad,* Laboratoire de la Matière Condensée et Sciences Interdisciplinaires
More informationClusters and Percolation
Chapter 6 Clusters and Percolation c 2012 by W. Klein, Harvey Gould, and Jan Tobochnik 5 November 2012 6.1 Introduction In this chapter we continue our investigation of nucleation near the spinodal. We
More informationarxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001
Modeling two-state cooperativity in protein folding Ke Fan, Jun Wang, and Wei Wang arxiv:cond-mat/0103385v1 [cond-mat.soft] 19 Mar 2001 National Laboratory of Solid State Microstructure and Department
More informationLECTURE 4 WORM ALGORITHM FOR QUANTUM STATISTICAL MODELS II
LECTURE 4 WORM ALGORITHM FOR QUANTUM STATISTICAL MODELS II LECTURE 4 WORM ALGORITHM FOR QUANTUM STATISTICAL MODELS II Path-integral for continuous systems: oriented closed loops LECTURE 4 WORM ALGORITHM
More informationMonte Caro simulations
Monte Caro simulations Monte Carlo methods - based on random numbers Stanislav Ulam s terminology - his uncle frequented the Casino in Monte Carlo Random (pseudo random) number generator on the computer
More informationStability Of Specialists Feeding On A Generalist
Stability Of Specialists Feeding On A Generalist Tomoyuki Sakata, Kei-ichi Tainaka, Yu Ito and Jin Yoshimura Department of Systems Engineering, Shizuoka University Abstract The investigation of ecosystem
More informationThe Phase Transition of the 2D-Ising Model
The Phase Transition of the 2D-Ising Model Lilian Witthauer and Manuel Dieterle Summer Term 2007 Contents 1 2D-Ising Model 2 1.1 Calculation of the Physical Quantities............... 2 2 Location of the
More informationMonte Carlo Methods in High Energy Physics I
Helmholtz International Workshop -- CALC 2009, July 10--20, Dubna Monte Carlo Methods in High Energy Physics CALC2009 - July 20 10, Dubna 2 Contents 3 Introduction Simple definition: A Monte Carlo technique
More informationDistance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11
Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11 Taking the advice of Lord Kelvin, the Father of Thermodynamics, I describe the protein molecule and other
More informationPressure Dependent Study of the Solid-Solid Phase Change in 38-Atom Lennard-Jones Cluster
University of Rhode Island DigitalCommons@URI Chemistry Faculty Publications Chemistry 2005 Pressure Dependent Study of the Solid-Solid Phase Change in 38-Atom Lennard-Jones Cluster Dubravko Sabo University
More informationarxiv:cond-mat/ v1 2 Feb 94
cond-mat/9402010 Properties and Origins of Protein Secondary Structure Nicholas D. Socci (1), William S. Bialek (2), and José Nelson Onuchic (1) (1) Department of Physics, University of California at San
More informationarxiv: v1 [cond-mat.stat-mech] 7 Mar 2019
Langevin thermostat for robust configurational and kinetic sampling Oded Farago, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB EW, United Kingdom Department of Biomedical
More informationAndré Schleife Department of Materials Science and Engineering
André Schleife Department of Materials Science and Engineering Length Scales (c) ICAMS: http://www.icams.de/cms/upload/01_home/01_research_at_icams/length_scales_1024x780.png Goals for today: Background
More informationEffect of protein shape on multibody interactions between membrane inclusions
PHYSICAL REVIEW E VOLUME 61, NUMBER 4 APRIL 000 Effect of protein shape on multibody interactions between membrane inclusions K. S. Kim, 1, * John Neu, and George Oster 3, 1 Department of Physics, Graduate
More information3.320 Lecture 18 (4/12/05)
3.320 Lecture 18 (4/12/05) Monte Carlo Simulation II and free energies Figure by MIT OCW. General Statistical Mechanics References D. Chandler, Introduction to Modern Statistical Mechanics D.A. McQuarrie,
More informationMonte Carlo Simulations of Protein Folding using Lattice Models
Monte Carlo Simulations of Protein Folding using Lattice Models Ryan Cheng 1,2 and Kenneth Jordan 1,3 1 Bioengineering and Bioinformatics Summer Institute, Department of Computational Biology, University
More informationImportance Sampling in Monte Carlo Simulation of Rare Transition Events
Importance Sampling in Monte Carlo Simulation of Rare Transition Events Wei Cai Lecture 1. August 1, 25 1 Motivation: time scale limit and rare events Atomistic simulations such as Molecular Dynamics (MD)
More informationMonte Carlo. Lecture 15 4/9/18. Harvard SEAS AP 275 Atomistic Modeling of Materials Boris Kozinsky
Monte Carlo Lecture 15 4/9/18 1 Sampling with dynamics In Molecular Dynamics we simulate evolution of a system over time according to Newton s equations, conserving energy Averages (thermodynamic properties)
More informationAny live cell with less than 2 live neighbours dies. Any live cell with 2 or 3 live neighbours lives on to the next step.
2. Cellular automata, and the SIRS model In this Section we consider an important set of models used in computer simulations, which are called cellular automata (these are very similar to the so-called
More informationStatistical Mechanics of Active Matter
Statistical Mechanics of Active Matter Umberto Marini Bettolo Marconi University of Camerino, Italy Naples, 24 May,2017 Umberto Marini Bettolo Marconi (2017) Statistical Mechanics of Active Matter 2017
More informationDistributed Estimation, Information Loss and Exponential Families. Qiang Liu Department of Computer Science Dartmouth College
Distributed Estimation, Information Loss and Exponential Families Qiang Liu Department of Computer Science Dartmouth College Statistical Learning / Estimation Learning generative models from data Topic
More informationElectrons in a periodic potential
Chapter 3 Electrons in a periodic potential 3.1 Bloch s theorem. We consider in this chapter electrons under the influence of a static, periodic potential V (x), i.e. such that it fulfills V (x) = V (x
More informationLos Alamos IMPROVED INTRA-SPECIES COLLISION MODELS FOR PIC SIMULATIONS. Michael E. Jones, XPA Don S. Lemons, XPA & Bethel College Dan Winske, XPA
, LA- U R-= Approved for public release; distribution is unlimited. m Title: Author(s) Submitted tc IMPROED INTRA-SPECIES COLLISION MODELS FOR PIC SIMULATIONS Michael E. Jones, XPA Don S. Lemons, XPA &
More information6 Hydrophobic interactions
The Physics and Chemistry of Water 6 Hydrophobic interactions A non-polar molecule in water disrupts the H- bond structure by forcing some water molecules to give up their hydrogen bonds. As a result,
More informationPlug-in Measure-Transformed Quasi Likelihood Ratio Test for Random Signal Detection
Plug-in Measure-Transformed Quasi Likelihood Ratio Test for Random Signal Detection Nir Halay and Koby Todros Dept. of ECE, Ben-Gurion University of the Negev, Beer-Sheva, Israel February 13, 2017 1 /
More informationElastic constants and the effect of strain on monovacancy concentration in fcc hard-sphere crystals
PHYSICAL REVIEW B 70, 214113 (2004) Elastic constants and the effect of strain on monovacancy concentration in fcc hard-sphere crystals Sang Kyu Kwak and David A. Kofke Department of Chemical and Biological
More informationMonte Carlo simulation of proteins through a random walk in energy space
JOURNAL OF CHEMICAL PHYSICS VOLUME 116, NUMBER 16 22 APRIL 2002 Monte Carlo simulation of proteins through a random walk in energy space Nitin Rathore and Juan J. de Pablo a) Department of Chemical Engineering,
More informationRate Constants from Uncorrelated Single-Molecule Data
646 J. Phys. Chem. B 001, 105, 646-650 Rate Constants from Uncorrelated Single-Molecule Data Marián Boguñá, Lisen Kullman, Sergey M. Bezrukov,, Alexander M. Berezhkovskii,, and George H. Weiss*, Center
More information8.334: Statistical Mechanics II Problem Set # 4 Due: 4/9/14 Transfer Matrices & Position space renormalization
8.334: Statistical Mechanics II Problem Set # 4 Due: 4/9/14 Transfer Matrices & Position space renormalization This problem set is partly intended to introduce the transfer matrix method, which is used
More informationProtein Mistranslation is Unlikely to Ease a Population s Transit across a Fitness Valley. Matt Weisberg May, 2012
Protein Mistranslation is Unlikely to Ease a Population s Transit across a Fitness Valley Matt Weisberg May, 2012 Abstract Recent research has shown that protein synthesis errors are much higher than previously
More informationFree energy recovery in single molecule experiments
Supplementary Material Free energy recovery in single molecule experiments Single molecule force measurements (experimental setup shown in Fig. S1) can be used to determine free-energy differences between
More informationMonte Carlo simulation of confined water
Monte Carlo simulation of confined water Author: Guillermo Cámbara Ruiz Advisor: Giancarlo Franzese Facultat de Física, Universitat de Barcelona, Diagonal 645, 08028 Barcelona, Spain. Abstract: In living
More informationComputer simulation methods (1) Dr. Vania Calandrini
Computer simulation methods (1) Dr. Vania Calandrini Why computational methods To understand and predict the properties of complex systems (many degrees of freedom): liquids, solids, adsorption of molecules
More informationTriangular Lattice Foldings-a Transfer Matrix Study.
EUROPHYSICS LETTERS Europhys. Lett., 11 (2)) pp. 157-161 (1990) 15 January 1990 Triangular Lattice Foldings-a Transfer Matrix Study. Y. KANT~R(*) and M. V. JARIC(**) (*) School of Physics and Astronomy,
More informationMolecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror
Molecular dynamics simulation CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror 1 Outline Molecular dynamics (MD): The basic idea Equations of motion Key properties of MD simulations Sample applications
More informationPhysics Letters A 375 (2011) Contents lists available at ScienceDirect. Physics Letters A.
Physics Letters A 375 (2011) 318 323 Contents lists available at ScienceDirect Physics Letters A www.elsevier.com/locate/pla Spontaneous symmetry breaking on a mutiple-channel hollow cylinder Ruili Wang
More informationEffect of surfactant structure on interfacial properties
EUROPHYSICS LETTERS 15 September 2003 Europhys. Lett., 63 (6), pp. 902 907 (2003) Effect of surfactant structure on interfacial properties L. Rekvig 1 ( ), M. Kranenburg 2, B. Hafskjold 1 and B. Smit 2
More informationarxiv:cond-mat/ v4 [cond-mat.dis-nn] 23 May 2001
Phase Diagram of the three-dimensional Gaussian andom Field Ising Model: A Monte Carlo enormalization Group Study arxiv:cond-mat/488v4 [cond-mat.dis-nn] 3 May M. Itakura JS Domestic esearch Fellow, Center
More informationSolving the Schrödinger equation for the Sherrington Kirkpatrick model in a transverse field
J. Phys. A: Math. Gen. 30 (1997) L41 L47. Printed in the UK PII: S0305-4470(97)79383-1 LETTER TO THE EDITOR Solving the Schrödinger equation for the Sherrington Kirkpatrick model in a transverse field
More informationImproved model of nonaffine strain measure
Improved model of nonaffine strain measure S. T. Milner a) ExxonMobil Research & Engineering, Route 22 East, Annandale, New Jersey 08801 (Received 27 December 2000; final revision received 3 April 2001)
More informationCHEM-UA 652: Thermodynamics and Kinetics
1 CHEM-UA 652: Thermodynamics and Kinetics Notes for Lecture 4 I. THE ISOTHERMAL-ISOBARIC ENSEMBLE The isothermal-isobaric ensemble is the closest mimic to the conditions under which most experiments are
More informationPhysics 115/242 Monte Carlo simulations in Statistical Physics
Physics 115/242 Monte Carlo simulations in Statistical Physics Peter Young (Dated: May 12, 2007) For additional information on the statistical Physics part of this handout, the first two sections, I strongly
More informationAnalysis of the ultrafast dynamics of the silver trimer upon photodetachment
J. Phys. B: At. Mol. Opt. Phys. 29 (1996) L545 L549. Printed in the UK LETTER TO THE EDITOR Analysis of the ultrafast dynamics of the silver trimer upon photodetachment H O Jeschke, M E Garcia and K H
More informationA Monte Carlo Implementation of the Ising Model in Python
A Monte Carlo Implementation of the Ising Model in Python Alexey Khorev alexey.s.khorev@gmail.com 2017.08.29 Contents 1 Theory 1 1.1 Introduction...................................... 1 1.2 Model.........................................
More informationNumerical Analysis of 2-D Ising Model. Ishita Agarwal Masters in Physics (University of Bonn) 17 th March 2011
Numerical Analysis of 2-D Ising Model By Ishita Agarwal Masters in Physics (University of Bonn) 17 th March 2011 Contents Abstract Acknowledgment Introduction Computational techniques Numerical Analysis
More informationThe Tangled Nature Model of Evolutionary Ecology: (Is the approach of Statistical Mechanics relevant to the New Ecology Systems Perspective project.
The Tangled Nature Model of Evolutionary Ecology: Is the approach of tatistical Mechanics relevant to the New Ecology ystems Perspective project. Institute for Mathematical ciences and Dept of Mathematics
More information1 Coherent-Mode Representation of Optical Fields and Sources
1 Coherent-Mode Representation of Optical Fields and Sources 1.1 Introduction In the 1980s, E. Wolf proposed a new theory of partial coherence formulated in the space-frequency domain. 1,2 The fundamental
More informationDepartment of Electrical and Electronic Engineering, Ege University, Bornova 3500, Izmir, Turkey
The effect of anisotropy on the absorption spectrum and the density of states of two-dimensional Frenkel exciton systems with Gaussian diagonal disorder I. Avgin a and D. L. Huber b,* a Department of Electrical
More informationToday: Fundamentals of Monte Carlo
Today: Fundamentals of Monte Carlo What is Monte Carlo? Named at Los Alamos in 1940 s after the casino. Any method which uses (pseudo)random numbers as an essential part of the algorithm. Stochastic -
More informationarxiv:nucl-th/ v1 12 Jun 2000
Microcanonical Lattice Gas Model for Nuclear Disassembly C. B. Das 1, S. Das Gupta 1 and S. K. Samaddar 2 1 Physics Department, McGill University, 3600 University St., Montréal, Québec Canada H3A 2T8 2
More informationProtein Structure Prediction, Engineering & Design CHEM 430
Protein Structure Prediction, Engineering & Design CHEM 430 Eero Saarinen The free energy surface of a protein Protein Structure Prediction & Design Full Protein Structure from Sequence - High Alignment
More informationLattice protein models
Lattice protein models Marc R. Roussel epartment of Chemistry and Biochemistry University of Lethbridge March 5, 2009 1 Model and assumptions The ideas developed in the last few lectures can be applied
More informationPhase behavior of a lattice protein model
JOURAL OF CHEMICAL PHYSICS VOLUME 118, UMBER 19 15 MAY 2003 Phase behavior of a lattice protein model icolas Combe a) and Daan Frenkel b) FOM Institute for Atomic and Molecular Physics, Kruislaan 407,
More informationExtending the Tools of Chemical Reaction Engineering to the Molecular Scale
Extending the Tools of Chemical Reaction Engineering to the Molecular Scale Multiple-time-scale order reduction for stochastic kinetics James B. Rawlings Department of Chemical and Biological Engineering
More informationStatistical Mechanics for the Truncated Quasi-Geostrophic Equations
Statistical Mechanics for the Truncated Quasi-Geostrophic Equations Di Qi, and Andrew J. Majda Courant Institute of Mathematical Sciences Fall 6 Advanced Topics in Applied Math Di Qi, and Andrew J. Majda
More informationSloppy Nuclear Energy Density Functionals: effective model optimisation. T. Nikšić and D. Vretenar
Sloppy Nuclear Energy Density Functionals: effective model optimisation T. Nikšić and D. Vretenar Energy Density Functionals the nuclear many-body problem is effectively mapped onto a one-body problem
More informationKinetic Monte Carlo. Heiko Rieger. Theoretical Physics Saarland University Saarbrücken, Germany
Kinetic Monte Carlo Heiko Rieger Theoretical Physics Saarland University Saarbrücken, Germany DPG school on Efficient Algorithms in Computational Physics, 10.-14.9.2012, Bad Honnef Intro Kinetic Monte
More informationTHE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION
THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure
More information3.320: Lecture 19 (4/14/05) Free Energies and physical Coarse-graining. ,T) + < σ > dµ
3.320: Lecture 19 (4/14/05) F(µ,T) = F(µ ref,t) + < σ > dµ µ µ ref Free Energies and physical Coarse-graining T S(T) = S(T ref ) + T T ref C V T dt Non-Boltzmann sampling and Umbrella sampling Simple
More informationSuperparameterization and Dynamic Stochastic Superresolution (DSS) for Filtering Sparse Geophysical Flows
Superparameterization and Dynamic Stochastic Superresolution (DSS) for Filtering Sparse Geophysical Flows June 2013 Outline 1 Filtering Filtering: obtaining the best statistical estimation of a nature
More informationClassical Monte Carlo Simulations
Classical Monte Carlo Simulations Hyejin Ju April 17, 2012 1 Introduction Why do we need numerics? One of the main goals of condensed matter is to compute expectation values O = 1 Z Tr{O e βĥ} (1) and
More information3D HP Protein Folding Problem using Ant Algorithm
3D HP Protein Folding Problem using Ant Algorithm Fidanova S. Institute of Parallel Processing BAS 25A Acad. G. Bonchev Str., 1113 Sofia, Bulgaria Phone: +359 2 979 66 42 E-mail: stefka@parallel.bas.bg
More informationProtein design: a perspective from simple tractable models Eugene I Shakhnovich
Review R45 Protein design: a perspective from simple tractable models Eugene I Shakhnovich Recent progress in computational approaches to protein design builds on advances in statistical mechanical protein
More informationBeyond Wiener Askey Expansions: Handling Arbitrary PDFs
Journal of Scientific Computing, Vol. 27, Nos. 1 3, June 2006 ( 2005) DOI: 10.1007/s10915-005-9038-8 Beyond Wiener Askey Expansions: Handling Arbitrary PDFs Xiaoliang Wan 1 and George Em Karniadakis 1
More informationGAMM-workshop in UQ, TU Dortmund. Characterization of fluctuations in stochastic homogenization. Mitia Duerinckx, Antoine Gloria, Felix Otto
GAMM-workshop in UQ, TU Dortmund Characterization of fluctuations in stochastic homogenization Mitia Duerinckx, Antoine Gloria, Felix Otto Max Planck Institut für Mathematik in den Naturwissenschaften,
More informationQuantum and classical annealing in spin glasses and quantum computing. Anders W Sandvik, Boston University
NATIONAL TAIWAN UNIVERSITY, COLLOQUIUM, MARCH 10, 2015 Quantum and classical annealing in spin glasses and quantum computing Anders W Sandvik, Boston University Cheng-Wei Liu (BU) Anatoli Polkovnikov (BU)
More informationChapter XXII The Covariance
Chapter XXII The Covariance Geometrical Covariogram Random Version : -Sets -Functions - Properties of the covariance Intrinsic Theory : - Variogram J. Serra Ecole des Mines de Paris ( 2000 ) Course on
More informationSequential Monte Carlo Methods for Bayesian Computation
Sequential Monte Carlo Methods for Bayesian Computation A. Doucet Kyoto Sept. 2012 A. Doucet (MLSS Sept. 2012) Sept. 2012 1 / 136 Motivating Example 1: Generic Bayesian Model Let X be a vector parameter
More informationPhysics 221A Fall 1996 Notes 16 Bloch s Theorem and Band Structure in One Dimension
Physics 221A Fall 1996 Notes 16 Bloch s Theorem and Band Structure in One Dimension In these notes we examine Bloch s theorem and band structure in problems with periodic potentials, as a part of our survey
More informationFolding of small proteins using a single continuous potential
JOURNAL OF CHEMICAL PHYSICS VOLUME 120, NUMBER 17 1 MAY 2004 Folding of small proteins using a single continuous potential Seung-Yeon Kim School of Computational Sciences, Korea Institute for Advanced
More informationVIII.B Equilibrium Dynamics of a Field
VIII.B Equilibrium Dynamics of a Field The next step is to generalize the Langevin formalism to a collection of degrees of freedom, most conveniently described by a continuous field. Let us consider the
More informationThe dynamics of small particles whose size is roughly 1 µmt or. smaller, in a fluid at room temperature, is extremely erratic, and is
1 I. BROWNIAN MOTION The dynamics of small particles whose size is roughly 1 µmt or smaller, in a fluid at room temperature, is extremely erratic, and is called Brownian motion. The velocity of such particles
More informationRenormalization Group for the Two-Dimensional Ising Model
Chapter 8 Renormalization Group for the Two-Dimensional Ising Model The two-dimensional (2D) Ising model is arguably the most important in statistical physics. This special status is due to Lars Onsager
More informationPART IV Spectral Methods
PART IV Spectral Methods Additional References: R. Peyret, Spectral methods for incompressible viscous flow, Springer (2002), B. Mercier, An introduction to the numerical analysis of spectral methods,
More informationProtein Folding Prof. Eugene Shakhnovich
Protein Folding Eugene Shakhnovich Department of Chemistry and Chemical Biology Harvard University 1 Proteins are folded on various scales As of now we know hundreds of thousands of sequences (Swissprot)
More information1.1 A Scattering Experiment
1 Transfer Matrix In this chapter we introduce and discuss a mathematical method for the analysis of the wave propagation in one-dimensional systems. The method uses the transfer matrix and is commonly
More informationDilatancy Transition in a Granular Model. David Aristoff and Charles Radin * Mathematics Department, University of Texas, Austin, TX 78712
Dilatancy Transition in a Granular Model by David Aristoff and Charles Radin * Mathematics Department, University of Texas, Austin, TX 78712 Abstract We introduce a model of granular matter and use a stress
More informationarxiv:chem-ph/ v2 11 May 1995
A Monte Carlo study of temperature-programmed desorption spectra with attractive lateral interactions. A.P.J. Jansen Laboratory of Inorganic Chemistry and Catalysis arxiv:chem-ph/9502009v2 11 May 1995
More informationC E N T R. Introduction to bioinformatics 2007 E B I O I N F O R M A T I C S V U F O R I N T. Lecture 5 G R A T I V. Pair-wise Sequence Alignment
C E N T R E F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U Introduction to bioinformatics 2007 Lecture 5 Pair-wise Sequence Alignment Bioinformatics Nothing in Biology makes sense except in
More information