Foundations of biomaterials: Models of protein solvation

Similar documents
Models for dielectrics at the continuum and molecular scales

Using an Artificial Regulatory Network to Investigate Neural Computation

Objective: You will be able to justify the claim that organisms share many conserved core processes and features.

Aoife McLysaght Dept. of Genetics Trinity College Dublin

Genetic code on the dyadic plane

Edinburgh Research Explorer

UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS General Certifi cate of Education Advanced Subsidiary Level and Advanced Level

CHEMISTRY 9701/42 Paper 4 Structured Questions May/June hours Candidates answer on the Question Paper. Additional Materials: Data Booklet

In previous lecture. Shannon s information measure x. Intuitive notion: H = number of required yes/no questions.

Biology 155 Practice FINAL EXAM

Lecture IV A. Shannon s theory of noisy channels and molecular codes

Genetic Code, Attributive Mappings and Stochastic Matrices

A p-adic Model of DNA Sequence and Genetic Code 1

A modular Fibonacci sequence in proteins

The degeneracy of the genetic code and Hadamard matrices. Sergey V. Petoukhov

Mathematics of Bioinformatics ---Theory, Practice, and Applications (Part II)

Reducing Redundancy of Codons through Total Graph

A Minimum Principle in Codon-Anticodon Interaction

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

Lect. 19. Natural Selection I. 4 April 2017 EEB 2245, C. Simon

A Mathematical Model of the Genetic Code, the Origin of Protein Coding, and the Ribosome as a Dynamical Molecular Machine

ATTRIBUTIVE CONCEPTION OF GENETIC CODE, ITS BI-PERIODIC TABLES AND PROBLEM OF UNIFICATION BASES OF BIOLOGICAL LANGUAGES *

Crystal Basis Model of the Genetic Code: Structure and Consequences

The genetic code, 8-dimensional hypercomplex numbers and dyadic shifts. Sergey V. Petoukhov

The Genetic Code Degeneracy and the Amino Acids Chemical Composition are Connected

Natural Selection. Nothing in Biology makes sense, except in the light of evolution. T. Dobzhansky

Analysis of Codon Usage Bias of Delta 6 Fatty Acid Elongase Gene in Pyramimonas cordata isolate CS-140

Three-Dimensional Algebraic Models of the trna Code and 12 Graphs for Representing the Amino Acids

THE GENETIC CODE INVARIANCE: WHEN EULER AND FIBONACCI MEET

Ribosome kinetics and aa-trna competition determine rate and fidelity of peptide synthesis

Proteins: Characteristics and Properties of Amino Acids

Models for dielectrics at the molecular and continuum scale

Abstract Following Petoukhov and his collaborators we use two length n zero-one sequences, α and β,

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Natural Selection. Nothing in Biology makes sense, except in the light of evolution. T. Dobzhansky

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Properties of amino acids in proteins

Get started on your Cornell notes right away

Digital biology: Relations between data-mining in biological sequences and physical chemistry

2013 Japan Student Services Origanization

Protein Struktur. Biologen und Chemiker dürfen mit Handys spielen (leise) go home, go to sleep. wake up at slide 39

CODING A LIFE FULL OF ERRORS

In Silico Modelling and Analysis of Ribosome Kinetics and aa-trna Competition

Protein Structure Bioinformatics Introduction

Chemistry Chapter 22

Answers to Chapter 6 (in-text & asterisked problems)

Lecture 15: Realities of Genome Assembly Protein Sequencing

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

In Silico Modelling and Analysis of Ribosome Kinetics and aa-trna Competition

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

Supplementary Information for

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Practical Bioinformatics

Digital Biology, Fall 2014 Project Suggestions and Reading List

C CH 3 N C COOH. Write the structural formulas of all of the dipeptides that they could form with each other.

Slide 1 / 54. Gene Expression in Eukaryotic cells

Chemistry Chapter 26

Peptides And Proteins

Exam I Answer Key: Summer 2006, Semester C

Recent Evidence for Evolution of the Genetic Code

arxiv: v2 [physics.bio-ph] 8 Mar 2018

The Trigram and other Fundamental Philosophies

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Protein Threading. Combinatorial optimization approach. Stefan Balev.

Structure Investigation of Fam20C, a Golgi Casein Kinase

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Amino Acids and Peptides

Translation. A ribosome, mrna, and trna.

NSCI Basic Properties of Life and The Biochemistry of Life on Earth

Introduction to Comparative Protein Modeling. Chapter 4 Part I

7.012 Problem Set 1 Solutions

DO NOT OPEN THE EXAMINATION PAPER UNTIL YOU ARE TOLD BY THE SUPERVISOR TO BEGIN

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

PROTEIN SYNTHESIS INTRO

3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies. 3. Evolution makes sense of homologies

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Physiochemical Properties of Residues

Fundamental mathematical structures applied to physics and biology. Peter Rowlands and Vanessa Hill

Basic Principles of Protein Structures

Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

Solutions and Non-Covalent Binding Forces

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

CHEMISTRY ATAR COURSE DATA BOOKLET

Protein Struktur (optional, flexible)

Transcription Attenuation

SYMMETRIES, GENERALIZED NUMBERS AND HARMONIC LAWS IN MATRIX GENETICS

Week 6: Protein sequence models, likelihood, hidden Markov models

A Plausible Model Correlates Prebiotic Peptide Synthesis with. Primordial Genetic Code

1. What is an ångstrom unit, and why is it used to describe molecular structures?

Packing of Secondary Structures

NIH Public Access Author Manuscript J Theor Biol. Author manuscript; available in PMC 2009 April 21.

An algebraic hypothesis about the primeval genetic code

Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis

Details of Protein Structure

CHMI 2227 EL. Biochemistry I. Test January Prof : Eric R. Gauthier, Ph.D.

Advanced topics in bioinformatics

Model Mélange. Physical Models of Peptides and Proteins

Advanced Topics in RNA and DNA. DNA Microarrays Aptamers

Transcription:

Foundations of biomaterials: Models of protein solvation L. Ridgway Scott The Institute for Biophysical Dynamics, The Computation Institute, and the Departments of Computer Science and Mathematics, The University of Chicago LRS Digital Biology 2014 1/47

Solvation in picomaterials Polar liquids play singular role in Protein interactions (water most important molecule in biology) Ionic solvation (ionic liquids) Environmental pollution (HCl hydrate) Energy production (methane hydrate) Batteries Photovoltaic cells Dielectric effect critical for protein interactions Picoscale models needed LRS Digital Biology 2014 2/47

Solvation models Modeling issues for solvation Solvents are mobile, not fixed in orientation Nonlocal effects (frequency dependence): high dimension Nonlinear models: do they achieve same results as nonlocal ones? Ionic effects: can size effects be modeled via continuum equations? E.g., Na Cl Algorithms to resolve nonlinear (e.g., ionic) interactions Need molecular scale models of solvation LRS Digital Biology 2014 3/47

Electrostatic modeling: ionic effects Protein sidechains have large electrostatic gradients Figure 1: Different models suggest different modes of interactions. Shown are two nonlocal dielectric models with (A) and without (B) ionic effects. LRS Digital Biology 2014 4/47

Water is complicated Water network from a molecular dynamics simulation LRS Digital Biology 2014 5/47

Dielectric effect of water (a) H O H (b) e O H H (a) Polarity of water molecule (b) Ability to rotate allows water to align with electric field e Result: water screens electric field LRS Digital Biology 2014 6/47

Competing effects Competing effects: why this is so hard Protein sidechains have large electrostatic gradients Water is a strong dielectric Hydrophobic groups modify the water structure Large electrostatic gradients Screening by dielectric effect Modulation of dielectric strength by hydrophobic effect Figure 2: Three competing effects that determine protein behavior. These conspire to weaken interactive forces, making biological relationships more tenuous and amenable to mutation. LRS Digital Biology 2014 7/47

Charges in a dielectric Charges in a dielectric are like lights in a fog. LRS Digital Biology 2014 8/47

Dielectric model Consider two charge distributions ρ (fixed charges) and γ (polar groups free to rotate). Resulting electric potential φ satisfies φ = ρ+γ, (1) where the dielectric constant of free space is set to one. Write φ = φ ρ +φ γ, where φ γ = γ and φ ρ = ρ. Ansatz of Debye [3]: the electric field e γ = φ γ is parallel to (opposing) the resulting electric field e = φ: Thus φ ρ = φ φ γ = ε φ and φ γ = (1 ε) φ. (2) (ε φ) = ρ. (3) LRS Digital Biology 2014 9/47

Polarization field and Debeye s Ansatz as projection Define p = φ γ : called the polarization field. Recall e = φ. Write p = (ǫ ǫ 0 )e+ζe, so that ǫ = ǫ 0 + p e e e, with the appropriate optimism that p = 0 when e = 0. That is, ǫ ǫ 0 reflects the correlation between p and e. As defined, ǫ is a function of r and t, and potentially singular. However, Debye postulated that a suitable average ǫ should be well behaved: ǫ = ǫ 0 + p e e e LRS Digital Biology 2014 10/47.

Interpretation of ε In bulk water ε is a (temperature-dependent) constant: ε 87.74 40.00τ +9.398τ 2 1.410τ 3, τ [0,1], (4) where τ = T/100 and T is temperature in Centigrade (for T > 0) [4]. ε >> 1: opposing field strength E γ = φ γ much greater than inducing field. ε increases with decreasing temperature; when water freezes, it increases further: for ice at zero degrees Centigrade, ε 92. Increased coherence yields increased dielectric LRS Digital Biology 2014 11/47

Model failure H H e H O O But model fails when the spatial frequencies of the electric field e = φ are commensurate with the size of a water molecule, since the water molecules cannot orient appropriately to align with the field. LRS Digital Biology 2014 12/47 H

Model fixes Manipulations leading to (3) valid when ε is an operator, even nonlinear. Frequency-dependent versions of ε have been proposed, and these are often called nonlocal models. The operator ε must be represented either as a Fourier integral (in frequency space), or as an integral in physical space with a nonlocal kernel [1, 6]. LRS Digital Biology 2014 13/47

Frequency dependence of dielectric constant Debye observed that the effective permittivity is frequency dependent: ǫ(ν) = ǫ 0 + ǫ 1 ǫ 0 1+τ 2 D ν2 (5) where τ D is characteristic time associated with dielectric material and ν is temporal wave number. Many experiments have verified this [5]: LRS Digital Biology 2014 14/47

Polar residues cause spatial high frequencies Glutamine Asparagine CH 2 CH 2 CH 2 Threonine Serine C C H C OH H C OH NH 2 O H 2 N O CH 3 H Some polar sidechains: ± charges at distance 2.2Å LRS Digital Biology 2014 15/47

Charged sidechains form salt bridge networks Arginine CH 2 Lysine Glutamic acid CH 2 CH 2 Aspartic acid CH 2 CH 2 CH 2 CH 2 NH CH 2 CH 2 COO NH 2 C NH + 2 O C O CH 2 NH + 3 Charged sidechains LRS Digital Biology 2014 16/47

Nonlocal models Survey of results from two papers: Xie Dexuan, Yi Jiang, Peter Brune, and L. Ridgway Scott. A fast solver for a nonlocal dielectric continuum model. SISC, 34(2):B107 B126, 2012. Xie Dexuan, Yi Jiang, and L. Ridgway Scott. Efficient algorithms for solving a nonlocal dielectric continuum model for protein in ionic solvent SISC, to appear LRS Digital Biology 2014 17/47

Nonlocal dielectric Linearized Poisson-Boltzmann equation: (ǫ Φ(r)) = ρ (6) In bulk dielectric, ǫ is a constant. But in general it depends on the frequency of Φ, e.g. where ǫf = f +K f (7) ˆK(ξ) = 1 1+ ξ 2. (8) So convolution with K is the inverse of a PDE. Can solve using a system of two PDEs. LRS Digital Biology 2014 18/47

Free energy differences Figure 3: Comparisons of analytical free energy differences calculated from the nonlocal dielectric model with two values of λ (λ = 15Å and λ = 30Å) and the values from chemical experiments. LRS Digital Biology 2014 19/47

Singularity resolution Step 0: Let G be the solution to G = ρ in all space: Find u 0 H 1 0(Ω) such that G(x) = q i x x i. a 0 (u 0,w) = l 0 (w) w H 1 0(Ω), (9) where l 0 and a 0 are linear and bilinear forms as defined by (λ > 0 is a model parameter) a 0 (u 0,w) = λ 2 u 0 wdr+ u 0 wdr, Ω Ω (10) l 0 (w) = G(r)w(r) dr. Ω LRS Digital Biology 2014 20/47

Nonlocal model with protein For = (Φ,u) and ṽ = (v 1,v 2 ), define φ a(φ,ṽ) = Ω ǫ(r) Φ(r) v 1 (r)dr +(ǫ s ǫ ) u(r) v 1 (r)dr D s +λ 2 u(r) v 2 (r)dr+ (u(r) Φ(r))v 2 (r)dr, Ω where ǫ p, ǫ s, ǫ and λ are constants, and { ǫ p, r D p (protein), ǫ(r) = ǫ, r D s (solvent), Ω (11) (12) LRS Digital Biology 2014 21/47

Nonlocal variational equations Find φ 1 = (Ψ,u 1 ),φ 2 = ( Φ,u 2 ) V such that a(φ1,v) = l 1 (v) v V, (13) a(φ2,ṽ) = l 2 (ṽ) ṽ V, where l 1 (ṽ) and l 2 (ṽ) are two linear forms as defined by l 1 (ṽ) = (ǫ ǫ s ) u 0 (r) v 1 (r)dr D p +(ǫ p ǫ ) G(r) v 1 (r)dr, (14) D s l 2 (ṽ) = 1 n q i c i (r)v 1 (r)dr. ǫ 0 D s i=1 LRS Digital Biology 2014 22/47

Surface mesh of BPTI The nonlocal model solution Φ is given by Φ = Ψ+ Ψ+G Key point: singularity G is added at the end. G appears in l 1 only on the solvent domain D s where there are no fixed charges. The auxiliary variables satisfy u = Φ Q λ,u 1 = Ψ Q λ,u 2 = Ψ Q λ u = u 0 +u 1 +u 2 LRS Digital Biology 2014 23/47

Surface mesh of BPTI LRS Digital Biology 2014 24/47

Mesh cross-section around BPTI LRS Digital Biology 2014 25/47

Surface electrostatics of BPTI Figure 4: Nonlocal model yields different potential at protein surface. LRS Digital Biology 2014 26/47

Nonlinear models Polarization field φ γ saturates for large fixed fields: lim (1 ε) φ = lim φ γ = C, (15) φ φ One simple model that satisfies (15) is ε(x) = ε 0 + for some constants ε 0, ε 1, and λ. ε 1 1+λ φ(x) (16) Both the nonlocal and nonlinear models of the dielectric response have the effect of representing frequency dependence of the dielectric effect. φ(x) provides a proxy for frequency content, although it will not reflect accurately high-frequency, low-power electric fields. Combination of nonlocal and nonlinear dielectric models may be needed. LRS Digital Biology 2014 27/47

Local model for dielectric effect? Hydrophobic (CH n ) groups remove water locally. This causes a reduction in ε locally. (Resulting increase in φ makes dehydrons sticky.) This can be quantified and used to predict binding sites. The placement of hydrophobic groups near an electrostatic bond is called wrapping. Like putting insulation on an electrical wire. (Wrapping modifies dielectric effect) We can see this effect on a single hydrogen bond. LRS Digital Biology 2014 28/47

Wrapping protects hydrogen bond from water CHn CHn CHn C O CHn CHn H N C CHn O CHn CHn H N H O O H H H Well wrapped hydrogen bond H O O H H H Underwrapped hydrogen bond LRS Digital Biology 2014 29/47

Extent of wrapping changes nature of hydrogen bond Hydrogen bonds (B) that are not protected from water do not persist. From De Simone, et al., PNAS 102 no 21 7535-7540 (2005) LRS Digital Biology 2014 30/47

Dynamics of dehydrons Dynamics of hydrogen bonds and wrapping 900 800 700 600 500 400 300 200 100 0 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 Figure 5: Distribution of bond lengths for two hydrogen bonds formed in a structure of the sheep prion [2]. Horizontal axis measured in nanometers, vertical axis represents numbers of occurrences taken from a simulation with 20,000 data points with bin widths of 0.1 Ångstrom. Distribution for the well-wrapped hydrogen bond (H3) has smaller mean value but a longer (exponential) tail, whereas distribution for the underwrapped hydrogen bond (H1) has larger mean but Gaussian tail. LRS Digital Biology 2014 31/47

Ligand binding removes water CHn CHn CHn CHn CHn CHn C O H N C O H N 00 11 000 111 0000 1111 0000 1111 0000 1111 000 111 H H O H O H L IG A N D Binding of ligand changes underprotected hydrogen bond (high dielectric) to strong bond (low dielectric) No intermolecular bonds needed! LRS Digital Biology 2014 32/47

Dehydrons Dehydrons in human hemoglobin, From PNAS 100: 6446-6451 (2003) Ariel Fernandez, Jozsef Kardos, L. Ridgway Scott, Yuji Goto, and R. Stephen Berry. Structural defects and the diagnosis of amyloidogenic propensity. Well-wrapped hydrogen bonds are grey, and dehydrons are green. The standard ribbon model of structure lacks indicators of electronic environment. LRS Digital Biology 2014 33/47

Mathematical explanation Charges ρ induce an electric field e = φ given by Energy = ρφdx (ǫ φ) = (ǫe) = ρ Hydrophobicity affects the operator ǫ: removing water reduces ǫ. When ǫ goes down, φ goes up. Hydrophilic groups contribute to the right-hand side ρ. Hydrophobicity and hydrophilic are orthogonal, not opposites. LRS Digital Biology 2014 34/47

Antibody binding to HIV protease The HIV protease has a dehydron at an antibody binding site. When the antibody binds at the dehydron, it wraps it with hydrophobic groups. LRS Digital Biology 2014 35/47

A model for protein-protein interaction Foot-and-mouth disease virus assembly from small proteins. LRS Digital Biology 2014 36/47

Dehydrons guide binding Dehydrons guide binding of component proteins VP1, VP2 and VP3 of foot-and-mouth disease virus. LRS Digital Biology 2014 37/47

Extreme interaction: amyloid formation Standard application of bioinformatics: look at distribution tails. If some is good, more may be better, but too many may be bad. Too many dehydrons signals trouble: the human prion. From PNAS 100: 6446-6451 (2003) Ariel Fernandez, Jozsef Kardos, L. Ridgway Scott, Yuji Goto, and R. Stephen Berry. Structural defects and the diagnosis of amyloidogenic propensity. LRS Digital Biology 2014 38/47

Genetic code Genetic code minimizes changes of polarity due to single-letter codon mutations, but it facilitates changes in wrapping due to single-letter codon mutations. First Position u c a g uuu uuc uua uug cuu cuc cua cug auu auc aua aug guu guc gua gug Phe 7 Leu 4 Leu 4 Ile Met 4 1 Val 3 + ucu ucc uca ucg ccu ccc cca ccg acu acc aca acg gcu gcc gca gcg Ser 0 + Pro 2 Thr 1 Ala 1 Second Position u c a + uau uac uaa uag cau cac caa cag aau aac aaa aag gau gac gaa gag Tyr 6 His 1 Gln 2 Asn 1 Lys 3 LRS Digital Biology 2014 39/47 stop stop Asp 1 Glu 2 g + ugu ugc Cys 0 + uga ugg stop Trp 7 + + cgu cgc + cga cgg Arg 2 + + + agu agc Ser 0 + + + aga agg Arg 2 + + ggu ggc gga ggg Gly 0 + First digit after residue name is amount of wrapping. Second indicator is polarity; : nonpolar, + : polar, : negatively charged, ++: positively charged. u c a g u c a g u c a g u c a g Third Position

Drug ligand wrapping Drug ligand provides additional non-polar carbonaceous group(s) in the desolvation domain, enhancing the wrapping of a hydrogen bond. LRS Digital Biology 2014 40/47

Desolvation spheres Desolvation spheres for flap Gly-49 Gly-52 dehydron containing nonpolar groups of the wrapping inhibitor. LRS Digital Biology 2014 41/47

Aligned paralogs Aligned backbones for two paralog kinases; dehydrons for Chk1 are marked in green and those for Pdk1 are in red. LRS Digital Biology 2014 42/47

Selective wrapper Dehydron Cys673-Gly676 in C-Kit is not conserved in its paralogs Bcr-Abl, Lck, Chk1 and Pdk1. By methylating Gleevec at the para position (1), the inhibitor becomes a selective wrapper of the packing defect in C-Kit. LRS Digital Biology 2014 43/47

Experimental confirmation Phosphorylation rates from spectrophotometric assay on the five kinases Bcr-Abl (blue), C-Kit (green), Lck (red), Chk1 (purple), and Pdk1 (brown) with Gleevec (triangles) and modified Gleevec methylated at positions (1) and (2) (squares). Notice the selective and enhanced inhibition of C-Kit. LRS Digital Biology 2014 44/47

Conclusions Some advances in solvation modeling Now tractable to compute nonlocal dielectric models Simple models (wrapping/dehydrons) give consistent predictions at picoscale Have been used to aid drug design LRS Digital Biology 2014 45/47

Thanks This talk was based on joint work with Peter Brune (Argonne Nat. Lab.), Yi Jiang and Dexuan Xie (UWisconsin-Milwaukee) Steve Berry (U. Chicago), Ariel Ferndandez (Calderon Inst., Argentina), Chris Fraser (Bioanalytical Computing), Kristina Rogale Plazonic (Princeton), Harold Scheraga (Cornell) We thank the Institute for Biophysical Dynamics at the University of Chicago and NSF for support. We are also grateful to the developers of the PDB, Viper, DIP, and other biological data bases. LRS Digital Biology 2014 46/47

References [1] P. A. Bopp, A. A. Kornyshev, and G. Sutmann. Static nonlocal dielectric function of liquid water. Physical Review Letters, 76:1280 1283, 1996. [2] Alfonso De Simone, Guy G. Dodson, Chandra S. Verma, Adriana Zagari, and Franca Fraternali. Prion and water: Tight and dynamical hydration sites have a key role in structural stability. Proceedings of the National Academy of Sciences, USA, 102:7535 7540, 2005. [3] P. Debye. Polar Molecules. Dover, New York, 1945. [4] John Barrett Hasted. Aqueous Dielectrics. Chapman and Hall, 1974. [5] U. Kaatze, R. Behrends, and R. Pottel. Hydrogen network fluctuations and dielectric spectrometry of liquids. Journal of Non-Crystalline Solids, 305(1):19 28, 2002. [6] L. Ridgway Scott, Mercedes Boland, Kristina Rogale, and Ariel Fernández. Continuum equations for dielectric response to macro-molecular assemblies at the nano scale. Journal of Physics A: Math. Gen., 37:9791 9803, 2004. LRS Digital Biology 2014 47/47