Structure Investigation of Fam20C, a Golgi Casein Kinase

Similar documents
Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Comparing crystal structure of M.HhaI with and without DNA1, 2 (PDBID:1hmy and PDBID:2hmy),

Computational Modeling of Protein Kinase A and Comparison with Nuclear Magnetic Resonance Data

Enhancing Specificity in the Janus Kinases: A Study on the Thienopyridine. JAK2 Selective Mechanism Combined Molecular Dynamics Simulation

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Supplementary Information

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Computational chemical biology to address non-traditional drug targets. John Karanicolas

SUPPLEMENTARY INFORMATION

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Journal of Pharmacology and Experimental Therapy-JPET#172536

Supplementary Figure S1. Urea-mediated buffering mechanism of H. pylori. Gastric urea is funneled to a cytoplasmic urease that is presumably attached

Oxygen Binding in Hemocyanin

Packing of Secondary Structures

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1

Signal Transduction Phosphorylation Protein kinases. Misfolding diseases. Protein Engineering Lysozyme variants

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Type II Kinase Inhibitors Show an Unexpected Inhibition Mode against Parkinson s Disease-Linked LRRK2 Mutant G2019S.

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

Portal. User Guide Version 1.0. Contributors

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

DOCKING TUTORIAL. A. The docking Workflow

Structural insights into energy regulation of light-harvesting complex from spinach CP29

Molecular Mechanics, Dynamics & Docking

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Docking. GBCB 5874: Problem Solving in GBCB

Exam I Answer Key: Summer 2006, Semester C

Sensitive NMR Approach for Determining the Binding Mode of Tightly Binding Ligand Molecules to Protein Targets

Geometrical Concept-reduction in conformational space.and his Φ-ψ Map. G. N. Ramachandran

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Overview & Applications. T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 04 June, 2015

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

7.012 Problem Set 1 Solutions

T H E J O U R N A L O F G E N E R A L P H Y S I O L O G Y. jgp

L718Q mutant EGFR escapes covalent inhibition by stabilizing. a non-reactive conformation of the lung cancer drug. osimertinib

Homology modeling. Dinesh Gupta ICGEB, New Delhi 1/27/2010 5:59 PM

Supplementary Materials for

Equilibrated atomic models of outward-facing P-glycoprotein and effect of ATP binding on structural dynamics (Supplementary Information)

Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007

Binding Modes of Peptidomimetics Designed to Inhibit STAT3

Experimental and Computational Mutagenesis to Investigate the. Positioning of a General Base within an Enzyme Active Site

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

SUPPLEMENTARY FIGURES. Figure S1

*Corresponding Author *K. F.: *T. H.:

Supporting Information

Downloaded from:

Effect of intracellular loop 3 on intrinsic dynamics of human β 2 -adrenergic receptor. Ozcan et al.

Protein Structure Bioinformatics Introduction

Flexibility and Constraints in GOLD

Volume 12(2)

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Potential Energy (hyper)surface

From Amino Acids to Proteins - in 4 Easy Steps

Full wwpdb X-ray Structure Validation Report i

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Applications of Molecular Dynamics

Chemistry Chapter 22

Supporting Material for. Microscopic origin of gating current fluctuations in a potassium channel voltage sensor

Supplement information

Multi-scale approaches in description and design of enzymes

Lecture 15: Enzymes & Kinetics. Mechanisms ROLE OF THE TRANSITION STATE. H-O-H + Cl - H-O δ- H Cl δ- HO - + H-Cl. Margaret A. Daugherty.

Structural and mechanistic insight into the substrate. binding from the conformational dynamics in apo. and substrate-bound DapE enzyme

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

Structural Insights from Molecular Dynamics. Simulations of Tryptophan 7-Halogenase and

Progress of Compound Library Design Using In-silico Approach for Collaborative Drug Discovery

Supplementary information for cloud computing approaches for prediction of ligand binding poses and pathways

What binds to Hb in addition to O 2?

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

Saba Al Fayoumi. Tamer Barakat. Dr. Mamoun Ahram + Dr. Diala Abu-Hassan

Amino Acids and Peptides

Potassium channel gating and structure!

MOLECULAR DRUG TARGETS

Supporting Information. A Comprehensive Computational Study of the Interaction between Human Serum Albumin and Fullerenes

Any protein that can be labelled by both procedures must be a transmembrane protein.

Time-dependence of key H-bond/electrostatic interaction distances in the sirna5-hago2 complexes... Page S14

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Cooperativity and Specificity of Cys 2 His 2 Zinc Finger Protein-DNA Interactions: A Molecular Dynamics Simulation Study

Identifying Interaction Hot Spots with SuperStar

Computational Structural Biology and Molecular Simulation. Introduction to VMD Molecular Visualization and Analysis

Substrate Binding and Formation of an Occluded State in the Leucine Transporter

BCH 4053 Exam I Review Spring 2017

Structural Bioinformatics (C3210) Molecular Docking

Modeling the Relationship between the p53 C-Terminal Domain and Its Binding Partners Using Molecular Dynamics

Supplementary Information Intrinsic Localized Modes in Proteins

Water. Dr. Diala Abu-Hassan, DDS, PhD Lecture 2 MD summer Dr. Diala Abu-Hassan

Proteins: Characteristics and Properties of Amino Acids

Section Week 3. Junaid Malek, M.D.

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

Mechanistic insight into inhibition of two-component system signaling

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle.

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

A Brief Guide to All Atom/Coarse Grain Simulations 1.1

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Goals. Structural Analysis of the EGR Family of Transcription Factors: Templates for Predicting Protein DNA Interactions

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Supporting Information

Transcription:

Structure Investigation of Fam20C, a Golgi Casein Kinase Sharon Grubner National Taiwan University, Dr. Jung-Hsin Lin University of California San Diego, Dr. Rommie Amaro Abstract This research project focused on molecular dynamics simulation of Fam20C, a secreted Golgi casein kinase, which was found to target multiple proteins with the same motif. Mutations in Fam20C lead to osteoclerotic bone dysplasia in humans, also known as Raine syndrome. Raine syndrome is characterized by ectopic calcification which increases bone density and often causes death in neonatal stages. The broad aim of this research project was to investigate the structure and function of this unique secreted kinase, Fam20C. Several tools were used to analyze the structure of Fam20C once the 200ns simulation of the unbound form was complete: root mean square deviation and fluctuation (RMSD and RMSF) to study certain regions of the protein, bond lengths throughout the simulation to measure salt bridges in the binding pocket, VolMap VMD plug-in to calculate isovalue and water occupancy, GROMOS clustering, as well as FTMap and FTProd to analyze clusters of binding sites inside and outside of the pocket.

Introduction Fam20C is part of a secretory pathway protein kinase family that is in charge of phosphorylating casein, and was found to target multiple proteins with the motif Ser-x-Glu/pSer [ref 1]. Several unique features, such as the existence of disulfide bonds [ref 2] shows that Fam20C protein kinase may have diverged early on in evolution and is therefore distinct from the common known protein kinases. Mutations in FAM20C lead to osteoclerotic bone dysplasia in humans, also known as Raine syndrome. Raine syndrome is characterized by ectopic calcification which increases bone density and often causes death in neonatal stages. The unique structure and function of Fam20C, which have been unclear until that point, requires design of a distinct inhibitor that would be used as a tool to further study its mechanism and target sites. Objective The broad aim of this research project is to investigate the structure and function of a unique secreted Golgi casein kinase, FAM20C. Studying the structure and specifically the ATP binding pocket of Fam20C would allow to better understand its function, better control it for experimental purposes, and use that data for a later drug discovery project. Image 1. ADP-bound Golgi casein kinase, Fam20C. The kinase is shown in surface representation with ADP bound at its ATP binding pocket-- using the visualization program Visual Molecular Dynamics.

Methods 1. MD (Molecuar Dynamics) simulation The first step in preparing the Fam20C structure for molecular dynamics simulation was using PDB2PQR in order to approximate protonation at neutral ph of 7.0. The system was set with FF99SB Amber force field, xleap was used to solvate the system in a TIP3P explicit water in a cubic unit cell to a distance of 10Å around the kinase, and 19 Na + Cl - ions were added into the system to reach a final solution of 20mM. The next step before starting MD simulation was energy minimization in four steps which removed any interactions in the system: first only let hydrogens move for 5,000 steps, next also allow water and ions to move for 5,000 steps, third also allow side-chains to move for 10,000 steps, and lastly relax the system and allow everything to move for 25,000 steps. Following the energy minimization steps, the system was equilibrated in four steps of harmonic constraints, starting from 1.0, 0.75, 0.5, 0.25 and lastly allowing the whole system to move. After the minimization and equilibration steps, a 200ns free dynamics simulation was completed with 10 8 steps of 2fs. The crystal structure used in the MD simulation was based on the published structure (pdb ID: 4KQA) with a few mutations that would make it resemble to the human form of the kinase; the active site mutations include: Met171 mutated to Lys, Gln295 mutated to Ala, His388 modified so that the nitrogen close to the bound water will be protonated, a water molecule added in the back of the ATP binding pocket, and glycosylation sugars were removed. The trajectory DCD files have been concatenated using CATDCD in order to skip every 50 frames of the simulation. The above allowed working with Fam20C simulation more easily while maintaining credibility of the data.

2. Clustering RMSD-based clustering was calculated on Fam20C trajectory, using g_cluster algorithm in the Gromacs package, to extract 6 distinct conformation snapshots from the 200ns simulation. The clustering was defined by the active site residues in the ATP binding pocket: Lys171, Gly173- Gln176, Lys178, Val190, Gly192, Glu213, Ala295- Leu298, Arg306, His308, Glu371, Leu386 and Asp387. In order to produce 6 clusters from the simulation, the RMSD cutoff was set to 0.125 nm. The RMSD ranges found were from 0.0294271nm to 0.338968nm and the average RMSD was 0.110407nm. 3. Visual Molecular Dynamics (VMD) VMD was used to visualize Fam20C trajectory as well as calculate various aspect of it using plug-ins, such as: VolMap for water occupancy in the pocket, RMSD Trajectory Tool for bond length and RMSD calculation and alignment, and FTProd to analyze potential binding sites using FTMap product files. Results Several salt bridges interaction were measured between key residues in Fam20C ATP binding pocket: Lys192-Glu218 [figure 1A], Lys192-Glu213 [figure 1B], and Lys178-Glu213 [figure 1C]. These interactions were selected for inspection due to their significant roles in most types of kinase activity. Each of these bond lengths were measured throughout the 200ns simulation between the nitrogen of Lysine to all 3 oxygens of glutamate. The salt bridge interactions between the conserved residues analyzed ensures the coordination with the ADP phosphate in order to facilitate binding and activation of the kinase. The above analysis shows that Lys192-

Bond length (Å) Bond length (Å Glu218 salt bridge interaction, of less than 4Å, exists 71.8% of the simulation time, while Sharon Grubner Lys178-Glu213 salt bridge interactions exists 66.1% of the time [figure 2]. These results are interesting since they show that alternate conformation is possible between these pairs and could be exploited for drug design. The above observation also means that the binding pocket will have fluctuations in its electrostatics depending on which conformation the salt bridge is. Figure 1. A Interactions Throughout Trajectory of K192-E218 12 10 8 6 4 2 0 0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250 3500 3750 4000 Frame Bond length (Å) O Bond length (Å) OE1 Bond length (Å) OE2 B Interactions Throughout Trajectory of K192-E213 10 8 6 4 2 0 0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250 3500 3750 4000 Frame Bond length (Å) O Bond length (Å) OE1 Bond length (Å) OE2

Percentage of Simulation Time Distance (Å) Sharon Grubner C Interactions Throughout Trajectory of K178-E213 20 15 10 5 0 0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250 3500 3750 4000 Frame O OE1 OE2 Figure 2. Salt Bridge Interactions in Fam20C Binding Pocket 80.00% 70.00% 60.00% 50.00% 40.00% 30.00% 20.00% 10.00% 0.00% K192-E213 K192-E218 K178-E213 Interactions between p-loop and the C-terminal lobe The opening and closing of the ATP binding pocket was measured by the distances between the alpha carbons of the pairs Gly173-His308 and His174-His369 [figure 3]. The distance between the two loops fluctuates between 11.6Å and 21.1Å throughout the 200ns simulation. When looking at the ADP bound structure from the Fam20C crystal structure it is evident that the average distance between the alpha carbons of the P-loop and the C-terminal lobe is at

Distance (Å) average 14Å. Comparing this value with the inactive apo-form of Fam20C shows that the Sharon Grubner unbound form is often in and out of position, which can be exploited into design of inhibitors. To observe the fluctuation in the ATP binding pocket s two major loops- the P loop and the C terminal lobe- the RMSF values were calculated separately for both of the alpha carbons of these loops. It is shown in figure 4A and 4B that the fluctuation of the C-terminal lobe residues is overall higher than the P-loop residues fluctuation. Flexibility around the ATP binding pocket indicates it is accessible and allows maximization of the number of contacts between the ligand and the binding pocket residues during binding. Figure 3. 23 Distance Between P-loop and C-terminal Lobe 18 13 8 0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250 3500 3750 4000 Frame G173-H308 H174-H369

RMSF (Å) RMSF (Å) Sharon Grubner Figure 4. A RMSF of P-loop residues K171-K178 25 20 15 10 5 0 171 172 173 174 175 176 177 178 P loop residue B RMSF of C-terminal lobe A300-R306 25 20 15 10 5 0 300 301 302 303 304 305 306 C terminal lobe residues Hinging Phe297 ring typically sits over adenine of ATP when it binds to a region called hinge, and acts as a gate mechanism for the ATP binding pocket. The hinge residues that form hydrogen bonds to adenine are the backbone NH of Leu298 and the carbonyl oxygen of Phe297. The Phe297 ring seems to be moving quite a lot and shows flexibility as the binding pocket hinge [figure 6]. The movement of the hinge s ring allows to observe how flexible it is for the drug design purposes, allows to estimate the size of the pocket, how it changes and what can potentially bind to it, as well as the free energy contributions to binding that would need to be compensated. Since RMSD was calculated over only 2 residues, the plot fluctuates greatly but the overall RMSD value does not exceed 2.5Å [figure 5]. Comparing this plot with other 3 additional sets of RMSD

RMSD (Å) Sharon Grubner over two residues (not shown in figure) shows another evidence that these two residues are indeed the hinge. Looking at the crystal structure of the ADP bound Fam20C form, it was suspected that these two residues act as the hinge of the binding pocket. After looking at the abovementioned data it seems as the assumption was correct. Figure 5. 2.5 RMSD of Hinge Residues Phe297 and Leu 298 2 1.5 1 0.5 0 0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250 3500 3750 4000 Frame Figure 6. Frames 500, 1000, 2500 and 4000 aligned by CA Phe297 and Leu 298 highligthed (500 green, 1000 yellow, 2500 red, 4000 blue).

Clustering Clustering using g_cluster allows to find frames throughout the trajectory with unique protein conformations in order to analyze ligand binding at different states. The 200ns trajectory was aligned by the protein s alpha carbons using RMSD Trajectory Tool VMD plugin and clustered by the active site residues: Lys171, Gly173- Gln176, Lys178, Val190, Gly192, Glu213, Ala295- Leu298, Arg306, His308, Glu371, Leu386 and Asp387. Using the RMSD cutoff of 0.125nm, the following 6 clusters were found: the most populated cluster of middle frame 3003 with 3811 members, frame 86 with 134 members, frame 323 with 29 members, frame 2 with 13 members, frame 2547 with 12 members, and frame 517 with 11 members. All six of these clusters were sent to FTMap server for further analysis [ref 3]. FTMap is a server that analyzes the clusters found by g_cluster and determines hot spots ; certain regions on the protein binding surface, of drug targets that bind small ligands with higher affinity than other regions of the binding site. Regions that are identified as hot spots and overlap with other clusters are defined as consensus sites. FTMAP analyzes one snapshot at once and ranks the clusters based on average free energy that incorporate accurate calculations such as of electrostatics, and repulsive and attractive van der Waals interactions. By using FTProd it is possible to confirm the exact sites of binding in the binding pocket, as well as to find new potential binding sites on the protein surface outside of the pocket [ref 4]. In addition, it is also possible to characterize the different binding sites at different point of the simulation based on the ligands bound. The 6 pdb structures output from FTMap were uploaded to VMD and aligned by the protein. Next these structures were clustered by Average Link

method with a cutoff of 9Å. Finding consensus sites that appear in all 6 structures, shows they are likely to correspond to binding sites that are able to bind ligands throughout the entire simulation. If these sites exist only in one or two structures, it means they are transient and not frequent throughout the simulation. Looking at the binding sites around the hinge, a polar permanent interaction was found at the hinge as the highest rank in 4/6 of the frames, and ranked high in the other two frames. Polar interaction with the active site residues Gly173- Gly174 and Glu325- Gln327 from both sides simultaneously was also ranked second at 4/6 of the frames, while ranked first in frames 323 and 3003. At the salt bridge Lys192-Glu218 a relatively transient polar interaction was found in 3/6 frames. An additional analyzed binding site was around the P-loop, however a bit more transient, was observed in 4/6 frames; polar interaction around residues Val177- Lys178 was in 3/4 frames while the other interactions were relatively transient and not significant for analysis. The third and last dominant binding sites was opposite to the hinge, at the ADP pocket: polar interaction was found with Arg306 in 4/6 of the frames, and a transient polar interaction was seen around residue Phe209 in 2/6 of the frames. Water occupancy The ADP binding pocket was also analyzed by looking at water occupancy. Water occupancy at the ATP binding pocket was analyzed using VolMap plug-in in VMD by changing the isovalue range. Water occupancy analysis was done since areas with high water occupancy usually mean excellent candidates for drug binding sites, since high water occupancy implies water ordering which is entropically unfavorable. By looking at the different binding pocket residues it was found that there was water around the hinge residues Phe297 and Leu298 55% of the simulation

time, around the P-loop and more specifically residue Lys171 60-65% of the simulation time, around Arg367 70% of the time, and around residues 294 and 295 80% of the simulation time. Moreover, water was found around the salt bridge Lys178-Glu213 65% of the simulation time, and around the salt bridge Lys192-Glu218 80% of the time, which corresponds to the finding of it being a relatively permanent salt bridge throughout the simulation with 71.8%. Conclusions Fam20C, Golgi casein kinase, beyond having atypical kinase structure, has several distinct characteristics that require a design of a distinct inhibitor. Previously designed inhibitors for other kinases cannot inhibit and bind to the unique structure of the studied kinase. In most cases inhibitors compete with ATP binding in order to inhibit the activation of the kinase. The required inhibitor would have to mimic these interactions of ATP in the pocket- in terms of the interactions with the key residues Image 2. ADP-bound Golgi casein kinase, Fam20C. The kinase is shown in ribbon representation with ADP bound at its ATP binding pocket-- using the visualization program Visual Molecular Dynamics. that have salt bridges as well as the interaction with the hinge. Future directions of this research project include virtual screening of small molecule libraries in order to predict sets of compounds that may be new inhibitors. Once these compounds are defined, they can be docked onto Fam20C kinase using AutoDock to confirm their binding affinities. A list of compounds can be confirmed and ranked for experimental testing to produce an inhibitor tool for the studied Golgi casein kinase.

Acknowledgments I am grateful for the support of the PRIME team Dr. Gabriele Wienhausen, Dr. Peter Arzberger and Teri Simas. I thank Dr. Rommie Amaro, Dr. Victoria Feher and Lane Votapka for their help throughout the research project in Taiwan as well as in San Diego. Special thanks to Dr. Jung- Hsin Lin for being a great host at his lab at National Taiwan University and assisting as much as possible. Finally, I am grateful for Julia Brown Research Scholarship which allowed me to travel to Taiwan and have a life changing experience. References 1. Tagliabracci Vincent S., et al. (2012) Secreted Kinase Phosphorylates Extracellular Proteins That Regulate Biomineralization. Science 1 June 2012: Vol. 336 no. 6085 pp. 1150-1153. 2. Xiao Junyu, et al. (2013) Crystal Structure of the Golgi Casein Kinase. Proc Natl Acad Sci U S A. 2013 Jun 25;110(26):10574-9. http://www.ncbi.nlm.nih.gov/pubmed/23754375 3. Brenke Ryan, et al. (2009) Fragment-based identification of druggable hot spots of proteins using Fourier domain correlation techniques. Bioinformatics (2009) 25 (5): 621-627. http://bioinformatics.oxfordjournals.org/content/25/5/621.full 4. Votapka L, Amaro RE (2012) Multistructural hot spot characterization with FTProd. Bioinformatics. 2013 Feb 1;29(3):393-4. http://www.ncbi.nlm.nih.gov/pubmed/23202744