Automatic Epitope Recognition in Proteins Oriented to the System for Macromolecular Interaction Assessment MIAX
|
|
- Toby Wheeler
- 5 years ago
- Views:
Transcription
1 Genome Informatics 12: (2001) 113 Automatic Epitope Recognition in Proteins Oriented to the System for Macromolecular Interaction Assessment MIAX Atsushi Yoshimori Carlos A. Del Carpio Toyohashi University of Technology, Ecological Engineering, 1-1 Tempaku, Toyohashi, Aichi , Japan Abstract In the present work we evaluate the performance of an algorithm for the automatic recognition of binding sites in proteins as well as in other macromolecules whose interactions are involved in many cellular and physiological processes. The algorithm is a combination of an unsupervised learning algorithm based on Kohonen self organizing maps to characterize the properties of patches of protein solvent accessible surfaces and a filtering algorithm to establish both the physical boundaries of the patches as well as the level of contribution of different and distant atoms involved in the interaction.we have found that the algorithm performs extremely well in a set of randomly selected protein complexes for which the interaction interfaces are extracted and compared with the results of the algorithm. A statistical evaluation of the algorithm is additionally performed by analysis of the degree of hydrophobicity and hydrophilicity of the output patches and comparison with that of the observed interface constituent amino acids. Keywords: epitopes, self organizing maps, hydrophobic cluster, protein interaction 1 Introduction In recent years the system for assessment of macromolecular interaction MIAX has been under development in our laboratories [2, 3, 4], the essential characteristics of the system being its potential function based on the scaled particle theory and a powerful configuration space search engine that allows flexible docking [4] of monomers known to interact but for which only structures at the isolated state are known. However, in spite of the quality of results output by the system, and several modules developed to get better processing times of the system such as its parallelization and execution on a parallel computer system, some obstacles still remain to be overcome in order to improve the performance of the system. One of this is the recognition of the putative interaction sites on the surface of the monomers in a way to reduce the hyper-dimensional configuration space searched by the hybrid algorithm. The present work aims at the evaluation of an algorithm developed in our laboratories directed to recognize epitopes in proteins for which the three dimensional structure is the only information possessed a priori. The algorithm is based on an unsupervised learning algorithm by means of which the characteristics of the surface of the proteins are identified (or any other macromolecule) and a filtering algorithm based on a two dimensional Fourier transform which combined with the learning algorithm assists in delimiting the clusters of surface atoms constituting the binding sites. Evaluation of the performance of the algorithm is carried out by firstly selecting a set of thirty protein complex structures recorded in the PDB. Values expressing the activity in function of electrophysical properties of the atoms are then computed on each point of the solvent accessible surface
2 114 Yoshimori and del Carpio for the molecule. These values are then processed with the learning algorithm in order to identify patterns of activity distributed on the surface. The filtering process is finally applied to delimit and differentiate those distribution patterns as described in the method section. The results output by the recognition algorithm are then compared with the actual interfaces of the monomers in the complexes. This evaluation is performed in two stages in the first of which two metrics introduced by Young et al. [10] are used to measure the overlap of the predicted and observed interfaces of the complex molecules. An additional evaluation is, however, introduced in the present work oriented at an automatic evaluation and corroboration of the results. Through this automatic analysis the correctness and/or incorrectness of the results is evaluated taking into account the characteristics of the amino acids to which the atoms composing the patches or surface clusters of atoms belong. This analysis is also oriented to the elucidation and establishment of underlying rules governing macromolecular interaction and protein interaction in special. In the present paper we focus the identification of protein epitopes considering fundamentally the hydrophobic characteristics of the protein surface regions. Among previously reported studies oriented to the identification of protein epitopes and active sites we can mention the work of Korn and Burnett [7] who calculated hydropathies for subsets of atoms on the surface of the protein that were in contact surfaces and non-contact surfaces. Using a normalized consensus hydropathy scale to express the hydropathy values of atoms they found that the hydropathy of the contact surfaces was higher than the exterior non-contact surfaces in multimeric proteins and 2 protein-protein complexes. Further development of these ideas was done by Young and coworkers [10] who suggested that clusters of hydrophobic residues could be obtained by adding hydrophobic parameters of residues within a certain distance from a solvent-accessible grid point. Ranking of the clusters of surface residues on the basis of the hydropohobicity of their constituent amino acids leads to a list of candidate interaction sites with the most putative being those ranked highly hydrophobic. By representing the solvent-accessible surface area by dots, Lijnzaad and co-workers [8] developed a method that identifies regions of a surface of arbitrary size and shape consisting solely of carbon and sulfur atoms. However, only those patches large enough to exceed random expectation are taken into account as meaningful; this being its advantage as compared to other techniques since the size of the region is not constrained by the methodology utilized to locate them. In fact, application of their technique to eight multi-meric proteins shows that, in general, only the largest patches coincided with regions of known biological relevance. Several algorithms have also been reported to predict protein-protein interaction sites using multiple characteristics of surface patches, a comprehensible review is given in [5]. The algorithm presented here, as already mentioned, deals with the identification of hydrophobic clusters on the surface of the molecule that are involved in expression of the function of the proteins. 2 Methodology Most of the cellular and physiological processes, including signaling to transcription, are performed and regulated by the interaction of macromolecules. Among them, protein interaction plays the utmost relevant role in molecular and cellular recognition involved in the majority of biochemical and biophysical processes in living organisms. Revealing the principles underlying this type of interactions is not only important to understand the mechanisms of the biochemical and physiological processes of living organisms but has also impact on the development of proteins with novel functions directed to several medical and biotechnology fields. Docking protein molecules is a way of assessing protein-protein interactions and computing the interaction energies offer a quantitative means of measuring the strength of the interaction and the stability of the resulting complex. The multidimensional configuration space, however, poses some
3 Automatic Epitope Recognition 115 difficulties to a rapid search of the optimal complex conformation, a preprocessing of the structures being necessary in order to identify sites enhancing interaction of the macromolecules i.e. those parts of the molecules possessing high activity and/or affinity as compared with others. An algorithm to perform the identification of these sites or epitopes has been developed in our laboratories and described elsewhere [4], and here we evaluate its performance in a series of protein complexes for establishing the conditions in which determined properties must be used instead of others to achieve that goal. 2.1 The Algorithm The algorithm proposed here is a combination of an unsupervised learning algorithm based on a self organizing map (SOM) [6] with a filtering methodology based on a two dimensional fast Fourier Transform. The module starts with the input of the molecular 3D structure for which prediction of the binding sites is required. Then the computation of the molecular solvent accessible surface (SAS) points is followed by the calculation of the electrophysical potential (related to activity) on every one of the computed points of the SAS. Here we used the hydrophobic potential on the surface of the molecule, the computation of which is performed by the expression [1]: MHP = E tri e (r i d i ) (1) where E tri is the transfer energy for several atom types, r i is the radius of the atom i and d i is the distance between atom i and an arbitrary point M. The next step consists in the application of the SOM algorithm to the set of position vectors of the points of the molecular solvent accessible surface. Then the two dimensional Fourier transform is applied to the set of neurons to which values of the physicochemical characteristic were previously calculated according to the molecular surface points each one includes (the average over all the point belonging to that particular neuron) [4]. Hydrophobic clusters are obtained taking the inverse Fourier transform of the spectrum at certain level of frequency, points at higher frequency being eliminated as noise in a common signal processing spectrum. Finally, the binding regions are displayed for visual inspection extracting their constituents at the atomic and amino acid level from the computed results. The flow of these calculations is shown taking as example the receptor and ligand polypeptides constituting the complex with PDB code 1CHO in Fig. 1. Here the epitope computation process is shown for each monomer composing the complex. In the first step the surfaces each molecular system is shown plotting all the points constituting the SAS of the molecule. The second step is the projection of the three dimensional point vectors of the SAS points by means of the SOM method. The third step shows the filtering process using the spectral method described, while the fourth illustrates the characterization of cluster of atoms as pattern on the SAS which is the epitope or active site of the molecule. Composition of each cluster is extracted easily from the composition of each patch automatically computed by the algorithm as shown in the same figure as a final step The Evaluation Procedure The main objective of the present paper is to evaluate the validity of the former algorithm and for this purpose we use an exhaustive statistical analysis of the clusters predicted by our methodology. This analysis reveals many characteristics of the interaction sites in proteins as well as the potential underlying it. The analysis is performed using two metrics similar to the ones suggested by Young et al. [10]. The fundamental difference being that the areas used in their analysis are points on the solvent accessible surface in this report. The relation of both physical values being proportional.
4 116 Yoshimori and del Carpio (a) Receptor (1CHO) Step1 Step2 Step3 Step5 Comparison Step4 (b) Ligand (1CHO) Step1 Actual Binding Site Step2 Step3 Step5 Comparison Step4 Actual Binding Site Figure 1: Hydrophobic Cluster Analysis Protocol: (a) Receptor (1CHO). (b) Ligand (1CHO).
5 Automatic Epitope Recognition 117 Accordingly, to perform this statistical analysis on the performance of the method, the following metrics having been adopted and modified to compare calculated and experimental binding site regions. M1 = (A actual A classx )/A actual (2) M2 = (A actual A classx )/A classx (3) where, A actual stands for the number of points in the actual intermolecular contact surface (region colored in red in Fig. 1 step5), A classx is the number of points belonging to one of the predicted clusters (regions shown with different colors in Fig. 1 step5), and X being the number identifying that particular cluster. These values are calculated for all the complexes processed here. Results for the isolated receptor are illustrated in Table 1(a), while those for the isolated ligands are shown in Table 1(b), where results at different levels of frequency for the inverse transform, and for every class obtained are presented. Since large values of M1 together with low values for M2 represent predicted clusters larger than the actual binding site, while large values of M2 with low values of M1 implicate larger experimental interfaces than the predicted clusters, an ideal result is that with comparable and high values for both M1 and M2. A further evaluation procedure is applied here to examine the properties of the hydrophobic clusters identified by the algorithm with those of the amino acids that compose them. The proposed semiquantitative evaluation consists in examination of the degree of agreement of the hydrophobicity of the clusters predicted with those of the amino acids involved in the clusters, a common hydrophobic scale being used for the amino acids. When a hydrophobic cluster is identified as the epitope for a determined molecule the hydrophobic properties of the amino acids constituting the cluster are analyzed. Whether the clusters are constituted mainly by hydrophobic amino acids or by hydrophilic amino acids when they are identified as hydrophobic leads not only to ascertain the correctness of the answer in the case of coincidence but also to endorsement of the answer when a weak hydrophobic cluster has been found or none at all. 3 Results The self organizing algorithm was applied to project the multidimensional distribution of the hydrophobic potential in the two dimensional space using 2000 iteration for each set of points constituting the surfaces of both monomers (which we call receptor and substrate or ligand in what follows for clarity reasons). Data is presented for the six biggest clusters that are obtained with an inverse Fourier transform at a frequency level of 5. That is, when only frequencies lower than 5 are considered in the analysis of the cluster boundaries. As stated previously the algorithm was tested with a set of 30 protein complexes shown in Table 1 where each entry shows the number of clusters identified (Num class), values of the M1 and M2 metrics for the six most significant clusters, the area of each of the six classes (A class), the area of the experimental interface (A act), and the SAS of the protein (A all). Table 2 summarized the analysis of each cluster recording also the name of the complex, but also the total number of points composing SAS, the number of point in the actual interface as well as the amino acids constituting each observed interface. The M1 and M2 metrics are classified in three levels of magnitude: high, middle, and low, according to the degree of overlapping with the actual interface. The high level stands for M1 or M2 values higher than 50%, the middle for values lower than 50% and higher than 20% while values of lower than 20% belong to the lowest level. Table 3 summarizes the results for the thirty structures processed by the algorithm. In this table it can be noticed that the first hydrophobic clusters for 8 of the receptors and 12 of the substrates coincide with the actual interface with a high overlapping degree.
6 118 Yoshimori and del Carpio Table 1: SOM-FT Results for Complexes in Table 2: (a) Receptor. (b) Ligand. When predictions at the middle level are incorporated as correct predictions to the former, the clusters correctly predicted increase to 14 for the receptors and 21 for the substrates, representing a 46.7% of the total processed receptor structures and 100% for the substrates. These figures can be observed in the histograms of Fig. 2. Table 4 summarizes the percentage of amino acids detected in each interface for both the receptor and the substrate structures, careful observation of this table leads to detection of some tendencies in the use of determined amino acids in protein complex interfaces. Thus Arg is a preferred amino acid in the interfaces of both receptor and ligands, in spite of its high hydrophilicity while Leu a mild hydrophobic amino acid is also present with high frequency in both the receptor and ligand interfaces. Besides, Gly, Leu, Phe, Ser, and Tyr are the most frequently used amino acids in receptor interfaces, while Arg, Leu, Lys and Pro are present in the ligand interface with more than 8 % of frequency. A more exhaustive statistical analysis was carried here, therefore, considering the hydrophobic characteristics of the amino acids of the epitopes identified by the algorithm described so far. This analysis consists in ascertaining the nature of the identified clusters as well as the actual interface taking into account the hydrophobicity of the amino acids involved in both clusters of atoms. We exemplify the analysis for 4 randamly selected receptors and 4 ligands in Table 2. The histograms in Fig. 3 show the percentage of hydrophobic and hydrophilic amino acids in actual interfaces of 4 randomly selected receptors and ligands. It is easy to perceive that the automatic recognition algorithm presented here perceives correctly and with high degree of reliability the hydrophobic region as the interaction site of the protein. When the interface is composed of highly hydrophobic
7 Automatic Epitope Recognition 119 Table 2: Protein Complexes Used in the Evaluation of the SOM-FT Algorithm. Figure 2: Over all prediction rates expressed in function of M1 and M2 for each of the six clusters obtained using analysis.
8 120 Yoshimori and del Carpio Table 3: Summary of the SOM-FT Prediction results: (a) Receptor. (b) Ligand. amino acids, this is evident for the receptors 1PPF and 1HNE for which the cluster recognized contains a high percentage of hydrophobic amino acids. The hydrophobic amino acid content for the 1TEC is only six points higher than the hydrophilic content while the content of hydrophobic amino acids for 1FC2 is lower than the hydrophilic content. Accordingly in these two cases the reliability of the prediction by the algorithm is proportional to the content of hydrophobic amino acids. Therefore a relationship can be established between the content of hydrophobic amino acids and the clusters output by the algorithm. This relationship correlates the reliability of the prediction with the percentage of hydrophobic amino acids in the cluster. Whenever the property used to recognize the epitopes is the hydrophobicity and the content of hydrophobic amino acids is also high, then the prediction of the epitope coincides with very high probability with the actual interface. In contrast, when the content is relatively low, the coincidence with the actual interface is only partial. This fact leads to the conclusion that the algorithm presented here can be used to recognize the epitopes based on several properties, allowing the determination of the reliability of the calculation based on a simple computation of the property at the amino acid level. Moreover, an automatic algorithm for the evaluation of each cluster recognized by the methodology is straightforward.
9 Automatic Epitope Recognition 121 (b) Ligand (a) Receptor Figure 3: Content of hydrophobic amino acids in actual protein complex interfaces. 4 Conclusion We propose a new methodology to identify binding regions on proteins. Although the description so far was concentrated on hydrophobic cluster identification, the methodology is amenable for use with any other characteristic of electrophysical nature known to drive and affect protein-protein interactions or any other interaction among macromolecules as well as small organic compounds. The results of this analysis indicate that the proposed SOM-FT methodology predicts the binding sites in proteins with a high reliability. This is confirmed also by comparing our results with those of the statistical analysis performed by Lo Conte et al. [9] since the tendencies in characteristics of the components of the clusters are very similar to those reported by them, especially in the Proteaseinhibitor class, where many of the proteins used in the validation of our methodology are classified here. For all the isolated monomers composing the complexes analyzed, the interface for binding corresponds to surface regions of relatively strong hydrophobicity. The analysis does not use any a priori knowledge on the phenomenon besides the three dimensional structure of the proteins involved in interaction. The methodology has a wide applicability, not only oriented to the system for assessment of macromolecular interaction MIAX that we have been developing, but especially in the fields of molecular recognition and drug design where information obtained by the methodology can give relevant clues on molecular specificity, selectivity, and reactivity. Furthermore, biochemical mechanisms of several life sustaining processes in organisms may be explained in function of the binding regions identified in this way, giving clues for the identification of defects in protein structure leading to abnormal function expression of these bio-macromolecules.
10 122 Yoshimori and del Carpio Table 4: Occurrence of amino acids at observed interaction interfaces. Acknowledgements This work was conducted under the support by a Grant-in-Aid for Scientific Research on Priority Areas (C) Genome Information Science from the Ministry of Education, Culture, Sports, Sicence and Technology (MEXT) (No.: ) and the cooperation of Klimers Corp, Nagoya, Japan, in the person of his President Mr. Masaki Kobayashi. References [1] Brasseur, R., Differentiation of lipid-associating helices by use of three-dimensional molecular hydrophobicity potential calculations, J. Biol. Chem., 266: , [2] Del Carpio, C.A. and Yoshimori, A., MIAX: a novel system for assessment of macromolecular interaction in condensed phases. 1) description of the interaction model and simulation algorithm, Genome Informatics, 10:3 12, [3] Del Carpio, C.A. and Yoshimori, A., MIAX: a system for assessment of macromolecular interaction. 3) a parallel hybrid GA for flexible protein docking, Genome Informatics, 11: , [4] Del Carpio, C.A. and Yoshimori, A., MIAX: a novel paradigm for modeling bio-macromolecular interactions and complex formation in condensed phases, in Press. [5] Kleanthous, C., Protein-Protein Recognition, Oxford University Press, [6] Kohonen, T., The self-organizing map, Proc. IEEE, 78: , [7] Korn, A.P. and Burnett, R.M., Distribution and complementarity of hydropathy in multisubunit proteins, Proteins, 9:37 55, [8] Lijnzaad, P, and Argos, P., Hydrophobic patches on protein subunit interfaces: characteristics and prediction, Proteins, 28: , [9] Lo Conte, L. Chothia, C., and Janin, J., The atomic structure of protein-protein recognition sites, J. Mol. Biol., 285: , [10] Young, L., Jernigan, R.L., and Covell, D.G., A role for surface hydrophobicity in protein-protein recognition, Protein Science, 3: , 1994.
Detection of Protein Binding Sites II
Detection of Protein Binding Sites II Goal: Given a protein structure, predict where a ligand might bind Thomas Funkhouser Princeton University CS597A, Fall 2007 1hld Geometric, chemical, evolutionary
More informationStructural Bioinformatics (C3210) Molecular Docking
Structural Bioinformatics (C3210) Molecular Docking Molecular Recognition, Molecular Docking Molecular recognition is the ability of biomolecules to recognize other biomolecules and selectively interact
More informationIn silico pharmacology for drug discovery
In silico pharmacology for drug discovery In silico drug design In silico methods can contribute to drug targets identification through application of bionformatics tools. Currently, the application of
More informationUser Guide for LeDock
User Guide for LeDock Hongtao Zhao, PhD Email: htzhao@lephar.com Website: www.lephar.com Copyright 2017 Hongtao Zhao. All rights reserved. Introduction LeDock is flexible small-molecule docking software,
More informationChemical properties that affect binding of enzyme-inhibiting drugs to enzymes
Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes Introduction The production of new drugs requires time for development and testing, and can result in large prohibitive costs
More informationRanjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013
Hydration of protein-rna recognition sites Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India 1 st November, 2013 Central Dogma of life DNA
More informationBiological Macromolecules
Introduction for Chem 493 Chemistry of Biological Macromolecules Dr. L. Luyt January 2008 Dr. L. Luyt Chem 493-2008 1 Biological macromolecules are the molecules of life allow for organization serve a
More informationHydrophobic Patches on Protein Subunit Interfaces: Characteristics and Prediction
5 INTERFACE PATCHES P. Lijnzaad and P. Argos. Hydrophobic patches on protein subunit interfaces: characteristics and prediction. Proteins 28:333-343 (1997) CHAPTER 5 PROTEINS: Structure, Function, and
More informationProtein Structure. W. M. Grogan, Ph.D. OBJECTIVES
Protein Structure W. M. Grogan, Ph.D. OBJECTIVES 1. Describe the structure and characteristic properties of typical proteins. 2. List and describe the four levels of structure found in proteins. 3. Relate
More informationPacking of Secondary Structures
7.88 Lecture Notes - 4 7.24/7.88J/5.48J The Protein Folding and Human Disease Professor Gossard Retrieving, Viewing Protein Structures from the Protein Data Base Helix helix packing Packing of Secondary
More informationViewing and Analyzing Proteins, Ligands and their Complexes 2
2 Viewing and Analyzing Proteins, Ligands and their Complexes 2 Overview Viewing the accessible surface Analyzing the properties of proteins containing thousands of atoms is best accomplished by representing
More informationCHEMISTRY (CHEM) CHEM 5800 Principles Of Materials Chemistry. Tutorial in selected topics in materials chemistry. S/U grading only.
Chemistry (CHEM) 1 CHEMISTRY (CHEM) CHEM 5100 Principles of Organic and Inorganic Chemistry Study of coordination compounds with a focus on ligand bonding, electron counting, molecular orbital theory,
More informationSAM Teacher s Guide Protein Partnering and Function
SAM Teacher s Guide Protein Partnering and Function Overview Students explore protein molecules physical and chemical characteristics and learn that these unique characteristics enable other molecules
More informationSecondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure
Bioch/BIMS 503 Lecture 2 Structure and Function of Proteins August 28, 2008 Robert Nakamoto rkn3c@virginia.edu 2-0279 Secondary Structure Φ Ψ angles determine protein structure Φ Ψ angles are restricted
More informationChapter 4: Amino Acids
Chapter 4: Amino Acids All peptides and polypeptides are polymers of alpha-amino acids. lipid polysaccharide enzyme 1940s 1980s. Lipids membrane 1960s. Polysaccharide Are energy metabolites and many of
More informationDocking. GBCB 5874: Problem Solving in GBCB
Docking Benzamidine Docking to Trypsin Relationship to Drug Design Ligand-based design QSAR Pharmacophore modeling Can be done without 3-D structure of protein Receptor/Structure-based design Molecular
More informationSoftwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007
Softwares for Molecular Docking Lokesh P. Tripathi NCBS 17 December 2007 Molecular Docking Attempt to predict structures of an intermolecular complex between two or more molecules Receptor-ligand (or drug)
More informationSequence analysis and comparison
The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species
More informationMolecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov
Biophysical Journal, Volume 98 Supporting Material Molecular dynamics simulations of anti-aggregation effect of ibuprofen Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov Supplemental
More informationChemical properties that affect binding of enzyme-inhibiting drugs to enzymes
Introduction Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes The production of new drugs requires time for development and testing, and can result in large prohibitive costs
More informationKd = koff/kon = [R][L]/[RL]
Taller de docking y cribado virtual: Uso de herramientas computacionales en el diseño de fármacos Docking program GLIDE El programa de docking GLIDE Sonsoles Martín-Santamaría Shrödinger is a scientific
More informationSTRUCTURAL BIOLOGY AND PATTERN RECOGNITION
STRUCTURAL BIOLOGY AND PATTERN RECOGNITION V. Cantoni, 1 A. Ferone, 2 O. Ozbudak, 3 and A. Petrosino 2 1 University of Pavia, Department of Electrical and Computer Engineering, Via A. Ferrata, 1, 27, Pavia,
More information7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?
ame 7.01 Problem Set 1 Section Question 1 a) What are the four major types of biological molecules discussed in lecture? Give one important function of each type of biological molecule in the cell? b)
More informationDOCKING TUTORIAL. A. The docking Workflow
2 nd Strasbourg Summer School on Chemoinformatics VVF Obernai, France, 20-24 June 2010 E. Kellenberger DOCKING TUTORIAL A. The docking Workflow 1. Ligand preparation It consists in the standardization
More information2 Dean C. Adams and Gavin J. P. Naylor the best three-dimensional ordination of the structure space is found through an eigen-decomposition (correspon
A Comparison of Methods for Assessing the Structural Similarity of Proteins Dean C. Adams and Gavin J. P. Naylor? Dept. Zoology and Genetics, Iowa State University, Ames, IA 50011, U.S.A. 1 Introduction
More informationSupplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution
Supplemental Materials for Structural Diversity of Protein Segments Follows a Power-law Distribution Yoshito SAWADA and Shinya HONDA* National Institute of Advanced Industrial Science and Technology (AIST),
More informationReceptor Based Drug Design (1)
Induced Fit Model For more than 100 years, the behaviour of enzymes had been explained by the "lock-and-key" mechanism developed by pioneering German chemist Emil Fischer. Fischer thought that the chemicals
More informationToward an Understanding of GPCR-ligand Interactions. Alexander Heifetz
Toward an Understanding of GPCR-ligand Interactions Alexander Heifetz UK QSAR and ChemoInformatics Group Conference, Cambridge, UK October 6 th, 2015 Agenda Fragment Molecular Orbitals (FMO) for GPCR exploration
More informationPose and affinity prediction by ICM in D3R GC3. Max Totrov Molsoft
Pose and affinity prediction by ICM in D3R GC3 Max Totrov Molsoft Pose prediction method: ICM-dock ICM-dock: - pre-sampling of ligand conformers - multiple trajectory Monte-Carlo with gradient minimization
More information2: CHEMICAL COMPOSITION OF THE BODY
1 2: CHEMICAL COMPOSITION OF THE BODY Although most students of human physiology have had at least some chemistry, this chapter serves very well as a review and as a glossary of chemical terms. In particular,
More informationMM-GBSA for Calculating Binding Affinity A rank-ordering study for the lead optimization of Fxa and COX-2 inhibitors
MM-GBSA for Calculating Binding Affinity A rank-ordering study for the lead optimization of Fxa and COX-2 inhibitors Thomas Steinbrecher Senior Application Scientist Typical Docking Workflow Databases
More informationRetrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a
Retrieving hits through in silico screening and expert assessment M.. Drwal a,b and R. Griffith a a: School of Medical Sciences/Pharmacology, USW, Sydney, Australia b: Charité Berlin, Germany Abstract:
More informationBA, BSc, and MSc Degree Examinations
Examination Candidate Number: Desk Number: BA, BSc, and MSc Degree Examinations 2017-8 Department : BIOLOGY Title of Exam: Molecular Biology and Biochemistry Part I Time Allowed: 1 hour and 30 minutes
More informationWhat makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary
Electronic Supplementary Material (ESI) for Journal of Materials Chemistry B. This journal is The Royal Society of Chemistry 21 What makes a good graphene-binding peptide? Adsorption of amino acids and
More informationUsing Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017
Using Phase for Pharmacophore Modelling 5th European Life Science Bootcamp March, 2017 Phase: Our Pharmacohore generation tool Significant improvements to Phase methods in 2016 New highly interactive interface
More informationSUPPLEMENTARY INFORMATION
doi:10.1038/nature17991 Supplementary Discussion Structural comparison with E. coli EmrE The DMT superfamily includes a wide variety of transporters with 4-10 TM segments 1. Since the subfamilies of the
More informationSupplementary Figure 1 Preparation of PDA nanoparticles derived from self-assembly of PCDA. (a)
Supplementary Figure 1 Preparation of PDA nanoparticles derived from self-assembly of PCDA. (a) Computer simulation on the self-assembly of PCDAs. Two kinds of totally different initial conformations were
More informationAny protein that can be labelled by both procedures must be a transmembrane protein.
1. What kind of experimental evidence would indicate that a protein crosses from one side of the membrane to the other? Regions of polypeptide part exposed on the outside of the membrane can be probed
More informationIntroduction to Comparative Protein Modeling. Chapter 4 Part I
Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature
More informationA Model for Computer Identification of Micro-organisms
J. gen, Microbial. (1965), 39, 401405 Printed.in Great Britain 401 A Model for Computer Identification of Micro-organisms BY H. G. GYLLENBERG Department of Microbiology, Ulziversity of Helsinki, Finland
More informationSupplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1
Supplementary Information Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1 Philip D. Mosier 1, Meng-Jung Chiang 2, Zhengshi Lin 2, Yamei Gao 2, Bashayer Althufairi
More informationFuzzy Clustering of Gene Expression Data
Fuzzy Clustering of Gene Data Matthias E. Futschik and Nikola K. Kasabov Department of Information Science, University of Otago P.O. Box 56, Dunedin, New Zealand email: mfutschik@infoscience.otago.ac.nz,
More informationCentral Dogma. modifications genome transcriptome proteome
entral Dogma DA ma protein post-translational modifications genome transcriptome proteome 83 ierarchy of Protein Structure 20 Amino Acids There are 20 n possible sequences for a protein of n residues!
More informationVisualization of Macromolecular Structures
Visualization of Macromolecular Structures Present by: Qihang Li orig. author: O Donoghue, et al. Structural biology is rapidly accumulating a wealth of detailed information. Over 60,000 high-resolution
More informationHit Finding and Optimization Using BLAZE & FORGE
Hit Finding and Optimization Using BLAZE & FORGE Kevin Cusack,* Maria Argiriadi, Eric Breinlinger, Jeremy Edmunds, Michael Hoemann, Michael Friedman, Sami Osman, Raymond Huntley, Thomas Vargo AbbVie, Immunology
More informationschematic diagram; EGF binding, dimerization, phosphorylation, Grb2 binding, etc.
Lecture 1: Noncovalent Biomolecular Interactions Bioengineering and Modeling of biological processes -e.g. tissue engineering, cancer, autoimmune disease Example: RTK signaling, e.g. EGFR Growth responses
More informationBuilding 3D models of proteins
Building 3D models of proteins Why make a structural model for your protein? The structure can provide clues to the function through structural similarity with other proteins With a structure it is easier
More informationProtein Science (1997), 6: Cambridge University Press. Printed in the USA. Copyright 1997 The Protein Society
1 of 5 1/30/00 8:08 PM Protein Science (1997), 6: 246-248. Cambridge University Press. Printed in the USA. Copyright 1997 The Protein Society FOR THE RECORD LPFC: An Internet library of protein family
More informationAmino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information
Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information Grzegorz Nawrocki and Marek Cieplak Institute of Physics, Polish Academy of Sciences,
More informationUniversal Similarity Measure for Comparing Protein Structures
Marcos R. Betancourt Jeffrey Skolnick Laboratory of Computational Genomics, The Donald Danforth Plant Science Center, 893. Warson Rd., Creve Coeur, MO 63141 Universal Similarity Measure for Comparing Protein
More informationTowards Physics-based Models for ADME/Tox. Tyler Day
Towards Physics-based Models for ADME/Tox Tyler Day Overview Motivation Application: P450 Site of Metabolism Application: Membrane Permeability Future Directions and Applications Motivation Advantages
More informationProtein Structure Prediction Using Multiple Artificial Neural Network Classifier *
Protein Structure Prediction Using Multiple Artificial Neural Network Classifier * Hemashree Bordoloi and Kandarpa Kumar Sarma Abstract. Protein secondary structure prediction is the method of extracting
More informationCHEM 463: Advanced Inorganic Chemistry Modeling Metalloproteins for Structural Analysis
CHEM 463: Advanced Inorganic Chemistry Modeling Metalloproteins for Structural Analysis Purpose: The purpose of this laboratory is to introduce some of the basic visualization and modeling tools for viewing
More informationNext Generation Computational Chemistry Tools to Predict Toxicity of CWAs
Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National
More informationA Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery
AtomNet A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery Izhar Wallach, Michael Dzamba, Abraham Heifets Victor Storchan, Institute for Computational and
More informationUsing Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface
Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites Introduction J. Andrew Surface Hampden-Sydney College / Virginia Commonwealth University In the past several decades
More informationSelecting protein fuzzy contact maps through information and structure measures
Selecting protein fuzzy contact maps through information and structure measures Carlos Bousoño-Calzón Signal Processing and Communication Dpt. Univ. Carlos III de Madrid Avda. de la Universidad, 30 28911
More informationLecture 10: Cyclins, cyclin kinases and cell division
Chem*3560 Lecture 10: Cyclins, cyclin kinases and cell division The eukaryotic cell cycle Actively growing mammalian cells divide roughly every 24 hours, and follow a precise sequence of events know as
More information2.1 The Nature of Matter
2.1 The Nature of Matter Lesson Objectives Identify the three subatomic particles found in atoms. Explain how all of the isotopes of an element are similar and how they are different. Explain how compounds
More informationStudy of Non-Covalent Complexes by ESI-MS. By Quinn Tays
Study of Non-Covalent Complexes by ESI-MS By Quinn Tays History Overview Background Electrospray Ionization How it is used in study of noncovalent interactions Uses of the Technique Types of molecules
More informationPhysiochemical Properties of Residues
Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)
More informationStructural biology and drug design: An overview
Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved
More informationGC and CELPP: Workflows and Insights
GC and CELPP: Workflows and Insights Xianjin Xu, Zhiwei Ma, Rui Duan, Xiaoqin Zou Dalton Cardiovascular Research Center, Department of Physics and Astronomy, Department of Biochemistry, & Informatics Institute
More informationUsing AutoDock for Virtual Screening
Using AutoDock for Virtual Screening CUHK Croucher ASI Workshop 2011 Stefano Forli, PhD Prof. Arthur J. Olson, Ph.D Molecular Graphics Lab Screening and Virtual Screening The ultimate tool for identifying
More informationProtein Structure Basics
Protein Structure Basics Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera Importance of Proteins Muscle structure depends on protein-protein interactions Transport across membranes
More informationDISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:
DISCRETE TUTORIAL Agustí Emperador Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING: STRUCTURAL REFINEMENT OF DOCKING CONFORMATIONS Emperador
More informationMajor Types of Association of Proteins with Cell Membranes. From Alberts et al
Major Types of Association of Proteins with Cell Membranes From Alberts et al Proteins Are Polymers of Amino Acids Peptide Bond Formation Amino Acid central carbon atom to which are attached amino group
More informationTowards Detecting Protein Complexes from Protein Interaction Data
Towards Detecting Protein Complexes from Protein Interaction Data Pengjun Pei 1 and Aidong Zhang 1 Department of Computer Science and Engineering State University of New York at Buffalo Buffalo NY 14260,
More informationNMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease
University of Wollongong Research Online Faculty of Science - Papers (Archive) Faculty of Science, Medicine and Health 2009 NMR study of complexes between low molecular mass inhibitors and the West Nile
More informationSupplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two
Supplementary Figure 1. Biopanningg and clone enrichment of Alphabody binders against human IL 23. Positive clones in i phage ELISA with optical density (OD) 3 times higher than background are shown for
More informationIntroduction to" Protein Structure
Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Learning Objectives Outline the basic levels of protein structure.
More informationSignal Transduction Phosphorylation Protein kinases. Misfolding diseases. Protein Engineering Lysozyme variants
Signal Transduction Phosphorylation Protein kinases Misfolding diseases Protein Engineering Lysozyme variants Cells and Signals Regulation The cell must be able to respond to stimuli Cellular activities
More informationFrancisco Melo, Damien Devos, Eric Depiereux and Ernest Feytmans
From: ISMB-97 Proceedings. Copyright 1997, AAAI (www.aaai.org). All rights reserved. ANOLEA: A www Server to Assess Protein Structures Francisco Melo, Damien Devos, Eric Depiereux and Ernest Feytmans Facultés
More informationExam I Answer Key: Summer 2006, Semester C
1. Which of the following tripeptides would migrate most rapidly towards the negative electrode if electrophoresis is carried out at ph 3.0? a. gly-gly-gly b. glu-glu-asp c. lys-glu-lys d. val-asn-lys
More informationCHEMISTRY (CHEM) CHEM 1200 Problem Solving In General Chemistry
Chemistry (CHEM) 1 CHEMISTRY (CHEM) CHEM 1090 Elementary Chemistry [0-3 credit hours (0-2, 0, 0-1)] For students who major in science, engineering or other fields which require chemistry as a prerequisite
More informationPhysical Science Q2, U4: Chemical Bonding (This unit builds student capacity to engage Keystone Biology Eligible Content.)
Physical Science: Term 2, Unit 4 Topic: Chemical Bonding Duration: Traditional (50 minute periods) : 13-16 days (adjust using professional discretion) Block (90 minute periods) : 6-8 days (adjust using
More informationSection Week 3. Junaid Malek, M.D.
Section Week 3 Junaid Malek, M.D. Biological Polymers DA 4 monomers (building blocks), limited structure (double-helix) RA 4 monomers, greater flexibility, multiple structures Proteins 20 Amino Acids,
More informationMARTINI simulation details
S1 Appendix MARTINI simulation details MARTINI simulation initialization and equilibration In this section, we describe the initialization of simulations from Main Text section Residue-based coarsegrained
More informationIn Silico Investigation of Off-Target Effects
PHARMA & LIFE SCIENCES WHITEPAPER In Silico Investigation of Off-Target Effects STREAMLINING IN SILICO PROFILING In silico techniques require exhaustive data and sophisticated, well-structured informatics
More informationNotes of Dr. Anil Mishra at 1
Introduction Quantitative Structure-Activity Relationships QSPR Quantitative Structure-Property Relationships What is? is a mathematical relationship between a biological activity of a molecular system
More informationPerforming a Pharmacophore Search using CSD-CrossMiner
Table of Contents Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Searching with a Pharmacophore... 4 Performing a Pharmacophore Search using CSD-CrossMiner Version 2.0
More informationValidation of Experimental Crystal Structures
Validation of Experimental Crystal Structures Aim This use case focuses on the subject of validating crystal structures using tools to analyse both molecular geometry and intermolecular packing. Introduction
More informationChemistry Courses. Courses. Chemistry Courses 1
Chemistry Courses 1 Chemistry Courses Courses CHEM 1105. Laboratory for CHEM 1305.. Prerequisite(s): (CHEM 1305 w/d or better) CHEM 1106. Laboratory for CHEM 1306.. Prerequisite(s): (CHEM 1306 w/d or better)
More informationMolecular Structure Prediction by Global Optimization
Molecular Structure Prediction by Global Optimization K.A. DILL Department of Pharmaceutical Chemistry, University of California at San Francisco, San Francisco, CA 94118 A.T. PHILLIPS Computer Science
More informationPrinciples of Drug Design
Advanced Medicinal Chemistry II Principles of Drug Design Tentative Course Outline Instructors: Longqin Hu and John Kerrigan Direct questions and enquiries to the Course Coordinator: Longqin Hu I. Introduction
More informationLecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability
Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions
More informationMembrane Protein Channels
Membrane Protein Channels Potassium ions queuing up in the potassium channel Pumps: 1000 s -1 Channels: 1000000 s -1 Pumps & Channels The lipid bilayer of biological membranes is intrinsically impermeable
More informationProtein Folding Prof. Eugene Shakhnovich
Protein Folding Eugene Shakhnovich Department of Chemistry and Chemical Biology Harvard University 1 Proteins are folded on various scales As of now we know hundreds of thousands of sequences (Swissprot)
More informationFEEDBACK GMDH-TYPE NEURAL NETWORK AND ITS APPLICATION TO MEDICAL IMAGE ANALYSIS OF LIVER CANCER. Tadashi Kondo and Junji Ueno
International Journal of Innovative Computing, Information and Control ICIC International c 2012 ISSN 1349-4198 Volume 8, Number 3(B), March 2012 pp. 2285 2300 FEEDBACK GMDH-TYPE NEURAL NETWORK AND ITS
More informationProtein Fragment Search Program ver Overview: Contents:
Protein Fragment Search Program ver 1.1.1 Developed by: BioPhysics Laboratory, Faculty of Life and Environmental Science, Shimane University 1060 Nishikawatsu-cho, Matsue-shi, Shimane, 690-8504, Japan
More information7.012 Problem Set 1 Solutions
ame TA Section 7.012 Problem Set 1 Solutions Your answers to this problem set must be inserted into the large wooden box on wheels outside 68120 by 4:30 PM, Thursday, September 15. Problem sets will not
More informationBioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA
Pharmacophore Model Development for the Identification of Novel Acetylcholinesterase Inhibitors Edwin Kamau Dept Chem & Biochem Kennesa State Uni ersit Kennesa GA 30144 Dept. Chem. & Biochem. Kennesaw
More informationStructural and mechanistic insight into the substrate. binding from the conformational dynamics in apo. and substrate-bound DapE enzyme
Electronic Supplementary Material (ESI) for Physical Chemistry Chemical Physics. This journal is the Owner Societies 215 Structural and mechanistic insight into the substrate binding from the conformational
More informationALL LECTURES IN SB Introduction
1. Introduction 2. Molecular Architecture I 3. Molecular Architecture II 4. Molecular Simulation I 5. Molecular Simulation II 6. Bioinformatics I 7. Bioinformatics II 8. Prediction I 9. Prediction II ALL
More informationAutomated Assignment of Backbone NMR Data using Artificial Intelligence
Automated Assignment of Backbone NMR Data using Artificial Intelligence John Emmons στ, Steven Johnson τ, Timothy Urness*, and Adina Kilpatrick* Department of Computer Science and Mathematics Department
More informationAn introduction to SYSTEMS BIOLOGY
An introduction to SYSTEMS BIOLOGY Paolo Tieri CNR Consiglio Nazionale delle Ricerche, Rome, Italy 10 February 2015 Universidade Federal de Minas Gerais, Belo Horizonte, Brasil Course outline Day 1: intro
More informationStructure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27
Acta Cryst. (2014). D70, doi:10.1107/s1399004714021695 Supporting information Volume 70 (2014) Supporting information for article: Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase
More informationPENNSYLVANIA. Explain the nature of the carbon atom and how organic molecules are important to organisms. Page 1 of B.A7.
Know: Understand: Do: S11.C.1.1.1 -- Essential Explain that matter is made of particles called atoms and that atoms are composed of even smaller particles (e.g., proton, neutrons, electrons). S11.C.1.1.2
More informationComputational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007
Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based
More informationCHAPTER 2 THE CHEMICAL BASIS OF LIFE
CHAPTER 2 THE CHEMICAL BASIS OF LIFE CHAPTER OVERVIEW This chapter introduces very basic concepts of chemistry, emphasizing the structure of atoms and how they combine (form bonds). The types of bonds,
More information