Homology modeling of Ferredoxin-nitrite reductase from Arabidopsis thaliana

Similar documents
HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Modeling for 3D structure prediction

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Protein structure analysis. Risto Laakso 10th January 2005

Bioinformatics. Macromolecular structure

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Introduction to" Protein Structure

ALL LECTURES IN SB Introduction

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Homology Modeling. Roberto Lins EPFL - summer semester 2005

Protein Modeling. Generating, Evaluating and Refining Protein Homology Models

7.91 Amy Keating. Solving structures using X-ray crystallography & NMR spectroscopy

Protein Modeling Methods. Knowledge. Protein Modeling Methods. Fold Recognition. Knowledge-based methods. Introduction to Bioinformatics

Supporting Online Material for

Prediction and refinement of NMR structures from sparse experimental data

Bioinformatics. Dept. of Computational Biology & Bioinformatics

CAP 5510 Lecture 3 Protein Structures

Presenter: She Zhang

The Structure and Functions of Proteins

Can protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU

Secondary and sidechain structures

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University

The typical end scenario for those who try to predict protein

Week 10: Homology Modelling (II) - HHpred

Useful background reading

Biomolecules: lecture 10

Analysis and Prediction of Protein Structure (I)

Protein Structure Prediction and Display

Protein Folding Prof. Eugene Shakhnovich

BIOINFORMATICS LAB AP BIOLOGY

Nature Structural and Molecular Biology: doi: /nsmb.2938

Basics of protein structure

Protein Structure Analysis and Verification. Course S Basics for Biosystems of the Cell exercise work. Maija Nevala, BIO, 67485U 16.1.

Photosystem I in Arabidopsis Thaliana

Protein Structure Prediction

Lecture 26: Polymers: DNA Packing and Protein folding 26.1 Problem Set 4 due today. Reading for Lectures 22 24: PKT Chapter 8 [ ].

Enzyme Catalysis & Biotechnology

Pymol Practial Guide

SUPPLEMENTARY INFORMATION

Biomolecules: lecture 9

Francisco Melo, Damien Devos, Eric Depiereux and Ernest Feytmans

DATE A DAtabase of TIM Barrel Enzymes

tconcoord-gui: Visually Supported Conformational Sampling of Bioactive Molecules

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

BIOCHEMISTRY Course Outline (Fall, 2011)

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor

Physiochemical Properties of Residues

Computer simulations of protein folding with a small number of distance restraints

BCMP 201 Protein biochemistry

Getting To Know Your Protein

Table S1. Primers used for the constructions of recombinant GAL1 and λ5 mutants. GAL1-E74A ccgagcagcgggcggctgtctttcc ggaaagacagccgcccgctgctcgg

Motif Prediction in Amino Acid Interaction Networks

CHEM 463: Advanced Inorganic Chemistry Modeling Metalloproteins for Structural Analysis

Details of Protein Structure

SUPPLEMENTARY FIGURES

PROTEIN'STRUCTURE'DETERMINATION'

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

Molecular Modeling lecture 2

Protein Structures. 11/19/2002 Lecture 24 1

Protein Structure Determination Using NMR Restraints BCMB/CHEM 8190

Building a Homology Model of the Transmembrane Domain of the Human Glycine α-1 Receptor

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Chem. 27 Section 1 Conformational Analysis Week of Feb. 6, TF: Walter E. Kowtoniuk Mallinckrodt 303 Liu Laboratory

Overview. The peptide bond. Page 1

Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate

Identifying Interaction Hot Spots with SuperStar

Grundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson

CHAPTER 29 HW: AMINO ACIDS + PROTEINS

BSc and MSc Degree Examinations

Homology Modeling I. Growth of the Protein Data Bank PDB. Basel, September 30, EMBnet course: Introduction to Protein Structure Bioinformatics

Structure to Function. Molecular Bioinformatics, X3, 2006

Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence

Algorithm for Rapid Reconstruction of Protein Backbone from Alpha Carbon Coordinates

Tools for Cryo-EM Map Fitting. Paul Emsley MRC Laboratory of Molecular Biology

Orientational degeneracy in the presence of one alignment tensor.

STRUCTURAL BIOINFORMATICS. Barry Grant University of Michigan

Template-Based Modeling of Protein Structure

Visualization of Macromolecular Structures

Protein Structure Basics

Paramount role of sequencing in Ubiquitous forms of Nucleotide

CS612 - Algorithms in Bioinformatics

Analysis of correlated mutations in Ras G-domain

Conformational Geometry of Peptides and Proteins:

Protein Structure Prediction, Engineering & Design CHEM 430

Computational Molecular Biology

SUPPLEMENTARY INFORMATION

BCH 4053 Spring 2003 Chapter 6 Lecture Notes

Protein Structure Prediction

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

THE UNIVERSITY OF MANITOBA. PAPER NO: 409 LOCATION: Fr. Kennedy Gold Gym PAGE NO: 1 of 6 DEPARTMENT & COURSE NO: CHEM 4630 TIME: 3 HOURS

Full wwpdb NMR Structure Validation Report i

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Molecular Visualization. Introduction

Transcription:

www.bioinformation.net Hypothesis Volume 6(3) Homology modeling of Ferredoxin-nitrite reductase from Arabidopsis thaliana Karim Kherraz*, Khaled Kherraz, Abdelkrim Kameli Biology department, Ecole Normale Superieure, ENS-Kouba, PB 92, Algiers, Algeria; Karim Kherraz - Email: karimkherraz12@yahoo.fr; Phone: +213557710828; *Corresponding author Received December 02, 2010; Accepted December 24, 2010; Published April 22, 2011 Abstract: Nitrogen is one of the major growth-limiting nutrients for plants: The main source of nitrogen in most of the higher plants is nitrate taken up through roots. Nitrate can be reduced both in the chloroplasts (photosynthetic tissues) and in proplastes (nonphotosynthetic tissues) such as roots. Ferredoxin-nitrite reductase (NiR) catalyses the reduction of nitrite to ammonium in the second step of the nitrate- assimilation pathway. Homology model of Ferredoxin-nitrite reductase has been constructed using the X-ray structure (PDB code: 2akj) a s a template and MODELLER 9v5 software. The resulting model assessed by PROCHECK, PROSAII and RMSD that showed the final refined model is reliable: has 81% of amino acid sequence identity with template, 0.2Å as RMSD and has (-10.37) as Z-scores, the Ramachandran plot analysis showed that conformations for 99.5 % of amino acid residues are within the most favored regions. The model could prove useful in further functional characterization of this protein. Keywords: Arabidopsis thaliana; Homology modeling; Validation. Abbreviations: PDB-Protein Data Bank, NMR-Nuclear Magnetic Resonance, NiR-Nitrite Reductase, RMSD-Root Mean Squared Deviation, Fd-ferredoxin. Background: Nitrogen assimilation is a fundamental biological process that has marked effects on plant productivity with nitrogen deficiency leading to a decrease in plant growth. Nitrate (NO3) is the main source of nitrogen for higher plants [1]. It is converted to reduced nitrogen by the nitrate assimilation pathway. The initial pathway of nitrate assimilation involves two enzymes, nitrate reductase [EC 1.6.6.1] which catalyzes the reduction of nitrate to nitrite, and ferrdoxinnitrite reductase [EC 1.7.7.1] which catalyzes the reduction of nitrite to ammonium ions (Figure 1). Ammonium is then incorporated into amino acids by the glutamine synthetase and glutamate synthase (GOGAT) pathway. Ferredoxin-nitrite reductase (EC 1.7.7.1), commonly called nitrite reductase (NiR) catalyzes the six electron reduction of nitrite to ammonia, with reduced ferredoxin (Fd) serving as the electron donor. Nitrite reductase found in plants, algae, and cyanobacteria contains one siroheme and one [Fe4-S4] cluster, acting as separate one-electroncarriers. All ferredoxin-dependent nitrite reductases contain two prosthetic groups, a siroheme and an iron-sulfur cluster. In Nitrite and Sulfite reductases, the sirohemes are axially coordinated through a cysteine residue that also serves as one of the ligands for the iron-sulfur cluster [2]. Nitrite binds to the iron of the siroheme at the axial position opposite the iron- sulfur cluster [3] in cyanobacteria and chloroplasts of photosynthetic eukaryotes.the physiological electron-donor ferredoxin is reduced by light dependent reactions of the electron-transfer chain. In higher plants, ferredoxin- dependent nitrite reductase is presented both in the chloroplasts and in nonphotosynthetic tissues such as roots where ferredoxin is reduced by the NADPH pool produced via the reductive pentose phosphate cycle. The function of proteins is generally determined by its three-dimensional (3D) structure. Thus, it would be useful to know the 3D structure of the thousands of protein sequences that are emerging from many genome projects. Structural studies on bio-molecules have changed our perception of the biological world in the last twenty years. A number of efforts have been made on structure prediction.one such technique that has found a wide appreciation, is Homology modeling which aims at predicting the 3D structure of biomolecules, relying heavily on resources such as pattern/function and sequence. However, three- dimensional structure of Ferredoxin-nitrite reductase (EC 1.7.7.1) from Arabidopsis thaliana remains unknown. In the present study, effort was made to generate the three-dimensional (3D) structure of the Ferredoxin-nitrite reductase (EC 1.7.7.1) from Arabidopsis thaliana based on the available template (2akj) structural homologues from Protein Data Bank and the model validated with standard parameters (PROCHECK, PROSAII, RMSD). This study could prove useful in further functional characterization of this important group of enzymes. Materials and Methodology: Sequence Alignment: The FASTA sequence of Ferredoxin-nitrite reductase (EC 1.7.7.1) from Arabidopsis thaliana was retrieved from the NCBI database (National Center for Biotechnology Information) that has 586 amino acids (AAD17406). Comparative modeling usually starts by searching the PDB of known protein structures using the target sequence as the query [4]. This search is generally done by comparing the target sequence with the sequence of each of the Bioinformation 6(3): 115-119 (2011) 115 2011 Biomedical Informatics

structures in the database. The target sequence was searched for similar sequence using the BLAST (Basic Local Alignment Search Tool) [5] against Protein Database (PDB). The BLAST results yielded X-ray structure of 2akj (Ferredoxin-nitrite reductase (EC 1.7.7.1)) from Spinach with 81% similarity to our target protein. RMSD: Root Mean Squared Deviation (RMSD) is commonly used to represent the distance between two objects. In a structural sense, this value indicates the degree to which two three dimensional structures are similar. The lower the value, the more similar the structures are. The RMSD value [10] between the template 2akj and our model structure was calculated using PYMOL program (Figure 3). Figure 1: Nitrite reductase in chloroplasts transfers electrons from Ferredoxin to Nitrite. Figure 3: Superposition of Cα trace of Ferredoxin-nitrite reductase model from Arabidopsis thaliana (represented in red color) and 2akj (represented in blue color) by Pymol program. Results: Comparative Modeling of Ferredoxin-nitrite reductase Model: Tertiary structure of a protein is build by packing of its secondary structure elements to form discrete domains or autonomous folding units. Comparative modeling predicts the 3-D structure of Ferredoxin-nitrite reductase Model a given protein sequence (target) based primarily on its alignment to 2akj as a template (a determined structure experimentally). The hypothetical protein models created were stored as PDB output file. This latter was visualized by Pymol program. Figure 2: The final 3D structure of Ferredoxin-nitrite reductase model from Arabidopsis thaliana. The α-helices and β sheets are represented by red helices and yellow ribbons respectively. This model was conducted by MODELLER program Comparative Modeling: The theoretical structure (Figure 2) of Ferredoxin-nitrite reductase (EC 1.7.7.1) from Arabidopsis thaliana is generated using MODELLER-9v5 by comparative modeling of protein structure prediction. MODELLER implements comparative protein structure modeling by satisfaction of spatial restraints. The program was designed to use as many different types of information about the target sequence as possible [6]. Validation of Ferredoxin-nitrite reductase Model: PROCHECK: A versatile protein structure analysis program [7] available at the Joint Centre for Structural Genomics, Bioinformatics core, University of California, San Diego; was used in validation of protein structure and models by verifying the parameters like Ramachandran plot quality, peptide bond planarity, Bad nonbonded interactions, main chain hydrogen bond energy, Calpha chirality and over-all G factor and the side chain parameters like standard deviations of chi1 gauche minus, trans and plus, pooled standard deviations of chi1 with respect to refined structures [8]. PROSAII: This program compares Z scores between target and template structure. The Z scores of model is a measure of compatibility between its sequence and structure. The model Z score should be comparable to the Z scores obtained from the template [9]. Validation of Protein Structures of Ferredoxin-nitrite reductase Model: The hypothetical protein models generated were analyzed online by submitting to Joint Center for Structural Genomics (JCSG), Bioinformatics core, University of California, San Diego. Accuracy of the protein model generated was judged by validity report generated by PROCHECK. Parameter comparisons of these proteins were made with well-refined structures that have similar resolution.the main chain parameters plotted are Ramachandran plot quality, peptide bond planarity, Bad non-bonded interactions, main chain hydrogen bond energy, C- alpha chirality and over-all G factor. In the Ramamchandran plot analysis, the residues were classified according o their regions in the quadrangle. The Ramachandran map for Ferredoxin-nitrite reductase model is represented in (Figure 4) and the plot statistics (Table 1 see Supplementary material). Discussion: Homology protein modelling uses experimentally determined protein structures (templates) to predict the 3-D of another protein that has a similar amino acid sequence (the target). This approach to modelling is possible since a small change in the protein sequence usually results in a small change in its 3D structure [11] Homology modelling remains the only modelling method that can provide models with a root mean square error lower than 2Å. The FASTA sequence Ferredoxin-nitrite reductase (EC 1.7.7.1) from Arabidopsis thaliana was obtained from NCBI. The primary requirement for reliable homology modeling is a detectable similarity between the sequence of interest (target sequence) and a known structure (template) By BLASTp query. Based on sequence similarity analysis between target and other proteins known structures showed that our Ferredoxin-nitrite reductase from Arabidopsis thaliana (2akj) has 81% of amino acid sequence identity with Ferredoxin-nitrite reductase (EC 1.7.7.1) from Spinach. Practically, at this level of sequence identity, it is good enough to use crystallographic structures of 2akj as a template in order to obtain high quality alignment for structure prediction by homology modeling. The alignment between target and 2akj_A is shown in (Figure 5). Bioinformation 6(3): 115-119 (2011) 116 2011 Biomedical Informatics

generated by Modeller program [14] is more acceptable. The molecular visualization Pymol program was used to manipulate the models based on residue interactions, energy minimization and steric hinderance. The best model predicted by Modeller (isoa 4) was used for further analysis by PROCHECK [7]. Ramachandran plot analysis showed that main-chain conformations for 99.5 % of amino acid residues are within the most favored or allowed regions, 89.9% of the residues in the most favored region, 9.2 % in the allowed and 0.5% in the generously allowed regions and only 0.5% in the disallowed region. In general, a score close to 100% implies good stereochemical quality of the models [15]. Figure 4: Ramachandran s Map of Ferredoxin-nitrite reductase model from Arabidopsis thaliana. The Plot calculation was done with PROCHECK program. Figure 5: Sequence alignment of Ferredoxin-nitrite reductase model from Arabidopsis thaliana with template (PDB code: 2akj) done using MODELLER. The conserved regions are indicated by *. Homology modeling is currently restricted to protein sequences (targets) that share 30% or more sequence identity to an experimentally solved protein structure template [12]. Under this sequence identity the reliability of the sequence alignment between target and template decreases fastly, resulting into significant modeling errors, low accuracy models should still be treated with attention. Medium accuracy models, obtained with a template-target sequence identity of 30 50%, tend to have nearly 85% of their C-α atoms within 3.5 Å of the correct position. These models often fit a variety of applications, including the testing of ligand binding states by designing site directed mutants with altered binding capacity, and computational screening of databases listing small molecules for potential lead compounds or inhibitors. Top accuracy models, based on sequence identities more than 50%, usually have structures comparable to 3 Å resolution X-ray structures and can be used for more reliable calculations as (ligand docking, drug design), however sequence identities more than 90% can be used to facilitate a meaningful biophysical description of the active site [13]. In this study were also checked for φ and ψ torsion angles using the Ramchandran plots. A comparison of the results shows that one of the models Figure 6: The plan of Z-Score shows spots of Z scores's values of proteins determined by NMR (represented in dark blue coulor) and by X ray (represented in light blue coulor) using PROSAII program. The two black dots represent Z-Scores of the template and our model. The overall general similarities and subtle difference among the 3D structure of template 2akj and predicted model isoa4 can be seen from the backbone superposition. As evident from superposition, general folding topology of the structure is similar; however, some structural differences appear between the predicted model and template. These differences are mainly due to insertion and deletions in different loop regions. The RMSD (Root Mean Square Deviation) between predicted model and template is 0.2 Å. The low RMSD between the target and template reflects the presence of strong homology (The lower the value, the more similar the structures are). The z-score indicates overall model quality and measures the deviation of the total energy of the structure with respect to an energy distribution derived from random conformations [9]. In order to facilitate interpretation of the z-score of the specified protein, its particular value is displayed in a plot that contains the z- scores of all experimentally determined protein chains in current PDB. Groups of structures from different sources (X-ray, NMR) are distinguished by different colors (NMR with dark blue and X ray with light blue). This plot can be used to check whether the z-score of the protein in question is within the range of scores typically found for proteins of similar size belonging to one of these groups. It can be seen in (Figure 6) that Z-scores value of the obtained model (isoa.b99990004.pdb) of Ferredoxin-nitrite reductase (EC 1.7.7.1) from Arabidopsis thaliana (-10.37) is located within the space of proteins determined By X ray. This value is extremely closed to the value of the template (-10.54) which suggests that the obtained model is reliable and very close to experimentally determined structures. Conclusion: Based on the Template structure it is clearly observed that the theoretical structure generated is structurally similar to the template structure which is highly sufficient for the development of specific ligand for Ferredoxin-nitrite reductase from Arabidopsis thaliana. Our model of NIR is only a predictive, and needs to be confirmed experimentally. This modeled structure can be used to predict the molecular function and the active sites which received less attention in previous reports. References: [1] Crawford NM & Glass ADM. Trends Plant Sci. 1998 3: 389 [2] Knaff DB et al. Kluwer Academic Publishers. 1996 4: 333 [3] Kuznetsova S et al. Biochemistry. 2004 43: 10765 [PMID: 15311938] [4] Westbrook J et al. Nucleic Acids Res. 2002 30: 245 [PMID: 11752306] [5] Altschul SF et al. Nucleic Acids Res. 1997 25: 3389 [PMID: 9254694] [6] Fiser A et al. Protein Sci. 2000 9: 1753 [PMID: 11045621] [7] Laskowski RA et al. J Biomol NMR. 1996 8: 477 [PMID: 9008363] [8] Morris AL et al. Proteins. 1992 12: 345 [PMID: 1579569] Bioinformation 6(3): 115-119 (2011) 117 2011 Biomedical Informatics

[9] Sippl MJ et al. Curr Opin Struct Biol.1995 5: 229 [PMID: 7648326] [10] Zhang Y & Skolnick J. Proteins. 2004 57: 702 [PMID: 15476259] [11] Hubbard TJ & Blundell TL. Protein Eng. 1: 159 [PMID: 3507702] [12] Baker D & Sali A. Science. 2001 294: 3 [PMID: 11588250] [13] Marsden RL, Orengo CA. Methods Mol Biol. 2008 426: 3 [PMID: 18542854] [14] Sali A et al. Proteins. 1995 23: 318 [PMID: 8710825] [15] Reddy ChS et al. Comput Biol Chem. 2006 30: 120 [PMID: 16540373] Edited by I Roterman Citation: Kherraz et al. Bioinformation 6(3): 115-119 (2011) License statement: This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited. Bioinformation 6(3): 115-119 (2011) 118 2011 Biomedical Informatics

Supplementary material: Table 1: Ramachandran plot calculation for 3D model of Ferredoxin: nitrite reductase model from Arabidopsis thaliana computed with the PROCHECK program. Residues in quadrangles Scattered residues Number Percentage Most favoured regions [A, B, L] 399 89.9% Additional allowed regions [a,b,l,p] 41 9.2% Generously allowed regions [~a,~b,~l,~p] 2 0.5% Disallowed regions [XX] 2 0.5% Non-glycine and non-proline residues 444 100.0% Bioinformation 6(3): 115-119 (2011) 119 2011 Biomedical Informatics