IBiSA_tools Documentation

Similar documents
Tutorial: Structural Analysis of a Protein-Protein Complex

Comparing whole genomes

Online Protein Structure Analysis with the Bio3D WebApp

Ligand Scout Tutorials

Reaxys Pipeline Pilot Components Installation and User Guide

April, The energy functions include:

The Schrödinger KNIME extensions

APBS electrostatics in VMD - Software. APBS! >!Examples! >!Visualization! >! Contents

Tutorial. Getting started. Sample to Insight. March 31, 2016

ATLAS of Biochemistry

ATP GTP Problem 2 mm.py

ncounter PlexSet Data Analysis Guidelines

VMD: Visual Molecular Dynamics. Our Microscope is Made of...

Conformational Analysis of n-butane

Hands-on pdb4dna.

OpenDiscovery: Automated Docking of Ligands to Proteins and Molecular Simulation

Statistics for Python

The TEN Package. User Guide

Dock Ligands from a 2D Molecule Sketch

Side-chain positioning with integer and linear programming

Packing_Similarity_Dendrogram.py

Description of the ED library Basic Atoms

Athena Visual Software, Inc. 1

Lab 1 Uniform Motion - Graphing and Analyzing Motion

Experiment: Oscillations of a Mass on a Spring

HOW TO USE MIKANA. 1. Decompress the zip file MATLAB.zip. This will create the directory MIKANA.

Tutorial 11. Use of User-Defined Scalars and User-Defined Memories for Modeling Ohmic Heating

Elementary charge and Millikan experiment Students worksheet

Supporting Text 1. Comparison of GRoSS sequence alignment to HMM-HMM and GPCRDB

Supporting Information Practical Prediction of Heteropolymer Composition and Drift

Hands-on Course in Computational Structural Biology and Molecular Simulation BIOP590C/MCB590C. Course Details

44 Force extension characteristics for a spring / elastic material

Department of Chemical Engineering University of California, Santa Barbara Spring Exercise 2. Due: Thursday, 4/19/09

Veller, M.G.P. van. This is a "Post-Print" accepted manuscript, which will be published in "Qualitative and Quantitative Methods in Libraries (QQML)"

How to Create a Substance Answer Set

Pymol Practial Guide

COMPUTER SESSION: ARMA PROCESSES

Protein Structure and Visualisation. Introduction to PDB and PyMOL

Reaxys Medicinal Chemistry Fact Sheet

Viewing and Analyzing Proteins, Ligands and their Complexes 2

ArcGIS 9 ArcGIS StreetMap Tutorial

MODULE 9 NORMAL DISTRIBUTION

Hands- On on geometries

Week 10: Homology Modelling (II) - HHpred

PNmerger: a Cytoscape plugin to merge biological pathways and protein interaction networks

Molecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror

Table of Contents. Table of Contents LAMMPS trajectories into QuantumATK

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions

User Guide for LeDock

QuantumWise. QuantumWise is now part of Synopsys

Working with protein structures. Benjamin Jack

Cosmic Ray Detector Software

Protein Structure Prediction and Display

Implementing MD Simulations and Principal Component Analysis to Calculate Protein Motions

LightWork Memo 18: Galactic Spectra Data Overview 1

XYZ file format Protein Data Bank (pdb) file format Z - matrix

Mathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

Introduction to Spark

Winmostar tutorial LAMMPS Polymer modeling V X-Ability Co,. Ltd. 2017/7/6

Guidance for Writing Lab Reports for PHYS 233:

Appendices. Appendix I: PDB file format

X-ray crystallography NMR Cryoelectron microscopy

SeeSAR 7.1 Beginners Guide. June 2017

Molecular modeling with InsightII

Introduction to Structure Preparation and Visualization

Calibration Routine. Store in HDD. Switch "Program Control" Ref 1/ Ref 2 Manual Automatic

Map image from the Atlas of Oregon (2nd. Ed.), Copyright 2001 University of Oregon Press

1 Introduction. command intended for command prompt

Octave. Tutorial. Daniel Lamprecht. March 26, Graz University of Technology. Slides based on previous work by Ingo Holzmann

Nature Structural and Molecular Biology: doi: /nsmb.2938

Cloud-based WRF Downscaling Simulations at Scale using Community Reanalysis and Climate Datasets

SUPPLEMENTARY INFORMATION

protein interaction analysis bulletin 6300

SUPPLEMENTARY INFORMATION. doi: /nature07461

Portal. User Guide Version 1.0. Contributors

Physics 750: Exercise 2 Tuesday, August 29, 2017

Lab 1: Empirical Energy Methods Due: 2/14/18

Package CEC. R topics documented: August 29, Title Cross-Entropy Clustering Version Date

FleXScan User Guide. for version 3.1. Kunihiko Takahashi Tetsuji Yokoyama Toshiro Tango. National Institute of Public Health

Supplementary Information Intrinsic Localized Modes in Proteins

Winmostar tutorial LAMMPS Polymer Annealing V X-Ability Co., Ltd. 2018/01/15

PDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies)

Mnova Software for Analyzing Reaction Monitoring NMR Spectra

Using SPSS for One Way Analysis of Variance

Application Note. U. Heat of Formation of Ethyl Alcohol and Dimethyl Ether. Introduction

----- Ver October 24, 2014 Bug about reading MOPAC2012 Ver.14 calculations of 1 atom and 2 atoms molecule was fixed.

Introduction to ArcMap

Networks & pathways. Hedi Peterson MTAT Bioinformatics

Measures of. U4 C 1.2 Dot plot and Histogram 2 January 15 16, 2015

BASIC TECHNOLOGY Pre K starts and shuts down computer, monitor, and printer E E D D P P P P P P P P P P

The Schrödinger KNIME extensions

Homework 9: Protein Folding & Simulated Annealing : Programming for Scientists Due: Thursday, April 14, 2016 at 11:59 PM

Supporting Online Material for

let s examine pupation rates. With the conclusion of that data collection, we will go on to explore the rate at which new adults appear, a process

Arcmap Manual Esri 10 READ ONLINE

GROMACS Implementation of Metadynamics in Essential Coordinates

Arcmap Manual Esri 10 READ ONLINE

Computational Structural Biology and Molecular Simulation. Introduction to VMD Molecular Visualization and Analysis

Satellite project, AST 1100

Transcription:

IBiSA_tools Documentation Release 1.00 Kota Kasahara Oct 24, 2016

CONTENTS 1 About 3 1.1 Authors.................................................. 3 1.2 Citation.................................................. 3 1.3 Requirements............................................... 3 1.4 Download................................................. 3 1.5 Installation................................................ 4 2 Overview 5 3 Tutorial 7 3.1 Setting the path to IBiSA_tools..................................... 7 3.2 Preparing the configuration file..................................... 7 3.3 Converting a trajectory into the pore axis coordinates.......................... 8 3.4 Drawing a trajectory of ions along the pore axis............................. 9 3.5 Analyzing a histogram of ions...................................... 9 3.6 Analyzing the density distribution of ions along the pore axis..................... 9 3.7 Discretizing the trajectory based on ion-binding sites.......................... 10 3.8 Analyzing the trajectories of each ion.................................. 10 3.9 Generating the ion-binding state graph................................. 11 3.10 Extracting cyclic paths from the state trajectory............................. 12 3.11 Converting states into characters. A cyclic parts transformed into a sequence::............ 12 3.12 Generating score matrix of states.................................... 13 3.13 Performing the sequence alignment................................... 13 3.14 Make the similarity matrix of cyclic paths................................ 13 3.15 Clustering aligned sequences by using R................................ 14 4 Appendix 15 4.1 Converting a trajectory file........................................ 15 i

ii

Contents: CONTENTS 1

2 CONTENTS

CHAPTER ONE ABOUT 1.1 Authors IBiSA_tools version 1.00 (12 Jul. 2016) Kota Kasahara, Ritsumeikan University, Japan Kengo Kinoshita, Tohoku University, Japan 1.2 Citation Kasahara K and Kinoshita K, (Under review) IBiSA_tools: A Computational Toolkit for the Ion Binding State Analysis on Molecular Dynamics Trajectories of Ion Channels. Kasahara K, Shirota M, and Kinoshita K (2016) Ion Concentration- and Voltage-Dependent Push and Pull Mechanisms of Potassium Channel Ion Conduction. PLoS ONE, 11, e0150716. Kasahara K, Shirota M, and Kinoshita K (2013) Ion Concentration-Dependent Ion Conduction Mechanism of a Voltage-Sensitive Potassium Channel. PLoS ONE, 8, e56342. 1.3 Requirements The current version of IBiSA_tools can be applied only for GROMACS trajectory file (.trr). If your trajectories are written in another format, you have to convert it into.trr, by using some tools, e.g., VMD plugin and MDAnalysis (see Appendix). IBiSA_tools is consisting of a C++ program and Python (2.6 or 2.7) scripts. The attached tutorial files use R software (www.r-project.org) to draw plots. Network drawing software, e.g., Cytoscape, is required to visualize ion-binding state graph. 1.4 Download https://github.com/kotakasahara/ibisa_tools 3

1.5 Installation Only a C++ program, trachan, must be compiled as follows: cd src/trachan./configure make In this document, we assume the binary and python scripts are included in the directory indicated as ${IBISA}. 4 Chapter 1. About

CHAPTER TWO OVERVIEW IBiSA_tools, which stands for Ion-Binding State Analysis tools, provides a computational tools for analyzing ion conduction mechanisms hidden in the molecular dynamics (MD) trajectory data. This analysis provides a varitery of information about each ion conduction event and overall properties. See the citations for details of theory and applications. A list of programs contained in IBiSA_tools (the left column in A) and the default names of input/output files (the right column in A) are summarized. The panels B, C, D, and E are schematic images of output figures (reprinted from our paper). 5

6 Chapter 2. Overview

CHAPTER THREE TUTORIAL 3.1 Setting the path to IBiSA_tools export IBISA="${HOME}/local/IBiSA_tools" Set your own install directory. 3.2 Preparing the configuration file config.txt: --fn-pdb init.pdb --dt 10 --site-boundary 20.0 --site-boundary -25.0 --fn-pore-axis-coordinates pore_axis.txt --fn-pore-axis-coordinates-r pore_axis_r.txt --pore-axis-basis-from A 374 O --pore-axis-basis-from B 374 O --pore-axis-basis-from C 374 O --pore-axis-basis-from D 374 O --pore-axis-basis-to A 377 O --pore-axis-basis-to B 377 O --pore-axis-basis-to C 377 O --pore-axis-basis-to D 377 O --site-max-radius 10.0 --site-height-margin 5.0 --channel-chain-id ABCD --trace-atom-name K --fn-trr traj.trr Each line indicate a set of key and values. fn-pdb is the initial structure file. dt is the time step of the trajectory file site-boundary is specified two values, 20.0 and -25.0. This setting means that the range from 20.0 to -25.0 in the pore axis will be analyzed. The origin of the pore axis is set by pore-axis-basis-from key. fn-pore-axis-coordinates and fn-pore-axis-coordinates-r are the output file names. 7

pore-axis-basis-from specifies the origin of the pore axis. A 374 O means the O atom in the residue 374 of the chain A. In this tutorial, four oxigen atoms are specified. The center of these atoms is set to be the origin of the pore axis. pore-axis-basis-to defines the direction of the pore axis. The line from the center of...-from to the center of...-to defines the pore axis. channel-chain-id specifies chain IDs of a channel protein. trace-atom-name specifies the atom name of target ions, defined in the.pdb file. fn-trr is the file name of the trajectory. Multiple files can be specified. site-max-radius (optional) is the maximum radius of ion channel pore. As the channel pore is narrowed by the protein structure and the range of ion-binding sites on the radial coordinates is sterically defined by the protein structure in usual cases, this setting is not important, but is should be set large value enough to cover the pore. site-height-margin (optional) is the margin length along the pore axis. In this case, the range from 20.0+5.0 to -25.0-5.0 will be analyzed. 3.3 Converting a trajectory into the pore axis coordinates $IBISA/bin/trachan --fn-cfg config.txt Some text should appear in the standard output: ----------------------------------------------------------------- TraChan TRAjectory analyzer for CHANnel pore axis ----------------------------------------------------------------- Copyright (c) 2016 Kota Kasahara, Ritsumeikan University This software is distributed under the terms of the GPL license ----------------------------------------------------------------- This program contains some parts of the source code of GROMACS software. Check out http://www.gromacs.org about GROMACS. Copyright (c) 1991-2000, University of Groningen, The Netherlands Copyright (c) 2001-2010, The GROMACS development team at Uppsala University & The Royal Institute of Technology, Sweden. ----------------------------------------------------------------- TraChan::mainRoutine() site occupancy mode n_atoms : 6997 pore axis basis a 1014 O THR 374 2735 O THR 374 4456 O THR 374 6177 O THR 374 pore axis basis b 1058 O TYR 377 2779 O TYR 377 4500 O TYR 377 6221 O TYR 377 open pore_axis.txt open pore_axis_r.txt average axis length = 0.945331 sd axis length = 0.000977426 8 Chapter 3. Tutorial

And the two output files, pore_axis.txt and pore_axis_r.txt will be obtained. They are tab-separated text files. The first column indicates the time, and the other columns correspond to the coordinate of each potassium ion. Each row indicates the positions in each snapshot. pore_axis.txt and pore_axis_r.txt record the coordinates along the pore axis and those along the radial direction parpendicular to the pore axis. 3.4 Drawing a trajectory of ions along the pore axis R --vanilla --slave < $IBISA/r/pore_axis_traj.R pore_axis_traj.eps shows the time course of ion coordinates. 3.5 Analyzing a histogram of ions Histogram of ion frequency over the 2D pore axis space can be drawn by: python $IBISA/bin/ion_histogram.py \ --i-pore-crd-h pore_axis.txt \ --i-pore-crd-r pore_axis_r.txt \ --o-histogram histogram.txt \ --atomname K R --vanilla --slave < $IBISA/r/histogram.R histogram.eps is the 1D and 2D histogram of ions. 3.6 Analyzing the density distribution of ions along the pore axis R --vanilla --slave < $IBISA/r/pore_axis_density.R density_distribution.eps is the distribution plot. 3.4. Drawing a trajectory of ions along the pore axis 9

This plot clearly shows localization of ions in the ion-binding sites. On the basis of this plot, we can define the boundary of each ion binding site. 3.7 Discretizing the trajectory based on ion-binding sites Here, we use the definition which is determined in our previous reports. The boundaries of ion binding sites are 15.13, 12.93, 9.32, 6.25, 3.00, 0.44, -2.21, -6.08, and -20.: python $IBISA/bin/site_occupancy.py \ --i-pore-crd-h pore_axis.txt \ --i-pore-crd-r pore_axis_r.txt \ --o-site-occ site_occ.txt \ --atomname K \ -b 12.93 -b 9.32 -b 6.25 -b 3.00 -b 0.44 -b -2.21 -b -6.08 -b -20 \ -n '-1' -n 0 -n 1 -n 2 -n 3 -n 4 -n 5 -n 6 The output file site_occ.txt records information about what ions are retained in each ion binding sites in each snapshot. site_ooc.txt: 0 1:6946:K 3:6985:K 4:6993:K 6:6935:K 10 1:6946:K 3:6985:K 4:6993:K 6:6935:K 20 1:6946:K 3:6985:K 4:6993:K 6:6935:K 30 1:6946:K 3:6985:K 4:6993:K 6:6935:K 40 1:6946:K 3:6985:K 4:6993:K 6:6935:K 1:6956:K means the ion K with the ID 6946 is bound at the site 1. 3.8 Analyzing the trajectories of each ion python ${IBISA}/bin/analyze_ion_path.py \ --i-site-occ site_occ.txt \ --o-all-path site_path.txt \ --o-count-full site_path_count_full.txt \ --o-count-head-tail site_path_count_ht.txt 10 Chapter 3. Tutorial

The output site_path.txt: 6985 K *:3:0:* 0:7250 *:3:2:1:0:* 0:5290:5340:7240:7250 6985 K *:0:* 7320:7350 *:0:* 7320:7350 6985 K *:0:* 7540:7570 *:0:* 7540:7570 The first column indicates the ID of the ion. At the third column, :3:0: means this ion got into the pore at site 3, and went out from the site 0. The fourth column denote the times for getting into and going out from the pore. The fifth column, :3:2:1:0: indicates the full trajectory of this ion from association the to pore and dissociation from the pore. Each line corresponds to each event starting with an ion association and ending with a dissociation of that ion. The third and fourth columns are abbreviations of the fifth and sixth columns, respectively. 3.9 Generating the ion-binding state graph python $IBISA/bin/analyze_site_state.py \ --i-site-occ site_occ.txt \ --o-states state_traj.txt \ --o-graph state_graph.gml \ --atomname K The ion binding state graph can be visualized by using the output file state_graph.gml with a network analysis software, e.g., Cytoscape. state_traj.txt records the ion binding state in each snapshot: 0 K:1:3:4:6 K:6946:6985:6993:6935 10 K:1:3:4:6 K:6946:6985:6993:6935 20 K:1:3:4:6 K:6946:6985:6993:6935 30 K:1:3:4:6 K:6946:6985:6993:6935 The third column indicate the IDs of ions in the ion binding sites. 3.9. Generating the ion-binding state graph 11

3.10 Extracting cyclic paths from the state trajectory python $IBISA/bin/extract_cycles.py \ --i-state state_traj.txt \ --o-cycles state_traj_cycles.txt \ --o-state-dict state_dict_pre.txt \ --title "sample" option title is an arbitrary string. state_traj_cycles.txt stores the cyclic paths: > 1 6010 7830 sample 6010 K:0:2:4 K:6985:6993:6935 6630 K:0:2:4:6 K:6985:6993:6935:6961 7190 K:0:2:4:5 K:6985:6993:6935:6961 7230 K:0:1:3:5 K:6985:6993:6935:6961 7820 K:1:3:5 K:6993:6935:6961 7830 K:0:2:4 K:6993:6935:6961 The line begining with > is the header line. The cyclic path 1 starts at 6010 and ends at 7830. 3.11 Converting states into characters. A cyclic parts transformed into a sequence:: python $IBISA/bin/cycle_to_sequence.py \ --i-cycles state_traj_cycles.txt \ --i-state-dict state_dict_pre.txt \ --o-state-dict state_dict.txt \ --o-sequence sequences.fsa state_dict.txt describes a correspondence between states and characters. sequences.fsa is the sequences of cyclic paths. > 0 0 *POMFB* 5 28990 31650 sample *POMFB* > 1 0 *POMFE* 8 44310 45210 sample *POMFE* > 2 0 *PJHF* 7 39280 42200 sample *PJHF* > 3 0 *POKLF* 6 36850 38640 sample *POKLF* Assignments of characters for states can be modified by revising the file specified as i-state-dict option. For clarity, the characters for the ion binding states with 4 ions are replaced with lower cases, and 5-ion state is #. state_dict.txt: K:2:3:6 I K:2:4:6 J K:0:1:3:5 K:0:1:3:6 K:0:2:3:5 k l m 12 Chapter 3. Tutorial

.. K:0:2:3:5:6 # Then, re-do cycle_to_sequence.py: python $IBISA/bin/cycle_to_sequence.py \ --i-cycles state_traj_cycles.txt \ --i-state-dict state_dict.txt \ --o-sequence sequences.fsa 3.12 Generating score matrix of states python $IBISA/bin/make_score_matrix.py \ --i-state-dict state_dict.txt \ --o-score score_matrix.txt For each pair of ion-binding states, when the two states are identical, the similarity score is 1.0. When the two states have the same number of ions, the score is 0.5. Otherwise, teh score is 0.0. 3.13 Performing the sequence alignment python $IBISA/bin/dp_align.py \ --i-score-matrix score_matrix.txt \ --i-sequence sequences.fsa \ --o-align align.txt -a\ --min-len 4 \ -g 1.0 \ -m 1.0 \ --ignore * -g and -m are gap score and match scores, respectively. The output file align.txt shows the pairwise alignments > 2 3 0.0 2 0 *pjhf* 7... *pjh-f* *poklf* > 0 3 1.0 0 0 *pomfb* 5... *pom-fb* *poklf-* 3.14 Make the similarity matrix of cyclic paths python $IBISA/bin/align_similarity.py \ --i-align align.txt \ --i-sequence sequences.fsa \ --o-sim align_sim.txt \ -g 1 -m 1 3.12. Generating score matrix of states 13

3.15 Clustering aligned sequences by using R R --vanilla --slave < $IBISA/r/clustering_seq.R 14 Chapter 3. Tutorial

CHAPTER FOUR APPENDIX 4.1 Converting a trajectory file IBiSA_tools can read only.trr format. When you analyze trajectories written in other format, the files have to be converted into.trr file. Some tools for the conversion exist. Here, a sample code powered by the Python library MDAnalysis is presented.: import MDAnalysis u = MDAnalysis.Universe("initial.pdb","trajectory.ncdf") writer = MDAnalysis.coordinates.TRJ.TRRWriter("trajectory.trr", len(u.atoms)) for ts in u.trajectory: writer.write_next_timestep(ts) The script converts trajectory.ncdf (AMBER, NetCDF file) into trajectory.trr file (GROMACS,.trr file). initial.pdb is the initial coordinates writtein in the flat.pdb format. 15