ADRIANA.Code and SONNIA. Tutorial

Size: px
Start display at page:

Download "ADRIANA.Code and SONNIA. Tutorial"

Transcription

1 ADRIANA.Code and SONNIA Tutorial Modeling Corticosteroid Binding Globulin Receptor Activity Molecular Networks GmbH Computerchemie July

2

3 Henkestr Erlangen Germany Phone: Fax: WWW: This document is copyright by Molecular Networks GmbH Computerchemie. All rights reserved. Except as permitted under the terms of the Software Licensing Agreement of Molecular Networks GmbH Computerchemie, no part of this publication may be reproduced or distributed in any form or by any means or stored in a database retrieval system without the prior written permission of Molecular Networks GmbH Computerchemie. The software described in this document is furnished under a license and may be used and copied only in accordance with the terms of such license. ADRIANA and SONNIA are registered trademarks in the Federal Republic of Germany. Other product names and company names may be trademarks or registered trademarks of their respective owners, in the Federal Republic of Germany and other countries. All rights reserved. (Document version: CHS/LT )

4

5 Contents Contents Introduction and Objective 1 The Dataset 2 Calculating Molecular Descriptors with ADRIANA.Code 4 Step 1: Start ADRIANA.Code, Load Structure File and Set Output File Options 4 Step 2: Select and Calculate the Molecular Descriptors 5 Step 3 Calculate a Descriptor File with Experimental pk Values 7 Classification of Compounds According to their Biological Activity with SONNIA 8 Step 1: Start SONNIA, Load the Descriptor and the Structure File 8 Step 2: Create and Train a Kohonen Neural Network 9 Step 3: Create a Kohonen Map 11 Step 4: Analyze a Kohonen Map 13 Quantitative Modeling of Biological Activities with SONNIA 16 Step 1: Start SONNIA, Load the Descriptor and the Structure File 16 Step 2: Create and Train a Counterpropagation Neural Network 16 Step 3: Visualize the Trained Counterpropagation Network 17 Step 4: Write and Analyze the Prediction File 19 Tips and Tricks 21 Preprocessing Data Files 21 Training Parameters of a Neural Network 23 Assessing the Quality of an Unsupervised Classification 25 Problems and Help! 27 References 28

6

7 Introduction and Objective Introduction and Objective Statistical or machine learning methods are widely used to establish relationships between biological activities, physical or chemical properties of a compound and its chemical structure. These methods, in combination with structure descriptors, are used to derive models that can be applied to predict properties of new compounds. The objective of this tutorial is to show on an example how the methods descriptor calculation package ADRIANA.Code [1], and neural network package SONNIA [2] can be applied in the area of qualitative and quantitative structure-activity relationship (QSAR) studies. The tutorial guides the user through the entire workflow starting from a dataset of chemical structures with experimentally derived biological activities and describes how to calculate molecular descriptors for a dataset of compounds with ADRIANA.Code, how to classify compounds according to their biological activity with a Kohonen neural network implemented in SONNIA, and how to quantitatively model a biological activity using the counterpropagation neural network implemented in SONNIA. In addition, the tutorial gives some hints, tips and tricks that are valuable and helpful when ADRIANA.Code and SONNIA are applied to other datasets and in QSAR studies. For further information about the usage as well as the methods that are implemented in the program packages ADRIANA.Code and SONNIA, please refer to the respective program manuals. The example "Modeling Corticosteroid Binding Globulin (CBG) Receptor Activity" is taken from the literature [3]. The dataset comprises 31 steroid compounds and their experimental CBG receptor binding affinity values (pk values). Based on the pk values, the compounds were pre-classified into the three different classes, high, medium and low CBG binding affinity. In the example study, each molecule of the dataset is represented by a vector of 12 autocorrelation coefficients that encode the spatial distribution of the electrostatic potential on the molecular surface (calculated by ADRIANA.Code). These descriptors are then used to classify the compounds according to the three different CBG activity classes using an unsupervised Kohonen neural network technique (implemented in SONNIA). Finally, a supervised neural network method (counterpropagation neural network implemented in SONNIA) is used to quantitatively model the pk values. The dataset of 31 steroid compounds can be downloaded from Molecular Networks' web server at 1

8 The Dataset The Dataset The dataset of 31 steroid compounds and their CBG receptor binding affinity values are stored in MDL SDFile format [4]. All chemical structures are fully defined including hydrogen atoms and stereo information (atom parity flags). For each record, the experimentally determined biological activity (pk value) is contained in the SDF data field <CBG_ACTIVITY_pK>. Furthermore, the compounds are pre-classified into three different affinity classes: high affinity (class 1) medium affinity (class 2) low affinity (class 3) The binding affinity class is stored in the SDF data field <CBG_ACTIVITY_CLASS>. Figure 1 shows the structures of the dataset sorted by their CBG receptor binding affinity class. high affinity (class 1) medium affinity (class 2) 2

9 The Dataset low affinity (class 3) Figure 1 Dataset of 31 steroid compounds 3

10 Calculating Molecular Descriptors with ADRIANA.Code Calculating Molecular Descriptors with ADRIANA.Code In the following sections, the calculation of a set of molecular descriptors with ADRIANA.Code is described. A descriptor file will be generated that represents each molecule of the dataset by a vector of 12 autocorrelation coefficients encoding the spatial distribution of the electrostatic potential on the molecular surface. Step 1: Start ADRIANA.Code, Load Structure File and Set Output File Options Start the graphical user interface (GUI) of ADRIANA.Code by double-clicking on the desktop icon of ADRIANA.Code. Load the structure file steroids31_act.sdf by clicking on the button... in the section Input of the ADRIANA.Code GUI and selecting the file in the dialog box Choose a structure file to open (see Figure 2). Figure 2 Loading a chemical structure file. Set the output file format in the drop down menu Format in the section Output to SONNIA. Click on the button... in the section Output and set the name of the output file to steroids31_actclass_mep_ac12.dat in the same directory where the input file is located in the dialog box Choose an output file to write to. 4

11 Calculating Molecular Descriptors with ADRIANA.Code Note: The full name of the output file (file name and path) is set automatically but can be changed by the user either in the field File in the section Output or by using the dialog box as described above. Click on the button Select properties in the section Output, choose NAME in the drop down menu Compound ID property, check the box CBG_ACTIVITY_CLASS in the list Select properties to copy and confirm with the button OK (see Figure 3). Figure 3 Selecting the properties of the output file. Step 2: Select and Calculate the Molecular Descriptors Select Autocorrelation of Molecular Surface Properties molecular electrostatic potential (SurfACorr_ESP) in the list Available in the section Descriptors and press the button > to select the descriptor for calculation. SurfACorr_ESP now appears in the list Selected. Use the default settings and parameters in the section Available Control Parameters (see Figure 4). 5

12 Calculating Molecular Descriptors with ADRIANA.Code Figure 4 Selecting the descriptor. Press the button Calculate. Note: ADRIANA.Code now calculates for each compound a vector of 12 autocorrelation coefficients that encode the spatial distribution of the electrostatic potential on the molecular surface. After the descriptor calculation is finished a dialog box appears. Press the button View output file to display the output file in a table formatted view. The first 12 columns contain the 12 autocorrelation coefficients. The last two columns contain the affinity class (CBG_ACTIVITY_CLASS, 1 = high affinity; 2 = medium affinity; 3 = low affinity) and the name of the compound with a leading "!" which SONNIA interprets as the compound name (see Figure 5). Figure 5 Viewing the output file. 6

13 Calculating Molecular Descriptors with ADRIANA.Code Step 3 Calculate a Descriptor File with Experimental pk Values Change the name of the output file to steroids31_actpk_mep_ac12.dat. Select CBG_ACTIVITY_pK in the dialog box Select properties instead of CBG_ACTIVITY_CLASS and confirm with the button OK (see Figure 6). Figure 6 Selecting the properties of the output file. Calculate the descriptors by pressing the button Calculate. 7

14 Classification of Compounds According to their Biological Activity with SONNIA Classification of Compounds According to their Biological Activity with SONNIA The following section describes the classification of the steroid compounds according to their CBG binding affinity class (CBG_ACTIVITY_CLASS) using the Kohonen neural network algorithm implemented in SONNIA. The Kohonen algorithm is an unsupervised, non-linear mapping technique that projects the twelve-dimensional descriptor space (12 autocorrelation coefficients) into a two-dimensional plane (Kohonen map). The information about the CBG affinity class is not used for the projection (unsupervised learning). The neurons of the resulting Kohonen map are color-coded according to the CBG receptor binding affinity class (high, medium or low) of the compounds that are assigned to a specific neuron. Step 1: Start SONNIA, Load the Descriptor and the Structure File Start the graphical user interface (GUI) of SONNIA by double-clicking the desktop icon of SONNIA. Select Read... in the menu File in the main menu bar. The dialog box SONNIA Read appears (see Figure 7). Figure 7 Loading the descriptor and structure file into SONNIA. Select in the list Directory the directory where the structure and descriptor files are located and select Data File in the drop down menu Object. Select the file steroids31_actclass_mep_ac12.dat and press the button OK. 8

15 Classification of Compounds According to their Biological Activity with SONNIA In order to load the structure file, repeat this procedure, but select Structure File in the drop down menu Object and select the file steroids31_act.sdf. Step 2: Create and Train a Kohonen Neural Network Select Create... in the menu Network in the main menu bar. The dialog box SONNIA Network appears (see Figure 8). Figure 8 Creating a Kohonen neural network. Ensure that Kohonen is set in the drop-down menu in the section Algorithm and Topology is set to toroidal. The size of the network is set automatically by SONNIA. In this example, the network has a size of 5 (width) x 3 (height) = 15 neurons. Enter the number 12 in the field Input in the section Network Dimensions (this is the number of descriptors of each molecule). Use the default settings for all other parameters. In this case the network (plane) has a dimension of 5 (width) x 3 (height) = 15 neurons. Press the button Create. Select Train... in the menu Network in the main menu bar. The dialog box SONNIA Training appears (see Figure 9). 9

16 Classification of Compounds According to their Biological Activity with SONNIA Figure 9 Setting the training parameters for a Kohonen neural network. Use the default settings for all parameters (see Figure 9) and press the button Train. The window SONNIA Monitor appears which shows the changes of the dynamic error (distance between input vectors and neuron weights) with the number of training cycles (see Figure 10). Figure 10 Training a Kohonen neural network. The training is finished if the button Stop in the window SONNIA Monitor changes to OK (see also Figure 10). 10

17 Classification of Compounds According to their Biological Activity with SONNIA Step 3: Create a Kohonen Map Select Palette Editor... in the menu Maps in the main menu bar. The dialog box SONNIA Palette Editor appears. Choose 3 in the drop down menu Colors (this is the number of classes) and 1 (default, this is the position of the affinity class in the input vector) in the field Output (see Figure 11 left). Confirm the settings by pressing the button Apply. Figure 11 Setting the number and type of used colors for the Kohonen map. Note: The default colors can be changed by clicking on a color in the section Palette of the dialog box SONNIA Palette Editor. The dialog box SONNIA Color Editor appears (see Figure 11 right). The color can now be changed by using the sliders or by entering color values for Red, Green and Blue. Confirm by pressing the button Apply. Select Selected Maps in the menu Maps in the main menu bar. The Kohonen maps are generated and displayed (see Figure 12). Each colored square in the map corresponds to one neuron. Note: By default, two Kohonen maps are generated. The first map is color-coded by the most frequent pattern that has been mapped into a neuron. In this example, this is the most frequent CBG binding affinity class. For instance, if two compounds with high and one compound with medium affinity were mapped into one single neuron the neuron gets color-coded with the color for high affinity (class 1, red). The second map additionally shows all neurons that contain compounds of at least two different classes (collision or conflict neurons). These neurons are marked in black color (see Figure 12, right map). Note: The number and type of default maps can be changed by selecting Selected Maps... in the menu Maps in the main menu bar. By default, the map types most frequent output and average output (conflicts) are checked (selected). Check further map types to add them to the default maps which are generated when selecting Selected Maps in the menu Maps in the main menu bar. 11

18 Classification of Compounds According to their Biological Activity with SONNIA Figure 12 Visualizing the Kohonen map colored by the most frequent activity class in each neuron (left) and with marked collision neurons (at least two molecules of different classes in the same neuron). Note: The generated Kohonen maps have a toroidal geometry (see also Figure 26 on page 24). Therefore, each neuron in the map has the same number of neighbors (8), also the neurons at the edges. By clicking on the map and holding the left mouse button, the maps can be shifted in x and y direction. Note that only the selected map is shifted. All other maps remain unchanged. Right-click on a map and select Tile... in the context menu. The window SONNIA Tiling appears (see Figure 13). Due to the toroidal geometry of the maps they can be tiled. Tile more maps by changing the size of the window SONNIA Tiling with the mouse. Note: Tiled maps often better visualize the result of the Kohonen mapping and help to better assess the quality of the classification. 12

19 Classification of Compounds According to their Biological Activity with SONNIA Figure 13 Tiling of a Kohonen map. Step 4: Analyze a Kohonen Map In order to visualize which compounds were mapped into which neurons, left-click on a neuron while keeping the Crtl key pressed. The neuron is now selected and is marked in light-grey color. Right-click on the selected neuron and select Export Structures... in the context menu. The Structure Browser appears and displays the compounds that have been mapped into the selected neurons (see Figure 14). Figure 14 Displaying the chemical structures that are mapped to a specific neuron. 13

20 Classification of Compounds According to their Biological Activity with SONNIA Note: The structure file must have been loaded into SONNIA (see also Figure 7) to use this functionality. Note: More than one neuron can be selected by a left-click on the map while keeping the Crtl key pressed and dragging the mouse over the map. The focus of the selection is shown by a temporary rectangle while dragging the mouse. All selected neurons are finally marked in light-gray color. Note: Neurons can be de-selected by left-clicking on the neuron while keeping the Crtl and the Shift key pressed. Note: Additional properties that are stored in the structure file (e.g., compound names, CBG affinity classes) can be displayed in the Structure Browser by selecting Chemical Properties... in the menu Display of the main menu bar of the structure browser (Prop tabs in the Browser Annotation Display Style). Right-click on a map and select Export Centroids... in the context menu. The Structure Browser appears. The browser now displays the centroid compounds of all neurons (see Figure 15). The arrangement of the structure browser always reproduces the size of the network (here: 5 x 3). Note: The centroid compound of a neuron is the compound having a descriptor vector (twelve dimensions) most similar to the weights of the neuron vector (also twelve dimensions). The descriptor vector of the centroid compound has the minimum Euclidean distance to the vector of the neuron weights of all compounds that have been mapped to this neuron. Figure 15 Displaying the centroid structures of all neurons. In order to export the contents of all neurons (i.e., the information which compounds are mapped into which neurons), select Export Contents... in the menu Analyze in the main menu bar. The dialog box SONNIA Write appears (see Figure 16). 14

21 Classification of Compounds According to their Biological Activity with SONNIA Figure 16 Exporting the contents of all neurons. Select a directory in the list Directory and select CSV File (Contents Maps) in the drop down menu Object. Enter a file name, e.g., steroids31_contentmap.csv, in the field Files and confirm with the button OK. Note: The ASCII csv file (csv: comma separated values) can be displayed with a standard ASCII file browser or loaded into spreadsheet programs (e.g., Microsoft Excel). Figure 17 shows the content of the csv file (displayed in Microsoft WordPad). Figure 17 Displaying a contents maps file (csv). 15

22 Quantitative Modeling of Biological Activities with SONNIA Quantitative Modeling of Biological Activities with SONNIA The following section describes the quantitative modeling of the CBG receptor binding affinity (CBG_ACTIVITY_pK) of the 31 steroid compounds using the counterpropagation neural network algorithm implemented in SONNIA. Again, each compound of the dataset is represented by a twelve-dimensional autocorrelation vector that encodes the spatial distribution of the electrostatic potential on the molecular surface. The counterpropagation algorithm is a supervised learning technique. In contrast to the Kohonen algorithm, the pk values of the CBG receptor binding affinity are now used to derive a model expressing the relationship between the descriptors (independent variables) and the biological activity (dependent variables). Step 1: Start SONNIA, Load the Descriptor and the Structure File Start the graphical user interface (GUI) of SONNIA by double-clicking the desktop icon. Select Read... in the menu File in the main menu bar. The dialog box SONNIA Read appears (see also Figure 7). Select in the list Directory the directory where the structure and descriptor files are located and select Data File in the drop down menu Object. Select the file steroids31_actpk_mep_ac12.dat and press the button OK. In order to load the structure file, repeat this procedure, but select Structure File in the drop down menu Object and select the file steroids31_act.sdf. Step 2: Create and Train a Counterpropagation Neural Network Select Create... in the menu Network in the main menu bar. The dialog box SONNIA Network appears (see Figure 18). 16

23 Quantitative Modeling of Biological Activities with SONNIA Figure 18 Creating a counterpropagation network. Select Counterprop. in the drop down menu of the section Algorithm. Ensure that Topology is set to toroidal. Enter the number 12 (dimension of descriptor vector) in the field Input and 1 in the field Output (dimension of the property to model, single value of CBG binding affinity) in the section Network Dimensions. Use the default settings for all other parameters and press the button Create. Select Train... in the menu Network in the main menu bar. The dialog box SONNIA Training appears (see also Figure 9). Use the default settings for all parameters and press the button Train. The window SONNIA Monitor appears which shows the changes of the dynamic error (distance between input vectors and neuron weights) with the number of training cycles (see also Figure 10). The training is finished when the button Stop in the window SONNIA Monitor changes to OK (see also Figure 10). Step 3: Visualize the Trained Counterpropagation Network Note: The trained counterpropagation network can be visualized in a style similar to a Kohonen map. In this example, a continuous value (pk value) is modeled which ranges from about -7.8 to The number of colors that are available in SONNIA is limited to 10. Therefore, only ranges of the predicted values can be color-coded by a single color. 17

24 Quantitative Modeling of Biological Activities with SONNIA Select Palette Editor... in the menu Maps in the main menu bar. The dialog box SONNIA Palette Editor appears. Choose 3 in the drop-down menu Colors and 13 (= 12 autocorrelation coefficients+ 1 activity value) in the field Output (see Figure 19; the 13 th column in the input data file steroids31_actpk_mep_ac12.dat is the pk value). Note: The entire range of the pk values from about -7.8 to -5.0 is now represented by three colors in equidistant ranges, i.e., red: pk values from -7.8 to -6.9; yellow: pk values from about -6.8 to -5.9; green: pk values from -5.8 to Confirm the settings by pressing the button Apply. Figure 19 Setting the number and type of colors for displaying the map of the counterpropagation network. Select Selected Maps in the menu Maps in the main menu bar. The two default maps are generated and displayed (see Figure 20). Figure 20 Displaying the maps of the counterpropagation network. 18

25 Quantitative Modeling of Biological Activities with SONNIA Step 4: Write and Analyze the Prediction File In order to write out the predicted pk values by the counterpropagation network select Write in the menu File in the main menu bar. The dialog box SONNIA Write appears (see Figure 21). Select Prediction File in the drop down menu Object and enter a file name in the field Files (e.g., steroids31.prd). Confirm with the button OK. The dialog box Prediction appears and suggests in the field Input Dimensionality the figure 12 (see Figure 21; number of descriptors of each compound). Confirm with the button Apply. Figure 21 Writing the prediction file. Note: The prediction file steroids31.prd is an ASCII file which lists the input Y variable(s) (experimental pk values), the predicted Y variable(s) (predicted pk values) and the name of the compound. The file can be loaded in spreadsheet applications or standard ASCII text browser for further analysis (see Figure 22 and Figure 23). 19

26 Quantitative Modeling of Biological Activities with SONNIA Figure 22 Loading a prediction file into a spreadsheet application (here: MS Excel). Figure 23 Analyzing the prediction of SONNIA (here: MS Excel). 20

27 Tips and Tricks Tips and Tricks Preprocessing Data Files Merging Structure and Property Data Often, the chemical structure data is stored in an MDL SDFile whereas any additional information related to the chemical structures (e.g., any measured or experimental data) is stored in a separate file, e.g., in a table-like formatted ASCII file. A primary key (e.g., a unique name or number of the chemical structures) that is present in both the SD and the ASCII file is the only link between the structure and the additional data. In order to merge chemical structure and additional data into a single SDFile, Molecular Networks' tool MN.MERGE ( can be used. Figure 24 shows a part of an SDFile (left) and an ASCII file (right) that contains some experimental (Exp1, Exp2), a categorical value (Class1) and the compound name (CpdName) organized as a table. The primary key is the given in the column CpdName that can be present in the correspondent SDFile either in the name field (see Figure 24) or in a data field. The command line of MN.MERGE to merge the files is: mn.merge tablefile tablefile.txt tablekey CpdName outfile outfile_merged.sdf infile.sdf compound_1 CS D 0 Molecular Networks V C C M END Exp1 Exp2 Class1 CpdName compound_ compound_ compound_ compound_ compound_ compound_31 $$$$.. Figure 24 Merging SDFiles and data files with MN.MERGE. The resulting SDFile outfile_merge.sdf will contain the values of Exp1, Exp2 and Class1 in the SDF data fields "<Exp1>", "<Exp2>", "<Class1>" and "<CpdName>". Note: Any data field that is already present in the input SDFile is written to the output. 21

28 Tips and Tricks Standardization and Checking Structural State and Integrity of Structure Files Chemical structure files may originate from different sources. Therefore, the chemical structures may differ in the way they are coded in their connection table representation or even show some errors. For instance, functional groups such as nitro groups may be coded with a pentavalent nitrogen atom or as a charged species, hydrogen atoms may be given implicitly or explicitly or charges in salts may not be balanced correctly. However, for a corporate compound database or a dataset under investigation it may be mandatory that all chemical structures and their connection table representation comply with a certain standard, i.e., are coded in a consistent and pre-defined fashion. Molecular Networks' tool MN.CHECK ( can be helpful to standardize chemical structure data by applying a set of business rules that can be selected by the user. MN.CHECK supports batch mode execution and is able to process large chemical files fast and efficient. Furthermore, MN.CHECK can be used to detect and correct errors in the structure coding (e.g., missing charges at counter ions in salts) and to identify and remove duplicate structures in large collections of chemical compounds (based on a 64bit hashcoding technique). For example, the MN.CHECK command line mn.check -hydrogen add -nitrostyle ionic -chargebalance - pedantic -unique -outfile outfile_checked.sdf infile.sdf will read in the file infile.sdf, add implicit hydrogen atoms, re-code all nitro groups (and similar functional units) as charge pairs (with a tetravalent, positively charged nitrogen atom, and a negatively charged oxygen atom or another ligand atom), balance charges in salts, pedantically check the file formatting and structure coding and write out a message when errors are detected, identify and remove duplicate structures and write out the normalized and checked structures to the file outfile_checked.sdf. Complementary Software Another helpful and valuable tool in this area is Molecular Networks' file format converter MN.CONVERT that supports over 50 different file formats for chemical structure and reaction information and interconverts them with high conversion rates and reliability. A complete list of all supported file formats can be found at the product page of MN.CONVERT at 2D structure diagrams (2D coordinates) in publishing quality can be generated with Molecular Networks' tool MN.2DCOOR. The tool offers a variety of options and features to customize the layout of 2D structure plots. For instance, structures can be aligned to their main x or y axes or to a template structure provided in a separate file (e.g., to align all structures in a combinatorial library to a predefined orientation of their common scaffold). Further information about MN.2DCOOR can be found at its product page at 22

29 Tips and Tricks Training Parameters of a Neural Network Network Size By default, SONNIA suggests a ratio of approximately one neuron per two compounds/patterns (1:2) which usually works fine for initial tests. Another possibility is to start with a ratio of 1:1 and to gradually reduce the size in following runs. If the size of the network gets too large there is a high likelihood that it will only memorize the input data without showing the maximum of the actual neighborhood relationship of the data patterns (e.g., by conflict neurons, neurons with patterns of more than one class, e.g., known actives and unknown). Smaller networks (high neuron/pattern ratio) tend to produce more conflict neurons which might be of interest for some applications, e.g., for lead-hopping. However, in too small networks the data has to be compressed in a few neurons. This may lead to conflict neurons that are not very meaningful. A balanced ratio should be achieved. Another example for a rather high neuron/pattern ratio is the visualization of large chemical spaces. Figure 25 shows the projection of about 404,000 chemical compounds from different sources into a Kohonen map of the size 80 x 60 neurons. # of compounds: 404,449 # of neurons: 4,800 (80 x 60) # of occupied neurons: 4,799 Chemical supplier databases (139,961) NCI database (193,339) MDDR (71,149) Color coding: most frequent pattern in neuron, scaled Figure 25 Visualization of large chemical spaces with SONNIA. Network Topology SONNIA offers two different types of network topology, a toroidal and a rectangular topology Toroidal topology. All neurons have the same neighbor relationship, i.e., eight direct neighbors. This means that in the resulting Kohonen map the neurons at the corners and edges are adjacent to the neurons at the opposite site of the map. This can be illustrated by a torus that is cut two times to obtain a plane (see Figure 26). 23

30 Tips and Tricks Figure 26 Toroidal topology of a Kohonen neural network. Rectangular topology. The neurons at the corners and the edges form the boundary of the network. Therefore, a neuron at a corner of the network has three only neurons as direct neighbors, an edge neuron five neurons and all other neurons have eight neighbors. Rectangular topologies are better for classification purposes since, e.g., "outliers" are more pushed to the edges and corners. Toroidal topologies are better if the data under investigation represents a "closed" system, e.g., if a molecular surface and its property is mapped into a two-dimensional plane by a Kohonen network. Training and Learning Parameters SONNIA (Network window, see Figure 9) makes some reasonable suggestions for the number of training cycles (epochs) and intervals, i.e., how often the data set is presented to the network before the weights of the neurons are adapted to the input data. Furthermore, the initial spans and steps (the distance in x and y direction in the network to which the weights of the neurons are adapted to a central/winning neuron; this distance is gradually reduced during the training) are set automatically according to the size of the network. Reasonable, new training parameters for span and step can be calculated as following (see Figure 27). Width Span( x) = 2 Span( x) Step( x) = Epochs Height Span( y) = 2 Span( y) Step( y) = Epochs Figure 27 Calculation of training parameters for a neural network. Learning rates (Rate in SONNIA Training window, see Figure 9) of about 0.5 are recommended. In general, it's preferable to train longer (i.e., higher number of epochs) 24

31 Tips and Tricks but with lower learning rates. High learning rates may cause problems if several input patterns compete for one neuron. The rate factor (Rate Factor in SONNIA Training window, see Figure 9) reduces the learning rate after each epoch by multiplying the learning rate with the rate factor. At the beginning of the training In general, Kohonen (or SOM) mapping is quite powerful since you can very quickly do a visual inspection of a high dimensional space and it allows for a rapid assessment and evaluation if the used descriptors are able to reveal trends and patterns in the data. Assessing the Quality of an Unsupervised Classification Basically, there are three different criteria which can be used to assess the quality of a classification done by a Kohonen mapping. These three criteria, visual inspection, occupancy and number of collisions (conflict neurons) are described in the following. Note that all three criteria should be taken into account to support the decision whether a generated Kohonen map shows a "good" classification. Visual Inspection The strength of Kohonen maps is that they can be generated rather quickly and the results can be visually inspected. The visual inspection allows for a rapid assessment and evaluation if the used descriptors are able to reveal trends and patterns in the data ("... human inspection building on the powerful pattern recognition capabilities of the human mind") [7]. A Kohonen map that shows a clear separation of different classes of compounds in a dataset can be regarded as an indicator that there is a relationship between the used descriptor(s) and the property under investigation. Occupancy A well-trained Kohonen network should also show a balanced and even distribution of the patterns (i.e., compounds) over the resulting map as well as a low fraction of unoccupied neurons (shown as white squares in the map). The distribution of the patterns and the occupancy of each individual neuron can be checked with an "occupancy map" (menu Maps in the main menu bar of SONNIA, see Figure 28, right map). The occupancy map is color-coded by the number of patterns/compounds that are assigned to each neurons. 25

32 Tips and Tricks Figure 28 Occupancy map of a Kohonen neural network (right map). A Kohonen map with an unbalanced occupancy of the neurons (e.g., more than the half of the input patterns are located in only 10% of the total number of neurons) may have several reasons, e.g., The training of the network was stopped too early: Train a newly created network and increase the number of Epochs (adjust the values for Step(x) and Step(y) accordingly). The input values of one or a few input patterns are rather different from the rest of the input patterns of the dataset ("outliers"): remove these patterns from your training set and train a newly created network with the reduced dataset. 26

33 Problems and Help! Problems and Help! If there are any difficulties with the installation of ADRIANA.Code or SONNIA or if any problems occur while running ADRIANA.Code or SONNIA please send all inquiries to the following address: Molecular Networks GmbH Computerchemie Henkestr Erlangen Germany or contact us by or by Fax

34 References References [1] Descriptor Calculation Package ADRIANA.Code, developed and distributed by Molecular Networks GmbH, Erlangen, Germany ( [2] Neural Networks Package SONNIA, developed and distributed by Molecular Networks GmbH, Erlangen, Germany ( [3] Wagener, M.; Sadowski, J.; Gasteiger, J. Autocorrelation of Molecular Surface Properties for Modeling Corticosteroid Binding Globulin and Cytosolic Ah Receptor Activity by Neural Networks. J. Am. Chem. Soc. 1995, 117, [4] a) Dalby, A.; Nourse, J. G.; Hounshell, W. D.; Gushurst, A. K. I.; Grier, D. L.; Leland, B. A.; Laufer, J. Description of Several Chemical Structure File Formats Used by Computer Programs Developed at Molecular Design Limited. J. Chem. Inf. Comput. Sci. 1992, 32, b) A detailed description of MDL file formats (Mol, SDF and RDF) is available for download as a PDF document at [5] Sadowski, J.; Gasteiger, J.; Klebe, G. Comparison of Automatic Three-Dimensional Model Builders Using 639 X-Ray Structures. J. Chem. Inf. Comput. Sci. 1994, 34, [6] 3D Structure Generator CORINA, developed and distributed by Molecular Networks GmbH, Erlangen, Germany ( [7] Zupan, J.; Gasteiger, J. Neural Network in Chemistry and Drug Design. Second Edition, Wiley-VCH, Weinheim, 1999, 380 pages, ISBN

Ligand Scout Tutorials

Ligand Scout Tutorials Ligand Scout Tutorials Step : Creating a pharmacophore from a protein-ligand complex. Type ke6 in the upper right area of the screen and press the button Download *+. The protein will be downloaded and

More information

Tutorial. Getting started. Sample to Insight. March 31, 2016

Tutorial. Getting started. Sample to Insight. March 31, 2016 Getting started March 31, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com Getting started

More information

NMR Predictor. Introduction

NMR Predictor. Introduction NMR Predictor This manual gives a walk-through on how to use the NMR Predictor: Introduction NMR Predictor QuickHelp NMR Predictor Overview Chemical features GUI features Usage Menu system File menu Edit

More information

ICM-Chemist How-To Guide. Version 3.6-1g Last Updated 12/01/2009

ICM-Chemist How-To Guide. Version 3.6-1g Last Updated 12/01/2009 ICM-Chemist How-To Guide Version 3.6-1g Last Updated 12/01/2009 ICM-Chemist HOW TO IMPORT, SKETCH AND EDIT CHEMICALS How to access the ICM Molecular Editor. 1. Click here 2. Start sketching How to sketch

More information

OECD QSAR Toolbox v.4.1. Tutorial on how to predict Skin sensitization potential taking into account alert performance

OECD QSAR Toolbox v.4.1. Tutorial on how to predict Skin sensitization potential taking into account alert performance OECD QSAR Toolbox v.4.1 Tutorial on how to predict Skin sensitization potential taking into account alert performance Outlook Background Objectives Specific Aims Read across and analogue approach The exercise

More information

ArcGIS 9 ArcGIS StreetMap Tutorial

ArcGIS 9 ArcGIS StreetMap Tutorial ArcGIS 9 ArcGIS StreetMap Tutorial Copyright 2001 2008 ESRI All Rights Reserved. Printed in the United States of America. The information contained in this document is the exclusive property of ESRI. This

More information

Dock Ligands from a 2D Molecule Sketch

Dock Ligands from a 2D Molecule Sketch Dock Ligands from a 2D Molecule Sketch March 31, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com

More information

Introduction to Spark

Introduction to Spark 1 As you become familiar or continue to explore the Cresset technology and software applications, we encourage you to look through the user manual. This is accessible from the Help menu. However, don t

More information

Preparing Spatial Data

Preparing Spatial Data 13 CHAPTER 2 Preparing Spatial Data Assessing Your Spatial Data Needs 13 Assessing Your Attribute Data 13 Determining Your Spatial Data Requirements 14 Locating a Source of Spatial Data 14 Performing Common

More information

Exercises for Windows

Exercises for Windows Exercises for Windows CAChe User Interface for Windows Select tool Application window Document window (workspace) Style bar Tool palette Select entire molecule Select Similar Group Select Atom tool Rotate

More information

APBS electrostatics in VMD - Software. APBS! >!Examples! >!Visualization! >! Contents

APBS electrostatics in VMD - Software. APBS! >!Examples! >!Visualization! >! Contents Software Search this site Home Announcements An update on mailing lists APBS 1.2.0 released APBS 1.2.1 released APBS 1.3 released New APBS 1.3 Windows Installer PDB2PQR 1.7.1 released PDB2PQR 1.8 released

More information

Search for the Gulf of Carpentaria in the remap search bar:

Search for the Gulf of Carpentaria in the remap search bar: This tutorial is aimed at getting you started with making maps in Remap (). In this tutorial we are going to develop a simple classification of mangroves in northern Australia. Before getting started with

More information

SeeSAR 7.1 Beginners Guide. June 2017

SeeSAR 7.1 Beginners Guide. June 2017 SeeSAR 7.1 Beginners Guide June 2017 Part 1: Basics 1 Type a pdb code and press return or Load your own protein or already existing project, or Just load molecules To begin, let s type 2zff and download

More information

OECD QSAR Toolbox v.4.0. Tutorial on how to predict Skin sensitization potential taking into account alert performance

OECD QSAR Toolbox v.4.0. Tutorial on how to predict Skin sensitization potential taking into account alert performance OECD QSAR Toolbox v.4.0 Tutorial on how to predict Skin sensitization potential taking into account alert performance Outlook Background Objectives Specific Aims Read across and analogue approach The exercise

More information

Space Objects. Section. When you finish this section, you should understand the following:

Space Objects. Section. When you finish this section, you should understand the following: GOLDMC02_132283433X 8/24/06 2:21 PM Page 97 Section 2 Space Objects When you finish this section, you should understand the following: How to create a 2D Space Object and label it with a Space Tag. How

More information

Reaxys Pipeline Pilot Components Installation and User Guide

Reaxys Pipeline Pilot Components Installation and User Guide 1 1 Reaxys Pipeline Pilot components for Pipeline Pilot 9.5 Reaxys Pipeline Pilot Components Installation and User Guide Version 1.0 2 Introduction The Reaxys and Reaxys Medicinal Chemistry Application

More information

Software BioScout-Calibrator June 2013

Software BioScout-Calibrator June 2013 SARAD GmbH BioScout -Calibrator 1 Manual Software BioScout-Calibrator June 2013 SARAD GmbH Tel.: ++49 (0)351 / 6580712 Wiesbadener Straße 10 FAX: ++49 (0)351 / 6580718 D-01159 Dresden email: support@sarad.de

More information

OECD QSAR Toolbox v.3.3. Predicting skin sensitisation potential of a chemical using skin sensitization data extracted from ECHA CHEM database

OECD QSAR Toolbox v.3.3. Predicting skin sensitisation potential of a chemical using skin sensitization data extracted from ECHA CHEM database OECD QSAR Toolbox v.3.3 Predicting skin sensitisation potential of a chemical using skin sensitization data extracted from ECHA CHEM database Outlook Background The exercise Workflow Save prediction 23.02.2015

More information

Agilent MassHunter Quantitative Data Analysis

Agilent MassHunter Quantitative Data Analysis Agilent MassHunter Quantitative Data Analysis Presenters: Howard Sanford Stephen Harnos MassHunter Quantitation: Batch and Method Setup Outliers, Data Review, Reporting 1 MassHunter Quantitative Analysis

More information

Tutorial: Structural Analysis of a Protein-Protein Complex

Tutorial: Structural Analysis of a Protein-Protein Complex Molecular Modeling Section (MMS) Department of Pharmaceutical and Pharmacological Sciences University of Padova Via Marzolo 5-35131 Padova (IT) @contact: stefano.moro@unipd.it Tutorial: Structural Analysis

More information

Assignment 1: Molecular Mechanics (PART 1 25 points)

Assignment 1: Molecular Mechanics (PART 1 25 points) Chemistry 380.37 Fall 2015 Dr. Jean M. Standard August 19, 2015 Assignment 1: Molecular Mechanics (PART 1 25 points) In this assignment, you will perform some molecular mechanics calculations using the

More information

How to Create a Substance Answer Set

How to Create a Substance Answer Set How to Create a Substance Answer Set Select among five search techniques to find substances Since substances can be described by multiple names or other characteristics, SciFinder gives you the flexibility

More information

M E R C E R W I N WA L K T H R O U G H

M E R C E R W I N WA L K T H R O U G H H E A L T H W E A L T H C A R E E R WA L K T H R O U G H C L I E N T S O L U T I O N S T E A M T A B L E O F C O N T E N T 1. Login to the Tool 2 2. Published reports... 7 3. Select Results Criteria...

More information

Computer simulation of radioactive decay

Computer simulation of radioactive decay Computer simulation of radioactive decay y now you should have worked your way through the introduction to Maple, as well as the introduction to data analysis using Excel Now we will explore radioactive

More information

ST-Links. SpatialKit. Version 3.0.x. For ArcMap. ArcMap Extension for Directly Connecting to Spatial Databases. ST-Links Corporation.

ST-Links. SpatialKit. Version 3.0.x. For ArcMap. ArcMap Extension for Directly Connecting to Spatial Databases. ST-Links Corporation. ST-Links SpatialKit For ArcMap Version 3.0.x ArcMap Extension for Directly Connecting to Spatial Databases ST-Links Corporation www.st-links.com 2012 Contents Introduction... 3 Installation... 3 Database

More information

This tutorial is intended to familiarize you with the Geomatica Toolbar and describe the basics of viewing data using Geomatica Focus.

This tutorial is intended to familiarize you with the Geomatica Toolbar and describe the basics of viewing data using Geomatica Focus. PCI GEOMATICS GEOMATICA QUICKSTART 1. Introduction This tutorial is intended to familiarize you with the Geomatica Toolbar and describe the basics of viewing data using Geomatica Focus. All data used in

More information

ISIS/Draw "Quick Start"

ISIS/Draw Quick Start ISIS/Draw "Quick Start" Click to print, or click Drawing Molecules * Basic Strategy 5.1 * Drawing Structures with Template tools and template pages 5.2 * Drawing bonds and chains 5.3 * Drawing atoms 5.4

More information

OECD QSAR Toolbox v.3.3

OECD QSAR Toolbox v.3.3 OECD QSAR Toolbox v.3.3 Step-by-step example on how to predict the skin sensitisation potential of a chemical by read-across based on an analogue approach Outlook Background Objectives Specific Aims Read

More information

How to Make or Plot a Graph or Chart in Excel

How to Make or Plot a Graph or Chart in Excel This is a complete video tutorial on How to Make or Plot a Graph or Chart in Excel. To make complex chart like Gantt Chart, you have know the basic principles of making a chart. Though I have used Excel

More information

Electric Fields and Equipotentials

Electric Fields and Equipotentials OBJECTIVE Electric Fields and Equipotentials To study and describe the two-dimensional electric field. To map the location of the equipotential surfaces around charged electrodes. To study the relationship

More information

DivCalc: A Utility for Diversity Analysis and Compound Sampling

DivCalc: A Utility for Diversity Analysis and Compound Sampling Molecules 2002, 7, 657-661 molecules ISSN 1420-3049 http://www.mdpi.org DivCalc: A Utility for Diversity Analysis and Compound Sampling Rajeev Gangal* SciNova Informatics, 161 Madhumanjiri Apartments,

More information

DiscoveryGate SM Version 1.4 Participant s Guide

DiscoveryGate SM Version 1.4 Participant s Guide Citation Searching in CrossFire Beilstein DiscoveryGate SM Version 1.4 Participant s Guide Citation Searching in CrossFire Beilstein DiscoveryGate SM Version 1.4 Participant s Guide Elsevier MDL 14600

More information

Marvin. Sketching, viewing and predicting properties with Marvin - features, tips and tricks. Gyorgy Pirok. Solutions for Cheminformatics

Marvin. Sketching, viewing and predicting properties with Marvin - features, tips and tricks. Gyorgy Pirok. Solutions for Cheminformatics Marvin Sketching, viewing and predicting properties with Marvin - features, tips and tricks Gyorgy Pirok Solutions for Cheminformatics The Marvin family The Marvin toolkit provides web-enabled components

More information

OECD QSAR Toolbox v.3.4

OECD QSAR Toolbox v.3.4 OECD QSAR Toolbox v.3.4 Step-by-step example on how to predict the skin sensitisation potential approach of a chemical by read-across based on an analogue approach Outlook Background Objectives Specific

More information

Lab 1 Uniform Motion - Graphing and Analyzing Motion

Lab 1 Uniform Motion - Graphing and Analyzing Motion Lab 1 Uniform Motion - Graphing and Analyzing Motion Objectives: < To observe the distance-time relation for motion at constant velocity. < To make a straight line fit to the distance-time data. < To interpret

More information

OECD QSAR Toolbox v.3.0

OECD QSAR Toolbox v.3.0 OECD QSAR Toolbox v.3.0 Step-by-step example of how to categorize an inventory by mechanistic behaviour of the chemicals which it consists Background Objectives Specific Aims Trend analysis The exercise

More information

Application Note. U. Heat of Formation of Ethyl Alcohol and Dimethyl Ether. Introduction

Application Note. U. Heat of Formation of Ethyl Alcohol and Dimethyl Ether. Introduction Application Note U. Introduction The molecular builder (Molecular Builder) is part of the MEDEA standard suite of building tools. This tutorial provides an overview of the Molecular Builder s basic functionality.

More information

Performing a Pharmacophore Search using CSD-CrossMiner

Performing a Pharmacophore Search using CSD-CrossMiner Table of Contents Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Searching with a Pharmacophore... 4 Performing a Pharmacophore Search using CSD-CrossMiner Version 2.0

More information

ON SITE SYSTEMS Chemical Safety Assistant

ON SITE SYSTEMS Chemical Safety Assistant ON SITE SYSTEMS Chemical Safety Assistant CS ASSISTANT WEB USERS MANUAL On Site Systems 23 N. Gore Ave. Suite 200 St. Louis, MO 63119 Phone 314-963-9934 Fax 314-963-9281 Table of Contents INTRODUCTION

More information

Data Mining with the PDF-4 Databases. FeO Non-stoichiometric Oxides

Data Mining with the PDF-4 Databases. FeO Non-stoichiometric Oxides Data Mining with the PDF-4 Databases FeO Non-stoichiometric Oxides This is one of three example-based tutorials for using the data mining capabilities of the PDF-4+ database and it covers the following

More information

Quantification of JEOL XPS Spectra from SpecSurf

Quantification of JEOL XPS Spectra from SpecSurf Quantification of JEOL XPS Spectra from SpecSurf The quantification procedure used by the JEOL SpecSurf software involves modifying the Scofield cross-sections to account for both an energy dependency

More information

OECD QSAR Toolbox v.4.1. Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals

OECD QSAR Toolbox v.4.1. Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals OECD QSAR Toolbox v.4.1 Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals Background Outlook Objectives The exercise Workflow 2 Background This is a

More information

OECD QSAR Toolbox v.4.1. Step-by-step example for building QSAR model

OECD QSAR Toolbox v.4.1. Step-by-step example for building QSAR model OECD QSAR Toolbox v.4.1 Step-by-step example for building QSAR model Background Objectives The exercise Workflow of the exercise Outlook 2 Background This is a step-by-step presentation designed to take

More information

Preparing a PDB File

Preparing a PDB File Figure 1: Schematic view of the ligand-binding domain from the vitamin D receptor (PDB file 1IE9). The crystallographic waters are shown as small spheres and the bound ligand is shown as a CPK model. HO

More information

Getting started with BatchReactor Example : Simulation of the Chlorotoluene chlorination

Getting started with BatchReactor Example : Simulation of the Chlorotoluene chlorination Getting started with BatchReactor Example : Simulation of the Chlorotoluene chlorination 2011 ProSim S.A. All rights reserved. Introduction This document presents the different steps to follow in order

More information

VCell Tutorial. Building a Rule-Based Model

VCell Tutorial. Building a Rule-Based Model VCell Tutorial Building a Rule-Based Model We will demonstrate how to create a rule-based model of EGFR receptor interaction with two adapter proteins Grb2 and Shc. A Receptor-monomer reversibly binds

More information

CityGML XFM Application Template Documentation. Bentley Map V8i (SELECTseries 2)

CityGML XFM Application Template Documentation. Bentley Map V8i (SELECTseries 2) CityGML XFM Application Template Documentation Bentley Map V8i (SELECTseries 2) Table of Contents Introduction to CityGML 1 CityGML XFM Application Template 2 Requirements 2 Finding Documentation 2 To

More information

NINE CHOICE SERIAL REACTION TIME TASK

NINE CHOICE SERIAL REACTION TIME TASK instrumentation and software for research NINE CHOICE SERIAL REACTION TIME TASK MED-STATE NOTATION PROCEDURE SOF-700RA-8 USER S MANUAL DOC-025 Rev. 1.3 Copyright 2013 All Rights Reserved MED Associates

More information

Urban Canopy Tool User Guide `bo`

Urban Canopy Tool User Guide `bo` Urban Canopy Tool User Guide `bo` ADMS Urban Canopy Tool User Guide Version 2.0 June 2014 Cambridge Environmental Research Consultants Ltd. 3, King s Parade Cambridge CB2 1SJ UK Telephone: +44 (0)1223

More information

Skin Damage Visualizer TiVi60 User Manual

Skin Damage Visualizer TiVi60 User Manual Skin Damage Visualizer TiVi60 User Manual PIONEERS IN TISSUE VIABILITY IMAGING User Manual 3.2 Version 3.2 October 2013 Dear Valued Customer! TiVi60 Skin Damage Visualizer Welcome to the WheelsBridge Skin

More information

v Prerequisite Tutorials GSSHA WMS Basics Watershed Delineation using DEMs and 2D Grid Generation Time minutes

v Prerequisite Tutorials GSSHA WMS Basics Watershed Delineation using DEMs and 2D Grid Generation Time minutes v. 10.1 WMS 10.1 Tutorial GSSHA WMS Basics Creating Feature Objects and Mapping Attributes to the 2D Grid Populate hydrologic parameters in a GSSHA model using land use and soil data Objectives This tutorial

More information

Comparing whole genomes

Comparing whole genomes BioNumerics Tutorial: Comparing whole genomes 1 Aim The Chromosome Comparison window in BioNumerics has been designed for large-scale comparison of sequences of unlimited length. In this tutorial you will

More information

Geodatabase Management Pathway

Geodatabase Management Pathway Geodatabase Management Pathway Table of Contents ArcGIS Desktop II: Tools and Functionality 3 ArcGIS Desktop III: GIS Workflows and Analysis 6 Building Geodatabases 8 Data Management in the Multiuser Geodatabase

More information

Calculating Bond Enthalpies of the Hydrides

Calculating Bond Enthalpies of the Hydrides Proposed Exercise for the General Chemistry Section of the Teaching with Cache Workbook: Calculating Bond Enthalpies of the Hydrides Contributed by James Foresman, Rachel Fogle, and Jeremy Beck, York College

More information

OECD QSAR Toolbox v.3.3. Step-by-step example of how to categorize an inventory by mechanistic behaviour of the chemicals which it consists

OECD QSAR Toolbox v.3.3. Step-by-step example of how to categorize an inventory by mechanistic behaviour of the chemicals which it consists OECD QSAR Toolbox v.3.3 Step-by-step example of how to categorize an inventory by mechanistic behaviour of the chemicals which it consists Background Objectives Specific Aims Trend analysis The exercise

More information

General Chemistry Lab Molecular Modeling

General Chemistry Lab Molecular Modeling PURPOSE The objectives of this experiment are PROCEDURE General Chemistry Lab Molecular Modeling To learn how to use molecular modeling software, a commonly used tool in chemical research and industry.

More information

Troubleshooting Replication and Geodata Services. Liz Parrish & Ben Lin

Troubleshooting Replication and Geodata Services. Liz Parrish & Ben Lin Troubleshooting Replication and Geodata Services Liz Parrish & Ben Lin AGENDA: Troubleshooting Replication and Geodata Services Overview Demo Troubleshooting Q & A Overview of Replication Liz Parrish What

More information

Geography 281 Map Making with GIS Project Four: Comparing Classification Methods

Geography 281 Map Making with GIS Project Four: Comparing Classification Methods Geography 281 Map Making with GIS Project Four: Comparing Classification Methods Thematic maps commonly deal with either of two kinds of data: Qualitative Data showing differences in kind or type (e.g.,

More information

OECD QSAR Toolbox v.4.1

OECD QSAR Toolbox v.4.1 OECD QSAR Toolbox v.4.1 Step-by-step example on how to predict the skin sensitisation potential approach of a chemical by read-across based on an analogue approach Outlook Background Objectives Specific

More information

OECD QSAR Toolbox v.4.1. Tutorial illustrating new options for grouping with metabolism

OECD QSAR Toolbox v.4.1. Tutorial illustrating new options for grouping with metabolism OECD QSAR Toolbox v.4.1 Tutorial illustrating new options for grouping with metabolism Outlook Background Objectives Specific Aims The exercise Workflow 2 Background Grouping with metabolism is a procedure

More information

CORINA.direct. User Manual. Version 3.2. Molecular Networks GmbH Computerchemie October 2008

CORINA.direct. User Manual. Version 3.2. Molecular Networks GmbH Computerchemie October 2008 CORINA.direct User Manual Version 3.2 Molecular Networks GmbH Computerchemie October 2008 http://www.molecular-networks.com Molecular Networks GmbH Computerchemie Henkestr. 91 91052 Erlangen Germany Phone:

More information

McIDAS-V Tutorial Displaying Point Observations from ADDE Datasets updated July 2016 (software version 1.6)

McIDAS-V Tutorial Displaying Point Observations from ADDE Datasets updated July 2016 (software version 1.6) McIDAS-V Tutorial Displaying Point Observations from ADDE Datasets updated July 2016 (software version 1.6) McIDAS-V is a free, open source, visualization and data analysis software package that is the

More information

Lightcloud Application

Lightcloud Application Controlling Your Lightcloud System Lightcloud Application Lightcloud Application Navigating the Application Devices Device Settings Organize Control Energy Scenes Schedules Demand Response Power Up State

More information

ncounter PlexSet Data Analysis Guidelines

ncounter PlexSet Data Analysis Guidelines ncounter PlexSet Data Analysis Guidelines NanoString Technologies, Inc. 530 airview Ave North Seattle, Washington 98109 USA Telephone: 206.378.6266 888.358.6266 E-mail: info@nanostring.com Molecules That

More information

Watershed Modeling Orange County Hydrology Using GIS Data

Watershed Modeling Orange County Hydrology Using GIS Data v. 10.0 WMS 10.0 Tutorial Watershed Modeling Orange County Hydrology Using GIS Data Learn how to delineate sub-basins and compute soil losses for Orange County (California) hydrologic modeling Objectives

More information

Tutorials on Library Design E. Lounkine and J. Bajorath (University of Bonn) C. Muller and A. Varnek (University of Strasbourg)

Tutorials on Library Design E. Lounkine and J. Bajorath (University of Bonn) C. Muller and A. Varnek (University of Strasbourg) Tutorials on Library Design E. Lounkine and J. Bajorath (University of Bonn) C. Muller and A. Varnek (University of Strasbourg) The purpose of this tutorial is to generate a library of potential inhibitors

More information

Searching Substances in Reaxys

Searching Substances in Reaxys Searching Substances in Reaxys Learning Objectives Understand that substances in Reaxys have different sources (e.g., Reaxys, PubChem) and can be found in Document, Reaction and Substance Records Recognize

More information

Understanding Your Spectra Module. Agilent OpenLAB CDS ChemStation Edition

Understanding Your Spectra Module. Agilent OpenLAB CDS ChemStation Edition Understanding Your Spectra Module Agilent OpenLAB CDS ChemStation Edition Notices Agilent Technologies, Inc. 1994-2012, 2013 No part of this manual may be reproduced in any form or by any means (including

More information

Introduction to Structure Preparation and Visualization

Introduction to Structure Preparation and Visualization Introduction to Structure Preparation and Visualization Created with: Release 2018-4 Prerequisites: Release 2018-2 or higher Access to the internet Categories: Molecular Visualization, Structure-Based

More information

User Guide for LeDock

User Guide for LeDock User Guide for LeDock Hongtao Zhao, PhD Email: htzhao@lephar.com Website: www.lephar.com Copyright 2017 Hongtao Zhao. All rights reserved. Introduction LeDock is flexible small-molecule docking software,

More information

Esri UC2013. Technical Workshop.

Esri UC2013. Technical Workshop. Esri International User Conference San Diego, California Technical Workshops July 9, 2013 CAD: Introduction to using CAD Data in ArcGIS Jeff Reinhart & Phil Sanchez Agenda Overview of ArcGIS CAD Support

More information

www.goldensoftware.com Why Create a Thematic Map? A thematic map visually represents the geographic distribution of data. MapViewer will help you to: understand demographics define sales or insurance territories

More information

C:\Dokumente und Einstellungen \All Users\Anwendungsdaten \Mathematica. C:\Dokumente und Einstellungen \albert.retey\anwendungsdaten \Mathematica

C:\Dokumente und Einstellungen \All Users\Anwendungsdaten \Mathematica. C:\Dokumente und Einstellungen \albert.retey\anwendungsdaten \Mathematica Installation SmartCAE`HeatTransmission` is delivered as either a Microsoft Windows Installer Package (SmartCAEHeatTransmission-1.0.msi) or a ZIP-File. If you have trouble installing the package, please

More information

Geodatabases and ArcCatalog

Geodatabases and ArcCatalog Geodatabases and ArcCatalog Prepared by Francisco Olivera, Ph.D. and Srikanth Koka Department of Civil Engineering Texas A&M University February 2004 Contents Brief Overview of Geodatabases Goals of the

More information

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009 ICM-Chemist-Pro How-To Guide Version 3.6-1h Last Updated 12/29/2009 ICM-Chemist-Pro ICM 3D LIGAND EDITOR: SETUP 1. Read in a ligand molecule or PDB file. How to setup the ligand in the ICM 3D Ligand Editor.

More information

ISSP User Guide CY3207ISSP. Revision C

ISSP User Guide CY3207ISSP. Revision C CY3207ISSP ISSP User Guide Revision C Cypress Semiconductor 198 Champion Court San Jose, CA 95134-1709 Phone (USA): 800.858.1810 Phone (Intnl): 408.943.2600 http://www.cypress.com Copyrights Copyrights

More information

O P E R A T I N G M A N U A L

O P E R A T I N G M A N U A L OPERATING MANUAL WeatherJack OPERATING MANUAL 1-800-645-1061 The baud rate is 2400 ( 8 bits, 1 stop bit, no parity. Flow control = none) To make sure the unit is on line, send an X. the machine will respond

More information

GIS Workshop UCLS_Fall Forum 2014 Sowmya Selvarajan, PhD TABLE OF CONTENTS

GIS Workshop UCLS_Fall Forum 2014 Sowmya Selvarajan, PhD TABLE OF CONTENTS TABLE OF CONTENTS TITLE PAGE NO. 1. ArcGIS Basics I 2 a. Open and Save a Map Document 2 b. Work with Map Layers 2 c. Navigate in a Map Document 4 d. Measure Distances 4 2. ArcGIS Basics II 5 a. Work with

More information

THE CRYSTAL BALL SCATTER CHART

THE CRYSTAL BALL SCATTER CHART One-Minute Spotlight THE CRYSTAL BALL SCATTER CHART Once you have run a simulation with Oracle s Crystal Ball, you can view several charts to help you visualize, understand, and communicate the simulation

More information

OECD QSAR Toolbox v.3.4

OECD QSAR Toolbox v.3.4 OECD QSAR Toolbox v.3.4 Predicting developmental and reproductive toxicity of Diuron (CAS 330-54-1) based on DART categorization tool and DART SAR model Outlook Background Objectives The exercise Workflow

More information

Last updated: Copyright

Last updated: Copyright Last updated: 2012-08-20 Copyright 2004-2012 plabel (v2.4) User s Manual by Bioinformatics Group, Institute of Computing Technology, Chinese Academy of Sciences Tel: 86-10-62601016 Email: zhangkun01@ict.ac.cn,

More information

Virtual Beach Building a GBM Model

Virtual Beach Building a GBM Model Virtual Beach 3.0.6 Building a GBM Model Building, Evaluating and Validating Anytime Nowcast Models In this module you will learn how to: A. Build and evaluate an anytime GBM model B. Optimize a GBM model

More information

Learning ArcGIS: Introduction to ArcCatalog 10.1

Learning ArcGIS: Introduction to ArcCatalog 10.1 Learning ArcGIS: Introduction to ArcCatalog 10.1 Estimated Time: 1 Hour Information systems help us to manage what we know by making it easier to organize, access, manipulate, and apply knowledge to the

More information

Quality Measures Green Light Report Online Management Tool. Self Guided Tutorial

Quality Measures Green Light Report Online Management Tool. Self Guided Tutorial Quality Measures Green Light Report Online Management Tool Self Guided Tutorial 1 Tutorial Contents Overview Access the QM Green Light Report Review the QM Green Light Report Tips for Success Contact PointRight

More information

Introduction to Hartree-Fock calculations in Spartan

Introduction to Hartree-Fock calculations in Spartan EE5 in 2008 Hannes Jónsson Introduction to Hartree-Fock calculations in Spartan In this exercise, you will get to use state of the art software for carrying out calculations of wavefunctions for molecues,

More information

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner Table of Contents Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Features

More information

POC via CHEMnetBASE for Identifying Unknowns

POC via CHEMnetBASE for Identifying Unknowns Table of Contents A red arrow was used to identify where buttons and functions are located in CHEMnetBASE. Figure Description Page Entering the Properties of Organic Compounds (POC) Database 1 Swain Home

More information

Table of content. Understanding workflow automation - Making the right choice Creating a workflow...05

Table of content. Understanding workflow automation - Making the right choice Creating a workflow...05 Marketers need to categorize their audience to maximize their r e a c h. Z o h o C a m p a i g n s a u t o m a t e s r e c i p i e n t c l a s s i fi c a t i o n a n d action performance to free up marketers

More information

Data Structures & Database Queries in GIS

Data Structures & Database Queries in GIS Data Structures & Database Queries in GIS Objective In this lab we will show you how to use ArcGIS for analysis of digital elevation models (DEM s), in relationship to Rocky Mountain bighorn sheep (Ovis

More information

Molecular modeling with InsightII

Molecular modeling with InsightII Molecular modeling with InsightII Yuk Sham Computational Biology/Biochemistry Consultant Phone: (612) 624 7427 (Walter Library) Phone: (612) 624 0783 (VWL) Email: shamy@msi.umn.edu How to run InsightII

More information

Integrated Electricity Demand and Price Forecasting

Integrated Electricity Demand and Price Forecasting Integrated Electricity Demand and Price Forecasting Create and Evaluate Forecasting Models The many interrelated factors which influence demand for electricity cannot be directly modeled by closed-form

More information

CHEMICAL INVENTORY ENTRY GUIDE

CHEMICAL INVENTORY ENTRY GUIDE CHEMICAL INVENTORY ENTRY GUIDE Version Date Comments 1 October 2013 Initial A. SUMMARY All chemicals located in research and instructional laboratories at George Mason University are required to be input

More information

Location Intelligence Infrastructure Asset Management. Confirm. Confirm Mapping Link to ArcMap Version v18.00b.am

Location Intelligence Infrastructure Asset Management. Confirm. Confirm Mapping Link to ArcMap Version v18.00b.am Location Intelligence Infrastructure Asset Management Confirm Confirm Mapping Link to ArcMap Version v18.00b.am Information in this document is subject to change without notice and does not represent a

More information

CREATING CUSTOMIZED DATE RANGE COLLECTIONS IN PRESENTATION STUDIO

CREATING CUSTOMIZED DATE RANGE COLLECTIONS IN PRESENTATION STUDIO CREATING CUSTOMIZED DATE RANGE COLLECTIONS IN PRESENTATION STUDIO Date range collections are pre-defined reporting periods for performance data. You have two options: Dynamic date ranges automatically

More information

OECD QSAR Toolbox v.3.3. Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding

OECD QSAR Toolbox v.3.3. Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding OECD QSAR Toolbox v.3.3 Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding Outlook Background Objectives Specific Aims The exercise Workflow

More information

User's Guide. DISTO online. Leica Geosystems

User's Guide. DISTO online. Leica Geosystems User's Guide DISTO online Leica Geosystems Copyright 2001 by PMS Photo Mess Systeme AG. All rights reserved. This manual describes the versions 2.x of the program DISTO online. PMS PHOTO-MESS-SYSTEME AG

More information

Winmostar tutorial LAMMPS Polymer Annealing V X-Ability Co., Ltd. 2018/01/15

Winmostar tutorial LAMMPS Polymer Annealing V X-Ability Co., Ltd. 2018/01/15 Winmostar tutorial LAMMPS Polymer Annealing V8.007 X-Ability Co., Ltd. question@winmostar.com 2018/01/15 Summary In this tutorial we will calculate glass transition temperature from the cooling process

More information

Chem 253. Tutorial for Materials Studio

Chem 253. Tutorial for Materials Studio Chem 253 Tutorial for Materials Studio This tutorial is designed to introduce Materials Studio 7.0, which is a program used for modeling and simulating materials for predicting and rationalizing structure

More information

SuperCELL Data Programmer and ACTiSys IR Programmer User s Guide

SuperCELL Data Programmer and ACTiSys IR Programmer User s Guide SuperCELL Data Programmer and ACTiSys IR Programmer User s Guide This page is intentionally left blank. SuperCELL Data Programmer and ACTiSys IR Programmer User s Guide The ACTiSys IR Programmer and SuperCELL

More information

Athena Visual Software, Inc. 1

Athena Visual Software, Inc. 1 Athena Visual Studio Visual Kinetics Tutorial VisualKinetics is an integrated tool within the Athena Visual Studio software environment, which allows scientists and engineers to simulate the dynamic behavior

More information