Pharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design

Size: px
Start display at page:

Download "Pharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design"

Transcription

1 J. Chem. Inf. Comput. Sci. 1999, 39, Pharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design Malcolm J. McGregor and Steven M. Muskal* Affymax Research Institute, 3410 Central Expressway, Santa Clara, California Received November 16, 1998 A new method of rapid pharmacophore fingerprinting (PharmPrint method) has been developed. A basis set of three-point pharmacophores has been constructed by enumerating several distance ranges and pharmacophoric features. Software has been developed to assign pharmacophoric types to atoms in chemical structures, generate multiple conformations, and construct the binary fingerprint according to the pharmacophores that result. The fingerprint is used as a descriptor for developing a quantitative structureactivity relationship (QSAR) model using partial least squares. An example is given using sets of ligands for the estrogen receptor (ER). The result is compared with previously published results on the same data to show the superiority of a full 3D, conformationally flexible approach. The QSAR model can be readily interpreted in structural/chemical terms. Further examples are given using binary activity data and some of our novel in-house compounds, which show the value of the model when crossing compound classes. INTRODUCTION Pharmacophore based screening has become commonplace in the field of computer aided drug design. The pharmacophore concept is based on the kinds of interactions observed in molecular recognition: hydrogen bonding, charge-charge, and hydrophobic interactions. A pharmacophore is a set of functional group types in a spatial arrangement that represents the interactions made in common by a set of small molecule ligands with a protein receptor. Some advantages of the methodology are as follows: (i) it can be used where the only data available are a set of known hits without a knowledge of the structure of the receptor; (ii) the pharmacophore specification is sufficiently general that it can be applied across different classes of ligands, i.e., a pharmacophore can be derived from one class and used to predict activity in another; and (iii) the methodology can be applied to large data sets in high throughput screening (HTS) applications. Thus the method has wide utility. In the usual application of the methodology a single pharmacophore hypothesis, or a small number of them, is derived from a set of ligands with known activity. (The term hypothesis is usually used at this stage to indicate that this is a computational result, as opposed to an empirical one arrived at by experiment or observation of structure complexes.) The hypothesis is then computationally screened across a database of compounds to narrow the selection for biological screening. A measure of the increase in hit rate achieved is desirable. Several software systems are commercially available that support pharmacophoric development and use. Widely used examples are Catalyst 1-3 by Molecular Simulations Inc. 4 and the ChemDiverse module of Chem-X by Chemical Design Ltd. 5 However, a major limitation is that compounds must be registered into the proprietary, closed database system provided by the respective vendors. Pharmacophore fingerprinting is an extension of this approach whereby a basis set of pharmacophores is generated * To whom correspondence should be addressed. Figure 1. Schematic representation of pharmacophoric types (p) and distances (d). by enumerating all pharmacophoric types with a set of distance ranges. The concept has been described previously, 6-9 and applications to structure-activity relationships have been explored with atom pair descriptors. 10 In the present study a basis set of three-point pharmacophores has been generated. Tools have been developed to fingerprint large libraries of compounds with the entire set and incorporated into a program called PharmPrint. The efficiency of the code is such that it can be applied to high-throughput electronic ( in silico ) screening applications, in our case involving combinatorial libraries. The question then arises as to how to derive a function that relates the fingerprints to biological activity. Methods that suggest themselves are neural networks, genetic algorithms, and regression techniques. We chose one of the latter methods, namely, partial least squares (PLS). As a natural outcome of this methodology, we are able to address the issues of chemical diversity and coverage of drug space (in preparation) in addition to activity prediction. METHODOLOGY Fingerprint Generation. Figure 1 shows the general definition of the three-point pharmacophore. The pharmacophoric groups used were the six commonly used in this type of analysis: hydrogen bond acceptor (A) and donor (D), groups with formal negative (N) and positive (P) charges, and hydrophobic (H) and aromatic (R). In addition a seventh /ci980159j CCC: $ American Chemical Society Published on Web 03/19/1999

2 570 J. Chem. Inf. Comput. Sci., Vol. 39, No. 3, 1999 MCGREGOR AND MUSKAL type was added for any atom that is not labeled with any of the first six types (X). Six distance ranges were used: , , , , , and Å. These have been used previously, where distributions of hits were analyzed, 7 and smaller distance bins were assigned to the more populated regions. The three-point pharmacophores were enumerated for all combinations of types and distances and then subjected to two additional constraints: (i) triangle rule, i.e., the length of each side of a triangle cannot exceed the sum of the lengths of the other two sides, otherwise this would produce a geometrically impossible object; (ii) elimination of redundant pharmacophores related by symmetry. The resulting number is Most of the software used for this analysis was written in-house. The exception was the Corina program for 3D structure generation. 11 This was chosen because it runs in batch mode, accepts a variety of standard molecule formats, and has been shown to generate good quality structures. 12 The output from Corina is used as input to the PharmPrint program which was developed in-house. Only heavy atoms are used in computations; if hydrogens are present, they are ignored. The function of the PharmPrint program is 3-fold. (1) It assigns the pharmacophoric groups to atoms. This is done using a substructure search for the relevant fragments, using an atom-by-atom mapping algorithm. 13 The fragments are chosen using heuristics about which substructures contain the pharmacophoric groups. For the most part this is fairly straightforward; e.g., a carboxylate has a negative charge, an aliphatic amine has a positive charge, a hydroxyl is a hydrogen bond donor and acceptor, etc. The most problematic assignment was that of hydrophobic. The following was tried and appears to work well: an atom of type C, Cl, Br, or I is considered hydrophobic if it is further than two bonds removed in 2D connectivity from any atom of type N, O, P, or S-H. (2) It rotates about bonds to generate multiple conformations. This utilizes the quaternion rotation algorithm. 14 Heuristics are used to determine which bonds are rotatable and the angles they can assume. There is a limit to the number of conformations generated (set at 1000); rotations that affect the largest number of atoms are performed first, so that if the limit is reached, then as much of the conformational space is covered as possible. A simple energy function is implemented to eliminate conformations with severe steric overlaps. Bonds in rings are assigned to be nonrotatable and the multiple ring conformation option in Corina is switched on. (3) It builds the pharmacophore fingerprint by measuring distances between pharmacophoric groups. The output from the program is a fingerprint for each structure in the original SD file (SD format is the MDL ASCII molecule format), with an identifier derived from a specified data field. As a binary descriptor, the fingerprint is efficient to deal with computationally. It can be represented in computer memory using one integer for 32 bits in the bitstring. This can be unpacked into one integer or floating point number per bit, but for some calculations it does not need to be unpacked. An example is the Tanimoto coefficient, a measure of bitstring (and therefore molecular) similarity, which can be calculated using bitwise operators in a programming language like C. The computational speed of fingerprinting is dependent on the parameters set and the nature of the structures, in particular the number of total atoms and rotatable bonds. With the MDDR9104 set (below), and with the parameters used, the speed was of the order of 10 structures/min on a Silicon Graphics R10000 processor. Preparation of the MDDR Subset. The MDDR (MDL Drug Data Report) 15 was used as a reference for drug like compounds. It is a database of biologically active compounds with associated data, including activity classes. Version 98.1 contains entries. A subset was prepared using the following criteria. (1) Structures that have a molecular weight in the drug like range of are used. For this and subsequent steps it is important that salts are removed from the original database structure. To this end a program stripsalt was used to remove small-disconnected fragments in SD files. (2) Only structures which consist entirely of atoms from the following list are included: C, N, O, H, S, P, F, Cl, Br, I. This preserves only the types of structures which reflect the chemistry carried out in our laboratories and removes unusual structures such as metal complexes. (3) The compound activity class, as given in the activ_class and activ_index fields in the MDDR, indicates a well-defined target (i.e., individual enzyme or receptor), as opposed to a broad therapeutic class. The file activty.txt, provided by MDL, lists the classes. This file was manually inspected to extract all such classes. (4) The 2D chemical similarity between any structure and all other structures in the final list is below a certain threshold. This eliminates close analogues that might bias the analysis. The measure chosen was the Tanimoto coefficient with the MDL 166 user keys, and the threshold was 0.8. The keys are 2D fragment-based descriptors which are calculated automatically in MDL ISIS databases. 16 (5) Classes that had less than eight members, and compounds that belonged only to those classes, were eliminated. This procedure resulted in 9104 compounds and 152 classes. Partial Least Squares. Partial least squares (PLS) was applied to derive a function that relates the fingerprints to the activity values using code written in-house. The algorithm used is based on the NIPALS algorithm for principle component analysis (PCA). 18 PLS has been previously applied to the analysis of chemical structure, most notably in the CoMFA methodology. 20 The data were mean centered but not variance scaled. The data for the independent variables were entered as 1.0 for a pharmacophore hit or 0.0 for no hit, corresponding to the binary fingerprint. The data for the dependent variables were either the log of the relative binding affinity (RBA) for data sets 1-3 or 1.0/0.0 to represent active or inactive in data set 4. Data Sets. We chose data sets for the estrogen receptor because of the recent therapeutic interest in this class of targets and the fact that there have been several quantitative structure-activity relationship (QSAR) models developed for ER ligands The crystal structure for ER-R has also recently been reported. 29 The data sets are summarized in Table 1 and described in detail below. Data Sets 1 and 2. The first two data sets used for the QSAR analysis were a set of 31 ER ligands with activity

3 PHARMACOPHORE FINGERPRINTING J. Chem. Inf. Comput. Sci., Vol. 39, No. 3, Table 1 set training testing 1 31 literature compounds 30 leave-one-out cross-validation on training set (RBA for human ER-R: ) 2 31 literature compounds 30 leave-one-out cross-validation on training set (RBA for rat ER-β: ) 3 set proprietary heterocycles (RBA for human ER-R: ) 18 heterocycles (distinct from training set) (RBA for human ER-R: ) 4 15 from set 1 (RBA g10.0) MDDR not estrogen 86 proprietary compounds (<1 µm for ER-R) MDDR estrogen compounds MDDR not estrogen compounds values for binding to human ER-R (set 1) and rat ER-β (set 2). 30 These are given as relative binding activity (RBA) compared to the activity of the natural ligand, estradiol (E2), which is given a value of Thus the higher the value, the more active the compound. The structures are illustrated in Figure 2. It can be seen that they are reasonably diverse, spanning several structural classes; some of the structures are quite rigid while others are somewhat flexible. There are two published crystal structures of ER-R with different ligands bound, 29 which cast light on the nature of the protein-ligand interactions. One crystal structure contains in the active site the natural ligand and agonist E2. The other has the antagonist raloxifene. Raloxifene is not part of data sets 1 and 2 but is structurally analogous to compounds 23, 24, 25, and 26 (Figure 2). Raloxifene makes additional interactions compared to E2, involving a positively charged group on a flexible side chain on the ligand. This is accommodated by a conformational shift in the protein that is the structural basis for antagonism in the ER. This indicates that this training set contains ligands with at least two different binding modes. Three different QSAR methods have been previously applied to these data sets 28 that we use for comparison of results. The methods apply PLS to different molecular descriptors: (1) comparative molecular field analysis (CoM- FA), a widely used method based on the calculation of a steric and electrostatic field on a grid around each ligand; (2) The CoDESSA program, which calculates descriptors for 2D and 3D structures and quantum-mechanical properties; and (3) hologram QSAR (HQSAR), which uses as a descriptor a molecular hologram constructed from counts of substructural molecular fragments (2D only). The results for these data sets are presented as r 2 and q 2, comparing the predicted and actual activity values. The q 2 calculation (cross-validated r 2 ) uses the leave-one-out (LOO) procedure. Data Set 3. This makes use of 35 heterocyclic compounds that have shown activity in our ER-R assay. The activity values have been expressed as RBA to make them equivalent to set 1; 17 of these were added to set 1 to give a training set of 48. The remaining 18 compounds were used for testing using the model derived from the training set, and the result is given for these compounds only. Data Set 4. This set was designed so that the method could be tested using binary activity values. The PLS algorithm was run with activity values of either 1.0 or 0.0. Separate training and testing sets, each with actives and inactives, were established as follows. The training set actives were the 15 compounds from set 1 which have a RBA of g10.0 (numbers 2-8, 20-22, 24-27, and 29 in Figure 2). The testing set actives were the compounds in the complete MDDR which Table 2 dataset statistic CoMFA HQSAR CODESSA PharmPrint 1 q r PCs q r PCs q 2 N/A N/A N/A 0.88 PCs N/A N/A N/A 6 have the string ESTROGEN in the activity_class field. This set was pruned to eliminate obvious prodrugs and to exclude compounds in the training set, the resulting number being 250. The MDDR9104 subset was pruned to exclude compounds that had the string ESTROGEN in the activity- _class field. This gave a set of 9040 from which 750 compounds were randomly chosen and included in the training set with activity values of 0.0, i.e., assumed inactive. The remaining 8290 were used for testing. To distinguish these compounds from ones which can be properly claimed to be inactive (as actually tested in the biological assay), we term these the background sets as opposed to inactives. At the training stage the active compounds were duplicated 50 times to give as much overall weight to the active as the background compounds. An additional testing set was established of 86 compounds from our corporate database which have an activity for ER-R of better then 1 µm (the ARI actives ). These were derived from combinatorial libraries, with scaffolds of three different chemical classes, none of which are represented in the training set, and which include most of the heterocyclics in data set 3. RESULTS AND DISCUSSION Table 2 presents the results for the PharmPrint/PLS QSAR on data sets 1-3 as described in Methodology. For comparison, the same statistics are reproduced for the three other QSAR methods previously applied 28 to data sets 1 and 2: comparative molecular field analysis (CoMFA), classical QSAR using the CoDESSA program, and hologram QSAR (HQSAR). It can be seen that for both data sets the r 2 (noncross-validated result) for the PharmPrint/PLS is comparable to the other three methods. With the cross-validated statistic, q 2, this value is higher for the Pharmprint/PLS than for any of the other methods. With set 3, the q 2 is higher still, at This suggests that the phamacophore fingerprints are more successfully generalizing from the data. This may not be surprising since the methodology uses not only 3D features but conformational flexibility as well. As a point of interest, our method initially used only the six pharmacophoric types A, D, H, N, P, R; unlabeled atoms

4 572 J. Chem. Inf. Comput. Sci., Vol. 39, No. 3, 1999 MCGREGOR AND MUSKAL Figure 2. The 31 structures in data sets 1 and 2. were ignored, giving a bitstring of length With this fingerprint it was very difficult to approach the accuracy given in Table 2, the q 2 statistic remaining around 0.60 or below (data not shown). Therefore it is clear that the previously unlabeled atoms, now given a default label X and included in the fingerprint of length , contain important information, probably related to molecular volume. The X type accounts for 49.7% of all atoms in molecules of the MDDR9104 set. In QSAR studies of this type it is considered important to be able to deal with different kinds of activity data. One situation that is common, especially with the results of an

5 PHARMACOPHORE FINGERPRINTING J. Chem. Inf. Comput. Sci., Vol. 39, No. 3, Figure 3. Distribution of binary activity prediction values for testing set 4. Table 3 mean sd % correct MDDR background MDDR estrogen ARI actives initial screening of primary libraries, is that data are available as to which compounds are active or inactive, but reliable IC50 or EC50 data have not been established. Thus training set 4 was established consisting of the 15 compounds from set 1 which have RBA values of g10.0. These were used with activity values of 1.0, ignoring the actual affinity values. It was considered desirable to include as many inactive compounds as possible, as the determinants of activity are not just the pharmacophores that occur more frequently in the active compounds but also ones which occur less frequently compared to a background hit rate. These pharmacophores may have a negative impact on activity when they are present. Since the PharmPrint method is very efficient, it is possible at the training stage to include many or all of the compounds in the MDDR set, the great majority of which can be assumed to be inactive for a particular target, which we term the background compounds. The results are presented graphically in Figure 3 and statistically in Table 3. The 8290 MDDR background compounds in the testing set are clustered close to zero. The 250 MDDR estrogen testing compounds and 86 ARI estrogen compounds are distributed between 0.0 and 1.0. In Figure 3, it is clear that both have a distribution that is clearly distinct from the background compounds. The ARI compounds have a distribution that is somewhat to the left of the MDDR estrogen compounds. This can be interpreted by considering that the MDDR estrogen compounds are generally of the same class as the training set. The ARI compounds, however, are derived from our combinatorial libraries and are of three distinct classes, none of which are represented in the training set. This gives some measure of the predictive ability across different classes of molecules. Table 3 gives the percentage of correctly classified compounds, assuming the MDDR background set is inactive and the MDDR estrogen and ARI compounds are active, and taking an arbitrary discrimination cutoff of 0.2. The results are 89.7, 87.4, and 87.2%, respectively. Given good results, it is also desirable to be able to interpret them in terms of structural features. With some computational methods this may be difficult. However, with the fingerprints, we can look at the weights produced by the Figure 4. Illustration to show a mapping of pharmacophore 1624 from Table 4 onto (i) estradiol (top), the natural ligand, and (ii) diethylstilbestrol (bottom), the most potent compound in data set 1. The hydroxyl groups are both hydrogen bond donors and acceptors (D, A), and the centroid of the aromatic ring is shown. Table 4 distances types rank pharm no. weight A D R D D R D D H D D X D A H H D R D D H D A R A D X D A X X A D A X R A R X X H D X A A X A A H X X X A A X A X X A D PLS analysis. Table 4 presents the 10 highest and 10 lowest pharmacophores rank ordered by the magnitude of the weights for the first principle component from data set 4. Positive weights indicate pharmacophores that are common in the active compounds relative to the background compounds, and negative weights indicate the reverse. For example, the pharmacophore ranked highest, number 1624, can be seen to be a strong feature of the active compounds in the training set. It consists of an aromatic group (R) Å from a hydrogen bond donor (D); this maps to the phenol group that is common to the most active compounds. There is a hydrogen bond acceptor atom (A) 7-10 Å from the ring centroid (R) and Å from the (D) atom. This maps to another hydroxyl or carbonyl group further away. Figure 4 shows how pharmacophore 1624 maps to the molecular structures of estradiol, the natural ligand, and diethylstilbestrol, the most active compound in set 1. This illustrates that when one pharmacophore hits two molecules, it is because of a similar presentation of functionality, even though the compounds may be of different structural classes. Most of the rest of the top scoring pharmacophores are similar to the first, having the same distance ranges and varying only in some of the types. The pharmacophores with

6 574 J. Chem. Inf. Comput. Sci., Vol. 39, No. 3, 1999 MCGREGOR AND MUSKAL the negative weights are more difficult to interpret in terms of the structures of the active compounds but appear to contribute roughly equally in magnitude to the model. CONCLUSIONS Pharmacophore fingerprinting is a promising approach to computer aided drug design. The PharmPrint fingerprint is a compact but information-rich descriptor. It is based on features observed to be important in ligand-receptor interactions, and it takes into account not only 3D structure but multiple conformations. We note the value of the X atom type that might help describe additional skeletal, support feature, and/or overall molecular volume. The QSAR performs well with structures where there is some conformational flexibility and where there are multiple structural classes and binding modes. The QSAR does not require the structure of the receptor, and no assumptions are required about how molecules overlap at the binding site. However, one can envision a pharmacophore description being derived from the binding site of a known receptor structure and a function used to compare that to the molecule fingerprint. This is an area of current investigation for us. The PharmPrint software runs in batch mode, requiring only a single 3D structure as input, and the output is a flat file. It does not require the construction of specialized databases. Thus it integrates well with our chemical information systems and can be triggered automatically as structures or libraries are added. Performance is such that tens of thousands of compounds can be processed each day. It is thus well-suited to our main application of building block selection for the design of focused or targeted combinatorial libraries using in silico screening. We are currently exploring ways to increase the efficiency of the calculation; these include parallelization strategies and ways to take advantage of the redundancy in enumerated combinatorial libraries where the same building block appears many times on the same scaffold. We will also be reporting on the use of this descriptor for what we consider the complementary problem to targeted designsprimary library designswhich brings up the questions of molecular diversity and coverage of drug space. ACKNOWLEDGMENT We would like to thank Charlie Peng for supplying the code for the rotation algorithm and Charles Hart and Ron Hale for the biology and chemistry, respectively. REFERENCES AND NOTES (1) Sprague, P. W. Automated Chemical Hypothesis Generation and Database Searching with Catalyst. PerspectiVes in Drug DiscoVery and Design; Müller, K., Ed.; ESCOM Science Publishers B. V.: Leiden, The Netherlands, 1995; Vol. 3, pp (2) Barnum, D.; Greene, J.; Smellie, A.; Sprague, P. Identification of Common Functional Configurations among Molecules. J. Chem. Inf. Comput. Sci. 1996, 36, (3) Greene, J.; Kahn, S.; Savoj, H.; Sprague, P.; Teig, S. Chemical Function Queries for 3D Database Search. J. Chem. Inf. Comput. Sci. 1994, 34, (4) Molecular Simulations Inc., San Diego, CA. (5) Chemical Design Ltd., Oxfordshire, U.K. (6) Good, A. C.; Kuntz, I. D. Investigating the extension of pairwise distance pharmacophore measures to triplet-based descriptors. J. Comput.-Aided Mol. Des. 1995, 9, 373. (7) Pickett, S. D.; Mason, J. S.; McLay, I. M. Diversity Profiling and Design Using 3D Pharmacophores: Pharmacophore-Derived Queries (PDQ). J. Chem. Inf. Comput. Sci. 1996, 36, (8) Mason, J. S.; Pickett, S. D. Partition-based selection. Perspect. Drug DiscoVery Des. 1997, 7/8, (9) Pickett, S. D.; Luttmann, C.; Guerin, V.; Laoui, A.; James, E. DIVSEL and COMPLIBsStrategies for the design and comparison of combinatorial libraries using pharmacophoric descriptors. J. Chem. Inf. Comput. Sci. 1998, 38, (10) Chen, X.; Rusinko, A.; Young, S. S. Recursive partitioning analysis of a large structure-activity data set using three-dimensional descriptors. J. Chem. Inf. Comput. Sci. 1998, 38, (11) Gasteiger, J.; Rudolph, C.; Sadowski, J. Automatic Generation of 3D- Atomic Coordinates for Organic Molecules. Tetrahedron Comput. Methodol. 1990, 3, (12) Sadowski, J.; Gasteiger, J.; Klebe, G. Comparison of Automatic Three- Dimensional Model Builders Using 639 X-ray Structures. J. Chem. Inf. Comput. Sci. 1994, 34, (13) Gluck, D. J. A chemical structure storage and search system developed at Du Pont. J. Chem. Doc. 1965, 5, (14) Shoemake, K. Animating rotation with quaternion curves. SIGGRAPH 1985, 19, (15) MDL Information Systems, Inc., San Leandro, CA. (16) McGregor, M. J.; Pallai, P. V. Clustering of Large Databases of Compounds: Using the MDL Keys as Structural Descriptors. J. Chem. Inf. Comput. Sci. 1997, 37, (17) Lindberg, W.; Persson, J.-A.; Wold, S. Partial least-squares method for spectrofluorimetric analysis of mixtures of humic acid and lignin sulfonate. Anal. Chem. 1983, 55, (18) Geladi, P.; Kowalski, B. R. Partial least-squares regression: A tutorial. Anal. Chim. Acta 1986, 185, (19) Wold, S.; Sjöström, M.; Eriksson, L. Partial least squares projections to latent structures (PLS) in chemistry. Encyclopedia of Computational Chemistry; John Wiley & Sons: New York, 1998; pp (20) Cramer, R. D., III; Patterson, D. E.; Bunce, J. D. Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. J. Am. Chem. Soc. 1988, 110, (21) Williams, C. L.; Stancel, G. M Estrogens and Progestins. In Goodman and Gillman s The Pharmacological Basis of Therapeutics, 9th ed.; Hardman, J. G., Limbird, L. E., Eds.; McGraw-Hill: New York, 1996; pp (22) Tong, W.; Perkins, R.; Strelitz, R.; Collantes, E. R.; Keenan, S.; Welsh, W. J.; Branham, W. S.; Sheehan, D. M. Quantitative structure-activity relationships (QSARs) for estrogen binding to the estrogen receptor: Predictions across species. EnViron. Health Perspect. 1997, 105, (23) Tong, W.; Perkins, R.; Xing, L.; Welsh, W. J.; Sheehan, D. M. QSAR models for binding of estrogenic compounds to estrogen receptor R and β subtypes. Endocrinology 1997, 138, (24) Waller, C. L.; Minor, D. L.; Mckinney, J. D. Examination of the estrogen receptor binding affinities of polychlorinated hydroxybiphenyls using three-dimensional quantitative structure-activity relationships. EnViron. Health Perspect. 1996, 103, (25) Bradbury, S. P.; Mekenyan, O. G.; Ankley, G. T. Quantitative structure-activity relationships for polychlorinated hydroxybiphenyl estrogen receptor binding affinity: An assessment of conformer flexibility. EnViron. Toxicol. Chem. 1996, 15, (26) Gantchev, T. G.; Ali, H.; van Lier, J. E. Quantitative Structure- Activity Relationships/Comparative Molecular Field Analysis (QSAR/ CoMFA) for Receptor-Binding Properties of Halogenated Estradiol Derivatives. J. Med. Chem. 1994, 37, (27) Waller, C. L.; Oprea, T. I.; Chae, K.; Park, H.-K.; Korach, K. S.; Laws, S. C.; Wiese, T. E.; Kelce, W. R.; Gray, L. E., Jr. Ligand- Based Identification of Environmental Estrogens. Chem. Res. Toxicol. 1996, 9, (28) Tong, W.; Lowis, D. R.; Perkins, R.; Chen, Y.; Welsh, W. J.; Goddette, D. W.; Heritage, T. W.; Sheehan, D. M. Evaluation of Quantitative Structure-Activity Relationship Methods for Large-Scale Prediction of Chemicals Binding to Estrogen Receptor. J. Chem. Inf. Comput. Sci. 1998, 38, (29) Brzozowski, A. M.; Pike, A. C. W.; Dauter, Z.; Hubbard, R. E.; Bonn, T.; Engstrom, O.; Ohman, L.; Greene, G. L.; Gustafsson, J.-A.; Carlquist, M. Molecular basis of agonism and antagonism in the estrogen receptor. Nature (London) 1997, 389, (30) Kuiper, G. G. J. M.; Carlsson, B.; Grandien, K.; Enmark, E.; Haeggblad, J.; Nilsson, S.; Gustafsson, J.-A. Comparison of the ligand binding specificity and transcript tissue distribution of estrogen receptors R and β. Endocrinology 1997, 138, CI980159J

An Integrated Approach to in-silico

An Integrated Approach to in-silico An Integrated Approach to in-silico Screening Joseph L. Durant Jr., Douglas. R. Henry, Maurizio Bronzetti, and David. A. Evans MDL Information Systems, Inc. 14600 Catalina St., San Leandro, CA 94577 Goals

More information

In silico pharmacology for drug discovery

In silico pharmacology for drug discovery In silico pharmacology for drug discovery In silico drug design In silico methods can contribute to drug targets identification through application of bionformatics tools. Currently, the application of

More information

Structural biology and drug design: An overview

Structural biology and drug design: An overview Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved

More information

Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors. Robert D. Clark 2004 Tripos, Inc.

Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors. Robert D. Clark 2004 Tripos, Inc. Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors Robert D. Clark Tripos, Inc. bclark@tripos.com 2004 Tripos, Inc. Outline Background o history o mechanics Finding appropriate

More information

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters Drug Design 2 Oliver Kohlbacher Winter 2009/2010 11. QSAR Part 4: Selected Chapters Abt. Simulation biologischer Systeme WSI/ZBIT, Eberhard-Karls-Universität Tübingen Overview GRIND GRid-INDependent Descriptors

More information

Similarity Search. Uwe Koch

Similarity Search. Uwe Koch Similarity Search Uwe Koch Similarity Search The similar property principle: strurally similar molecules tend to have similar properties. However, structure property discontinuities occur frequently. Relevance

More information

5.1. Hardwares, Softwares and Web server used in Molecular modeling

5.1. Hardwares, Softwares and Web server used in Molecular modeling 5. EXPERIMENTAL The tools, techniques and procedures/methods used for carrying out research work reported in this thesis have been described as follows: 5.1. Hardwares, Softwares and Web server used in

More information

Computational chemical biology to address non-traditional drug targets. John Karanicolas

Computational chemical biology to address non-traditional drug targets. John Karanicolas Computational chemical biology to address non-traditional drug targets John Karanicolas Our computational toolbox Structure-based approaches Ligand-based approaches Detailed MD simulations 2D fingerprints

More information

Pacific Symposium on Biocomputing 6: (2001)

Pacific Symposium on Biocomputing 6: (2001) Molecular Fingerprinting on the SIMD Parallel Processor Kestrel Eric Rice and Richard Hughey Department of Computer Engineering University of California, Santa Cruz, CA 95064 felrice,rphg@cse.ucsc.edu

More information

Retrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a

Retrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a Retrieving hits through in silico screening and expert assessment M.. Drwal a,b and R. Griffith a a: School of Medical Sciences/Pharmacology, USW, Sydney, Australia b: Charité Berlin, Germany Abstract:

More information

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017 Using Phase for Pharmacophore Modelling 5th European Life Science Bootcamp March, 2017 Phase: Our Pharmacohore generation tool Significant improvements to Phase methods in 2016 New highly interactive interface

More information

Receptor Based Drug Design (1)

Receptor Based Drug Design (1) Induced Fit Model For more than 100 years, the behaviour of enzymes had been explained by the "lock-and-key" mechanism developed by pioneering German chemist Emil Fischer. Fischer thought that the chemicals

More information

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Iván Solt Solutions for Cheminformatics Drug Discovery Strategies for known targets High-Throughput Screening (HTS) Cells

More information

Priority Setting of Endocrine Disruptors Using QSARs

Priority Setting of Endocrine Disruptors Using QSARs Priority Setting of Endocrine Disruptors Using QSARs Weida Tong Manager of Computational Science Group, Logicon ROW Sciences, FDA s National Center for Toxicological Research (NCTR), U.S.A. Thanks for

More information

Structure-Activity Modeling - QSAR. Uwe Koch

Structure-Activity Modeling - QSAR. Uwe Koch Structure-Activity Modeling - QSAR Uwe Koch QSAR Assumption: QSAR attempts to quantify the relationship between activity and molecular strcucture by correlating descriptors with properties Biological activity

More information

Workshop 1.2 Regulatory application of SAR/QSAR for priority setting of endocrine disruptors: A perspective*

Workshop 1.2 Regulatory application of SAR/QSAR for priority setting of endocrine disruptors: A perspective* Pure Appl. Chem., Vol. 75, Nos. 11 12, pp. 2375 2388, 2003. 2003 IUPAC Workshop 1.2 Regulatory application of SAR/QSAR for priority setting of endocrine disruptors: A perspective* Weida Tong 1,, Hong Fang

More information

Introduction. OntoChem

Introduction. OntoChem Introduction ntochem Providing drug discovery knowledge & small molecules... Supporting the task of medicinal chemistry Allows selecting best possible small molecule starting point From target to leads

More information

Introduction to Chemoinformatics and Drug Discovery

Introduction to Chemoinformatics and Drug Discovery Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.

More information

Generating Small Molecule Conformations from Structural Data

Generating Small Molecule Conformations from Structural Data Generating Small Molecule Conformations from Structural Data Jason Cole cole@ccdc.cam.ac.uk Cambridge Crystallographic Data Centre 1 The Cambridge Crystallographic Data Centre About us A not-for-profit,

More information

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner Table of Contents Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Features

More information

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning and Simulated Annealing Student: Ke Zhang MBMA Committee: Dr. Charles E. Smith (Chair) Dr. Jacqueline M. Hughes-Oliver

More information

Docking. GBCB 5874: Problem Solving in GBCB

Docking. GBCB 5874: Problem Solving in GBCB Docking Benzamidine Docking to Trypsin Relationship to Drug Design Ligand-based design QSAR Pharmacophore modeling Can be done without 3-D structure of protein Receptor/Structure-based design Molecular

More information

Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom

Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom verview D-QSAR Definition Examples Features counts Topological indices D fingerprints and fragment counts R-group descriptors ow good are D descriptors in practice? Summary Peter Gedeck ovartis Institutes

More information

Performing a Pharmacophore Search using CSD-CrossMiner

Performing a Pharmacophore Search using CSD-CrossMiner Table of Contents Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Searching with a Pharmacophore... 4 Performing a Pharmacophore Search using CSD-CrossMiner Version 2.0

More information

DivCalc: A Utility for Diversity Analysis and Compound Sampling

DivCalc: A Utility for Diversity Analysis and Compound Sampling Molecules 2002, 7, 657-661 molecules ISSN 1420-3049 http://www.mdpi.org DivCalc: A Utility for Diversity Analysis and Compound Sampling Rajeev Gangal* SciNova Informatics, 161 Madhumanjiri Apartments,

More information

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007 Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based

More information

Statistical concepts in QSAR.

Statistical concepts in QSAR. Statistical concepts in QSAR. Computational chemistry represents molecular structures as a numerical models and simulates their behavior with the equations of quantum and classical physics. Available programs

More information

User Guide for LeDock

User Guide for LeDock User Guide for LeDock Hongtao Zhao, PhD Email: htzhao@lephar.com Website: www.lephar.com Copyright 2017 Hongtao Zhao. All rights reserved. Introduction LeDock is flexible small-molecule docking software,

More information

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller Chemogenomic: Approaches to Rational Drug Design Jonas Skjødt Møller Chemogenomic Chemistry Biology Chemical biology Medical chemistry Chemical genetics Chemoinformatics Bioinformatics Chemoproteomics

More information

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics

More information

Design and Synthesis of the Comprehensive Fragment Library

Design and Synthesis of the Comprehensive Fragment Library YOUR INNOVATIVE CHEMISTRY PARTNER IN DRUG DISCOVERY Design and Synthesis of the Comprehensive Fragment Library A 3D Enabled Library for Medicinal Chemistry Discovery Warren S Wade 1, Kuei-Lin Chang 1,

More information

Drug Informatics for Chemical Genomics...

Drug Informatics for Chemical Genomics... Drug Informatics for Chemical Genomics... An Overview First Annual ChemGen IGERT Retreat Sept 2005 Drug Informatics for Chemical Genomics... p. Topics ChemGen Informatics The ChemMine Project Library Comparison

More information

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Plan Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Exercise: Example and exercise with herg potassium channel: Use of

More information

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre Dr. Sander B. Nabuurs Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre The road to new drugs. How to find new hits? High Throughput

More information

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer LigandScout Automated Structure-Based Pharmacophore Model Generation Gerhard Wolber* and Thierry Langer * E-Mail: wolber@inteligand.com Pharmacophores from LigandScout Pharmacophores & the Protein Data

More information

The PhilOEsophy. There are only two fundamental molecular descriptors

The PhilOEsophy. There are only two fundamental molecular descriptors The PhilOEsophy There are only two fundamental molecular descriptors Where can we use shape? Virtual screening More effective than 2D Lead-hopping Shape analogues are not graph analogues Molecular alignment

More information

QSAR/QSPR modeling. Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships

QSAR/QSPR modeling. Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships QSAR/QSPR modeling Alexandre Varnek Faculté de Chimie, ULP, Strasbourg, FRANCE QSAR/QSPR models Development Validation

More information

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013 Chemical Space Space, Diversity, and Synthesis Jeremy Henle, 4/23/2013 Computational Modeling Chemical Space As a diversity construct Outline Quantifying Diversity Diversity Oriented Synthesis Wolf and

More information

Universities of Leeds, Sheffield and York

Universities of Leeds, Sheffield and York promoting access to White Rose research papers Universities of Leeds, Sheffield and York http://eprints.whiterose.ac.uk/ This is an author produced version of a paper published in Quantitative structure

More information

Data Quality Issues That Can Impact Drug Discovery

Data Quality Issues That Can Impact Drug Discovery Data Quality Issues That Can Impact Drug Discovery Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay Varina, NC. 2 Labcyte Inc, Sunnyvale, CA. 3 Royal Society of Chemistry,

More information

Virtual Screening: How Are We Doing?

Virtual Screening: How Are We Doing? Virtual Screening: How Are We Doing? Mark E. Snow, James Dunbar, Lakshmi Narasimhan, Jack A. Bikker, Dan Ortwine, Christopher Whitehead, Yiannis Kaznessis, Dave Moreland, Christine Humblet Pfizer Global

More information

Clustering Ambiguity: An Overview

Clustering Ambiguity: An Overview Clustering Ambiguity: An Overview John D. MacCuish Norah E. MacCuish 3 rd Joint Sheffield Conference on Chemoinformatics April 23, 2004 Outline The Problem: Clustering Ambiguity and Chemoinformatics Preliminaries:

More information

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National

More information

Functional Group Fingerprints CNS Chemistry Wilmington, USA

Functional Group Fingerprints CNS Chemistry Wilmington, USA Functional Group Fingerprints CS Chemistry Wilmington, USA James R. Arnold Charles L. Lerman William F. Michne James R. Damewood American Chemical Society ational Meeting August, 2004 Philadelphia, PA

More information

Quantitative Structure-Activity Relationship Models for Prediction of Estrogen Receptor Binding Affinity of Structurally Diverse Chemicals

Quantitative Structure-Activity Relationship Models for Prediction of Estrogen Receptor Binding Affinity of Structurally Diverse Chemicals Iowa State University From the SelectedWorks of Steven P. Bradbury 2003 Quantitative Structure-Activity Relationship Models for Prediction of Estrogen Receptor Binding Affinity of Structurally Diverse

More information

Data Mining in the Chemical Industry. Overview of presentation

Data Mining in the Chemical Industry. Overview of presentation Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. glenn.myatt@gmail.com verview of presentation verview of the chemical industry Example of the pharmaceutical industry

More information

Tautomerism in chemical information management systems

Tautomerism in chemical information management systems Tautomerism in chemical information management systems Dr. Wendy A. Warr http://www.warr.com Tautomerism in chemical information management systems Author: Wendy A. Warr DOI: 10.1007/s10822-010-9338-4

More information

ENERGY MINIMIZATION AND CONFORMATION SEARCH ANALYSIS OF TYPE-2 ANTI-DIABETES DRUGS

ENERGY MINIMIZATION AND CONFORMATION SEARCH ANALYSIS OF TYPE-2 ANTI-DIABETES DRUGS Int. J. Chem. Sci.: 6(2), 2008, 982-992 EERGY MIIMIZATI AD CFRMATI SEARC AALYSIS F TYPE-2 ATI-DIABETES DRUGS R. PRASAA LAKSMI a, C. ARASIMA KUMAR a, B. VASATA LAKSMI, K. AGA SUDA, K. MAJA, V. JAYA LAKSMI

More information

The Schrödinger KNIME extensions

The Schrödinger KNIME extensions The Schrödinger KNIME extensions Computational Chemistry and Cheminformatics in a workflow environment Jean-Christophe Mozziconacci Volker Eyrich Topics What are the Schrödinger extensions? Workflow application

More information

Prediction of Estrogen Receptor Binding for 58,000 Chemicals Using an Integrated System of a Tree-Based Model with Structural Alerts

Prediction of Estrogen Receptor Binding for 58,000 Chemicals Using an Integrated System of a Tree-Based Model with Structural Alerts Articles Prediction of Estrogen Receptor Binding for 58,000 Chemicals Using an ntegrated System of a Tree-Based Model with Structural Alerts Huixiao Hong, 1 Weida Tong, 1 Hong Fang, 1 Leming Shi, 2 Qian

More information

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors Rajarshi Guha, Debojyoti Dutta, Ting Chen and David J. Wild School of Informatics Indiana University and Dept.

More information

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Navigation in Chemical Space Towards Biological Activity Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Data Explosion in Chemistry CAS 65 million molecules CCDC 600 000 structures

More information

Machine Learning Concepts in Chemoinformatics

Machine Learning Concepts in Chemoinformatics Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics

More information

Virtual screening in drug discovery

Virtual screening in drug discovery Virtual screening in drug discovery Pavel Polishchuk Institute of Molecular and Translational Medicine Palacky University pavlo.polishchuk@upol.cz Drug development workflow Vistoli G., et al., Drug Discovery

More information

Fast similarity searching making the virtual real. Stephen Pickett, GSK

Fast similarity searching making the virtual real. Stephen Pickett, GSK Fast similarity searching making the virtual real Stephen Pickett, GSK Introduction Introduction to similarity searching Use cases Why is speed so crucial? Why MadFast? Some performance stats Implementation

More information

Chemical library design

Chemical library design Chemical library design Pavel Polishchuk Institute of Molecular and Translational Medicine Palacky University pavlo.polishchuk@upol.cz Drug development workflow Vistoli G., et al., Drug Discovery Today,

More information

Biologically Relevant Molecular Comparisons. Mark Mackey

Biologically Relevant Molecular Comparisons. Mark Mackey Biologically Relevant Molecular Comparisons Mark Mackey Agenda > Cresset Technology > Cresset Products > FieldStere > FieldScreen > FieldAlign > FieldTemplater > Cresset and Knime About Cresset > Specialist

More information

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites Introduction J. Andrew Surface Hampden-Sydney College / Virginia Commonwealth University In the past several decades

More information

Applying Bioisosteric Transformations to Predict Novel, High Quality Compounds

Applying Bioisosteric Transformations to Predict Novel, High Quality Compounds Applying Bioisosteric Transformations to Predict Novel, High Quality Compounds Dr James Chisholm,* Dr John Barnard, Dr Julian Hayward, Dr Matthew Segall*, Mr Edmund Champness*, Dr Chris Leeding,* Mr Hector

More information

Conformational Sampling of Druglike Molecules with MOE and Catalyst: Implications for Pharmacophore Modeling and Virtual Screening

Conformational Sampling of Druglike Molecules with MOE and Catalyst: Implications for Pharmacophore Modeling and Virtual Screening J. Chem. Inf. Model. 2008, 48, 1773 1791 1773 Conformational Sampling of Druglike Molecules with MOE and Catalyst: Implications for Pharmacophore Modeling and Virtual Screening I-Jen Chen* and Nicolas

More information

has its own advantages and drawbacks, depending on the questions facing the drug discovery.

has its own advantages and drawbacks, depending on the questions facing the drug discovery. 2013 First International Conference on Artificial Intelligence, Modelling & Simulation Comparison of Similarity Coefficients for Chemical Database Retrieval Mukhsin Syuib School of Information Technology

More information

Interactive Feature Selection with

Interactive Feature Selection with Chapter 6 Interactive Feature Selection with TotalBoost g ν We saw in the experimental section that the generalization performance of the corrective and totally corrective boosting algorithms is comparable.

More information

Jonathan S. Mason,, Isabelle Morize, Paul R. Menard,*, Daniel L. Cheney, Christopher Hulme, and Richard F. Labaudiniere

Jonathan S. Mason,, Isabelle Morize, Paul R. Menard,*, Daniel L. Cheney, Christopher Hulme, and Richard F. Labaudiniere J. Med. Chem. 1999, 42, 3251-3264 3251 New 4-Point Pharmacophore Method for Molecular Similarity and Diversity Applications: Overview of the Method and Applications, Including a Novel Approach to the Design

More information

Ignasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015

Ignasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015 Ignasi Belda, PhD CEO HPC Advisory Council Spain Conference 2015 Business lines Molecular Modeling Services We carry out computational chemistry projects using our selfdeveloped and third party technologies

More information

Using AutoDock for Virtual Screening

Using AutoDock for Virtual Screening Using AutoDock for Virtual Screening CUHK Croucher ASI Workshop 2011 Stefano Forli, PhD Prof. Arthur J. Olson, Ph.D Molecular Graphics Lab Screening and Virtual Screening The ultimate tool for identifying

More information

Keywords: anti-coagulants, factor Xa, QSAR, Thrombosis. Introduction

Keywords: anti-coagulants, factor Xa, QSAR, Thrombosis. Introduction PostDoc Journal Vol. 2, No. 3, March 2014 Journal of Postdoctoral Research www.postdocjournal.com QSAR Study of Thiophene-Anthranilamides Based Factor Xa Direct Inhibitors Preetpal S. Sidhu Department

More information

1.QSAR identifier 1.1.QSAR identifier (title): QSAR for Relative Binding Affinity to Estrogen Receptor 1.2.Other related models:

1.QSAR identifier 1.1.QSAR identifier (title): QSAR for Relative Binding Affinity to Estrogen Receptor 1.2.Other related models: QMRF identifier (ECB Inventory): QMRF Title: QSAR for Relative Binding Affinity to Estrogen Receptor Printing Date:Feb 16, 2010 1.QSAR identifier 1.1.QSAR identifier (title): QSAR for Relative Binding

More information

Spacer conformation in biologically active molecules*

Spacer conformation in biologically active molecules* Pure Appl. Chem., Vol. 76, No. 5, pp. 959 964, 2004. 2004 IUPAC Spacer conformation in biologically active molecules* J. Karolak-Wojciechowska and A. Fruziński Institute of General and Ecological Chemistry,

More information

Supplementary information

Supplementary information Electronic Supplementary Material (ESI) for MedChemComm. This journal is The Royal Society of Chemistry 2017 Supplementary information Identification of steroid-like natural products as potent antiplasmodial

More information

DOCKING TUTORIAL. A. The docking Workflow

DOCKING TUTORIAL. A. The docking Workflow 2 nd Strasbourg Summer School on Chemoinformatics VVF Obernai, France, 20-24 June 2010 E. Kellenberger DOCKING TUTORIAL A. The docking Workflow 1. Ligand preparation It consists in the standardization

More information

An Enhancement of Bayesian Inference Network for Ligand-Based Virtual Screening using Features Selection

An Enhancement of Bayesian Inference Network for Ligand-Based Virtual Screening using Features Selection American Journal of Applied Sciences 8 (4): 368-373, 2011 ISSN 1546-9239 2010 Science Publications An Enhancement of Bayesian Inference Network for Ligand-Based Virtual Screening using Features Selection

More information

Implementation of novel tools to facilitate fragment-based drug discovery by NMR:

Implementation of novel tools to facilitate fragment-based drug discovery by NMR: Implementation of novel tools to facilitate fragment-based drug discovery by NMR: Automated analysis of large sets of ligand-observed NMR binding data and 19 F methods Andreas Lingel Global Discovery Chemistry

More information

Pose and affinity prediction by ICM in D3R GC3. Max Totrov Molsoft

Pose and affinity prediction by ICM in D3R GC3. Max Totrov Molsoft Pose and affinity prediction by ICM in D3R GC3 Max Totrov Molsoft Pose prediction method: ICM-dock ICM-dock: - pre-sampling of ligand conformers - multiple trajectory Monte-Carlo with gradient minimization

More information

Bioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA

Bioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA Pharmacophore Model Development for the Identification of Novel Acetylcholinesterase Inhibitors Edwin Kamau Dept Chem & Biochem Kennesa State Uni ersit Kennesa GA 30144 Dept. Chem. & Biochem. Kennesaw

More information

Machine learning for ligand-based virtual screening and chemogenomics!

Machine learning for ligand-based virtual screening and chemogenomics! Machine learning for ligand-based virtual screening and chemogenomics! Jean-Philippe Vert Institut Curie - INSERM U900 - Mines ParisTech In silico discovery of molecular probes and drug-like compounds:

More information

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression APPLICATION NOTE QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression GAINING EFFICIENCY IN QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIPS ErbB1 kinase is the cell-surface receptor

More information

Ultra High Throughput Screening using THINK on the Internet

Ultra High Throughput Screening using THINK on the Internet Ultra High Throughput Screening using THINK on the Internet Keith Davies Central Chemistry Laboratory, Oxford University Cathy Davies Treweren Consultants, UK Blue Sky Objectives Reduce Development Failures

More information

Three-dimensional molecular descriptors and a novel QSAR method

Three-dimensional molecular descriptors and a novel QSAR method Journal of Molecular Graphics and Modelling 21 (2002) 161 170 Three-dimensional molecular descriptors and a novel QSAR method Scott A. Wildman 1, Gordon M. Crippen College of Pharmacy, University of Michigan,

More information

molecules ISSN

molecules ISSN Molecules 2004, 9, 1004-1009 molecules ISSN 1420-3049 http://www.mdpi.org Performance of Kier-Hall E-state Descriptors in Quantitative Structure Activity Relationship (QSAR) Studies of Multifunctional

More information

QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov

QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov CADD Group Chemical Biology Laboratory Frederick National Laboratory for Cancer Research National Cancer Institute, National Institutes

More information

György M. Keserű H2020 FRAGNET Network Hungarian Academy of Sciences

György M. Keserű H2020 FRAGNET Network Hungarian Academy of Sciences Fragment based lead discovery - introduction György M. Keserű H2020 FRAGET etwork Hungarian Academy of Sciences www.fragnet.eu Hit discovery from screening Druglike library Fragment library Large molecules

More information

OECD QSAR Toolbox v.3.4

OECD QSAR Toolbox v.3.4 OECD QSAR Toolbox v.3.4 Predicting developmental and reproductive toxicity of Diuron (CAS 330-54-1) based on DART categorization tool and DART SAR model Outlook Background Objectives The exercise Workflow

More information

Practical QSAR and Library Design: Advanced tools for research teams

Practical QSAR and Library Design: Advanced tools for research teams DS QSAR and Library Design Webinar Practical QSAR and Library Design: Advanced tools for research teams Reservationless-Plus Dial-In Number (US): (866) 519-8942 Reservationless-Plus International Dial-In

More information

Ping-Chiang Lyu. Institute of Bioinformatics and Structural Biology, Department of Life Science, National Tsing Hua University.

Ping-Chiang Lyu. Institute of Bioinformatics and Structural Biology, Department of Life Science, National Tsing Hua University. Pharmacophore-based Drug design Ping-Chiang Lyu Institute of Bioinformatics and Structural Biology, Department of Life Science, National Tsing Hua University 96/08/07 Outline Part I: Analysis The analytical

More information

Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No

Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No. 12 1801 Ligand-based QSAR Studies on the Indolinones Derivatives as Inhibitors of the Protein Tyrosine

More information

Similarity methods for ligandbased virtual screening

Similarity methods for ligandbased virtual screening Similarity methods for ligandbased virtual screening Peter Willett, University of Sheffield Computers in Scientific Discovery 5, 22 nd July 2010 Overview Molecular similarity and its use in virtual screening

More information

Author Index Volume

Author Index Volume Perspectives in Drug Discovery and Design, 20: 289, 2000. KLUWER/ESCOM Author Index Volume 20 2000 Bradshaw,J., 1 Knegtel,R.M.A., 191 Rose,P.W., 209 Briem, H., 231 Kostka, T., 245 Kuhn, L.A., 171 Sadowski,

More information

Identifying Interaction Hot Spots with SuperStar

Identifying Interaction Hot Spots with SuperStar Identifying Interaction Hot Spots with SuperStar Version 1.0 November 2017 Table of Contents Identifying Interaction Hot Spots with SuperStar... 2 Case Study... 3 Introduction... 3 Generate SuperStar Maps

More information

CSD. Unlock value from crystal structure information in the CSD

CSD. Unlock value from crystal structure information in the CSD CSD CSD-System Unlock value from crystal structure information in the CSD The Cambridge Structural Database (CSD) is the world s most comprehensive and up-todate knowledge base of crystal structure data,

More information

Identification of proteins by enzyme digestion, mass

Identification of proteins by enzyme digestion, mass Method for Screening Peptide Fragment Ion Mass Spectra Prior to Database Searching Roger E. Moore, Mary K. Young, and Terry D. Lee Beckman Research Institute of the City of Hope, Duarte, California, USA

More information

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments BioSolveIT Biology Problems Solved using Information Technology A Combinatorial Approach for andling of Protonation and Tautomer Ambiguities in Docking Experiments Ingo Dramburg BioSolve IT Gmb An der

More information

Quantification of free ligand conformational preferences by NMR and their relationship to the bioactive conformation

Quantification of free ligand conformational preferences by NMR and their relationship to the bioactive conformation Quantification of free ligand conformational preferences by NMR and their relationship to the bioactive conformation Charles Blundell charles.blundell@c4xdiscovery.com www.c4xdiscovery.com Rigid: single

More information

BioSolveIT. A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities

BioSolveIT. A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities BioSolveIT Biology Problems Solved using Information Technology A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities Ingo Dramburg BioSolve IT Gmb An der Ziegelei 75 53757

More information

Structural Bioinformatics (C3210) Molecular Docking

Structural Bioinformatics (C3210) Molecular Docking Structural Bioinformatics (C3210) Molecular Docking Molecular Recognition, Molecular Docking Molecular recognition is the ability of biomolecules to recognize other biomolecules and selectively interact

More information

In Silico Investigation of Off-Target Effects

In Silico Investigation of Off-Target Effects PHARMA & LIFE SCIENCES WHITEPAPER In Silico Investigation of Off-Target Effects STREAMLINING IN SILICO PROFILING In silico techniques require exhaustive data and sophisticated, well-structured informatics

More information

Mining Molecular Fragments: Finding Relevant Substructures of Molecules

Mining Molecular Fragments: Finding Relevant Substructures of Molecules Mining Molecular Fragments: Finding Relevant Substructures of Molecules Christian Borgelt, Michael R. Berthold Proc. IEEE International Conference on Data Mining, 2002. ICDM 2002. Lecturers: Carlo Cagli

More information

Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies

Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies Dissertation zur Erlangung des Doktorgrades (Dr. rer. nat.) der Mathematisch-aturwissenschaftlichen Fakultät der Rheinischen

More information

(e.g.training and prediction set, algorithm, ecc...). 2.9.Availability of another QMRF for exactly the same model: No other information available

(e.g.training and prediction set, algorithm, ecc...). 2.9.Availability of another QMRF for exactly the same model: No other information available QMRF identifier (JRC Inventory):To be entered by JRC QMRF Title: Insubria QSAR PaDEL-Descriptor model for prediction of NitroPAH mutagenicity. Printing Date:Jan 20, 2014 1.QSAR identifier 1.1.QSAR identifier

More information

bcl::cheminfo Suite Enables Machine Learning-Based Drug Discovery Using GPUs Edward W. Lowe, Jr. Nils Woetzel May 17, 2012

bcl::cheminfo Suite Enables Machine Learning-Based Drug Discovery Using GPUs Edward W. Lowe, Jr. Nils Woetzel May 17, 2012 bcl::cheminfo Suite Enables Machine Learning-Based Drug Discovery Using GPUs Edward W. Lowe, Jr. Nils Woetzel May 17, 2012 Outline Machine Learning Cheminformatics Framework QSPR logp QSAR mglur 5 CYP

More information

1. Some examples of coping with Molecular informatics data legacy data (accuracy)

1. Some examples of coping with Molecular informatics data legacy data (accuracy) Molecular Informatics Tools for Data Analysis and Discovery 1. Some examples of coping with Molecular informatics data legacy data (accuracy) 2. Database searching using a similarity approach fingerprints

More information