SHAPE-BASED TWO DIMENSIONAL DESCRIPTOR FOR SEARCHING MOLECULAR DATABASE

Size: px
Start display at page:

Download "SHAPE-BASED TWO DIMENSIONAL DESCRIPTOR FOR SEARCHING MOLECULAR DATABASE"

Transcription

1 ! SHAPE-BASED TWO DIMENSIONAL DESCRIPTOR FOR SEARCHING MOLECULAR DATABASE HENTABLI HAMZA UNIVERSITI TEKNOLOGI MALAYSIA

2 I hereby declare that I have read this thesis and in my opinion this thesis is sufficient in terms of scope and quality for the award of the degree of Master of Science (Computer Science) Signature : Name : Naomie Salim Date : May 26, 2014

3 i SHAPE-BASED TWO DIMENSIONAL DESCRIPTOR FOR SEARCHING MOLECULAR DATABASE HENTABLI HAMZA A thesis submitted in fulfilment of the requirements for the award of the degree of Master of Science (Computer Science) Faculty of Computing Universiti Teknologi Malaysia MAY 2014

4 I declare that this thesis entitled Shape-Based Two Dimensional Descriptor for Searching Molecular Database is the result of my own research except as cited in the references. The thesis has not been accepted for any degree and is not concurrently submitted in candidature of any other degree. Signature : Name : Hentabli Hamza Date : May 26, 2014

5 iii This research study is dedicated to my wife, to my mother, and in the memory of my father If we knew what it was we were doing, it would not be called research, would it? Albert Einstein Research is to see what everybody else has seen, and to think what nobody else has thought Albert Szent-Gyorgyi

6 iv ACKNOWLEDGEMENT Alhamdulillah, all praises to Allah S.W.T., The Most Greatest and The Most Merciful for His guidance and blessing, because without him I cant finished this research. I also wish to express my gratitude to my thesis supervisor, Prof. Dr. Naomie Salim for her enthusiastic guidance, valuable help, encouragement and patience for all the aspect of the thesis progress. Her numerous comments, criticisms and suggestion during the preparation of this project are greatly appreciated. Also, for her patience on problems that occurred during the process of completing this thesis. Furthermore, special thanks go to Dr. Faisal Saeed, for many days of scientific discussions, this work would not have been possible without his support. Many thanks goes to the people who supported me in write this thesis, especially amer tawfiq, Dr.Ali ahmed, Dr.Djamel. I also would like to thank all my friends that give me support and help during the writing of the thesis. Their support and help always give me the motivation and energy I need to finish the thesis. My appreciation also extended to all academic and non-academic member of the Faculty of Computing for their warm heart cooperation during my stay in Universiti Teknologi Malaysia. My heartfelt acknowledgement are expressed to my family, especially My mother and Wife, without their guidance, support, encouragement and advises I may never have overcome this long journey in my studies. When I felt down, their love always give me the strength to face the challenges of the research. I would also like to thank people that directly or indirectly help me in finishing the thesis. Thank you very much.

7 v ABSTRACT Biological functions of compounds can be predicted from similarity of their chemical structures to discover new compounds for drug development. Molecular similarity can also be used to infer unknown functions and side effects of existing drugs. A multitude of molecular similarity methods based on different molecular representations have been used to perform virtual screenings. The molecules are transformed into descriptors to create a chemical database which allows mathematical manipulation and searching of the chemical information contained in the molecules. In this research, a new Shape based Descriptor of Molecule (SBDM) was developed based on the 2-dimensional shape of a chemical compound. The outline shape of a molecule is split into parts that are related in graph connectivity. The first atom in the molecule is determined using the Morgan algorithm. The molecular features, such as atom name, bond type, angle and rings are represented using specific symbols based on some specification rules. Subsequent atoms are scanned in a clockwise direction with respect to the first atom. The scan is repeated until the first atom is reached again. Two similarity measures were used to evaluate the performance of the molecular descriptors, which are the Basic Local Alignment Search Tool (BLAST) and the Tanimoto coefficient. The performance of the SBDM is compared with six standard molecular descriptors. Simulation of virtual screening experiments with the MDL Drug Data Report database show the superiority of the shape-based descriptor, with % and % in terms of average recall rates for the top of 1 % and 5 % retrieved molecules, respectively, compared to the six standard descriptors mentioned earlier.

8 vi ABSTRAK Fungsi biologi sebatian boleh diramal dari persamaan struktur kimia mereka untuk meneroka sebatian baru untuk pembangunan ubat-ubatan. Persamaan molekul juga boleh digunakan untuk menilai fungsi yang tidak diketahui serta kesan sampingan ubat-ubatan sedia ada. Satu kaedah persamaan molekul berdasarkan perwakilan molekul yang berbeza telah digunakan untuk melaksanakan pemeriksaan maya. Molekul-molekul tersebut diubah menjadi penghurai untuk mewujudkan pangkalan data kimia yang membenarkan manipulasi matematik dan pencarian maklumat kimia yang terkandung di dalam molekul-molekul tersebut. Dalam kajian ini, satu Penghurai Molekul baru berasaskan Bentuk (SBDM) dibangunkan berdasarkan bentuk 2-dimensi sebatian kimia. Bentuk rangka molekul dipisahkan kepada bahagian-bahagian yang berkaitan dalam satu hubungan graf. Atom pertama dalam molekul ditentukan dengan menggunakan algoritma Morgan. Ciri-ciri molekul, seperti nama atom, jenis ikatan, sudut dan pergelangan diwakili menggunakan simbol-simbol tertentu berdasarkan beberapa peraturan spesifikasi. Atom-atom yang seterusnya diimbas mengikut arah jam bersandarkan atom pertama. Imbasan ini diulang sehingga atom pertama dicapai semula. Dua ukuran persamaan digunakan untuk menilai prestasi penghurai molekul, iaitu Alat Pencarian Penjajaran Tempatan Asas (BLAST) dan pekali Tanimoto. Prestasi SBDM dibandingkan dengan enam penghurai molekul piawai. Simulasi eksperimen pemeriksaan maya menggunakan pangkalan data Laporan Data Ubatan MDL menunjukkan keunggulan penghurai berdasarkan bentuk, di mana pencapaian 19.32% dan 34.13% dari segi purata kadar ingatan, masing-masing untuk 1% dan 5% molekul tertinggi yang dicapai semula, berbanding dengan enam penghurai piawai yang dinyatakan sebelum ini.

9 vii TABLE OF CONTENTS CHAPTER TITLE PAGE DECLARATION DEDICATION ACKNOWLEDGEMENT ABSTRACT ABSTRAK TABLE OF CONTENTS LIST OF TABLES LIST OF FIGURES ii iii iv v vi vii xi xiii 1 INTRODUCTION Introduction Problem Background Problem Statement Research Question Research Aim Objectives Research Scope Significance and justification of study Organization of the report Summary 7 2 LITERATURE REVIEW Introduction Chemical Database Chemical Compounds Molecular Similarity and Molecular searching Representation of Chemical Structures Fingerprint Descriptors 13

10 viii String Representation and Canonical SMILES Graph Representation Molecular Representation D Descriptors D Descriptors D descriptors Searching in Chemical Database Structure Searching Substructure Searching Similarity Searching Molecular Descriptors for Similarity Searching Similarity Coefficients Distance Coefficients Association Coefficients Correlation Coefficients Probabilistic Coefficients Shape Representation and Description Techniques Classifications of Shape Representation Techniques Chain Code Representation Syntactic Analysis Discussion Summary 45 3 RESEARCH METHODOLOGY Introduction Research Design General Research Framework Molecular Dataset Preparation Generating Shape-based Descriptors Language for Writing Descriptors of Outline Shape of Molecules D Fingerprint Descriptors of Outline Shape of Molecules Canonical Representation for Descriptors of Outline Shape of Molecules Generating Other Molecular Descriptors 57

11 ix Extended Connectivity Fingerprints (ECFP) Properties of ECFP Applications Representation and Generation The Generation Process Evaluate the Proposed Descriptors Structure-based approaches Ligand-based approaches Evaluation Measures of Similarity Performance Tanimoto Coefficient Recall Calculation Kendall rank correlation coefficient τ Basic Local Alignment Search Tool Summary 75 4 LANGUAGE FOR WRITING DESCRIPTORS OF OUT- LINE SHAPE OF MOLECULES Introduction Materials and Methods Specification Rules Experimental Design Validation of the Implemented System Results and Discussion Summary D FINGERPRINT DESCRIPTORS OF OUTLINE SHAPE OF MOLECULES Introduction Materials and Methods Experimental Design Results and Discussion Summary CANONICAL REPRESENTATIONS FOR DESCRIP- TORS OF OUTLINE SHAPE OF MOLECULES Introduction Materials and Methods 116

12 x 6.3 Experimental Design Validation of the implemented System Results and Discussion Summary CONCLUSION AND FUTURE WORK Conclusion Research Contributions Future Works 139 REFERENCES 140

13 xi LIST OF TABLES TABLE NO. TITLE PAGE 2.1 Example of SMILES structure of different molecules illustrating the design rules of SMILES-sturcture Examples of descriptors classified according to dimensionality, demonstrated on the saccharin molecule Comparison of the three type of searching method in chemical database Examples of distance coefficients Examples of association coefficients Examples of correlation coefficients MDDR Activity Classes for DS1 Data Set MDDR Activity Classes for DS2 Data Set MDDR Activity Classes for DS3 Data Set Interpretation of the correlation coefficients size for positive values Validation of the implemented System Retrieval results of top 1% for data set DS Retrieval results of top 5% for data set DS Retrieval results of top 1% for data set DS Retrieval results of top 5% for data set DS Retrieval results of top 1% for data set DS Retrieval results of top 5% for data set DS Rankings of various types of descriptors Based on Kendall W Test Results Numbers of Shaded Cells for Mean Recall of Actives molecules List of atoms name that contain two characters and their new labeling Sample of Lingo-DOSM descriptor Fragmentation of LWDOSM Molecule A 102

14 xii 5.4 Fragmentation of LWDOSM Molecule B Intermolecular Similarity Calculation for the molecule A and B Retrieval results of top 1% for data set DS Retrieval results of top 5% for data set DS Retrieval results of top 1% and 5% for data set DS1 comparing with LWDOSM Retrieval results of top 1% for data set DS Retrieval results of top 5% for data set DS Retrieval results of top 1% and 5 % for data set DS2 comparing with LWDOSM Retrieval results of top 1% for data set DS Retrieval results of top 5% for data set DS Retrieval results of top 1% and 5 % for data set DS3 comparing with LWDOSM Rankings of various types of descriptors Based on Kendall W Test Results: Top 1% and 5% Numbers of Shaded Cells for Mean Recall of Actives Using Different De-scriptors: DS1-DS3 Top 1% and 5% Validation of the implemented System Retrieval results of top 1% for data set DS Retrieval results of top 5% for data set DS Retrieval results of top 1% and 5% for data set DS1 comparing with LWDOSM and Lingo-DOSM Retrieval results of top 1% for data set DS Retrieval results of top 5% for data set DS Retrieval results of top 1% and 5% for data set DS2 comparing with LWDOSM and Lingo-DOSM Retrieval results of top 1% for data set DS Retrieval results of top 5% for data set DS Retrieval results of top 1% and 5% for data set DS3 comparing DS2 comparing with LWDOSM and Lingo- DOSM Rankings of various types of descriptors based on Kendall W Test Results Numbers of Shaded Cells for Mean Recall of Actives Using Different Descriptors: DS1-DS3 Top 1% and 5% 135

15 xiii LIST OF FIGURES FIGURE NO. TITLE PAGE 2.1 Three main step for molecular descriptor The chemical structure of aspirin Fragment code representation for three molecules Examples of the SMILES Strings for a variety of molecules Examples of the Wisswesser Line Notation Strings of molecule Examples of the InChI Strings descriptor of molecule Examples of Canonical versus non-canonical SMILES Generation of SMILES: A)Break cycles, B)write as branches off a main backbone C)spanning tree D) Final Canonical SMILES The connection table for aspirin in the MDL format (hydrogen-suppressed form), along with its 2D structure Representations of a molecular structure in 1D, 2D, and 3D Classification of shape representation and description techniques (a) The directions of the eight connected chain code (K=8). (b) A sample object, a square, and (c) chain code presentation of the square Structural description of chromosome shape The general research operational framework Examples of low diversity molecules in MDDR dataset Examples of high diversity molecules in MDDR dataset The mean pairwise similarity (MPS) across each set of active molecules Framework for generating of Language for Writing Descriptors of Outline Shape of Molecules. 55

16 xiv 3.6 2D Fingerprint Descriptors of Outline Shape of Molecules Canonical Representation for Descriptors of Outline Shape of Molecules Generating ALOGP using Pipeline Pilot software Molecular Descriptor generation Illustration of the effect of iterative updating for a selected atom in a sample molecule ECFP generation process Generation of the fixed-length bit string Schematic representation of virtual screening approaches within early-phase drug discovery An example workflow for 3D shape-based flexible alignment: 1) 2D input structures; 2) 3D conformer is generated and the shape is colored by atomic types; 3) the volume intersection, which maximized during the alignment, is shown along with the resulting pose Overview of the Ligand Based Virtual Screening (LBVS) process. (A) Query and database compounds are encoded by pharmacophore patterns or descriptor vectors. (B) Chemical similarity between query and database compounds is quantified. (C) The library compounds are ranked according to their similarity with the query Tanimoto in similarity searching Generating word seeds Examples of how alignment seeds work. Larger word sizes usually mean that there are fewer regions to evaluate (e.g., part a versus part b). Once an alignment is seeded, BLAST extends the alignment according to a threshold (T), which is set by the user. When performing a BLAST query, the computer extends words with a neighborhood score greater than T (part c) The external and internal visiting movement in 2D graph The representation of internal regions (cyclic structures) The Angle Direction The disconnect parts representation LWDOSM calculation process The method to establish the q-letter query word list 99

17 xv 5.3 LINGO-DOSM fragmentation The adjacency matrix of the Graph (G) to represented the aspirin molecule Example of the Morgan algorithm Example of canonical enumerations for atoms Graphical representation of Tree T extracted from aspirin Molecule Process of generating the shape descriptor (SBMD) 123

18 CHAPTER 1 INTRODUCTION 1.1 Introduction Chemoinformatics is the collection, representation and organization of chemical data to create chemical information in which it can be applied to create chemical knowledge. In pharmaceutical and agrochemical industry, Chemoinformatics has been used for identification of novel compounds with useful and commercially valuable biological properties. The drug discovery process is very complex and it is a multi-disciplinary task with many stages to be performed in a long time. However, the molecule that has the potential to become drugs may cause unexpected long-term side effects and the drug discovery process can take about 12 years and the costs may be up to $350 million per drug [1]. The high costs of bringing a drug into the market have increased the pressure on the pharmaceutical industries. Therefore, attention is given to research and development in order to develop faster and more effective way to produce chemical compounds that can react to the disease and also produce antibodies towards the disease. This has encouraged the study of chemoinformatics and drug discovery as one of the new areas in Malaysia s research and development (R&D) [1]. The primary task of the similarity calculation and molecular searching in chemical database is to find and quantify the relationship between structures of compounds with the hope that the quantity represents similarity in physical, chemical, or biological properties. Thus, an appropriate description (molecular descriptor)

19 2 of chemical structure for the property to be represented is an essential task. Molecular descriptors are numerical values (or vectors of numbers) that characterize properties of compounds. They can be generated from a machine-readable structure representation like a 2D connection table or a set of experimental or calculated 3D coordinates. Molecular descriptors should be effective (can differentiate between different compounds) and efficient (fast to calculate) by playing a vital role in similarity calculations especially when a large database is used. However, there is a conflict between these two requirements since the most effective descriptors tend to be the least efficient to calculate, and vice versa. Molecular descriptors can be classified into 1D, 2D and 3D descriptors. Molecule representations based on the real shape of the molecules are important because it represents the physical interaction between molecules, especially protein-ligand bindings, instead most of the descriptors are developed based on the molecule features or topological indices. 1.2 Problem Background In organic chemistry research the chemists synthesize novel compounds that exhibit the desired properties and perform better than existing compounds. The commercial objective is to sell these new and superior compounds directly or in new products containing them. During the process from the initial creation of a new compound until it is commercialized, a lot of data related to it is generated such as physiochemical properties, analytical and toxicological data. All this data needs to be stored and scientists must be able to search it by chemical structure [2]. The chemist performs a chemical reaction and enters the relevant chemical structures, the exact procedure and outcome into this electronic lab notebook (ELN) so that other chemists can find it and repeat the reaction if necessary. After this new compound is synthesized, a new entry for it is generated in the chemical registration system. Unfortunately there is not much scientific literature concerning how and what such registration systems do [2]. The main purpose is to assign each compound a unique in-house reference number that then can be used to cross-reference it. There are many commercial software solutions for ELNs and chemical registrations such as those from PerkinElmer (CambridgeSoft), Accelrys, ChemAxon or Dotmatics to name a few. However these solutions may be costly especially if they also require a commercial relational database management system (RDBMS). An exception is

20 3 ChemAxon Compound Registration [2] [3] [4] which runs on MySQL. Another issue is that they are closed-source and if such systems are used in the context of scientific experiments, the experiment is not fully reproducable by other scientists unless they also have access to a license for the same system. Because these systems need to be highly configurable, they are also very complex. Configuring and administrating such a system requires a lot of very specific expertise. Consequentially, it can be advantageous to build your own solution especially for organizations that already have an in-house software development department [5, 6, 7]. A custom solution does not have to be highly configurable as it can be tailored to your needs. This includes administrative interfaces for user management and specific tasks like informing all users about new features. Such tools can greatly reduce administrative overhead [2]. The foundation of a chemical information system is the ability to represent molecules in a computer and to compare a molecule s structure with another. Molecular comparison has been used in the early chemical information systems, e.g. structure and substructure searching [8, 9]. Structure searching involves searching a chemical database for a particular query structure with aims to retrieve all molecules with an exact match to the query structure whereas substructure searching retrieves all molecules that contain the query structure as a subgraph. The equivalence (similarity) between two structures can be achieved by using a graph and subgraph isomorphism algorithms. Isomorphism algorithms are time consuming because it is a combinatorial problem. Various isomorphism algorithms have been developed for efficient performance but they are still too slow for large chemical databases [10, 11, 12, 13], However, structure and substructure searching were later complemented by another searching mechanism called similarity searching. Similarity searching methods may be the simplest tools for ligand based virtual screening. The basic idea in similarity searching is the similar property principle, which states that structurally similar molecules will exhibit similar physicochemical and biological properties [14]. Over the years, many ways of measuring the structural similarity of molecules have been introduced [15, 16]. 2D similarity methods can be divided into two classes, the first class is the graph-based similarity methods and the second class is the fingerprint-based similarity methods. The graph-based similarity methods directly compare the molecular structures with each other and identify the similar (or common) substructures. These methods relate parts of one molecule to parts of the other molecule, they generates a mapping or alignment between molecules. Maximum common sub-graph method (MCS) is an example of the graph-based

21 4 similarity methods. Another example of the graph-based similarity methods is the feature trees. The feature trees were introduced by Rarey and Dixon [17], which are the most abstract way of representing a molecule by means of a graph. A feature tree represents hydrophobic fragments and functional groups of the molecule and the way these groups are linked together. Each node in the tree is labelled with a set of features representing chemical properties of the part of the molecule corresponding to the node. The comparison of feature trees is based on matching subtrees of two feature trees onto each other. Feature trees allow similarity searching to be performed against large database, when combined with a fast mapping algorithms [18]. However, the most common similarity approaches use molecules characterized by fingerprints that encode the presence of fragments features in a molecule. The similarity between two molecules is then computed using the number of sub-structural fragments that is common to a pair of structures and a simple association coefficient [19]. 1.3 Problem Statement The shape similarity between two molecules can be determined by comparing the shapes of those molecules; find the overlap volume between them and then use similarity measure (e.g. Tanimoto) to calculate the similarity between the molecules. However, most of the works in shape-based similarity approaches depended on the 3D molecular shape [20, 21]. Recently, shape-based similarity approaches have been used more. The shape comparison program Rapid Overlay of Chemical Structures (ROCS) [22] is used to perceive similarity between molecules based on their 3D shape. The objective of this approach is to find molecules with similar bioactivity to a target molecule but with different chemotypes, i.e., scaffold hopping. However, a disadvantage of 3D similarity methods is that the conformational properties of the molecules should be considered and therefore these methods are more computationally intensive than methods based on 2D structure representation. The complexity increases considerably if conformational flexibility is taken into account. There are many 2D structure representations in a numerical form integer or real. The simplest 2D descriptors are based on simple counts of features such as hydrogen donors, hydrogen bond acceptors, ring systems (such as aromatic rings) and rotatable bonds, whereas the complex 2D descriptors are computed from complex mathematical equations such as 2D fingerprints and topological indices. Topological indices are integer or real value numbers (single value) that represent the constitution of the molecules and can be calculated from the 2D graph representation of molecules and may contain

22 5 additional property information about the molecule [23, 24, 25]. They characterize molecular structures according to their size, degree of branching and overall shape where the structural diagram of molecules is considered as a mathematical graph, but not the contour of molecule shape. In this thesis, we introduced a new shape-based molecular descriptor that have been inspired by research in information retrieval on the use of contour based shape descriptor for image retrieval systems [26, 27, 28]. Shape-based molecular descriptor is a new method to obtain rough description of 2D molecular structure from its 2D outline shape of its 2D diagram. shape-based molecular descriptor is a textual descriptor that allows rigorous structure specification by the use of a very small and natural grammar. 1.4 Research Question This study will focus on the following questions: 1. How can we develop new descriptor based on 2D shape of molecule to be used in similarity searching? 2. How can this descriptor be modeled to represent the molecule in fingerprint format? 3. How can graph theory be applied to canonical representation of molecule for similarity searching? 1.5 Research Aim Molecular descriptors play a fundamental role in chemistry, pharmaceutical sciences, environmental protection policy, and health researches. The molecules are transformed into numbers, allowing some mathematical treatment of the chemical information contained in the molecule. The aim of this research is to propose and develop a new descriptor for chemical compounds based on their 2D shape used in

23 6 similarity searching and molecular retrieval. 1.6 Objectives The objectives of this research are as follows: 1. To develop a new descriptor based on 2D molecular structure shape for similarity searching. 2. To develop a new fingerprint descriptor for shape based representation of chemical compound. 3. To develop canonical representation based on 2D shape of molecule and graph theory for molecular similarity searching. 1.7 Research Scope In order to achieve the objectives stated above, the scope of this study is limited to the following: 1. Develop the new descriptor based on the shape of molecules extracted from 2D connection table representation. 2. To evaluate the new descriptor and compared with the six standard descriptors (ALOGP, MACCS, EPFP4, CDKFP, PCFP, GRFP), then evaluate using Tanimoto coefficient. 3. The database aimed to be used in this study is only limited to chemical data from MDL Drug Data Report (MDDR).

24 7 1.8 Significance and justification of study The field of molecular descriptors is strongly interdisciplinary and involves a mass of different theories. For the definition of molecular descriptors, knowledge of algebra, graph theory, information theory, computational chemistry, theories of organic reactivity and physical chemistry is usually required. This study is going to produce a descriptor based on shape of molecules, which can be used in similarity searching in chemical database for drug design and discovery. 1.9 Organization of the report This thesis will be organized into seven Chapters. Chapter 2 will describe the relevant literature review, the fundamentals of chemoinformatics from the molecule descriptors and the molecule searching. Chapter 3 introduces the methodology that is used to build up the proposed descriptor. Chapter 4 introduces the development of the first model labeled as Language for Writing Descriptor or Outline Shape of Molecule (LWDOSM). The development of the second model 2D Fingerprint Descriptors of Outline Shape of Molecules using the stepwise fragmentation of the first language is presented in Chapter 5 and finally Chapter 6 presents the third and the last model of our work shape-based molecular descriptor SBDM. Chapter 7 discusses and concludes this thesis, highlights the contribution and finding of this work, and provide suggestions and recommendations for future research Summary The molecular descriptor is the final result of a logic and mathematical procedure which transforms chemical information encoded within a symbolic representation of a molecule into a useful number or the result of some standardized experiment. This Chapter gives an overview of the problems involved in the representation of chemical database that store the molecular structures. It highlights the importance of the problem, especially for similarity searching and drug discovery.

25 REFERENCES 1. Barry Werth. Billion Dollar Molecule: The Quest for the Perfect Drug. SimonandSchuster. com, Norbert Haider. Functionality pattern matching as an efficient complementary structure/reaction search tool: an open-source approach. Molecules, 15(8): , Bing Xia, Zheng-Fu Tai, Yu-Cheng Gu, Bang-Jing Li, Li-Sheng Ding, and Yan Zhou. Mymoldb: A micromolecular database solution with open source and free components. Journal of Computational Chemistry, 32(13): , Bruno Bienfait and Peter Ertl. Jsme: a free molecule editor in javascript. Journal of cheminformatics, 5(1):1 6, John J Irwin and Brian K Shoichet. Zinc-a free database of commercially available compounds for virtual screening. Journal of chemical information and modeling, 45(1): , Stuart Armstrong, Garrett M Morris, Paul W Finn, Raman Sharma, Loris Moretti, Richard I Cooper, and W Graham Richards. Electroshape: fast molecular similarity calculations incorporating shape, chirality and electrostatics. Journal of computer-aided molecular design, 24(9): , Bahram Hemmateenejad, Saeed Yousefinejad, and Ahmad Reza Mehdipour. Novel amino acids indices based on quantum topological molecular similarity and their application to qsar study of peptides. Amino acids, 40(4): , Craig and Ebert. Eleven years of structure seraching using the skf (smith,

26 141 kline and french) fragmentaton codes. J. Chem. Doc., 9: , G. G. Vander Stouw, P. M. Elliott, and A. C. Isenberg. Automated conversion of chemical substance names to atom-bond connection tables. Journal of Chemical Documentation, 14(4): , P. Willett, J. M. Barnard, and G. M. Downs. Chemical similarity searching. J. Chem. Inf. Comput. Sci., 38: , ito Kei, Tanaka Nobuya, and Fujita Shinsaku. Development and application of xym2mol system for convertingstructural data by xym notation into connection tables. Journal of Computer Chemistry, Japan, 4(2):79 88, Eleanor M. Ricketts, John Bradshaw, Mike Hann, Fiona Hayes, Neil Tanna, and David M. Ricketts. Comparison of conformations of small molecule structures from the protein data bank with those generated by concord, cobra, chemdbs-3d, and converter and those extracted from the cambridge structural database. Journal of Chemical Information and Computer Sciences, 33: , William Fisanick, Kevin P. Cross, and Andrew Rusinko. Similarity searching on cas registry substances. 1. global molecular property and generic atom triangle geometric searching. Journal of Chemical Information and Computer Sciences, 32: , P. G. Dittmar, N. A. Farmer, W. Fisanick, R. C. Haines, and J. Mockus. The cas online search system. 1. general system design and selection, generation, and use of search screens. Journal of Chemical Information and Computer Sciences, 23:93 102, Raymond E. Carhart, Dennis H. Smith, and R. Venkataraghavan. Atom pairs as molecular features in structure-activity studies: definition and applications. Journal of Chemical Information and Computer Sciences, 25:64 73, Ash and E Janet. Communication, storage, and retrieval of chemical information. E. Horwood; Halsted Press, R.E. Tarjan. Graph algorithms in chemical computation. R.E. Christoffersen. American Chemical Society Symposium Series, 46:1 20, 1977.

27 H. L. Morgan. The generation of a unique machine description for chemical structures-a technique developed at chemical abstracts service. Journal of Chemical Documentation, 5: , M. Garey and D. Johnson. The rectilinear steiner tree problem is npcomplete. SIAM Journal on Applied Mathematics, 32(4): , Barnard and M. John. Substructure searching methods: Old and new. Journal of Chemical Information and Computer Sciences, 33: , P.M. Dean and R. A. Lewis. Molecular Diversity in Drug Design. Kluwer,Amsterdam, P. Willett. Structural biology in drug metabolism and drug discovery. Biochemical Society Transactions, 31(part 3), P. Willett. Similarity-based virtual screening using 2d fingerprints. Drug discovery today, 11(23): , Renxiao Wang and Shaomeng Wang. How does consensus scoring work for virtual library screening? an idealized computer experiment. Journal of Chemical Information and Computer Sciences, 41: , P. H. A. Sneath and R. R. Sokal. Numerical taxonomy. The principles and practice of numerical classification. San Francisco, Theodosios Pavlidis. Algorithms for shape analysis of contours and waveforms. Pattern Analysis and Machine Intelligence, IEEE Transactions on, PAMI-2(4): , july H. Freeman and A Saghri. Generalized chain codes for planar curves. Proceedings of the 4th International Joint Conference on Pattern Recognition, pages , H Freeman. On the encoding of arbitrary geometric configurations. Electronic Computers, IRE Transactions on, EC-10: , june Mark A Johnson and Gerald M Maggiora. Concepts and applications of molecular similarity. Wiley New York, 1990.

28 Amos Tversky. Features of similarity. Psychological review, 84: , Dennis H Rouvray. Definition and role of similarity concepts in the chemical and physical sciences. Journal of chemical information and computer sciences, 32(6): , Thomas R Hagadone. Molecular substructure similarity searching: efficient retrieval in two-dimensional structure databases. Journal of chemical information and computer sciences, 32: , David Weininger. Smiles, a chemical language and information system. 1. introduction to methodology and encoding rules. Journal of Chemical Information and Computer Sciences, 28(1):31 36, David Weininger, Arthur Weininger, and Joseph L Weininger. Smiles 2. algorithm for generation of unique smiles notation. Journal of Chemical Information and Computer Sciences, 29(2):97 101, Ramaswamy Nilakantan, Norman Bauman, J. Scott Dixon, and R. Venkataraghavan. Topological torsion: a new molecular descriptor for sar applications. comparison with other descriptors. Journal of Chemical Information and Computer Sciences, 27(2):82 85, Harry Wiener. Structural determination of paraffin boiling points. Journal of the American Chemical Society, 69(1):17 20, Balaban and T. Alexandru. Applications of graph theory in chemistry. Journal of Chemical Information and Computer Sciences, 25(3): , Richard A. Lewis, Jonathan S. Mason, and Iain M. McLay. Similarity measures for rational set selection and analysis of combinatorial libraries:the diverse property-derived (dpd) approach. Journal of Chemical Information and Computer Sciences, 37: , Dmitrii Filimonov, Vladimir Poroikov, Yulia Borodina, and Tatyana Gloriozova. Chemical similarity assessment through multilevel neighborhoods of atoms: Definition and comparison with the other descriptors. Journal of

29 144 Chemical Information and Computer Sciences, 39: , Sibson R Jardine, N. Mathematical taxonomy. John Wiley and Sons, NY, Zdenek Hubalek. Coefficients of association and similarity, based on binary (presence-absence) data: An evaluation. Biological Reviews, 57: , D. Ellis, J. Furner-Hines, and P Willett. Measuring the degree of similarity between objects in text-retrieval systems. Perspect. Inf. Manage., 3: , G. W. Adamson and J. A Bush. A comparison of the performance of some similarity and dissimilarity measures in the automatic classification of chemical structures. J. Chem. Inf. Comput. Sci., 15:55 70, Zhang Dengsheng and Lu Guojun. Review of shape representation and description techniques. Pattern Recognition, 37(1):1 19, Bradley D. Christie, Burton A. Leland, and James G. Nourse. Structure searching in chemical databases by direct lookup methods. Journal of Chemical Information and Computer Sciences, 33: , W. Fisanick, A. H. Lipkus, and A Rusinko. Similarity searching on cas registry substances. 2. 2d structural similarity. J. Chem. Inf. Comput. Sci., 34: , John Figueras. Substructure search by set reduction. Journal of Chemical Documentation, 12: , Edward H. Sussenguth. A graph-theoretic algorithm for matching chemical structures. Journal of Chemical Documentation, 5:36 43, Annette Von Scholley. A relaxation algorithm for generic chemical structure screening. Journal of Chemical Information and Computer Sciences, 24: , David J. Wild and Peter Willett. Similarity searching in files of

30 145 three-dimensional chemical structures. alignment of molecular electrostatic potential fields with a genetic algorithm. Journal of Chemical Information and Computer Sciences, 36: , A Barth. Status and future development of reaction databases and online retrieval systems. J. Chem. Inf. Comput. Sci., 30: , P. Sheridan Robert and K. Kearsley Simon. Why do we need so many chemical similarity search methods? Drug Discovery Today, 7(17): , Nina Nikolova and Joanna Jaworska. Approaches to measure chemical similarity a review. QSAR & Combinatorial Science, 22(9-10): , AnaG. Maldonado, J.P. Doucet, Michel Petitjean, and Bo-Tao Fan. Molecular similarity and diversity in chemoinformatics: From theory to applications. Molecular Diversity, 10:39 79, Matthias Rarey and J.Scott Dixon. Feature trees: A new molecular similarity measure based on tree matching. Journal of Computer-Aided Molecular Design, 12: , P. J. Hansen and P. C Jurs. Chemical applications of graph theory. J. Chem. Ed., 65: , J. A. GRANT, M. A. GALLARDO, and B. T. PICKUP. A fast method of molecular shape comparison: A simple application of a gaussian description of molecular shape. Journal of Computational Chemistry, 17: , Jennings and Mike Tennant. Selection of molecules based on shape and electrostatic similarity: Proof of concept of electroforms. Journal of Chemical Information and Modeling, 47(5): , G. Madhavi Sastry, Steven L. Dixon, and Woody Sherman. Rapid shapebased ligand alignment and virtual screening method based on atom/featurepair similarities and volume overlap scoring. Journal of Chemical Information and Modeling, 51(10): , 2011.

31 Kirstin Moffat, Valerie J. Gillet, Martin Whittle, Gianpaolo Bravi, and Andrew R. Leach. A comparison of field-based similarity searching methods: Catshape, fbss, and rocs. Journal of Chemical Information and Modeling, 48(4): , Jayashree Srinivasan, Angelo Castellino, Erin K. Bradley, John E. Eksterowicz, Peter D. J. Grootenhuis, Santosh Putta, and Robert V. Stanton. Evaluation of a novel shape-based computational filter for lead evolution: Application to thrombin inhibitors. Journal of Medicinal Chemistry, 45(12): , Thomas S. Rush, J. Andrew Grant, Lidia Mosyak, and Anthony Nicholls. A shape-based 3-d scaffold hopping method and its application to a bacterial protein protein interaction. Journal of Medicinal Chemistry, 48(5): , Wendy A. Warr. Representation of chemical structures. Wiley Interdisciplinary Reviews: Computational Molecular Science, 1(4): , B Kier Lemont and H Hall Lowell. Molecular connectivity: intermolecular accessibility and encounter simulation. Journal of Molecular Graphics and Modelling, 20(1):76 83, H. Lowell, Hall, and B. Kier Lemont. Issues in representation of molecular structure: The development of molecular connectivity. Journal of Molecular Graphics and Modelling, 20(1):4 18, Randi Milan. The connectivity index 25 years after. Journal of Molecular Graphics and Modelling, 20(1):19 35, Thierry Kogej, Ola Engkvist, Niklas Blomberg, and Sorel Muresan. Multifingerprint based similarity searches for targeted class compound selection. Journal of Chemical Information and Modeling, 46(3): , Slimane Larabi, Saliha Bouagar, FelixMiguel Trespaderne, and Eusebiodela- Fuente Lopez. Lwdos: Language for writing descriptors of outline shapes. In Josef Bigun and Tomas Gustavsson, editors, Image Analysis, volume 2749

32 147 of Lecture Notes in Computer Science, pages Springer Berlin Heidelberg, Slimane Larabi. Textual description of shapes. Journal of Visual Communication and Image Representation, 20(8): , Ammar Abdo and Naomie Salim. Similarity-based virtual screening using bayesian inference network: Enhanced search using 2d fingerprints and multiple reference structures. QSAR & Combinatorial Science, 28(6-7): , Ammar Abdo and Naomie Salim. Similarity-based virtual screening with a bayesian inference network. ChemMedChem, 4(2): , Forty Liz, Smith Daniel, Jones Lisa, Jones Ian, Caesar Sian, Fraser Christine, Gordon-Smith Katherine, and Craddock Nick. Identifying hypomanic features in major depressive disorder using the hypomania checklist (hcl-32). Journal of Affective Disorders, 114(13):68 73, He Ningning, Wang Xiaoqi, Kim Nayoung, Lim Jong-Seok, and Yoon Sukjoon. 3d shape-based analysis of cell line-specific compound response in cancers. Journal of Molecular Graphics and Modelling, 43(0):41 46, Li Guo-Bo, Yang Ling-Ling, Shan Feng, Zhou Jian-Ping, Huang Qi, Xie Huan-Zhang, Li Lin-Li, and Yang Sheng-Yong. Discovery of novel mglur1 antagonists: A multistep virtual screening approach based on an {SVM} model and a pharmacophore hypothesis significantly increases the hit rate and enrichment factor. Bioorganic & Medicinal Chemistry Letters, 21(6): , T. Maalouf Fadi, Brent David, Clark Luke, Tavitian Lucy, Munnell McHugh Rebecca, J. Sahakian Barbara, and L. Phillips Mary. Neurocognitive impairment in adolescent major depressive disorder: State vs. trait illness markers. Journal of Affective Disorders, 133(3): , T. Maalouf Fadi, Clark Luke, Tavitian Lucy, J. Sahakian Barbara, Brent David, and L. Phillips Mary. Bias to negative emotions: A depression state-dependent marker in adolescent major depressive disorder. Psychiatry Research, 198(1):28 33, 2012.

33 Z Shi, XH Ma, C Qin, J Jia, YY Jiang, CY Tan, and YZ Chen. Combinatorial support vector machines approach for virtual screening of selective multitarget serotonin reuptake inhibitors from large compound libraries. Journal of Molecular Graphics and Modelling, 32:49 66, J. Hert, P. Willett, D. J. Wilton, P. Acklin, K. Azzaoui, E. Jacoby, and A Schuffenhauer. New methods for ligand-based virtual screening: use of data fusion and machine learning to enhance the effectiveness of similarity searching. J. Chem. Inf. Model., 46: , David Rogers and Mathew Hahn. Extended-connectivity fingerprints. Journal of chemical information and modeling, 50(5): , Ye Hu, Eugen Lounkine, and Jurgen Bajorath. Improving the search performance of extended connectivity fingerprints through activity-oriented feature filtering and application of a bit-density-dependent similarity function. ChemMedChem, 4(4): , Andreas Bender, Jeremy L Jenkins, Meir Glick, Zhan Deng, James H Nettles, and John W Davies. improve retrieval rates in virtual screening and define orthogonal bioactivity space. Journal of chemical information and modeling, 46(6): , Dragos Horvath. A virtual screening approach applied to the search for trypanothione reductase inhibitors. Journal of medicinal chemistry, 40: , Dik-Lung Ma, Daniel Shiu-Hin Chan, and Chung-Hang Leung. Drug repositioning by structure-based virtual screening. Chemical Society Reviews, 42(5): , Claudio N Cavasotto. Homology models in docking and high-throughput docking. Current topics in medicinal chemistry, 11(12): , Peter Ripphausen, Britta Nisius, Lisa Peltason, and Jurgen Bajorath. Quo vadis, virtual screening? a comprehensive survey of prospective applications. Journal of medicinal chemistry, 53(24): , Tiejun Cheng, Qingliang Li, Zhigang Zhou, Yanli Wang, and Stephen H

34 149 Bryant. Structure-based virtual screening for drug discovery: a problemcentric review. The AAPS journal, 14(1): , Herve Abdi. The kendall rank correlation coefficient. Encyclopedia of Measurement and Statistics. Sage, Thousand Oaks, CA, pages , Miroslav Chraska. Metody pedagogickeho vyzkumu. Grada Publishing as, Stephen F Altschul, Warren Gish, Webb Miller, Eugene W Myers, and David J Lipman. Basic local alignment search tool. Journal of molecular biology, 215(3): , Chris Oehmen and Jarek Nieplocha. Scalablast: A scalable implementation of blast for high-performance data-intensive bioinformatics analysis. Parallel and Distributed Systems, IEEE Transactions on, 17(8): , Christopher S Oehmen and Douglas J Baxter. Scalablast 2.0: rapid and robust blast calculations on multiprocessor systems. Bioinformatics, 29(6): , Christiam Camacho, George Coulouris, Vahram Avagyan, Ning Ma, Jason Papadopoulos, Kevin Bealer, and Thomas Madden. Blast: architecture and applications. BMC bioinformatics, 10(1): , Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman. Gapped blast and psi-blast: a new generation of protein database search programs. Nucleic Acids Research, 25: , Chun Wei Yap. Padel-descriptor: An open source software to calculate molecular descriptors and fingerprints. Journal of Computational Chemistry, 32(7): , Ammar Abdo, Beining Chen, Christoph Mueller, Naomie Salim, and Peter Willett. Ligand-based virtual screening using bayesian networks. Journal of Chemical Information and Modeling, 50(6): , 2010.

35 Robert D. Brown and Yvonne C. Martin. Use of structurea activity data to compare structure-based clustering methods and descriptors for use in compound selection. Journal of Chemical Information and Computer Sciences, 36: , Hans Matter and Thorsten Peter. Comparing 3d pharmacophore triplets and 2d fingerprints for selecting diverse compound subsets. Journal of Chemical Information and Computer Sciences, 39: , S. C. Basak, V. R. Magnuson, G. J. Niemi, and R. R Regal. Determining structural similarity of chemicals using graph-theoretic indices. Discrete Appl. Math., 19:17 30, David Vidal, Michael Thormann, and Miquel Pons. Lingo, an efficient holographic text based method to calculate biophysical properties and intermolecular similarities. Journal of Chemical Information and Modeling, 45(2): , Matthias Rarey and Martin Stahl. Similarity searching in large combinatorial chemistry spaces. Journal of Computer-Aided Molecular Design, 15: , 2001.

SHAPE-BASED TWO DIMENSIONAL DESCRIPTOR FOR SEARCHING MOLECULAR DATABASE

SHAPE-BASED TWO DIMENSIONAL DESCRIPTOR FOR SEARCHING MOLECULAR DATABASE 1.2411.25! SHAPE-BASED TWO DIMENSIONAL DESCRIPTOR FOR SEARCHING MOLECULAR DATABASE HENTABLI HAMZA UNIVERSITI TEKNOLOGI MALAYSIA I hereby declare that I have read this thesis and in my opinion this thesis

More information

FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM

FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM i FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM A thesis submitted in fulfilment of the requirements for the award of the degree of Doctor of Philosophy (Computer

More information

An Enhancement of Bayesian Inference Network for Ligand-Based Virtual Screening using Features Selection

An Enhancement of Bayesian Inference Network for Ligand-Based Virtual Screening using Features Selection American Journal of Applied Sciences 8 (4): 368-373, 2011 ISSN 1546-9239 2010 Science Publications An Enhancement of Bayesian Inference Network for Ligand-Based Virtual Screening using Features Selection

More information

COMPARISON OF SIMILARITY METHOD TO IMPROVE RETRIEVAL PERFORMANCE FOR CHEMICAL DATA

COMPARISON OF SIMILARITY METHOD TO IMPROVE RETRIEVAL PERFORMANCE FOR CHEMICAL DATA http://www.ftsm.ukm.my/apjitm Asia-Pacific Journal of Information Technology and Multimedia Jurnal Teknologi Maklumat dan Multimedia Asia-Pasifik Vol. 7 No. 1, June 2018: 91-98 e-issn: 2289-2192 COMPARISON

More information

has its own advantages and drawbacks, depending on the questions facing the drug discovery.

has its own advantages and drawbacks, depending on the questions facing the drug discovery. 2013 First International Conference on Artificial Intelligence, Modelling & Simulation Comparison of Similarity Coefficients for Chemical Database Retrieval Mukhsin Syuib School of Information Technology

More information

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics

More information

Similarity Search. Uwe Koch

Similarity Search. Uwe Koch Similarity Search Uwe Koch Similarity Search The similar property principle: strurally similar molecules tend to have similar properties. However, structure property discontinuities occur frequently. Relevance

More information

JCICS Major Research Areas

JCICS Major Research Areas JCICS Major Research Areas Chemical Information Text Searching Structure and Substructure Searching Databases Patents George W.A. Milne C571 Lecture Fall 2002 1 JCICS Major Research Areas Chemical Computation

More information

DATA FUSION APPROACHES IN LIGAND-BASED VIRTUAL SCREENING: RECENT DEVELOPMENTS OVERVIEW

DATA FUSION APPROACHES IN LIGAND-BASED VIRTUAL SCREENING: RECENT DEVELOPMENTS OVERVIEW DATA FUSION APPROACHES IN LIGAND-BASED VIRTUAL SCREENING: RECENT DEVELOPMENTS OVERVIEW Mubarak Himmat 1, Naomie Salim 1, Ali Ahmed 1, 2, and Mohammed Mumtaz Al-Dabbagh 1 1 Faculty of Computing, University

More information

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics... 1 1.1 Chemoinformatics... 2 1.1.1 Open-Source Tools... 2 1.1.2 Introduction to Programming Languages... 3 1.2 Chemical Structure

More information

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Iván Solt Solutions for Cheminformatics Drug Discovery Strategies for known targets High-Throughput Screening (HTS) Cells

More information

Introduction. OntoChem

Introduction. OntoChem Introduction ntochem Providing drug discovery knowledge & small molecules... Supporting the task of medicinal chemistry Allows selecting best possible small molecule starting point From target to leads

More information

Bioisosteres in Medicinal Chemistry

Bioisosteres in Medicinal Chemistry Edited by Nathan Brown Bioisosteres in Medicinal Chemistry VCH Verlag GmbH & Co. KGaA Contents List of Contributors Preface XV A Personal Foreword XI XVII Part One Principles 1 Bioisosterism in Medicinal

More information

Ignasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015

Ignasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015 Ignasi Belda, PhD CEO HPC Advisory Council Spain Conference 2015 Business lines Molecular Modeling Services We carry out computational chemistry projects using our selfdeveloped and third party technologies

More information

Universities of Leeds, Sheffield and York

Universities of Leeds, Sheffield and York promoting access to White Rose research papers Universities of Leeds, Sheffield and York http://eprints.whiterose.ac.uk/ This is an author produced version of a paper published in Organic & Biomolecular

More information

The Schrödinger KNIME extensions

The Schrödinger KNIME extensions The Schrödinger KNIME extensions Computational Chemistry and Cheminformatics in a workflow environment Jean-Christophe Mozziconacci Volker Eyrich Topics What are the Schrödinger extensions? Workflow application

More information

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Navigation in Chemical Space Towards Biological Activity Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Data Explosion in Chemistry CAS 65 million molecules CCDC 600 000 structures

More information

Similarity methods for ligandbased virtual screening

Similarity methods for ligandbased virtual screening Similarity methods for ligandbased virtual screening Peter Willett, University of Sheffield Computers in Scientific Discovery 5, 22 nd July 2010 Overview Molecular similarity and its use in virtual screening

More information

Data Mining in the Chemical Industry. Overview of presentation

Data Mining in the Chemical Industry. Overview of presentation Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. glenn.myatt@gmail.com verview of presentation verview of the chemical industry Example of the pharmaceutical industry

More information

An Integrated Approach to in-silico

An Integrated Approach to in-silico An Integrated Approach to in-silico Screening Joseph L. Durant Jr., Douglas. R. Henry, Maurizio Bronzetti, and David. A. Evans MDL Information Systems, Inc. 14600 Catalina St., San Leandro, CA 94577 Goals

More information

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller Chemogenomic: Approaches to Rational Drug Design Jonas Skjødt Møller Chemogenomic Chemistry Biology Chemical biology Medical chemistry Chemical genetics Chemoinformatics Bioinformatics Chemoproteomics

More information

COVRE OPTIMIZATION FOR IMAGE STEGANOGRAPHY BY USING IMAGE FEATURES ZAID NIDHAL KHUDHAIR

COVRE OPTIMIZATION FOR IMAGE STEGANOGRAPHY BY USING IMAGE FEATURES ZAID NIDHAL KHUDHAIR COVRE OPTIMIZATION FOR IMAGE STEGANOGRAPHY BY USING IMAGE FEATURES ZAID NIDHAL KHUDHAIR A dissertation submitted in partial fulfillment of the requirements for the award of the degree of Master of Science

More information

Mining Molecular Fragments: Finding Relevant Substructures of Molecules

Mining Molecular Fragments: Finding Relevant Substructures of Molecules Mining Molecular Fragments: Finding Relevant Substructures of Molecules Christian Borgelt, Michael R. Berthold Proc. IEEE International Conference on Data Mining, 2002. ICDM 2002. Lecturers: Carlo Cagli

More information

Practical QSAR and Library Design: Advanced tools for research teams

Practical QSAR and Library Design: Advanced tools for research teams DS QSAR and Library Design Webinar Practical QSAR and Library Design: Advanced tools for research teams Reservationless-Plus Dial-In Number (US): (866) 519-8942 Reservationless-Plus International Dial-In

More information

EVALUATION OF FUSION SCORE FOR FACE VERIFICATION SYSTEM REZA ARFA

EVALUATION OF FUSION SCORE FOR FACE VERIFICATION SYSTEM REZA ARFA EVALUATION OF FUSION SCORE FOR FACE VERIFICATION SYSTEM REZA ARFA A thesis submitted in fulfilment of the requirements for the award of the degree of Master of Engineering (Electrical) Faculty of Electrical

More information

Computational chemical biology to address non-traditional drug targets. John Karanicolas

Computational chemical biology to address non-traditional drug targets. John Karanicolas Computational chemical biology to address non-traditional drug targets John Karanicolas Our computational toolbox Structure-based approaches Ligand-based approaches Detailed MD simulations 2D fingerprints

More information

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer LigandScout Automated Structure-Based Pharmacophore Model Generation Gerhard Wolber* and Thierry Langer * E-Mail: wolber@inteligand.com Pharmacophores from LigandScout Pharmacophores & the Protein Data

More information

Introduction to Chemoinformatics and Drug Discovery

Introduction to Chemoinformatics and Drug Discovery Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.

More information

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression APPLICATION NOTE QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression GAINING EFFICIENCY IN QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIPS ErbB1 kinase is the cell-surface receptor

More information

Research Article. Chemical compound classification based on improved Max-Min kernel

Research Article. Chemical compound classification based on improved Max-Min kernel Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(2):368-372 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Chemical compound classification based on improved

More information

Molecular Similarity Searching Using Inference Network

Molecular Similarity Searching Using Inference Network Molecular Similarity Searching Using Inference Network Ammar Abdo, Naomie Salim* Faculty of Computer Science & Information Systems Universiti Teknologi Malaysia Molecular Similarity Searching Search for

More information

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017 Using Phase for Pharmacophore Modelling 5th European Life Science Bootcamp March, 2017 Phase: Our Pharmacohore generation tool Significant improvements to Phase methods in 2016 New highly interactive interface

More information

Universities of Leeds, Sheffield and York

Universities of Leeds, Sheffield and York promoting access to White Rose research papers Universities of Leeds, Sheffield and York http://eprints.whiterose.ac.uk/ This is an author produced version of a paper published in Statistical Analysis

More information

Hit Finding and Optimization Using BLAZE & FORGE

Hit Finding and Optimization Using BLAZE & FORGE Hit Finding and Optimization Using BLAZE & FORGE Kevin Cusack,* Maria Argiriadi, Eric Breinlinger, Jeremy Edmunds, Michael Hoemann, Michael Friedman, Sami Osman, Raymond Huntley, Thomas Vargo AbbVie, Immunology

More information

Evaluation of Molecular Similarity and Molecular Diversity Methods Using Biological Activity Data

Evaluation of Molecular Similarity and Molecular Diversity Methods Using Biological Activity Data 2 Evaluation of Molecular Similarity and Molecular Diversity Methods Using Biological Activity Data Peter Willett Abstract This chapter reviews the techniques available for quantifying the effectiveness

More information

FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM

FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM i FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM A thesis submitted in fulfilment of the requirements for the award of the degree of Doctor of Philosophy (Computer

More information

MOLECULAR DIVERSITY IN DRUG DESIGN

MOLECULAR DIVERSITY IN DRUG DESIGN MOLECULAR DIVERSITY IN DRUG DESIGN MOLECULAR DIVERSITY IN DRUG DESIGN Edited by PHILIP M. DEAN and RICHARD A. LEWIS KLUWER ACADEMIC PUBLISHERS NEW YORK / BOSTON / DORDRECHT / LONDON / MOSCOW ebook ISBN:

More information

INDEX SELECTION ENGINE FOR SPATIAL DATABASE SYSTEM MARUTO MASSERIE SARDADI UNIVERSITI TEKNOLOGI MALAYSIA

INDEX SELECTION ENGINE FOR SPATIAL DATABASE SYSTEM MARUTO MASSERIE SARDADI UNIVERSITI TEKNOLOGI MALAYSIA INDEX SELECTION ENGINE FOR SPATIAL DATABASE SYSTEM MARUTO MASSERIE SARDADI UNIVERSITI TEKNOLOGI MALAYSIA INDEX SELECTION ENGINE FOR SPATIAL DATABASE SYSTEM MARUTO MASSERIE SARDADI A Thesis submitted in

More information

MATHEMATICAL MODELING FOR TSUNAMI WAVES USING LATTICE BOLTZMANN METHOD SARA ZERGANI. UNIVERSITI TEKNOLOGI MALAYSIAi

MATHEMATICAL MODELING FOR TSUNAMI WAVES USING LATTICE BOLTZMANN METHOD SARA ZERGANI. UNIVERSITI TEKNOLOGI MALAYSIAi 1 MATHEMATICAL MODELING FOR TSUNAMI WAVES USING LATTICE BOLTZMANN METHOD SARA ZERGANI UNIVERSITI TEKNOLOGI MALAYSIAi ii MATHEMATICAL MODELING FOR TSUNAMI WAVES USING LATTICE BOLTZMANN METHOD SARA ZERGANI

More information

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters Drug Design 2 Oliver Kohlbacher Winter 2009/2010 11. QSAR Part 4: Selected Chapters Abt. Simulation biologischer Systeme WSI/ZBIT, Eberhard-Karls-Universität Tübingen Overview GRIND GRid-INDependent Descriptors

More information

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software CSD CSD-Enterprise Access the CSD and ALL CCDC application software CSD-Enterprise brings it all: access to the Cambridge Structural Database (CSD), the world s comprehensive and up-to-date database of

More information

Structural biology and drug design: An overview

Structural biology and drug design: An overview Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved

More information

Cheminformatics analysis and learning in a data pipelining environment

Cheminformatics analysis and learning in a data pipelining environment Molecular Diversity (2006) 10: 283 299 DOI: 10.1007/s11030-006-9041-5 c Springer 2006 Review Cheminformatics analysis and learning in a data pipelining environment Moises Hassan 1,, Robert D. Brown 1,

More information

Structure-Activity Modeling - QSAR. Uwe Koch

Structure-Activity Modeling - QSAR. Uwe Koch Structure-Activity Modeling - QSAR Uwe Koch QSAR Assumption: QSAR attempts to quantify the relationship between activity and molecular strcucture by correlating descriptors with properties Biological activity

More information

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning and Simulated Annealing Student: Ke Zhang MBMA Committee: Dr. Charles E. Smith (Chair) Dr. Jacqueline M. Hughes-Oliver

More information

Fast similarity searching making the virtual real. Stephen Pickett, GSK

Fast similarity searching making the virtual real. Stephen Pickett, GSK Fast similarity searching making the virtual real Stephen Pickett, GSK Introduction Introduction to similarity searching Use cases Why is speed so crucial? Why MadFast? Some performance stats Implementation

More information

SYNTHESIS AND CHARACTERIZATION OF MESO-SUBSTITUTED PORPHYRIN MUHAMMAD TAHIR MUHAMMAD

SYNTHESIS AND CHARACTERIZATION OF MESO-SUBSTITUTED PORPHYRIN MUHAMMAD TAHIR MUHAMMAD SYNTHESIS AND CHARACTERIZATION OF MESO-SUBSTITUTED PORPHYRIN MUHAMMAD TAHIR MUHAMMAD A dissertation submitted in fulfilment of the requirements for the award of degree of Master of Science (Chemistry)

More information

Tautomerism in chemical information management systems

Tautomerism in chemical information management systems Tautomerism in chemical information management systems Dr. Wendy A. Warr http://www.warr.com Tautomerism in chemical information management systems Author: Wendy A. Warr DOI: 10.1007/s10822-010-9338-4

More information

A Framework For Genetic-Based Fusion Of Similarity Measures In Chemical Compound Retrieval

A Framework For Genetic-Based Fusion Of Similarity Measures In Chemical Compound Retrieval A Framework For Genetic-Based Fusion Of Similarity Measures In Chemical Compound Retrieval Naomie Salim Faculty of Computer Science and Information Systems Universiti Teknologi Malaysia naomie@fsksm.utm.my

More information

MODELING AND SIMULATION OF BILAYER GRAPHENE NANORIBBON FIELD EFFECT TRANSISTOR SEYED MAHDI MOUSAVI UNIVERSITI TEKNOLOGI MALAYSIA

MODELING AND SIMULATION OF BILAYER GRAPHENE NANORIBBON FIELD EFFECT TRANSISTOR SEYED MAHDI MOUSAVI UNIVERSITI TEKNOLOGI MALAYSIA MODELING AND SIMULATION OF BILAYER GRAPHENE NANORIBBON FIELD EFFECT TRANSISTOR SEYED MAHDI MOUSAVI UNIVERSITI TEKNOLOGI MALAYSIA MODELING AND SIMULATION OF BILAYER GRAPHENE NANORIBBON FIELD EFFECT TRANSISTOR

More information

Structure-Based Drug Discovery An Overview

Structure-Based Drug Discovery An Overview Structure-Based Drug Discovery An Overview Edited by Roderick E. Hubbard University of York, Heslington, York, UK and Vernalis (R&D) Ltd, Abington, Cambridge, UK RSC Publishing Contents Chapter 1 3D Structure

More information

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös KIME-based scoring functions in Muse 3.0 KIME User Group Meeting 2013 Fabian Bös Certara Mission: End-to-End Model-Based Drug Development Certara was formed by acquiring and integrating Tripos, Pharsight,

More information

Solved and Unsolved Problems in Chemoinformatics

Solved and Unsolved Problems in Chemoinformatics Solved and Unsolved Problems in Chemoinformatics Johann Gasteiger Computer-Chemie-Centrum University of Erlangen-Nürnberg D-91052 Erlangen, Germany Johann.Gasteiger@fau.de Overview objectives of lecture

More information

Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility

Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility Dr. Wendy A. Warr http://www.warr.com Warr, W. A. A Short Review of Chemical Reaction Database Systems,

More information

POSITION CONTROL USING FUZZY-BASED CONTROLLER FOR PNEUMATIC-SERVO CYLINDER IN BALL AND BEAM APPLICATION MUHAMMAD ASYRAF BIN AZMAN

POSITION CONTROL USING FUZZY-BASED CONTROLLER FOR PNEUMATIC-SERVO CYLINDER IN BALL AND BEAM APPLICATION MUHAMMAD ASYRAF BIN AZMAN POSITION CONTROL USING FUZZY-BASED CONTROLLER FOR PNEUMATIC-SERVO CYLINDER IN BALL AND BEAM APPLICATION MUHAMMAD ASYRAF BIN AZMAN A thesis submitted in fulfilment of the requirements for the award of the

More information

Machine Learning Concepts in Chemoinformatics

Machine Learning Concepts in Chemoinformatics Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics

More information

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Plan Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Exercise: Example and exercise with herg potassium channel: Use of

More information

The shortest path to chemistry data and literature

The shortest path to chemistry data and literature R&D SOLUTIONS Reaxys Fact Sheet The shortest path to chemistry data and literature Designed to support the full range of chemistry research, including pharmaceutical development, environmental health &

More information

Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies

Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies Dissertation zur Erlangung des Doktorgrades (Dr. rer. nat.) der Mathematisch-aturwissenschaftlichen Fakultät der Rheinischen

More information

HYDROGEN RECOVERY FROM THE REFORMING GAS USING COMMERCIAL ACTIVATED CARBON MEHDI RAHMANIAN

HYDROGEN RECOVERY FROM THE REFORMING GAS USING COMMERCIAL ACTIVATED CARBON MEHDI RAHMANIAN HYDROGEN RECOVERY FROM THE REFORMING GAS USING COMMERCIAL ACTIVATED CARBON MEHDI RAHMANIAN A dissertation submitted in partial fulfilment of the requirements for the award of the degree of Master of Engineering

More information

ONLINE LOCATION DETERMINATION OF SWITCHED CAPACITOR BANK IN DISTRIBUTION SYSTEM AHMED JAMAL NOORI AL-ANI

ONLINE LOCATION DETERMINATION OF SWITCHED CAPACITOR BANK IN DISTRIBUTION SYSTEM AHMED JAMAL NOORI AL-ANI ONLINE LOCATION DETERMINATION OF SWITCHED CAPACITOR BANK IN DISTRIBUTION SYSTEM AHMED JAMAL NOORI AL-ANI A project report submitted in partial fulfilment of the requirements for the award of the degree

More information

SmallWorld: Efficient Maximum Common Subgraph Searching of Large Chemical Databases

SmallWorld: Efficient Maximum Common Subgraph Searching of Large Chemical Databases SmallWorld: Efficient Maximum Common Subgraph Searching of Large Chemical Databases Roger Sayle, Jose Batista and Andrew Grant NextMove Software, Cambridge, UK AstraZeneca R&D, Alderley Park, UK 2d chemical

More information

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors Rajarshi Guha, Debojyoti Dutta, Ting Chen and David J. Wild School of Informatics Indiana University and Dept.

More information

Introduction. Chemical Structure Graphs. Whitepaper

Introduction. Chemical Structure Graphs. Whitepaper Whitepaper Exact match searching Introduction Institutions find it valuable to maintain a registry of all the chemical substances that have prepared. The purpose of this registry is to determine whether

More information

Design and Synthesis of the Comprehensive Fragment Library

Design and Synthesis of the Comprehensive Fragment Library YOUR INNOVATIVE CHEMISTRY PARTNER IN DRUG DISCOVERY Design and Synthesis of the Comprehensive Fragment Library A 3D Enabled Library for Medicinal Chemistry Discovery Warren S Wade 1, Kuei-Lin Chang 1,

More information

Integrated Cheminformatics to Guide Drug Discovery

Integrated Cheminformatics to Guide Drug Discovery Integrated Cheminformatics to Guide Drug Discovery Matthew Segall, Ed Champness, Peter Hunt, Tamsin Mansley CINF Drug Discovery Cheminformatics Approaches August 23 rd 2017 Optibrium, StarDrop, Auto-Modeller,

More information

Imago: open-source toolkit for 2D chemical structure image recognition

Imago: open-source toolkit for 2D chemical structure image recognition Imago: open-source toolkit for 2D chemical structure image recognition Viktor Smolov *, Fedor Zentsev and Mikhail Rybalkin GGA Software Services LLC Abstract Different chemical databases contain molecule

More information

bcl::cheminfo Suite Enables Machine Learning-Based Drug Discovery Using GPUs Edward W. Lowe, Jr. Nils Woetzel May 17, 2012

bcl::cheminfo Suite Enables Machine Learning-Based Drug Discovery Using GPUs Edward W. Lowe, Jr. Nils Woetzel May 17, 2012 bcl::cheminfo Suite Enables Machine Learning-Based Drug Discovery Using GPUs Edward W. Lowe, Jr. Nils Woetzel May 17, 2012 Outline Machine Learning Cheminformatics Framework QSPR logp QSAR mglur 5 CYP

More information

ELIMINATION OF RAINDROPS EFFECTS IN INFRARED SENSITIVE CAMERA AHMAD SHARMI BIN ABDULLAH

ELIMINATION OF RAINDROPS EFFECTS IN INFRARED SENSITIVE CAMERA AHMAD SHARMI BIN ABDULLAH ELIMINATION OF RAINDROPS EFFECTS IN INFRARED SENSITIVE CAMERA AHMAD SHARMI BIN ABDULLAH A project report submitted in partial fulfillment of the requirements for the award of the degree of Master of Engineering

More information

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Anthony Arvanites Daylight User Group Meeting March 10, 2005 Outline 1. Company Introduction

More information

Chemical Similarity Searching

Chemical Similarity Searching J. Chem. Inf. Comput. Sci. 1998, 38, 983-996 983 Chemical Similarity Searching Peter Willett* Krebs Institute for Biomolecular Research and Department of Information Studies, University of Sheffield, Sheffield

More information

FINITE ELEMENT METHOD FOR TWO-DIMENSIONAL ELASTICITY PROBLEM HANIMAH OTHMAN

FINITE ELEMENT METHOD FOR TWO-DIMENSIONAL ELASTICITY PROBLEM HANIMAH OTHMAN FINITE ELEMENT METHOD FOR TWO-DIMENSIONAL ELASTICITY PROBLEM HANIMAH OTHMAN A dissertation submitted in partial fulfilment of the requirements for the award of the degree of Master of Science (Mathematics)

More information

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National

More information

Database Speaks. Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan

Database Speaks. Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan Database Speaks Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan Email: liuu@chem.sinica.edu.tw 1 OUTLINES -- Personal experiences Publication types Secondary publication

More information

COMBINATORIAL CHEMISTRY: CURRENT APPROACH

COMBINATORIAL CHEMISTRY: CURRENT APPROACH COMBINATORIAL CHEMISTRY: CURRENT APPROACH Dwivedi A. 1, Sitoke A. 2, Joshi V. 3, Akhtar A.K. 4* and Chaturvedi M. 1, NRI Institute of Pharmaceutical Sciences, Bhopal, M.P.-India 2, SRM College of Pharmacy,

More information

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann Information Extraction from Chemical Images Discovery Knowledge & Informatics April 24 th, 2006 Dr. Available Chemical Information Textbooks Reports Patents Databases Scientific journals and publications

More information

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK Molecular Modelling Computational Chemistry Demystified Peter Bladon Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK John E. Gorton Gorton Systems, Glasgow, UK Robert B. Hammond Institute

More information

DYNAMIC SIMULATION OF COLUMNS CONSIDERING GEOMETRIC NONLINEARITY MOSTAFA MIRSHEKARI

DYNAMIC SIMULATION OF COLUMNS CONSIDERING GEOMETRIC NONLINEARITY MOSTAFA MIRSHEKARI DYNAMIC SIMULATION OF COLUMNS CONSIDERING GEOMETRIC NONLINEARITY MOSTAFA MIRSHEKARI A project report submitted in partial fulfillment of the requirements for the award of the degree of Master of Engineering

More information

MODELING AND CONTROL OF A CLASS OF AERIAL ROBOTIC SYSTEMS TAN ENG TECK UNIVERSITI TEKNOLOGY MALAYSIA

MODELING AND CONTROL OF A CLASS OF AERIAL ROBOTIC SYSTEMS TAN ENG TECK UNIVERSITI TEKNOLOGY MALAYSIA MODELING AND CONTROL OF A CLASS OF AERIAL ROBOTIC SYSTEMS TAN ENG TECK UNIVERSITI TEKNOLOGY MALAYSIA MODELING AND CONTROL OF A CLASS OF AERIAL ROBOTIC SYSTEMS TAN ENG TECK A thesis submitted in fulfilment

More information

est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today

est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today Sign up for FREE GPU Test Drive on remotely hosted clusters www.nvidia.com/gputestd rive Shape Searching

More information

DETECTION OF STRUCTURAL DEFORMATION FROM 3D POINT CLOUDS JONATHAN NYOKA CHIVATSI UNIVERSITI TEKNOLOGI MALAYSIA

DETECTION OF STRUCTURAL DEFORMATION FROM 3D POINT CLOUDS JONATHAN NYOKA CHIVATSI UNIVERSITI TEKNOLOGI MALAYSIA DETECTION OF STRUCTURAL DEFORMATION FROM 3D POINT CLOUDS JONATHAN NYOKA CHIVATSI UNIVERSITI TEKNOLOGI MALAYSIA DETECTION OF STRUCTURAL DEFORMATION FROM 3D POINT CLOUDS JONATHAN NYOKA CHIVATSI A project

More information

Biologically Relevant Molecular Comparisons. Mark Mackey

Biologically Relevant Molecular Comparisons. Mark Mackey Biologically Relevant Molecular Comparisons Mark Mackey Agenda > Cresset Technology > Cresset Products > FieldStere > FieldScreen > FieldAlign > FieldTemplater > Cresset and Knime About Cresset > Specialist

More information

Introducing a Bioinformatics Similarity Search Solution

Introducing a Bioinformatics Similarity Search Solution Introducing a Bioinformatics Similarity Search Solution 1 Page About the APU 3 The APU as a Driver of Similarity Search 3 Similarity Search in Bioinformatics 3 POC: GSI Joins Forces with the Weizmann Institute

More information

Fragment based drug discovery in teams of medicinal and computational chemists. Carsten Detering

Fragment based drug discovery in teams of medicinal and computational chemists. Carsten Detering Fragment based drug discovery in teams of medicinal and computational chemists Carsten Detering BioSolveIT Quick Facts Founded in 2001 by the developers of FlexX ~20 people Core expertise: docking, screening,

More information

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre Dr. Sander B. Nabuurs Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre The road to new drugs. How to find new hits? High Throughput

More information

Computational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004

Computational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004 Computational Methods and Drug-Likeness Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004 The Problem Drug development in pharmaceutical industry: >8-12 years time ~$800m costs >90% failure

More information

In silico generation of novel, drug-like chemical matter using the LSTM deep neural network

In silico generation of novel, drug-like chemical matter using the LSTM deep neural network In silico generation of novel, drug-like chemical matter using the LSTM deep neural network Peter Ertl Novartis Institutes for BioMedical Research, Basel, CH September 2018 Neural networks in cheminformatics

More information

The Schrödinger KNIME extensions

The Schrödinger KNIME extensions The Schrödinger KNIME extensions Computational Chemistry and Cheminformatics in a workflow environment Jean-Christophe Mozziconacci Volker Eyrich KNIME UGM, Berlin, February 2015 The Schrödinger Extensions

More information

De Novo molecular design with Deep Reinforcement Learning

De Novo molecular design with Deep Reinforcement Learning De Novo molecular design with Deep Reinforcement Learning @olexandr Olexandr Isayev, Ph.D. University of North Carolina at Chapel Hill olexandr@unc.edu http://olexandrisayev.com About me Ph.D. in Chemistry

More information

Using AutoDock for Virtual Screening

Using AutoDock for Virtual Screening Using AutoDock for Virtual Screening CUHK Croucher ASI Workshop 2011 Stefano Forli, PhD Prof. Arthur J. Olson, Ph.D Molecular Graphics Lab Screening and Virtual Screening The ultimate tool for identifying

More information

October 6 University Faculty of pharmacy Computer Aided Drug Design Unit

October 6 University Faculty of pharmacy Computer Aided Drug Design Unit October 6 University Faculty of pharmacy Computer Aided Drug Design Unit CADD@O6U.edu.eg CADD Computer-Aided Drug Design Unit The development of new drugs is no longer a process of trial and error or strokes

More information

Receptor Based Drug Design (1)

Receptor Based Drug Design (1) Induced Fit Model For more than 100 years, the behaviour of enzymes had been explained by the "lock-and-key" mechanism developed by pioneering German chemist Emil Fischer. Fischer thought that the chemicals

More information

DEVELOPMENT OF GEODETIC DEFORMATION ANALYSIS SOFTWARE BASED ON ITERATIVE WEIGHTED SIMILARITY TRANSFORMATION TECHNIQUE ABDALLATEF A. M.

DEVELOPMENT OF GEODETIC DEFORMATION ANALYSIS SOFTWARE BASED ON ITERATIVE WEIGHTED SIMILARITY TRANSFORMATION TECHNIQUE ABDALLATEF A. M. ii DEVELOPMENT OF GEODETIC DEFORMATION ANALYSIS SOFTWARE BASED ON ITERATIVE WEIGHTED SIMILARITY TRANSFORMATION TECHNIQUE ABDALLATEF A. M. MOHAMED A thesis submitted in partial fulfillment of the requirements

More information

The Case for Use Cases

The Case for Use Cases The Case for Use Cases The integration of internal and external chemical information is a vital and complex activity for the pharmaceutical industry. David Walsh, Grail Entropix Ltd Costs of Integrating

More information

Using Self-Organizing maps to accelerate similarity search

Using Self-Organizing maps to accelerate similarity search YOU LOGO Using Self-Organizing maps to accelerate similarity search Fanny Bonachera, Gilles Marcou, Natalia Kireeva, Alexandre Varnek, Dragos Horvath Laboratoire d Infochimie, UM 7177. 1, rue Blaise Pascal,

More information

SYSTEM IDENTIFICATION MODEL AND PREDICTIVE FUNCTIONAL CONTROL OF AN ELECTRO-HYDRAULIC ACTUATOR SYSTEM NOOR HANIS IZZUDDIN BIN MAT LAZIM

SYSTEM IDENTIFICATION MODEL AND PREDICTIVE FUNCTIONAL CONTROL OF AN ELECTRO-HYDRAULIC ACTUATOR SYSTEM NOOR HANIS IZZUDDIN BIN MAT LAZIM iii SYSTEM IDENTIFICATION MODEL AND PREDICTIVE FUNCTIONAL CONTROL OF AN ELECTRO-HYDRAULIC ACTUATOR SYSTEM NOOR HANIS IZZUDDIN BIN MAT LAZIM A project report submitted in fulfilment of the requirements

More information

Drug Informatics for Chemical Genomics...

Drug Informatics for Chemical Genomics... Drug Informatics for Chemical Genomics... An Overview First Annual ChemGen IGERT Retreat Sept 2005 Drug Informatics for Chemical Genomics... p. Topics ChemGen Informatics The ChemMine Project Library Comparison

More information

The Electronic Representation of Chemical Structures: beyond the low hanging fruit

The Electronic Representation of Chemical Structures: beyond the low hanging fruit The Electronic Representation of Chemical Structures: beyond the low hanging fruit How Accelrys Plans to Address the Remaining Challenges in Structure Representation and Searching: Chemically Modified

More information

WIND TUNNEL TEST TO INVESTIGATE TRANSITION TO TURBULENCE ON WIND TURBINE AIRFOIL MAHDI HOZHABRI NAMIN UNIVERSITI TEKNOLOGI MALAYSIA

WIND TUNNEL TEST TO INVESTIGATE TRANSITION TO TURBULENCE ON WIND TURBINE AIRFOIL MAHDI HOZHABRI NAMIN UNIVERSITI TEKNOLOGI MALAYSIA WIND TUNNEL TEST TO INVESTIGATE TRANSITION TO TURBULENCE ON WIND TURBINE AIRFOIL MAHDI HOZHABRI NAMIN UNIVERSITI TEKNOLOGI MALAYSIA WIND TUNNEL TEST TO INVESTIGATE TRANSITION TO TURBULENCE ON WIND TURBINE

More information

Targeting protein-protein interactions: A hot topic in drug discovery

Targeting protein-protein interactions: A hot topic in drug discovery Michal Kamenicky; Maria Bräuer; Katrin Volk; Kamil Ödner; Christian Klein; Norbert Müller Targeting protein-protein interactions: A hot topic in drug discovery 104 Biomedizin Innovativ patientinnenfokussierte,

More information