FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM

Similar documents
FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM

SHAPE-BASED TWO DIMENSIONAL DESCRIPTOR FOR SEARCHING MOLECULAR DATABASE

COVRE OPTIMIZATION FOR IMAGE STEGANOGRAPHY BY USING IMAGE FEATURES ZAID NIDHAL KHUDHAIR

EVALUATION OF FUSION SCORE FOR FACE VERIFICATION SYSTEM REZA ARFA

WIND TUNNEL TEST TO INVESTIGATE TRANSITION TO TURBULENCE ON WIND TURBINE AIRFOIL MAHDI HOZHABRI NAMIN UNIVERSITI TEKNOLOGI MALAYSIA

MATHEMATICAL MODELLING OF UNSTEADY BIOMAGNETIC FLUID FLOW AND HEAT TRANSFER WITH GRAVITATIONAL ACCELERATION

POSITION CONTROL USING FUZZY-BASED CONTROLLER FOR PNEUMATIC-SERVO CYLINDER IN BALL AND BEAM APPLICATION MUHAMMAD ASYRAF BIN AZMAN

INDEX SELECTION ENGINE FOR SPATIAL DATABASE SYSTEM MARUTO MASSERIE SARDADI UNIVERSITI TEKNOLOGI MALAYSIA

ROCK SHAFT RESISTANCE OF BORED PILES SOCKETED INTO DECOMPOSED MALAYSIA GRANITE RAJA SHAHROM NIZAM SHAH BIN RAJA SHOIB UNIVERSITI TEKNOLOGI MALAYSIA

OPTIMIZATION OF CHROMIUM, NICKEL AND VANADIUM ANALYSIS IN CRUDE OIL USING GRAPHITE FURNACE ATOMIC ABSORPTION SPECTROSCOPY NURUL HANIS KAMARUDIN

EFFECTS OF Ni 3 Ti (DO 24 ) PRECIPITATES AND COMPOSITION ON Ni-BASED SUPERALLOYS USING MOLECULAR DYNAMICS METHOD

DEVELOPMENT OF PROCESS-BASED ENTROPY MEASUREMENT FRAMEWORK FOR ORGANIZATIONS MAHMOOD OLYAIY

NUMERICAL INVESTIGATION OF TURBULENT NANOFLUID FLOW EFFECT ON ENHANCING HEAT TRANSFER IN STRAIGHT CHANNELS DHAFIR GIYATH JEHAD

MODELING AND CONTROL OF A CLASS OF AERIAL ROBOTIC SYSTEMS TAN ENG TECK UNIVERSITI TEKNOLOGY MALAYSIA

OPTICAL TWEEZER INDUCED BY MICRORING RESONATOR MUHAMMAD SAFWAN BIN ABD AZIZ

ARTIFICIAL NEURAL NETWORK AND KALMAN FILTER APPROACHES BASED ON ARIMA FOR DAILY WIND SPEED FORECASTING OSAMAH BASHEER SHUKUR

NUMERICAL SIMULATIONS ON THE EFFECTS OF USING VENTILATION FAN ON CONDITIONS OF AIR INSIDE A CAR PASSENGER COMPARTMENT INTAN SABARIAH BINTI SABRI

DYNAMIC SIMULATION OF COLUMNS CONSIDERING GEOMETRIC NONLINEARITY MOSTAFA MIRSHEKARI

MULTISTAGE ARTIFICIAL NEURAL NETWORK IN STRUCTURAL DAMAGE DETECTION GOH LYN DEE

ROOFTOP MAPPING AND INFORMATION SYSTEM FOR RAINWATER HARVESTING SURAYA BINTI SAMSUDIN UNIVERSITI TEKNOLOGI MALAYSIA

A COMPUTATIONAL FLUID DYNAMIC FRAMEWORK FOR MODELING AND SIMULATION OF PROTON EXCHANGE MEMBRANE FUEL CELL HAMID KAZEMI ESFEH

EMPIRICAL STRENQTH ENVELOPE FOR SHALE NUR 'AIN BINTI MAT YUSOF

MATHEMATICAL MODELING FOR TSUNAMI WAVES USING LATTICE BOLTZMANN METHOD SARA ZERGANI. UNIVERSITI TEKNOLOGI MALAYSIAi

DETECTION OF STRUCTURAL DEFORMATION FROM 3D POINT CLOUDS JONATHAN NYOKA CHIVATSI UNIVERSITI TEKNOLOGI MALAYSIA

ENHANCED COPY-PASTE IMAGE FORGERY DETECTION BASED ON ARCHIMEDEAN SPIRAL AND COARSE-TO-FINE APPROACH MOHAMMAD AKBARPOUR SEKEH

A STUDY ON THE CHARACTERISTICS OF RAINFALL DATA AND ITS PARAMETER ESTIMATES JAYANTI A/P ARUMUGAM UNIVERSITI TEKNOLOGI MALAYSIA

COMMUTATIVITY DEGREES AND RELATED INVARIANTS OF SOME FINITE NILPOTENT GROUPS FADILA NORMAHIA BINTI ABD MANAF UNIVERSITI TEKNOLOGI MALAYSIA

INDIRECT TENSION TEST OF HOT MIX ASPHALT AS RELATED TO TEMPERATURE CHANGES AND BINDER TYPES AKRIMA BINTI ABU BAKAR

A GENERALIZED POWER-LAW MODEL OF BLOOD FLOW THROUGH TAPERED ARTERIES WITH AN OVERLAPPING STENOSIS HUDA SALMI BINTI AHMAD

BOUNDARY INTEGRAL EQUATION WITH THE GENERALIZED NEUMANN KERNEL FOR COMPUTING GREEN S FUNCTION FOR MULTIPLY CONNECTED REGIONS

MONTE CARLO SIMULATION OF NEUTRON RADIOGRAPHY 2 (NUR-2) SYSTEM AT TRIGA MARK II RESEARCH REACTOR OF MALAYSIAN NUCLEAR AGENCY

SYNTHESIS AND CHARACTERIZATION OF MESO-SUBSTITUTED PORPHYRIN MUHAMMAD TAHIR MUHAMMAD

ULTIMATE STRENGTH ANALYSIS OF SHIPS PLATE DUE TO CORROSION ZULFAQIH BIN LAZIM

CHARACTERISTICS OF SOLITARY WAVE IN FIBER BRAGG GRATING MARDIANA SHAHADATUL AINI BINTI ZAINUDIN UNIVERSITI TEKNOLOGI MALAYSIA

MODELING AND SIMULATION OF BILAYER GRAPHENE NANORIBBON FIELD EFFECT TRANSISTOR SEYED MAHDI MOUSAVI UNIVERSITI TEKNOLOGI MALAYSIA

DENSITY FUNCTIONAL THEORY SIMULATION OF MAGNETISM DUE TO ATOMIC VACANCIES IN GRAPHENE USING SIESTA

DEVELOPMENT OF GEODETIC DEFORMATION ANALYSIS SOFTWARE BASED ON ITERATIVE WEIGHTED SIMILARITY TRANSFORMATION TECHNIQUE ABDALLATEF A. M.

PREDICTION OF FREE FATTY ACID IN CRUDE PALM OIL USING NEAR INFRARED SPECTROSCOPY SITI NURHIDAYAH NAQIAH BINTI ABDULL RANI

SEDIMENTATION RATE AT SETIU LAGOON USING NATURAL. RADIOTRACER 210 Pb TECHNIQUE

OPTIMAL CONTROL BASED ON NONLINEAR CONJUGATE GRADIENT METHOD IN CARDIAC ELECTROPHYSIOLOGY NG KIN WEI UNIVERSITI TEKNOLOGI MALAYSIA

SOLUBILITY MODEL OF PALM OIL EXTRACTION FROM PALM FRUIT USING SUB-CRITICAL R134a NUR SYUHADA BINTI ABD RAHMAN

SYNTHESIS AND CATALYTIC PERFORMANCE OF PALLADIUM(II) HYDRAZONE COMPLEXES IN SUZUKI-MIYAURA CROSS-COUPLING REACTION

SYNTHESIS AND CHARACTERIZATION OF POLYACRYLAMIDE BASED HYDROGEL CONTAINING MAGNESIUM OXIDE NANOPARTICLES FOR ANTIBACTERIAL APPLICATIONS

UNIVERSITI PUTRA MALAYSIA

SHADOW AND SKY COLOR RENDERING TECHNIQUE IN AUGMENTED REALITY ENVIRONMENTS HOSHANG KOLIVAND UNIVERSITI TEKNOLOGI MALAYSIA

HYDROGEN RECOVERY FROM THE REFORMING GAS USING COMMERCIAL ACTIVATED CARBON MEHDI RAHMANIAN

INFLUENCES OF GROUNDWATER, RAINFALL, AND TIDES ON BEACH PROFILES CHANGES AT DESARU BEACH FARIZUL NIZAM BIN ABDULLAH

COMPUTATIONAL STUDY OF PROTON TRANSFER IN RESTRICTED SULFONIC ACID FOR PROTON EXCHANGE MEMBRANE FUEL CELL SITI NADIAH BINTI MD AJEMAN

ANALYTICAL SOLUTIONS OF DISSIPATIVE HEAT TRANSFER ON THE PERISTALTIC FLOW OF NON-NEWTONIAN FLUIDS IN ASYMMETRIC CHANNELS

GENERATING MULTI-LEVEL OF DETAILS FOR THREE-DIMENSIONAL BUILDING MODELUSINGTERRESTRIAL LASER SCANNINGDATA RIZKA AKMALIA

COMPUTATIONAL STUDY OF TURBULENT UNCONFINED SWIRL FLAMES ANIS ATHIRAH BINTI MOHD AZLI

STABILITY OF TRIAXIAL WEAVE FABRIC COMPOSITES EMPLOYING FINITE ELEMENT MODEL WITH HOMOGENIZED CONSTITUTIVE RELATION NORHIDAYAH RASIN

ADSORPTION OF ARSENATE BY HEXADECYLPYRIDINIUM BROMIDE MODIFIED NATURAL ZEOLITE MOHD AMMARUL AFFIQ BIN MD BUANG UNIVERSITI TEKNOLOGI MALAYSIA

HIGH RESOLUTION DIGITAL ELEVATION MODEL GENERATION BY SEMI GLOBAL MATCHING APPROACH SITI MUNIRAH BINTI SHAFFIE

ONLINE LOCATION DETERMINATION OF SWITCHED CAPACITOR BANK IN DISTRIBUTION SYSTEM AHMED JAMAL NOORI AL-ANI

ANOLYTE SOLUTION GENERATED FROM ELECTROCHEMICAL ACTIVATION PROCESS FOR THE TREATMENT OF PHENOL

ADSORPTION AND STRIPPING OF ETHANOL FROM AQUEOUS SOLUTION USING SEPABEADS207 ADSORBENT

ISOLATION AND CHARACTERIZATION OF NANOCELLULOSE FROM EMPTY FRUIT BUNCH FIBER FOR NANOCOMPOSITE APPLICATION

MICROWAVE SYNTHESIS OF SODALITE FROM COAL FLY ASH AS SOLID BASE CATALYST FOR KNOEVENAGEL REACTION MOHD HILMI BIN MOHAMED

SYSTEM IDENTIFICATION MODEL AND PREDICTIVE FUNCTIONAL CONTROL OF AN ELECTRO-HYDRAULIC ACTUATOR SYSTEM NOOR HANIS IZZUDDIN BIN MAT LAZIM

CLASSIFICATION OF MULTIBEAM SNIPPETS DATA USING STATISTICAL ANALYSIS METHOD LAU KUM WENG UNIVERSITITEKNOLOGI MALAYSIA

FLOOD MAPPING OF NORTHERN PENINSULAR MALAYSIA USING SAR IMAGES HAFSAT SALEH DUTSENWAI

MECHANICAL PROPERTIES OF CALCIUM CARBONATE AND COCONUT SHELL FILLED POLYPROPYLENE COMPOSITES MEHDI HEIDARI UNIVERSITI TEKNOLOGI MALAYSIA

AHMED MOKHTAR ALBSHIR BUDIEA

RAINFALL SERIES LIM TOU HIN

UNIVERSITI SAINS MALAYSIA. CPT115 Mathematical Methods for Computer Science [Kaedah Matematik bagi Sains Komputer]

COMPUTATIONAL MODELLING AND SIMULATION OF BIOMAGNETIC FLUID FLOW IN A STENOTIC AND ANEURYSMAL ARTERY NURSALASAWATI BINTI RUSLI

PRODUCTION OF POLYHYDROXYALKANATE (PHA) FROM WASTE COOKING OIL USING PSEUDOMONAS OLEOVORANS FARZANEH SABBAGH MOJAVERYAZDI

SYNTHESIS AND CHARACTERIZATION OF SUPRAMOLECULAR POLYMER BASED ON LINOLEIC ACID OF SUNFLOWER OIL MILI PURBAYA UNIVERSITI TEKNOLOGI MALAYSIA

THE DEVELOPMENT OF PORTABLE SENSOR TO DETERMINE THE FRESHNESS AND QUALITY OF FRUITS USING CAPACITIVE TECHNIQUE LAI KAI LING

MORPHOLOGICAL, GENETIC AND BIOLOGICAL STUDIES OF RED PALM WEEVIL, Rhynchophorus spp. (COLEOPTERA: CURCULIONIDAE) IN TERENGGANU, MALAYSIA

EFFECT OF GRAPHENE OXIDE (GO) IN IMPROVING THE PERFORMANCE OF HIGH VOLTAGE INSULATOR NUR FARAH AIN BINTI ISA UNIVERSITI TEKNOLOGI MALAYSIA

UNIVERSITI TEKNOLOGI MALAYSIA

EFFECT OF ROCK MASS PROPERTIES ON SKIN FRICTION OF ROCK SOCKET. YUSLIZA BINTI ALIAS UNIVERSITI TEKNOLOGI MALAYSIA

UNIVERSITI PUTRA MALAYSIA GENERATING MUTUALLY UNBIASED BASES AND DISCRETE WIGNER FUNCTION FOR THREE-QUBIT SYSTEM

Abstract. Glow discharges are used in a large number of applications, and it is one of the most

UNIVERSITI PUTRA MALAYSIA VARIABLE STEP VARIABLE ORDER BLOCK BACKWARD DIFFERENTIATION FORMULAE FOR SOLVING STIFF ORDINARY DIFFERENTIAL EQUATIONS

INTEGRATED MALAYSIAN METEOROLOGICAL DATA ATMOSPHERIC DISPERSION SOFTWARE FOR AIR POLLUTANT DISPERSION SIMULATION

SYNTHESIS, CHARACTERIZATION AND CATALYTIC ACTIVITY OF PALLADIUM(II) AROYLHYDRAZONE COMPLEXES IN MIZOROKI- HECK REACTION NUR HAFIZAH BT HASSAN

ERODABLE DAM BREACHING PATTERNS DUE TO OVERTOPPING NOR AIN BINTI MAT LAZIN UNIVERSITI TEKNOLOGI MALAYSIA

ENGINEERING PROPERTIES OF OLDER ALLUVIUM BADEE ABDULQAWI HAMOOD ALSHAMERI. Universiti Teknologi Malaysia

CPT115 Mathematical Methods for Computer Sciences [Kaedah Matematik bagi Sains Komputer]

UNIVERSITI SAINS MALAYSIA. CCS513 Computer Vision and Image Analysis [Penglihatan Komputer dan Analisis Imej]

Jawab soalan mengikut arahan yang diberikan dalam setiap bahagian. Questions should be answered according to the instructions given in each section.

AMPLIFICATION OF PARTIAL RICE FLORIGEN FROM MALAYSIAN UPLAND RICE CULTIVAR HITAM AND WAI ABDULRAHMAN MAHMOUD DOGARA UNIVERSITI TEKNOLOGI MALAYSIA

ELECTRODEPOSITION OF CARBOXYLATED MULTIWALL CARBON NANOTUBE ON GRAPHITE REINFORCEMENT CARBON FOR VOLTAMMETRY DETECTION OF CADMIUM

INSTRUCTION: This section consists of FOUR (4) structured questions. Answer TWO (2) questions. only.

SPECTROSCOPIC PROPERTIES OF Nd:YAG LASER PUMPED BY FLASHLAMP AT VARIOUS TEMPERATURES AND INPUT ENERGIES SEYED EBRAHIM POURMAND

Bab 7 : Carian. proses untuk memeriksa satu atau serangkaian unsur untuk mendapatkan data yang dicari. Carian / gelintaran

INSTRUCTION: This section consists of TWO (2) essay questions. Answer ALL questions.

A COMPARISO OF MODAL FLEXIBILITY METHOD A D MODAL CURVATURE METHOD I STRUCTURAL DAMAGE DETECTIO OOR SABRI A ZAHARUDI

DOSIMETRIC PROPERTIES OF GERMANIUM DOPED CALCIUM BORATE GLASS FOR USE AS PHOTON DOSIMETER

PREPARATION AND CHARACTERISATION OF KENAF BAST FIBRE REINFORCED RECYCLED POLYPROPYLENE/RECYCLED POLYAMIDE 6 COMPOSITES

SYNTHESIS AND LUMINESCENT PROPERTIES OF BIMETALLIC GOLD(I) AND SILVER(I) PYRAZOLATE COMPLEXES NURUL HUSNA BINTI SABRAN UNIVERSITI TEKNOLOGI MALAYSIA

UNSTEADY MAGNETOHYDRODYNAMICS FLOW OF A MICROPOLAR FLUID WITH HEAT AND MASS TRANSFER AURANGZAIB MANGI UNIVERSITI TEKNOLOGI MALAYSIA

SPRAY DRIED PRODIGIOSIN FROM Serratia marcescens AS A FOOD COLORANT SHAHLA NAMAZKAR UNIVERSITI TEKNOLOGI MALAYSIA

EFFECT OF FINES CONTENT AND PLASTICITY ON LIQUEFACTION SUSCEPTIBILITY OF SAND MATRIX SOILS TAN CHOY SOON

SOLAR RADIATION EQUATION OF TIME PUNITHA A/P MARIMUTHOO

INSTRUCTION: This section consists of FOUR (4) structured questions. Answer ALL questions.

Jawab soalan mengikut arahan yang diberikan dalam setiap bahagian. Answer the questions according to the instructions given in each section.

Transcription:

i FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM A thesis submitted in fulfilment of the requirements for the award of the degree of Doctor of Philosophy (Computer Science) Faculty of Computing Universiti Teknologi Malaysia FEBRUARY 2013

iii DEDICATION To my beloved father and mother, my wife and my sons

iv ACKNOWLEDGMENTS In the Name of Allah, Most Gracious, Most Merciful All praise and thanks are due to Allah, and peace and blessings be upon his messenger, Mohammed (peace be upon him). I am indebted to my advisor Professor Dr. Naomie Salim, for the outstanding motivation, guidance, support, and knowledge she has provided throughout the course of this work. She introduced me to the field of chemoinformatics and without her guidance and advice this study would not have been possible. She has been incredibly wise, helpful, understanding, and generous throughout the process. She has truly been a mentor and I owe here my deepest thanks. I have made many friends during my time in UTM and I thank them for their support and encouragement. Also I am extremely grateful to Dr. Ammar Abdo for his help and knowledge. A lot of information useful to the work was found via the World-Wide Web; I thank those who made their materials available by means of this medium and those who kindly answered back to my roll-calls of help sent over the World-Wide Web. I am extremely grateful to The Karary University for their generous financial support during this study. Finally, I would like to thank my parents, wife and my sons, Mohammed and Ayman, for their patience, encouragement, support and understanding.

v ABSTRACT Based on the molecular similarity principle, functionally similar molecules are sought by searching molecular databases for structurally similar molecules to be used in rational drug design. The conventional 2-dimentional similarity methods are the most used methods to measure similarity of molecules, including fragments that are not related to the biological activity of a molecule. The most common methods among the 2-dimentional similarity methods are the vector space model and the Bayesian networks, which are based on mutual independence between fragments. However, these methods do not consider the importance of fragments. In this thesis, four reweighting approaches are proposed to identify the important fragments. The first approach is based on reweighting the important fragments, where a set of active reference structures are used to reweight the fragments in the reference structure. Secondly, a statistically supervised features selection and minifingerprint to select only the important fragments are applied. In this approach, searching is carried out by using sub-fragments that represent the important ones. Thirdly, a similarity coefficient based on mutually dependent fuzzy correlation coefficient is used. The last approach combined the best two out of the three approaches which are reweighting factors and fragment selection based on statistically supervised features selection. The proposed approaches were tested on the MDL Data Drug Report standard data set. The overall results of this research showed that the proposed fragment reweighting approaches outperformed the conventional industry-standard Tanimoto-based similarity search approach.

vi ABSTRAK Berdasarkan prinsip persamaan molekul, molekul yang sama fungsi diperolehi dengan mencari molekul yang berstruktur sama dari pangkalan data molekul bagi kegunaan reka bentuk ubat secara rasional. Kaedah persamaan 2- dimensi konvensional telah digunakan secara paling meluas untuk mengukur kesamaan molekul termasuk fragmen yang tidak berkaitan dengan aktiviti biologi sesuatu molekul. Kaedah yang paling biasa digunakan antara kaedah-kaedah persamaan 2-dimensi adalah model ruang vektor dan rangkaian Bayesian yang berasaskan fragmen saling-bebas. Walau bagaimanapun, kaedah-kaedah ini tidak mengambil kira kepentingan fragmen. Dalam tesis ini, empat kaedah bobot semula telah dicadangkan untuk mengenal pasti fragmen-fragmen yang penting. Keadah pertama adalah berdasarkan bobot semula fragmen yang penting, iaitu satu set struktur rujukan aktif telah digunakan untuk bobot semula fragmen dalam struktur rujukan. Kedua, pemilihan ciri terselia secara statistik dan cap jari mini untuk memilih fragmen-fragmen yang penting telah digunakan. Dalam kaedah ini, pencarian dijalankan dengan menggunakan sub-fragmen yang penting. Ketiga, satu pekali persamaan berasaskan koefisien korelasi kabur yang saling bersandar telah digunakan. Kaedah terakhir menggabungkan dua daripada tiga kaedah terbaik iaitu faktor pemberatan semula dan pemilihan fragmen berdasarkan pemilihan ciri terselia secara statistik. Kaedah-kaedah yang dicadangkan telah diuji pada set data piawai MDL Drug Data Report. Keputusan keseluruhan kajian ini menunjukkan bahawa kaedah-kaedah bobot semula fragmen yang dicadangkan mengatasi kaedah piawai konvensional di dalam industri ini iaitu carian persamaan berasaskan Tanimoto.