Graph Kernels in Chemoinformatics

Similar documents
Kernels for small molecules

Research Article. Chemical compound classification based on improved Max-Min kernel

UniStra activities within the BigChem project:

Machine learning for ligand-based virtual screening and chemogenomics!

Taking into account interaction between stereocenters in a graph kernel framework

Linear Classifiers (Kernels)

Chapter 6: Classification

Multiple Kernel Self-Organizing Maps

HYPERGRAPH BASED SEMI-SUPERVISED LEARNING ALGORITHMS APPLIED TO SPEECH RECOGNITION PROBLEM: A NOVEL APPROACH

Discriminative Direction for Kernel Classifiers

Learning Methods for Linear Detectors

Support Vector Machine (SVM) and Kernel Methods

Outline. Motivation. Mapping the input space to the feature space Calculating the dot product in the feature space

Rchemcpp Similarity measures for chemical compounds. Michael Mahr and Günter Klambauer. Institute of Bioinformatics, Johannes Kepler University Linz

Statistical learning theory, Support vector machines, and Bioinformatics

Support Vector Machine (SVM) and Kernel Methods

Support Vector Machine (SVM) & Kernel CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2012

Package Rchemcpp. August 14, 2018

Machine Learning Concepts in Chemoinformatics

Data Mining in Bioinformatics Day 9: Graph Mining in Chemoinformatics

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Support Vector Machines.

CIS 520: Machine Learning Oct 09, Kernel Methods

Kernel Methods. Konstantin Tretyakov MTAT Machine Learning

CS798: Selected topics in Machine Learning

Kernel Methods. Konstantin Tretyakov MTAT Machine Learning

Kernel Methods & Support Vector Machines

Beyond the Point Cloud: From Transductive to Semi-Supervised Learning

Support Vector Machine (SVM) and Kernel Methods

Support Vector Machines (SVM) in bioinformatics. Day 1: Introduction to SVM

Support Vector Machines

The Representor Theorem, Kernels, and Hilbert Spaces

Linear vs Non-linear classifier. CS789: Machine Learning and Neural Network. Introduction

Kernels for Multi task Learning

A NETFLOW DISTANCE BETWEEN LABELED GRAPHS: APPLICATIONS IN CHEMOINFORMATICS

Course 10. Kernel methods. Classical and deep neural networks.

Translation invariant classification of non-stationary signals

Linear Classifiers as Pattern Detectors

PAC-learning, VC Dimension and Margin-based Bounds

Machine Learning : Support Vector Machines

Kernel Methods. Barnabás Póczos

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Kaggle.

A Linear Programming Approach for Molecular QSAR analysis

Kernel Learning via Random Fourier Representations

Improved Algorithms for Enumerating Tree-like Chemical Graphs with Given Path Frequency

Support Vector Machine

Recognition of On-Line Handwritten Commutative Diagrams

Spectral Generative Models for Graphs

Multiple kernel learning for multiple sources

A GENERAL FORMULATION FOR SUPPORT VECTOR MACHINES. Wei Chu, S. Sathiya Keerthi, Chong Jin Ong

The Laplacian PDF Distance: A Cost Function for Clustering in a Kernel Feature Space

Support Vector Machines

Kernel Methods. Foundations of Data Analysis. Torsten Möller. Möller/Mori 1

molecules ISSN

High-dimensional test for normality

SmallWorld: Efficient Maximum Common Subgraph Searching of Large Chemical Databases

SUPPORT VECTOR REGRESSION WITH A GENERALIZED QUADRATIC LOSS

Analysis of Multiclass Support Vector Machines

Spectral Clustering. by HU Pili. June 16, 2013

Mining Molecular Fragments: Finding Relevant Substructures of Molecules

Artificial Intelligence (AI) Common AI Methods. Training. Signals to Perceptrons. Artificial Neural Networks (ANN) Artificial Intelligence

Kernel Methods. Jean-Philippe Vert Last update: Jan Jean-Philippe Vert (Mines ParisTech) 1 / 444

Generalization to a zero-data task: an empirical study

Kronecker Decomposition for Image Classification

CS 188: Artificial Intelligence Spring Announcements

SPECTRAL CLUSTERING AND KERNEL PRINCIPAL COMPONENT ANALYSIS ARE PURSUING GOOD PROJECTIONS

Support Vector Machine & Its Applications

Support Vector Machines. CSE 6363 Machine Learning Vassilis Athitsos Computer Science and Engineering Department University of Texas at Arlington

Support Vector Machines

Approximate Kernel PCA with Random Features

Kernel methods, kernel SVM and ridge regression

Kernels and the Kernel Trick. Machine Learning Fall 2017

Kernel methods and the exponential family

Foundation of Intelligent Systems, Part I. SVM s & Kernel Methods

Classifier Complexity and Support Vector Classifiers

hsnim: Hyper Scalable Network Inference Machine for Scale-Free Protein-Protein Interaction Networks Inference

Support vector machines, Kernel methods, and Applications in bioinformatics

Neural Networks. Prof. Dr. Rudolf Kruse. Computational Intelligence Group Faculty for Computer Science

Few-shot learning with KRR

Pattern Recognition and Machine Learning. Perceptrons and Support Vector machines

Drawing Conclusions from Data The Rough Set Way

Introduction to Graphical Models. Srikumar Ramalingam School of Computing University of Utah

Random Feature Maps for Dot Product Kernels Supplementary Material

Learning sets and subspaces: a spectral approach

Linear Classifiers IV

ML (cont.): SUPPORT VECTOR MACHINES

Kernel Methods in Machine Learning

Modeling High-Dimensional Discrete Data with Multi-Layer Neural Networks

Algorithm-Independent Learning Issues

Comparison and Enumeration of Chemical Graphs

SVMs: Non-Separable Data, Convex Surrogate Loss, Multi-Class Classification, Kernels

Inferring the Causal Decomposition under the Presence of Deterministic Relations.

Interactive Feature Selection with

A short introduction to supervised learning, with applications to cancer pathway analysis Dr. Christina Leslie

Chapter 9. Support Vector Machine. Yongdai Kim Seoul National University

Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials

Data Mining in the Chemical Industry. Overview of presentation

Support Vector Machines. Introduction to Data Mining, 2 nd Edition by Tan, Steinbach, Karpatne, Kumar

Neural networks and support vector machines

Transcription:

in hemoinformatics Luc Brun o authors: Benoit Gaüzère(MIVIA), Pierre Anthony Grenier(GREY), Alessia aggese(mivia), Didier Villemin (LMT) Bordeaux, November, 20 2014

Plan yclic similarity onclusion 1 2 3 4 yclic similarity 5 onclusion 6 L. Brun (GREY) eminar Bordeaux, November, 20 2014 1 / 32

Plan yclic similarity onclusion 1 2 3 4 yclic similarity 5 onclusion 6 L. Brun (GREY) eminar Bordeaux, November, 20 2014 1 / 32

Motivation yclic similarity onclusion tructural Pattern Recognition Rich description of objects Poor properties of graph s space does not allow to readily generalize/combine sets of graphs tatistical Pattern Recognition Global description of objects Numerical spaces with many mathematical properties (metric, vector space,... ). Motivation Analyse large famillies of structural and numerical objects using a unified framework based on pairwise similarity. L. Brun (GREY) eminar Bordeaux, November, 20 2014 2 / 32

hemoinformatics yclic similarity onclusion Usages of computer science for chemistry Prediction of molecular properties Prediction of biological activities Toxicity, cancerous... Prediction of physico-chemical properties. Boiling point, LogP, ability to capture 2... creening : ynthesis of molecule, Test and validation of the synthesized molecules Virtual creening : election of a limited set of candidate molecules, ynthesis and test of the selected molecules, Gain of Time and Money L. Brun (GREY) eminar Bordeaux, November, 20 2014 3 / 32

Molecular representation yclic similarity onclusion Molecular Graph: Vertice atoms. Edges Atomic bonds. l l H 3 H 3 P P H 3 L. Brun (GREY) eminar Bordeaux, November, 20 2014 4 / 32

Molecular representation yclic similarity onclusion Molecular Graph: Vertice atoms. Edges Atomic bonds. l l H 3 H 3 P P H 3 imilarity principle [Johnson and Maggiora, 1990] imilar molecules have similar properties. Design of similarity measures between molecular graphs. L. Brun (GREY) eminar Bordeaux, November, 20 2014 4 / 32

Plan yclic similarity onclusion 1 2 3 4 yclic similarity 5 onclusion 6 L. Brun (GREY) eminar Bordeaux, November, 20 2014 4 / 32

Kernel Theory yclic similarity onclusion Kernel k : X X R ymmetric: k(x i, x j ) = k(x j, x i ), For any dataset D = {x 1,..., x n }, Gram s matrix K R n n is defined by K i,j = k(x i, x j ). emi-definite positive: α R n, α T Kα 0 L. Brun (GREY) eminar Bordeaux, November, 20 2014 5 / 32

Kernel Theory yclic similarity onclusion Kernel k : X X R ymmetric: k(x i, x j ) = k(x j, x i ), For any dataset D = {x 1,..., x n }, Gram s matrix K R n n is defined by K i,j = k(x i, x j ). emi-definite positive: α R n, α T Kα 0 onnection with Hilbert spaces, reproducing kernel Hilbert paces [Aronszajn, 1950]: Relationship scalar product kernel : k(x, x ) = Φ(x), Φ(x ) H. L. Brun (GREY) eminar Bordeaux, November, 20 2014 5 / 32

yclic similarity onclusion Graph kernels in chemoinformatics Graph kernel k : G G R. imilarity measure between molecular graphs Natural encoding of a molecule Implicit embedding in H, Machine learning methods (VM, KRR,... ). L. Brun (GREY) eminar Bordeaux, November, 20 2014 6 / 32

yclic similarity onclusion Graph kernels in chemoinformatics Graph kernel k : G G R. imilarity measure between molecular graphs Natural encoding of a molecule Implicit embedding in H, Machine learning methods (VM, KRR,... ). Natural connection between molecular graphs and machine learning methods Definition of a similarity measure as a kernel L. Brun (GREY) eminar Bordeaux, November, 20 2014 6 / 32

yclic similarity onclusion Kernels based on bags of patterns Graph kernels based on bags of patterns (1) Extraction of a set of patterns from graphs, (2) omparison between patterns, (3) omparison between bags of patterns. L. Brun (GREY) eminar Bordeaux, November, 20 2014 7 / 32

Plan yclic similarity onclusion 1 2 3 4 yclic similarity 5 onclusion 6 L. Brun (GREY) eminar Bordeaux, November, 20 2014 7 / 32

Treelets yclic similarity onclusion et of labeled subtrees with at most 6 nodes : Encode branching points Take labels into account. Pertinent from a chemical point of view limited set of substructures Linear complexity (in (nd 5 )). L. Brun (GREY) eminar Bordeaux, November, 20 2014 8 / 32

enumaration yclic similarity onclusion Labelling ode encoding the labels, Defined on each of the 14 structure of treelet, omputation in constant time. H 3 H 3 H 3 L. Brun (GREY) eminar Bordeaux, November, 20 2014 9 / 32

enumaration yclic similarity onclusion Labelling ode encoding the labels, Defined on each of the 14 structure of treelet, omputation in constant time. H 3 H 3 H 3 anonical code : type of treelet, } canonical code : G 9-11111 Labels. t t code(t) = code(t ) L. Brun (GREY) eminar Bordeaux, November, 20 2014 9 / 32

Definition of the Treelet kernel yclic similarity onclusion ounting treelet s occurences f t (G) : Number of occurences of treelet t in G. L. Brun (GREY) eminar Bordeaux, November, 20 2014 10 / 32

Definition of the Treelet kernel yclic similarity onclusion ounting treelet s occurences f t (G) : Number of occurences of treelet t in G. Treelet kernel k T (G, G ) = k(f t (G), f t (G )) t T (G) T (G ) T (G): et of treelets of G. k(.,.): Kernel between real numbers. L. Brun (GREY) eminar Bordeaux, November, 20 2014 10 / 32

Definition of the Treelet kernel yclic similarity onclusion ounting treelet s occurences f t (G) : Number of occurences of treelet t in G. Treelet kernel k T (G, G ) = k t (G, G ) T (G) : et of treelets of G. k t (G, G ) = k(f t (G), f t (G ). k t (.,.) : imilarity according to t. t T (G) T (G ) L. Brun (GREY) eminar Bordeaux, November, 20 2014 10 / 32

Pattern selection yclic similarity onclusion Prediction problems: Different properties Different causes. L. Brun (GREY) eminar Bordeaux, November, 20 2014 11 / 32

Pattern selection yclic similarity onclusion Prediction problems: Different properties Different causes. hemist approach: Pattern relevant for a precise property A priori selection by a chemical expert L. Brun (GREY) eminar Bordeaux, November, 20 2014 11 / 32

Pattern selection yclic similarity onclusion Prediction problems: Different properties Different causes. hemist approach: Pattern relevant for a precise property A priori selection by a chemical expert Machine learning approach: Parcimonious set of relevant treelets election a posteriori induced by data. L. Brun (GREY) eminar Bordeaux, November, 20 2014 11 / 32

Multiple kernel learning yclic similarity onclusion ombination of kernels [Rakotomamonjy et al., 2008] : k(g, G ) = t T k t (G, G ) L. Brun (GREY) eminar Bordeaux, November, 20 2014 12 / 32

Multiple kernel learning yclic similarity onclusion ptimal ombination of kernels [Rakotomamonjy et al., 2008] : k W (G, G ) = t T w t k t (G, G ) T : et of treelets extracted from the learning dataset. w t [0, 1] : Measure the influence of t Weighing of each sub kernel election of relevant treelets Weighs computed for a given property. L. Brun (GREY) eminar Bordeaux, November, 20 2014 12 / 32

Experiments yclic similarity onclusion Prediction of the boiling point of acyclic molecules: 183 acyclic molecules : Learning set: 80 % Validation set: 10 % 10 Test set: 10 % election of optimal parameters using a grid search RME obtained on the 10 test sets. L. Brun (GREY) eminar Bordeaux, November, 20 2014 13 / 32

Experiments yclic similarity onclusion Prediction of the boiling point of acyclic molecules: Méthode RME ( ) Descriptor based approach [herqaoui et al., 1994]* 5,10 Empirical embedding [Riesen, 2009] 10,15 Gaussian kernel based on the graph edit distance [Neuhaus and Bunke, 2007] 10,27 Kernel on paths [Ralaivola et al., 2005] 12,24 Kernel on random walks [Kashima et al., 2003] 18,72 Kernel on tree patterns [Mahé and Vert, 2009] 11,02 Weisfeiler-Lehman Kernel[hervaszide, 2012] 14,98 Treelet kernel 6,45 Treelet kernel with MKL 4,22 * leave-one-out L. Brun (GREY) eminar Bordeaux, November, 20 2014 14 / 32

Plan yclic similarity onclusion 1 2 3 4 yclic similarity 5 onclusion 6 L. Brun (GREY) eminar Bordeaux, November, 20 2014 14 / 32

yclic molecular information yclic similarity onclusion ycles of a molecule Have a large influence on molecular properties. Identifies some molecular s families. N (a) Benzène (b) Indole (c) yclohexane yclic information must be taken into account. L. Brun (GREY) eminar Bordeaux, November, 20 2014 15 / 32

Relevant ycles yclic similarity onclusion Enumeration of all cycles is NP complete ycles of a graph form a vector space on Z 2Z. HN HN H L. Brun (GREY) eminar Bordeaux, November, 20 2014 16 / 32

Relevant ycles yclic similarity onclusion Enumeration of relevant cycles [Vismara, 1995]: Union of all cyclic bases with a minimal size, Unique for each graph, Enumeration in polynomial time, Kernel on relevant cycles [Horváth, 2005]. HN HN H L. Brun (GREY) eminar Bordeaux, November, 20 2014 17 / 32

Relevant cycles Graph yclic similarity onclusion Graph of relevant cycles G (G) [Vismara, 1995] : Representation of the cyclic system, Vertices: Relevant cycles, Edges: intersection between relevant cycles. HN HN H L. Brun (GREY) eminar Bordeaux, November, 20 2014 18 / 32

Relevant cycles Graph yclic similarity onclusion Graph of relevant cycles G (G) [Vismara, 1995] : Representation of the cyclic system, Vertices: Relevant cycles, Edges: intersection between relevant cycles. HN HN H L. Brun (GREY) eminar Bordeaux, November, 20 2014 18 / 32

Relevant cycles Graph yclic similarity onclusion Graph of relevant cycles G (G) [Vismara, 1995] : Representation of the cyclic system, Vertices: Relevant cycles, Edges: intersection between relevant cycles. HN HN H L. Brun (GREY) eminar Bordeaux, November, 20 2014 18 / 32

Relevant cycles Graph yclic similarity onclusion imilarity between relevant cycle graphs Enumeration of relevant cycle graph s treelets. Adjacency relationship between relevant cycles. Extracted Patterns : HN HN H olely cyclic information. L. Brun (GREY) eminar Bordeaux, November, 20 2014 19 / 32

Relevant cycle Hypergraph yclic similarity onclusion onnexion between cycles and acyclic parts Idea : Molecular Representation: Global, Explicit cyclic Information Addition of acyclic parts to the relevant cycle graph. HN HN H L. Brun (GREY) eminar Bordeaux, November, 20 2014 20 / 32

Relevant cycle Hypergraph yclic similarity onclusion onnexion between cycles and acyclic parts Idea : Molecular Representation: Global, Explicit cyclic Information Addition of acyclic parts to the relevant cycle graph. HN HN HN H HN H L. Brun (GREY) eminar Bordeaux, November, 20 2014 20 / 32

Relevant cycle Hypergraph yclic similarity onclusion Problem : An atomic bound can connect an atom to two cycles. N N L. Brun (GREY) eminar Bordeaux, November, 20 2014 21 / 32

Relevant cycle Hypergraph yclic similarity onclusion onnexion between cycles and acyclic parts hypergraph representation of the molecule. Adjacency relationship: oriented hyper-edges. N N L. Brun (GREY) eminar Bordeaux, November, 20 2014 22 / 32

Relevant cycle Hypergraph yclic similarity onclusion onnexion between cycles and acyclic parts hypergraph representation of the molecule. Adjacency relationship: oriented hyper-edges. N N Hypergraph comparison. L. Brun (GREY) eminar Bordeaux, November, 20 2014 22 / 32

yclic similarity onclusion Treelets on relevant cycle hypergraphs Adaptation of the treelet kernel: (1) enumeration of the relevant cycle hypergraph s treelets without hyper-edges. N L. Brun (GREY) eminar Bordeaux, November, 20 2014 23 / 32

yclic similarity onclusion Treelets on relevant cycle hypergraphs Adaptation of the treelet kernel: (1) enumeration of the relevant cycle hypergraph s treelets without hyper-edges. N L. Brun (GREY) eminar Bordeaux, November, 20 2014 23 / 32

yclic similarity onclusion Treelets on relevant cycle hypergraphs Adaptation of the treelet kernel: (1) enumeration of the relevant cycle hypergraph s treelets without hyper-edges. N N N L. Brun (GREY) eminar Bordeaux, November, 20 2014 23 / 32

yclic similarity onclusion Treelets on relevant cycle hypergraphs Adaptation of the treelet kernel: (1) enumeration of the relevant cycle hypergraph s treelets without hyper-edges. (2) ontraction of cycles incident to each hyper-edge. onstruction of the reduced relevant cycle hypergraph G R (G). N N L. Brun (GREY) eminar Bordeaux, November, 20 2014 23 / 32

yclic similarity onclusion Treelets on relevant cycle hypergraphs Adaptation of the treelet kernel: (1) enumeration of the relevant cycle hypergraph s treelets without hyper-edges. (2) ontraction of cycles incident to each hyper-edge. onstruction of the reduced relevant cycle hypergraph G R (G). (3) Extraction of treelets containing an initial hyper-edge N N L. Brun (GREY) eminar Bordeaux, November, 20 2014 23 / 32

yclic similarity onclusion Kernel on relevant cycle hypergraphs Bag of patterns: T R (G) = T 1 T 2 T 1 : et of treelets obtained from the hypergraph without hyper-edges. T 2 : et of treelets extracted from G R (G). N N N N N L. Brun (GREY) eminar Bordeaux, November, 20 2014 24 / 32

yclic similarity onclusion Kernel on relevant cycle hypergraphs Bag of patterns: T R (G) = T 1 T 2 T 1 : et of treelets obtained from the hypergraph without hyper-edges. T 2 : et of treelets extracted from G R (G). Kernel design: k T (G, G ) = t T R (G) T R (G ) w t k t (f t (G), f t (G )) L. Brun (GREY) eminar Bordeaux, November, 20 2014 24 / 32

Experiments yclic similarity onclusion Predictive Toxicity hallenge Molecular graph labelled and composed of cycles. 340 molecules for each dataset 4 classes of animals (MM, FM, MR, FR) : Learning set 310 molécules, } 10 Test set 34 molecules, Number of molecules correctly classified on the 10 test sets. Méthode MM FM MR FR Treelet kernel (TK) 208 205 209 212 Relevant cycle kernel 209 207 202 228 TK on relevant cycle graph (T) 211 210 203 232 TK on relevant cycle hypergraph (TH) 217 224 207 233 L. Brun (GREY) eminar Bordeaux, November, 20 2014 25 / 32

Experiments yclic similarity onclusion Predictive Toxicity hallenge Méthode MM FM MR FR Treelet kernel (TK) 208 205 209 212 Relevant cycle kernel 209 207 202 228 TK on relevant cycle graph (T) 211 210 203 232 TK on relevant cycle hypergraph (TH) 217 224 207 233 TK + MKL 218 224 224 250 T + MKL 216 213 212 237 TH + MKL 225 229 215 239 L. Brun (GREY) eminar Bordeaux, November, 20 2014 26 / 32

Experiments yclic similarity onclusion Méthode MM FM MR FR Treelet kernel (TK) 208 205 209 212 Relevant cycle kernel 209 207 202 228 TK on relevant cycle graph (T) 211 210 203 232 TK on relevant cycle hypergraph (TH) 217 224 207 233 TK + MKL 218 224 224 250 TH + MKL 225 229 215 239 ombo TK - TH 225 230 224 252 Gaussian kernel on graph edit distance [Neuhaus and Bunke, 2007] 223 212 194 234 Laplacien kernel 207 186 209 203 Kernel on paths [Ralaivola et al., 2005]* 223 225 226 234 Weisfeiler-Lehman Kernel [hervaszide, 2012] 138 154 221 242 * leave-one-out L. Brun (GREY) eminar Bordeaux, November, 20 2014 27 / 32

Plan yclic similarity onclusion 1 2 3 4 yclic similarity 5 onclusion 6 L. Brun (GREY) eminar Bordeaux, November, 20 2014 27 / 32

onclusion yclic similarity onclusion General problem: Kernel between molecular graphs including cyclic and acyclic patterns in order to predict molecular properties. : yclic Information: Encoding the relative positioning of atoms R 1 R 1 R 3 Taking into account stereoisometry. R 2 R 3 R 2 L. Brun (GREY) eminar Bordeaux, November, 20 2014 28 / 32

Plan yclic similarity onclusion 1 2 3 4 yclic similarity 5 onclusion 6 L. Brun (GREY) eminar Bordeaux, November, 20 2014 28 / 32

Kernel between shapes yclic similarity onclusion a) le graphe ompute skeletons Encode skeletons by graphs, Kernel between bags of treelets. b) la segmentation induite L. Brun (GREY) eminar Bordeaux, November, 20 2014 29 / 32

Kernel on strings yclic similarity onclusion Decompose the scene into zones, ompute a string encoding the sequence of traversed zones, Define a kernel between strings, luster strings. 60 50 40 30 20 10 2 4 5 6 8 9 7 10 1 3 12 11 14 15 13 28 27 16 29 1917 23 21 26 30 2524 22 2018 L. Brun (GREY) eminar Bordeaux, November, 20 2014 30 / 32

Bibliographie I yclic similarity onclusion Aronszajn, N. (1950). Theory of reproducing kernels. Transactions of the American Mathematical ociety, 68(3):337 404. herqaoui, D., Villemin, D., Mesbah, A., ense, J. M., and Kvasnicka, V. (1994). Use of a Neural Network to Determine the Normal Boiling Points of Acyclic Ethers, Peroxides, Acetals and their ulfur Analogues. J. hem. oc. Faraday Trans., 90:2015 2019. Horváth, T. (2005). yclic pattern kernels revisited. In pringer-verlag, editor, Proceedings of the 9th Pacific-Asia onference on Knowledge Discovery and Data Mining, volume 3518, pages 791 801. L. Brun (GREY) eminar Bordeaux, November, 20 2014 30 / 32

Bibliographie II yclic similarity onclusion Johnson, M. A. and Maggiora, G. M., editors (1990). oncepts and Applications of Molecular imilarity. Wiley. Kashima, H., Tsuda, K., and Inokuchi, A. (2003). Marginalized Kernels Between Labeled Graphs. Machine Learning. Mahé, P. and Vert, J.-p. (2009). Graph kernels based on tree patterns for molecules. Machine Learning, 75(1)(eptember 2008):3 35. Neuhaus, M. and Bunke, H. (2007). Bridging the gap between graph edit distance and kernel machines, volume 68 of eries in Machine Perception and Artificial Intelligence. World cientific Publishing. L. Brun (GREY) eminar Bordeaux, November, 20 2014 31 / 32

Bibliographie III yclic similarity onclusion Rakotomamonjy, A., Bach, F., anu,., and Grandvalet, Y. (2008). implemkl. Journal of Machine Learning Research, 9:2491 2521. Ralaivola, L., wamidass,. J., aigo, H., and Baldi, P. (2005). Graph kernels for chemical informatics. Neural networks : the official journal of the International Neural Network ociety, 18(8):1093 110. Riesen, K. (2009). lassification and lustering of Vector pace Embedded Graphs. PhD thesis, Institut für Informatik und angewandte Mathematik Universität Bern. hervaszide, N. (2012). calable. PhD thesis, Universität Tübingen. L. Brun (GREY) eminar Bordeaux, November, 20 2014 32 / 32

Bibliographie IV yclic similarity onclusion Vismara, P. (1995). Reconnaissance et représentation d éléments structuraux pour la description d objets complexes. Application à l élaboration de stratégies de synthèse en chimie organique. PhD thesis, Université Montpellier II. L. Brun (GREY) eminar Bordeaux, November, 20 2014 33 / 32