On the Mining of Numerical Data with Formal Concept Analysis

Size: px
Start display at page:

Download "On the Mining of Numerical Data with Formal Concept Analysis"

Transcription

1 On the Mining of Numerical Data with Formal Concept Analysis Thèse de doctorat en informatique Mehdi Kaytoue 22 April 2011 Amedeo Napoli Sébastien Duplessis

2 Somewhere... in a temperate forest... N 2 / 40

3 Context A biological problem : How does symbiosis work at the cellular level? Analyse biological processes Find genes involved in symbiosis Choose a model for understanding symbiosis: Laccaria bicolor Analysing Gene Expression Data (GED) F. Martin et al. The Genome of Laccaria Bicolor Provides Insights into Mycorrhizal Symbiosis. In Nature., / 40

4 Context Gene expression data (GED) A numerical dataset, or data-table with genes in rows biological situations in columns expression value of a gene in row for the situation in column. A row denotes the expression profile of a gene (GEP) Biological hypothesis m 1 m 2 m 3 g g g g g A group of genes having a similar expression profile interact together within the same biological process 4 / 40

5 Context With very large datasets... Gene expression data of Laccaria bicolor 22,294 genes 3 types of biological situations reflecting cells of the organism in various stages of its biological cycle: free living mycelium symbiotic tissues fruiting bodies Attribute values ranged in [0, 65000] 5 / 40

6 Context Knowledge discovery in databases An iterative and interactive process U. Fayyad, G. Piatetsky-Shapiro and P. Smyth The KDD process for Extracting Useful Knowledge from Volumes of Data. In Commun. ACM., / 40

7 Context Mining gene expression data Extracting (maximal) rectangles in numerical data A set of genes co-expressed in some biological situations Local patterns: biological processes may be activated in some situations only Overlapping patterns: a gene may be involved in several biological process m 1 m 2 m 3 m 4 m 5 g g g g Biclustering: A difficult problem relying on heuristics R. Peeters The Maximum Edge Biclique Problem is NP-Complete. In Discrete Applied Math., vol. 131, no. 3., / 40

8 Context Core of the thesis Mining gene expression data with formal concept analysis Turning GED into binary, encoding over/under expression Bringing the problem into well-known settings Allowing a complete and mathematically well defined approach Exploiting algorithms and tools m 1 m 2 m 3 m 4 m 5 g g g g m 1 m 2 m 3 m 4 m 5 g g g g Can we work with FCA directly on numerical data? 8 / 40

9 Context Core of the thesis Mining gene expression data with formal concept analysis Turning GED into binary, encoding over/under expression Bringing the problem into well-known settings Allowing a complete and mathematically well defined approach Exploiting algorithms and tools m 1 m 2 m 3 m 4 m 5 g g g g m 1 m 2 m 3 m 4 m 5 g 1 g 2 g 3 g 4 Can we work with FCA directly on numerical data? 8 / 40

10 Context Outline 1 Context 2 Formal Concept Analysis 3 Contributions Interval pattern structures Introducing similarity A KDD-oriented discussion 4 Conclusion and perspectives 9 / 40

11 Formal Concept Analysis A binary table as a formal context Given by (G, M, I ) with G a set of objects M a set of attributes I a binary relation between objects and attributes: (g, m) I means that object g owns attribute m m 1 m 2 m 3 g 1 g 2 g 3 g 4 g 5 G = {g 1,..., g 5 } M = {m 1, m 2, m 3 } (g 1, m 3 ) I B. Ganter and R. Wille Formal Concept Analysis. In Springer, Mathematical foundations., / 40

12 Formal Concept Analysis A maximal rectangle as a formal concept A Galois connection to characterize formal concepts A = {m M g A G : (g, m) I } B = {g G m B M : (g, m) I } (A, B) is a concept with extent A = B and intent B = A {g 3 } = {m 2, m 3 } {m 2, m 3 } = {g 3, g 4, g 5 } m 1 m 2 m 3 g 1 g 2 g 3 g 4 g 5 ({g 3, g 4, g 5 }, {m 2, m 3 }) is a formal concept 11 / 40

13 Formal Concept Analysis Concept lattice Ordered set of concepts... (A 1, B 1 ) (A 2, B 2 ) A 1 A 2 ( B 2 B 1 ) ({g 1, g 5 }, {m 1, m 3 }) ({g 1, g 2, g 5 }, {m 1 })... with interesting properties Maximality of concepts as rectangles Overlapping of concepts Specialization/generalisation hierarchy Synthetic representation of the data without loss of information 12 / 40

14 Formal Concept Analysis Handling numerical data with FCA? Initial problem Extracting groups of genes with similar numerical values Conceptual scaling (discretization or binarization) An object has an attribute if its value lies in a predefined interval m 1 m 2 m 3 g g g g g m 1, [4, 5] m 2, [4, 7] m 3, [5, 6] g 1 g 2 g 3 g 4 g 5 Different scalings: different interpretations of the data General problem of the thesis How to directly build a concept lattice from numerical data? 13 / 40

15 1 Context 2 Formal Concept Analysis 3 Contributions Interval pattern structures Introducing similarity A KDD-oriented discussion 4 Conclusion and perspectives

16 Contributions Interval pattern structures How to handle complex descriptions An intersection as a similarity operator behaves as similarity operator {m 1, m 2 } {m 1, m 3 } = {m 1 } induces an ordering relation N O = N N O {m 1 } {m 1, m 2 } = {m 1 } {m 1 } {m 1, m 2 } has the properties of a meet in a semi lattice, a commutative, associative and idempotent operation c d = c c d A. Tversky Features of similarity. In Psychological Review, 84 (4), / 40

17 Contributions Interval pattern structures Pattern structure Given by (G, (D, ), δ) G a set of objects (D, ) a semi-lattice of descriptions or patterns δ a mapping such as δ(g) D describes object g A Galois connection A = δ(g) g A for A G d = {g G d δ(g)} for d (D, ) B. Ganter and S. O. Kuznetsov Pattern Structures and their Projections. In International Conference on Conceptual Structures, / 40

18 Contributions Interval pattern structures Numerical data are pattern structures Interval pattern structures m 1 m 2 m 3 g g g g g {g 1, g 2 } = g {g 1,g 2 } δ(g) = 5, 7, 6 6, 8, 4 = [5, 6], [7, 8], [4, 6] [5, 6], [7, 8], [4, 6] = {g G [5, 6], [7, 8], [4, 6] δ(g)} = {g 1, g 2, g 5 } ({g 1, g 2, g 5 }, [5, 6], [7, 8], [4, 6] ) is a (pattern) concept 17 / 40

19 Contributions Interval pattern structures Interval pattern concept lattice Lowest concepts: few objects, small intervals Highest concepts: many objects, large intervals Concept/pattern overwhelming 18 / 40

20 Contributions Interval pattern structures Links with conceptual scaling Interordinal scaling [Ganter & Wille] A scale to encode intervals of attribute values m 1 4 m 1 5 m 1 6 m 1 4 m 1 5 m Equivalent concept lattice Example ({g 1, g 2, g 5 }, {m 1 6, m 1 4, m 1 5,...,... }) ({g 1, g 2, g 5 }, [5, 6],...,... ) Why should we use pattern structures as we have scaling? Processing a pattern structure is more efficient 19 / 40

21 Contributions Introducing similarity Outline 1 Context 2 Formal Concept Analysis 3 Contributions Interval pattern structures Introducing similarity A KDD-oriented discussion 4 Conclusion and perspectives 20 / 40

22 Contributions Introducing similarity Introducing a similarity relation Grouping in a same concept objects having similar values? A natural similarity relation on numbers a θ b a b θ e.g Similarity operator in pattern structures [4,6] [4,5] [5,6] How to consider a similarity relation w.r.t. a distance? 21 / 40

23 Contributions Introducing similarity Introducing a similarity relation Grouping in a same concept objects having similar values? A natural similarity relation on numbers a θ b a b θ e.g Similarity operator in pattern structures [4,6] [4,5] [5,6] θ = How to consider a similarity relation w.r.t. a distance? 21 / 40

24 Contributions Introducing similarity Introducing a similarity relation Grouping in a same concept objects having similar values? A natural similarity relation on numbers a θ b a b θ e.g Similarity operator in pattern structures [4,6] [4,5] [5,6] θ = How to consider a similarity relation w.r.t. a distance? 21 / 40

25 Contributions Introducing similarity Introducing a similarity relation Grouping in a same concept objects having similar values? A natural similarity relation on numbers a θ b a b θ e.g Similarity operator in pattern structures [4,6] [4,5] [5,6] θ = 0 How to consider a similarity relation w.r.t. a distance? 21 / 40

26 Contributions Introducing similarity Towards a similarity between values Introduce an element (D, ) denoting dissimilarity Example with θ = 1 c d = iff c θ d c d iff c θ d m 1 m 2 m 3 g g g g g {g 3, g 4 } = [4, 4], [8, 9], [4, 4], [8, 9], = {g 3, g 4 } ({g 3, g 4 }, [4, 4], [8, 9], ) is a concept: g 3 and g 4 have similar values for attributes m 1 and m 2 only 22 / 40

27 Contributions Introducing similarity Towards a similarity between values Introduce an element (D, ) denoting dissimilarity Example with θ = 1 c d = iff c θ d c d iff c θ d m 1 m 2 m 3 g g g g g {g 3, g 4 } = [4, 4], [8, 9], [4, 4], [8, 9], = {g 3, g 4 } ({g 3, g 4 }, [4, 4], [8, 9], ) is a concept: g 3 and g 4 have similar values for attributes m 1 and m 2 only Is {g 3, g 4 } maximal w.r.t. similarity? We can add g / 40

28 Contributions Introducing similarity Classes of tolerance in numerical data Towards maximal sets of similar values θ a tolerance relation : reflexive, symmetric, not transitive Consider an attribute taking values in {6, 8, 11, 16, 17} and θ = , but A class of tolerance as a maximal set of pairwise similar values {6, 8, 11} {11, 16} {16, 17} [6, 11] [11, 16] [16, 17] S. O. Kuznetsov Galois Connections in Data Analysis: Contributions from the Soviet Era and Modern Russian Research. In Formal Concept Analysis, Foundations and Applications, / 40

29 Contributions Introducing similarity Tolerance in pattern structures Projecting the pattern structure Each value is replaced by the interval characterizing its class of tolerance (if unique) Each pattern d is projected with a mapping ψ(d) d (pre-processing) Example with θ = 1 m 1 m 2 m 3 g g g g g {g 3, g 4 } = ψ( [4, 4], [8, 9], ) = [4, 5], [8, 9], [4, 5], [8, 9], = {g 3, g 4, g 5 } 24 / 40

30 Contributions Introducing similarity Biological results An extracted pattern among 2, 150 others Genes present a high expression level in the fruit-body situations Some of these genes encode metabolic enzymes in remobilization of fungal resources towards the new organ in development Other genes are unknown but specific to Laccaria Bicolor: it requires biological experiments 25 / 40

31 Contributions Introducing similarity Relevant publications Interval pattern structures and GED analysis M. Kaytoue, S. Duplessis, S. O. Kuznetsov, and A. Napoli Two FCA-Based Methods for Mining Gene Expression Data. In International Conference on Formal Concept Analysis (ICFCA), M. Kaytoue, S. O. Kuznetsov, A. Napoli and S. Duplessis Mining Gene Expression Data with Pattern Structures in Formal Concept Analysis. In Information Sciences. Spec. Iss.: Lattices (Elsevier), Introducing tolerance relations and information fusion M. Kaytoue, Z. Assaghir, N. Messai and A. Napoli Two Complementary Classification Methods for Designing a Concept Lattice from Interval Data. In Foundations of Information and Knowledge Systems, 6th International Symposium (FoIKS), M. Kaytoue, Z. Assaghir, A. Napoli and S. O. Kuznetsov Embedding Tolerance Relations in Formal Concept Analysis: an Application in Information Fusion. In ACM Conference on Information and Knowledge Management (CIKM), / 40

32 Contributions Other works Pattern structures are useful for several tasks Bi-clustering and tolerance relations M. Kaytoue, S. O. Kuznetsov, and A. Napoli Biclustering Numerical Data in Formal Concept Analysis. In International Conference on Formal Concept Analysis (ICFCA), Information fusion: enhancing decision making Z. Assaghir, M. Kaytoue, A. Napoli and H. Prade Managing Information Fusion with Formal Concept Analysis. In Modeling Decisions for Artificial Intelligence, 6th International Conference (MDAI), KDD: a study of equivalence classes of interval patterns M. Kaytoue, S. O. Kuznetsov, and A. Napoli Revisiting Numerical Pattern Mining with Formal Concept Analysis. In International Joint Conference on Artificial Intelligence (IJCAI), / 40

33 Contributions A KDD-oriented discussion Outline 1 Context 2 Formal Concept Analysis 3 Contributions Interval pattern structures Introducing similarity A KDD-oriented discussion 4 Conclusion and perspectives 28 / 40

34 Contributions A KDD-oriented discussion Interval pattern search space Counting all possible interval patterns [a m1, b m1 ], [a m2, b m2 ],... where a mi, b mi W mi m 1 m 2 m 3 g g g g g i {1,..., M } W mi ( W mi + 1) possible interval patterns in our small example 29 / 40

35 Contributions A KDD-oriented discussion Semantics for interval patterns Interval patterns as (hyper) rectangles m 1 m 3 g g g g g m δ(g 4 ) δ(g 1 ) 5 δ(g 3 ) δ(g 5 ) 4 δ(g 2 ) m 1 30 / 40

36 Contributions A KDD-oriented discussion Semantics for interval patterns Interval patterns as (hyper) rectangles m 1 m 3 g g g g g m δ(g 4 ) δ(g 1 ) [4, 5], [5, 6] = {g 1, g 3, g 5 } 5 δ(g 3 ) δ(g 5 ) 4 δ(g 2 ) m 1 30 / 40

37 Contributions A KDD-oriented discussion Semantics for interval patterns Interval patterns as (hyper) rectangles m 1 m 3 g g g g g m δ(g 4 ) δ(g 1 ) [4, 5], [5, 6] = {g 1, g 3, g 5 } [4, 5], [5, 7] = {g 1, g 3, g 5 } 5 4 δ(g 3 ) δ(g 5 ) δ(g 2 ) m 1 30 / 40

38 Contributions A KDD-oriented discussion Semantics for interval patterns Interval patterns as (hyper) rectangles m 1 m 3 g g g g g m δ(g 4 ) δ(g 1 ) [4, 5], [5, 6] = {g 1, g 3, g 5 } [4, 5], [5, 7] = {g 1, g 3, g 5 } [4, 6], [5, 6] = {g 1, g 3, g 5 } 5 4 δ(g 3 ) δ(g 5 ) δ(g 2 ) m 1 30 / 40

39 Contributions A KDD-oriented discussion Semantics for interval patterns Interval patterns as (hyper) rectangles m 1 m 3 g g g g g m δ(g 4 ) δ(g 1 ) [4, 5], [5, 6] = {g 1, g 3, g 5 } [4, 5], [5, 7] = {g 1, g 3, g 5 } [4, 6], [5, 6] = {g 1, g 3, g 5 } [4, 5], [4, 6] = {g 1, g 3, g 5 } δ(g 3 ) δ(g 5 ) δ(g 2 ) m 1 30 / 40

40 Contributions A KDD-oriented discussion Semantics for interval patterns Interval patterns as (hyper) rectangles m 1 m 3 g g g g g m δ(g 4 ) δ(g 1 ) [4, 5], [5, 6] = {g 1, g 3, g 5 } [4, 5], [5, 7] = {g 1, g 3, g 5 } [4, 6], [5, 6] = {g 1, g 3, g 5 } [4, 5], [4, 6] = {g 1, g 3, g 5 } [4, 6], [5, 7] = {g 1, g 3, g 5 } δ(g 3 ) δ(g 5 ) δ(g 2 ) m 1 30 / 40

41 Contributions A KDD-oriented discussion Semantics for interval patterns Interval patterns as (hyper) rectangles m 1 m 3 g g g g g m δ(g 4 ) δ(g 1 ) [4, 5], [5, 6] = {g 1, g 3, g 5 } [4, 5], [5, 7] = {g 1, g 3, g 5 } [4, 6], [5, 6] = {g 1, g 3, g 5 } [4, 5], [4, 6] = {g 1, g 3, g 5 } [4, 6], [5, 7] = {g 1, g 3, g 5 } [4, 5], [4, 7] = {g 1, g 3, g 5 } δ(g 3 ) δ(g 5 ) δ(g 2 ) m 1 30 / 40

42 Contributions A KDD-oriented discussion A condensed representation Equivalence classes of interval patterns Two interval patterns with same image are said to be equivalent c = d c = d Equivalence class of a pattern d [d] = {c c = d} with a unique closed pattern: the smallest rectangle and one or several generators: the largest rectangles Y. Bastide, R. Taouil, N. Pasquier, G. Stumme, and L. Lakhal. Mining frequent patterns with counting inference. SIGKDD Expl., 2(2):66 75, In our example: 360 patterns ; 18 closed ; 44 generators 31 / 40

43 Contributions A KDD-oriented discussion Algorithms & experiments Algorithms: MintIntChange, MinIntChangeG[t h] [4,6] [4,5] [5,6] Experiments Mining several datasets from Bilkent University Repository Compression rate varies between 10 7 and 10 9 Interordinal scaling: encodes binary patterns not efficient even with best algorithms (e.g. LCMv2) redundancy problem discarding its use for generator extraction 32 / 40

44 Contributions A KDD-oriented discussion Algorithms & experiments Algorithms: MintIntChange, MinIntChangeG[t h] [4,6] [4,5] [5,6] Experiments Mining several datasets from Bilkent University Repository Compression rate varies between 10 7 and 10 9 Interordinal scaling: encodes binary patterns not efficient even with best algorithms (e.g. LCMv2) redundancy problem discarding its use for generator extraction 32 / 40

45 Advantages Problem Contributions A KDD-oriented discussion Discussion Minimum description length principle favours generators Potential applications Data privacy and k-anonymisation k-box problem in computational geometry Quantitative association rule mining Data summarization With very large data set, compression is not enough Numerical data are noisy Need of fault-tolerant condensed representations 33 / 40

46 1 Context 2 Formal Concept Analysis 3 Contributions Interval pattern structures Introducing similarity A KDD-oriented discussion 4 Conclusion and perspectives

47 Conclusion and perspectives Conclusion A new insight for the mining numerical data Our main tools... Formal Concept Analysis and conceptual scaling Pattern structures and projections Tolerance relation... for numerical data mining Conceptual representations of numerical data Bi-clustering Information fusion Applications: GED analysis and agricultural practice assessment 35 / 40

48 Conclusion and perspectives Conclusion An application in GED analysis With FCA and pattern structures Many ways of extracting patterns in GED Biological validation of several patterns We now need a systematic validation step using new knowledge transcription factors biological knowledge base, e.g. Gene Ontology 36 / 40

49 Conclusion and perspectives To be continued... Short- and mid- term Handle other types of biclusters and algorithm comparison S. C. Madeira and A. L. Oliveira Biclustering Algorithms for Biological Data Analysis: a survey. In IEEE/ACM Transactions on Computational Biology and Bioinformatics, Insert domain knowledge for biological data Study threshold θ effect w.r.t. the number of tolerance classes Post-doctoral position Biclustering (multi-dimensional) numerical data Numerical pattern based classifier and association rules Data privacy and pattern projection Wagner Jr. Meira (Universidade Federal de Minas Gerais, Brasil) 37 / 40

50 Conclusion and perspectives Cross-domain fertilization Itemset-mining in KDD Other frameworks for closed patterns H. Arimura and T. Uno Polynomial-Delay and Polynomial-Space Algorithms for Mining Closed Sequences, Graphs, and Pictures in Accessible Set Systems. In SIAM International Conference on Data Mining, G.C. Garriga Formal Methods for Mining Structured Objects. PhD Thesis, Universitat Politècnica de Catalunya, 2006 Condensed representations and fault-tolerant patterns m 1 m 2 m 3 g g g g g R. Pensa and J.-F. Boulicaut Towards Fault-Tolerant Formal Concept Analysis. In Proc. 9th Congress of the Italian Association for Artificial Intelligence (AI*IA), Springer, / 40

51 Conclusion and perspectives Cross-domain fertilization Data-analysis Symbolic data analysis and distances P. Agarwal, M. Kaytoue, S. O. Kuznetsov, A. Napoli and G. Polaillon Symbolic Galois Lattices with Pattern Structures. In International Conference on Rough Sets, Fuzzy Sets, Data-mining and Granularity Computing (RSFDGrC), Information fusion and fuzzy concept analysis Fuzzy settings and possibility theory Z. Assaghir, M. Kaytoue, and H. Prade A Possibility Theory Oriented Discussion of Conceptual Pattern Ptructures. In Scalable Uncertainty Management, 4th International Conference (SUM), / 40

52 Merci Danke schön Spasibo 40 / 40

Biclustering Numerical Data in Formal Concept Analysis

Biclustering Numerical Data in Formal Concept Analysis Biclustering Numerical Data in Formal Concept Analysis Mehdi Kaytoue 1,SergeiO.Kuznetsov 2, and Amedeo Napoli 1 1 Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA) Campus Scientifique,

More information

Σοφια: how to make FCA polynomial?

Σοφια: how to make FCA polynomial? Σοφια: how to make FCA polynomial? Aleksey Buzmakov 1,2, Sergei Kuznetsov 2, and Amedeo Napoli 1 1 LORIA (CNRS Inria NGE Université de Lorraine), Vandœuvre-lès-Nancy, France 2 National Research University

More information

An Introduction to Formal Concept Analysis

An Introduction to Formal Concept Analysis An Introduction to Formal Concept Analysis Mehdi Kaytoue Mehdi Kaytoue mehdi.kaytoue@insa-lyon.fr http://liris.cnrs.fr/mehdi.kaytoue October 29 th 2013 The Knowledge Discovery Process Identified domain(s)

More information

Mining Biclusters of Similar Values with Triadic Concept Analysis

Mining Biclusters of Similar Values with Triadic Concept Analysis Mining Biclusters of Similar Values with Triadic Concept Analysis Mehdi Kaytoue, Sergei O. Kuznetsov, Juraj Macko, Wagner Meira, Amedeo Napoli To cite this version: Mehdi Kaytoue, Sergei O. Kuznetsov,

More information

Biclustering meets Triadic Concept Analysis

Biclustering meets Triadic Concept Analysis Noname manuscript No. (will be inserted by the editor) Biclustering meets Triadic Concept Analysis Mehdi Kaytoue Sergei O. Kuznetsov Juraj Macko Amedeo Napoli Received: date / Accepted: date Abstract Biclustering

More information

Numerical information fusion: Lattice of answers with supporting arguments

Numerical information fusion: Lattice of answers with supporting arguments Numerical information fusion: Lattice of answers with supporting arguments Zainab Assaghir, Amedeo Napoli LORIA Campus Scientifique, B.P. 70239 54506 Vandoeuvre-les-Nancy, France {assaghiz,napoli}@loria.fr

More information

Computing Functional Dependencies with Pattern Structures

Computing Functional Dependencies with Pattern Structures Computing Functional Dependencies with Pattern Structures Jaume Baixeries 1, Mehdi Kaytoue 2, and Amedeo Napoli 3 1 Departament de Llenguatges i Sistemes Informàtics. Universitat Politècnica de Catalunya.

More information

Encyclopedia of Machine Learning Chapter Number Book CopyRight - Year 2010 Frequent Pattern. Given Name Hannu Family Name Toivonen

Encyclopedia of Machine Learning Chapter Number Book CopyRight - Year 2010 Frequent Pattern. Given Name Hannu Family Name Toivonen Book Title Encyclopedia of Machine Learning Chapter Number 00403 Book CopyRight - Year 2010 Title Frequent Pattern Author Particle Given Name Hannu Family Name Toivonen Suffix Email hannu.toivonen@cs.helsinki.fi

More information

Mining alpha/beta concepts as relevant bi-sets from transactional data

Mining alpha/beta concepts as relevant bi-sets from transactional data Mining alpha/beta concepts as relevant bi-sets from transactional data Jérémy Besson 1,2, Céline Robardet 3, and Jean-François Boulicaut 1 1 INSA Lyon, LIRIS CNRS FRE 2672, F-69621 Villeurbanne cedex,

More information

Pattern Structures for Risk Group Identification

Pattern Structures for Risk Group Identification Pattern Structures for Risk Group Identification Natalia V. Korepanova and Sergei O. Kuznetsov National Research University Higher School of Economics, Moscow, Russia, nkorepanova@hse.ru, skuznetsov@hse.ru

More information

Characterization of Database Dependencies with FCA and Pattern Structures

Characterization of Database Dependencies with FCA and Pattern Structures Characterization of Database Dependencies with FCA and Pattern Structures Jaume Baixeries, Mehdi Kaytoue, Amedeo Napoli To cite this version: Jaume Baixeries, Mehdi Kaytoue, Amedeo Napoli. Characterization

More information

Distributed Mining of Frequent Closed Itemsets: Some Preliminary Results

Distributed Mining of Frequent Closed Itemsets: Some Preliminary Results Distributed Mining of Frequent Closed Itemsets: Some Preliminary Results Claudio Lucchese Ca Foscari University of Venice clucches@dsi.unive.it Raffaele Perego ISTI-CNR of Pisa perego@isti.cnr.it Salvatore

More information

Analysing Online Social Network Data with Biclustering and Triclustering

Analysing Online Social Network Data with Biclustering and Triclustering Analysing Online Social Network Data with Biclustering and Triclustering Alexander Semenov 1 Dmitry Gnatyshak 1 Dmitry Ignatov 1 Jonas Poelmans 2,1 1 NRU Higher School of Economics, Russia 2 KU Leuven,

More information

Characterization of Order-like Dependencies with Formal Concept Analysis

Characterization of Order-like Dependencies with Formal Concept Analysis Characterization of Order-like Dependencies with Formal Concept Analysis Victor Codocedo, Jaume Baixeries, Mehdi Kaytoue, Amedeo Napoli To cite this version: Victor Codocedo, Jaume Baixeries, Mehdi Kaytoue,

More information

A Bi-clustering Framework for Categorical Data

A Bi-clustering Framework for Categorical Data A Bi-clustering Framework for Categorical Data Ruggero G. Pensa 1,Céline Robardet 2, and Jean-François Boulicaut 1 1 INSA Lyon, LIRIS CNRS UMR 5205, F-69621 Villeurbanne cedex, France 2 INSA Lyon, PRISMa

More information

Classification Based on Logical Concept Analysis

Classification Based on Logical Concept Analysis Classification Based on Logical Concept Analysis Yan Zhao and Yiyu Yao Department of Computer Science, University of Regina, Regina, Saskatchewan, Canada S4S 0A2 E-mail: {yanzhao, yyao}@cs.uregina.ca Abstract.

More information

On the Intractability of Computing the Duquenne-Guigues Base

On the Intractability of Computing the Duquenne-Guigues Base Journal of Universal Computer Science, vol 10, no 8 (2004), 927-933 submitted: 22/3/04, accepted: 28/6/04, appeared: 28/8/04 JUCS On the Intractability of Computing the Duquenne-Guigues Base Sergei O Kuznetsov

More information

Mining a New Fault-Tolerant Pattern Type as an Alternative to Formal Concept Discovery

Mining a New Fault-Tolerant Pattern Type as an Alternative to Formal Concept Discovery Mining a New Fault-Tolerant Pattern Type as an Alternative to Formal Concept Discovery Jérémy Besson 1,2 and Céline Robardet 3, and Jean-François Boulicaut 1 1 INSA Lyon, LIRIS CNRS UMR 5205, F-69621 Villeurbanne

More information

Discovering Non-Redundant Association Rules using MinMax Approximation Rules

Discovering Non-Redundant Association Rules using MinMax Approximation Rules Discovering Non-Redundant Association Rules using MinMax Approximation Rules R. Vijaya Prakash Department Of Informatics Kakatiya University, Warangal, India vijprak@hotmail.com Dr.A. Govardhan Department.

More information

Pattern-Based Decision Tree Construction

Pattern-Based Decision Tree Construction Pattern-Based Decision Tree Construction Dominique Gay, Nazha Selmaoui ERIM - University of New Caledonia BP R4 F-98851 Nouméa cedex, France {dominique.gay, nazha.selmaoui}@univ-nc.nc Jean-François Boulicaut

More information

Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise

Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise Guimei Liu 1,2 Jinyan Li 1 Limsoon Wong 2 Wynne Hsu 2 1 Institute for Infocomm Research, Singapore 2 School

More information

Chapter 6. Frequent Pattern Mining: Concepts and Apriori. Meng Jiang CSE 40647/60647 Data Science Fall 2017 Introduction to Data Mining

Chapter 6. Frequent Pattern Mining: Concepts and Apriori. Meng Jiang CSE 40647/60647 Data Science Fall 2017 Introduction to Data Mining Chapter 6. Frequent Pattern Mining: Concepts and Apriori Meng Jiang CSE 40647/60647 Data Science Fall 2017 Introduction to Data Mining Pattern Discovery: Definition What are patterns? Patterns: A set of

More information

Constraint-based Subspace Clustering

Constraint-based Subspace Clustering Constraint-based Subspace Clustering Elisa Fromont 1, Adriana Prado 2 and Céline Robardet 1 1 Université de Lyon, France 2 Universiteit Antwerpen, Belgium Thursday, April 30 Traditional Clustering Partitions

More information

Conceptual analysis of complex system simulation data for decision support: Application to aircraft cabin design

Conceptual analysis of complex system simulation data for decision support: Application to aircraft cabin design Conceptual analysis of complex system simulation data for decision support: Application to aircraft cabin design Nizar Messai 1, Cassio Melo 2, Mohamed Hamdaoui 2, Dung Bui 2, and Marie-Aude Aufaure 2

More information

Handling Noise in Boolean Matrix Factorization

Handling Noise in Boolean Matrix Factorization Handling Noise in Boolean Matrix Factorization Radim Belohlavek, Martin Trnecka DEPARTMENT OF COMPUTER SCIENCE PALACKÝ UNIVERSITY OLOMOUC 26th International Joint Conference on Artificial Intelligence

More information

Editorial Manager(tm) for Data Mining and Knowledge Discovery Manuscript Draft

Editorial Manager(tm) for Data Mining and Knowledge Discovery Manuscript Draft Editorial Manager(tm) for Data Mining and Knowledge Discovery Manuscript Draft Manuscript Number: Title: Summarizing transactional databases with overlapped hyperrectangles, theories and algorithms Article

More information

Un nouvel algorithme de génération des itemsets fermés fréquents

Un nouvel algorithme de génération des itemsets fermés fréquents Un nouvel algorithme de génération des itemsets fermés fréquents Huaiguo Fu CRIL-CNRS FRE2499, Université d Artois - IUT de Lens Rue de l université SP 16, 62307 Lens cedex. France. E-mail: fu@cril.univ-artois.fr

More information

George J. Klir Radim Belohlavek, Martin Trnecka. State University of New York (SUNY) Binghamton, New York 13902, USA

George J. Klir Radim Belohlavek, Martin Trnecka. State University of New York (SUNY) Binghamton, New York 13902, USA POSSIBILISTIC INFORMATION: A Tutorial Basic Level in Formal Concept Analysis: Interesting Concepts and Psychological Ramifications George J. Klir Radim Belohlavek, Martin Trnecka State University of New

More information

From Local Pattern Mining to Relevant Bi-cluster Characterization

From Local Pattern Mining to Relevant Bi-cluster Characterization From Local Pattern Mining to Relevant Bi-cluster Characterization Ruggero G. Pensa and Jean-François Boulicaut INSA Lyon, LIRIS CNRS, UMR 5205 F-69621, Villeurbanne cedex, France {Ruggero.Pensa, Jean-Francois.Boulicaut}@insa-lyon.fr

More information

A First Study on What MDL Can Do for FCA

A First Study on What MDL Can Do for FCA A First Study on What MDL Can Do for FCA Tatiana Makhalova 1,2, Sergei O. Kuznetsov 1, and Amedeo Napoli 2 1 National Research University Higher School of Economics, 3 Kochnovsky Proezd, Moscow, Russia

More information

Helsinki University of Technology Publications in Computer and Information Science December 2007 AN APPROXIMATION RATIO FOR BICLUSTERING

Helsinki University of Technology Publications in Computer and Information Science December 2007 AN APPROXIMATION RATIO FOR BICLUSTERING Helsinki University of Technology Publications in Computer and Information Science Report E13 December 2007 AN APPROXIMATION RATIO FOR BICLUSTERING Kai Puolamäki Sami Hanhijärvi Gemma C. Garriga ABTEKNILLINEN

More information

Bioinformatics Chapter 1. Introduction

Bioinformatics Chapter 1. Introduction Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!

More information

Concept Lattices in Rough Set Theory

Concept Lattices in Rough Set Theory Concept Lattices in Rough Set Theory Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca URL: http://www.cs.uregina/ yyao Abstract

More information

A Generalized Decision Logic in Interval-set-valued Information Tables

A Generalized Decision Logic in Interval-set-valued Information Tables A Generalized Decision Logic in Interval-set-valued Information Tables Y.Y. Yao 1 and Qing Liu 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca

More information

Towards the Complexity of Recognizing Pseudo-intents

Towards the Complexity of Recognizing Pseudo-intents Towards the Complexity of Recognizing Pseudo-intents Barış Sertkaya Theoretical Computer Science TU Dresden, Germany July 30, 2009 Supported by the German Research Foundation (DFG) under grant BA 1122/12-1

More information

Finding Errors in New Object in Formal Contexts

Finding Errors in New Object in Formal Contexts Finding Errors in New Object in Formal Contexts Artem Revenko 12, Sergei O. Kuznetsov 2, and Bernhard Ganter 1 1 Technische Universität Dresden Zellescher Weg 12-14, 01069 Dresden, Germany 2 National Research

More information

FUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH

FUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH FUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH M. De Cock C. Cornelis E. E. Kerre Dept. of Applied Mathematics and Computer Science Ghent University, Krijgslaan 281 (S9), B-9000 Gent, Belgium phone: +32

More information

Data Mining and Analysis: Fundamental Concepts and Algorithms

Data Mining and Analysis: Fundamental Concepts and Algorithms Data Mining and Analysis: Fundamental Concepts and Algorithms dataminingbook.info Mohammed J. Zaki 1 Wagner Meira Jr. 2 1 Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA

More information

Calcul de motifs sous contraintes pour la classification supervisée

Calcul de motifs sous contraintes pour la classification supervisée Calcul de motifs sous contraintes pour la classification supervisée Constraint-based pattern mining for supervised classification Dominique Joël Gay Soutenance de thèse pour l obtention du grade de docteur

More information

Concept-based Data Mining with Scaled Labeled Graphs

Concept-based Data Mining with Scaled Labeled Graphs Concept-based Data Mining with Scaled Labeled Graphs Bernhard Ganter, Peter A. Grigoriev, Sergei O. Kuznetsov, Mikhail V. Samokhin Technische Universität Dresden All-Russian Institute for Scientific and

More information

Exploring Spatial Relationships for Knowledge Discovery in Spatial Data

Exploring Spatial Relationships for Knowledge Discovery in Spatial Data 2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Exploring Spatial Relationships for Knowledge Discovery in Spatial Norazwin Buang

More information

From Meaningful Orderings in the Web of Data to Multi-level Pattern Structures

From Meaningful Orderings in the Web of Data to Multi-level Pattern Structures From Meaningful Orderings in the Web of Data to Multi-level Pattern Structures Quentin Brabant, Miguel Couceiro, Amedeo Napoli, Justine Reynaud To cite this version: Quentin Brabant, Miguel Couceiro, Amedeo

More information

On Proofs and Rule of Multiplication in Fuzzy Attribute Logic

On Proofs and Rule of Multiplication in Fuzzy Attribute Logic On Proofs and Rule of Multiplication in Fuzzy Attribute Logic Radim Belohlavek 1,2 and Vilem Vychodil 2 1 Dept. Systems Science and Industrial Engineering, Binghamton University SUNY Binghamton, NY 13902,

More information

Summarizing Transactional Databases with Overlapped Hyperrectangles

Summarizing Transactional Databases with Overlapped Hyperrectangles Noname manuscript No. (will be inserted by the editor) Summarizing Transactional Databases with Overlapped Hyperrectangles Yang Xiang Ruoming Jin David Fuhry Feodor F. Dragan Abstract Transactional data

More information

The dependence graph of a lattice

The dependence graph of a lattice The depence graph of a lattice Karell Bertet L3I - Université de La Rochelle - av Michel Crépeau - 17042 La Rochelle kbertet@univ-lr.fr Abstract: In this paper, we introduce the depence graph of a lattice

More information

What can FCA do for database linkkey extraction? (problem paper)

What can FCA do for database linkkey extraction? (problem paper) What can FCA do for database linkkey extraction? (problem paper) Manuel Atencia 1,2, Jérôme David 1,2, and Jérôme Euzenat 2,1 1 Univ. Grenoble Alpes & 2 INRIA, Grenoble, France http://exmo.inria.fr Abstract.

More information

Foundations of Classification

Foundations of Classification Foundations of Classification J. T. Yao Y. Y. Yao and Y. Zhao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 {jtyao, yyao, yanzhao}@cs.uregina.ca Summary. Classification

More information

2 GENE FUNCTIONAL SIMILARITY. 2.1 Semantic values of GO terms

2 GENE FUNCTIONAL SIMILARITY. 2.1 Semantic values of GO terms Bioinformatics Advance Access published March 7, 2007 The Author (2007). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

More information

1 Algebraic Methods. 1.1 Gröbner Bases Applied to SAT

1 Algebraic Methods. 1.1 Gröbner Bases Applied to SAT 1 Algebraic Methods In an algebraic system Boolean constraints are expressed as a system of algebraic equations or inequalities which has a solution if and only if the constraints are satisfiable. Equations

More information

Interpreting Low and High Order Rules: A Granular Computing Approach

Interpreting Low and High Order Rules: A Granular Computing Approach Interpreting Low and High Order Rules: A Granular Computing Approach Yiyu Yao, Bing Zhou and Yaohua Chen Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail:

More information

Mining Molecular Fragments: Finding Relevant Substructures of Molecules

Mining Molecular Fragments: Finding Relevant Substructures of Molecules Mining Molecular Fragments: Finding Relevant Substructures of Molecules Christian Borgelt, Michael R. Berthold Proc. IEEE International Conference on Data Mining, 2002. ICDM 2002. Lecturers: Carlo Cagli

More information

CSE 5243 INTRO. TO DATA MINING

CSE 5243 INTRO. TO DATA MINING CSE 5243 INTRO. TO DATA MINING Mining Frequent Patterns and Associations: Basic Concepts (Chapter 6) Huan Sun, CSE@The Ohio State University Slides adapted from Prof. Jiawei Han @UIUC, Prof. Srinivasan

More information

Free-sets : a Condensed Representation of Boolean Data for the Approximation of Frequency Queries

Free-sets : a Condensed Representation of Boolean Data for the Approximation of Frequency Queries Free-sets : a Condensed Representation of Boolean Data for the Approximation of Frequency Queries To appear in Data Mining and Knowledge Discovery, an International Journal c Kluwer Academic Publishers

More information

Characterizing Approximate-Matching Dependencies in Formal Concept Analysis with Pattern Structures

Characterizing Approximate-Matching Dependencies in Formal Concept Analysis with Pattern Structures Characterizing Approximate-Matching Dependencies in Formal Concept Analysis with Pattern Structures Jaume Baixeries a, Victor Codocedo c, Mehdi Kaytoue c,d and Amedeo Napoli b a Departament de Ciències

More information

Using Image Moment Invariants to Distinguish Classes of Geographical Shapes

Using Image Moment Invariants to Distinguish Classes of Geographical Shapes Using Image Moment Invariants to Distinguish Classes of Geographical Shapes J. F. Conley, I. J. Turton, M. N. Gahegan Pennsylvania State University Department of Geography 30 Walker Building University

More information

From a Possibility Theory View of Formal Concept Analysis to the Possibilistic Handling of Incomplete and Uncertain Contexts

From a Possibility Theory View of Formal Concept Analysis to the Possibilistic Handling of Incomplete and Uncertain Contexts From a Possibility Theory View of Formal Concept Analysis to the Possibilistic Handling of Incomplete and Uncertain Contexts Zina Ait-Yakoub 1, Yassine Djouadi 2, Didier Dubois 2, and Henri Prade 2 1 Department

More information

Towards a Navigation Paradigm for Triadic Concepts

Towards a Navigation Paradigm for Triadic Concepts Towards a Navigation Paradigm for Triadic Concepts Sebastian Rudolph 1, Christian Săcărea 2, and Diana Troancă 2 1 Technische Universität Dresden 2 Babeş-Bolyai Cluj Napoca Abstract. The simple formalization

More information

Free-Sets: A Condensed Representation of Boolean Data for the Approximation of Frequency Queries

Free-Sets: A Condensed Representation of Boolean Data for the Approximation of Frequency Queries Data Mining and Knowledge Discovery, 7, 5 22, 2003 c 2003 Kluwer Academic Publishers. Manufactured in The Netherlands. Free-Sets: A Condensed Representation of Boolean Data for the Approximation of Frequency

More information

hsnim: Hyper Scalable Network Inference Machine for Scale-Free Protein-Protein Interaction Networks Inference

hsnim: Hyper Scalable Network Inference Machine for Scale-Free Protein-Protein Interaction Networks Inference CS 229 Project Report (TR# MSB2010) Submitted 12/10/2010 hsnim: Hyper Scalable Network Inference Machine for Scale-Free Protein-Protein Interaction Networks Inference Muhammad Shoaib Sehgal Computer Science

More information

Fuzzy Answer Set semantics for Residuated Logic programs

Fuzzy Answer Set semantics for Residuated Logic programs semantics for Logic Nicolás Madrid & Universidad de Málaga September 23, 2009 Aims of this paper We are studying the introduction of two kinds of negations into residuated : Default negation: This negation

More information

SOME OPEN PROBLEMS IN FORMAL CONCEPT ANALYSIS. Problems presented at ICFCA 2006 in Dresden. Problem 1 (Minimal generators for Boolean layer cakes).

SOME OPEN PROBLEMS IN FORMAL CONCEPT ANALYSIS. Problems presented at ICFCA 2006 in Dresden. Problem 1 (Minimal generators for Boolean layer cakes). SOME OPEN PROBLEMS IN FORMAL CONCEPT ANALYSIS Abstract. This note intends to collect some open problems in Formal Concept Analysis. The ones below have been presented at the International Conference on

More information

Concept interestingness measures: a comparative study

Concept interestingness measures: a comparative study Concept interestingness measures: a comparative study Sergei O. Kuznetsov 1 and Tatiana P. Makhalova 1,2 1 National Research University Higher School of Economics, Kochnovsky pr. 3, Moscow 125319, Russia

More information

Reduction in Triadic Data Sets

Reduction in Triadic Data Sets Reduction in Triadic Data Sets Sebastian Rudolph 1, Christian Săcărea 2, and Diana Troancă 2 1 Technische Universität Dresden 2 Babeş-Bolyai University Cluj Napoca sebastian.rudolph@tu.dresden.de, csacarea@cs.ubbcluj.ro,

More information

A General Testability Theory: Classes, properties, complexity, and testing reductions

A General Testability Theory: Classes, properties, complexity, and testing reductions A General Testability Theory: Classes, properties, complexity, and testing reductions presenting joint work with Luis Llana and Pablo Rabanal Universidad Complutense de Madrid PROMETIDOS-CM WINTER SCHOOL

More information

Formal Concept Analysis: Foundations and Applications

Formal Concept Analysis: Foundations and Applications Formal Concept Analysis: Foundations and Applications Philippe Balbiani Institut de recherche en informatique de Toulouse Outline Introduction (page 3) Concept lattices of contexts (page 10) Many-valued

More information

OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTRIBUTES

OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTRIBUTES OPTIMIZATION IN ENGINE DESIGN VIA FORMAL CONCEPT ANALYSIS USING NEGATIVE ATTRIBUTES Rodríguez-Jiménez, J. M. 1, Cordero, P. 1, Enciso, M. 1 and Mora, A 1. 1 Universidad de Málaga, Andalucía Tech, Boulevar

More information

Approximate Boolean Reasoning Approach to Rough Sets and Data Mining

Approximate Boolean Reasoning Approach to Rough Sets and Data Mining Approximate Boolean Reasoning Approach to Rough Sets and Data Mining Hung Son Nguyen Institute of Mathematics, Warsaw University son@mimuw.edu.pl RSFDGrC, September 3, 2005 Hung Son Nguyen (UW) ABR approach

More information

Using transposition for pattern discovery from microarray data

Using transposition for pattern discovery from microarray data Using transposition for pattern discovery from microarray data François Rioult GREYC CNRS UMR 6072 Université de Caen F-14032 Caen, France frioult@info.unicaen.fr Jean-François Boulicaut LIRIS CNRS FRE

More information

Home Page. Title Page. Page 1 of 35. Go Back. Full Screen. Close. Quit

Home Page. Title Page. Page 1 of 35. Go Back. Full Screen. Close. Quit JJ II J I Page 1 of 35 General Attribute Reduction of Formal Contexts Tong-Jun Li Zhejiang Ocean University, China litj@zjou.edu.cn September, 2011,University of Milano-Bicocca Page 2 of 35 Objective of

More information

An embedding of ChuCors in L-ChuCors

An embedding of ChuCors in L-ChuCors Proceedings of the 10th International Conference on Computational and Mathematical Methods in Science and Engineering, CMMSE 2010 27 30 June 2010. An embedding of ChuCors in L-ChuCors Ondrej Krídlo 1,

More information

Galois Connections for Dependencies in Databases

Galois Connections for Dependencies in Databases Galois onnections for Dependencies in Databases Sergei O. Kuznetsov National Research University Higher School of Economics Moscow, Russia November 17, 2017 Gestion de Données Principes, Technologies et

More information

Relation between Pareto-Optimal Fuzzy Rules and Pareto-Optimal Fuzzy Rule Sets

Relation between Pareto-Optimal Fuzzy Rules and Pareto-Optimal Fuzzy Rule Sets Relation between Pareto-Optimal Fuzzy Rules and Pareto-Optimal Fuzzy Rule Sets Hisao Ishibuchi, Isao Kuwajima, and Yusuke Nojima Department of Computer Science and Intelligent Systems, Osaka Prefecture

More information

Incremental Learning of TBoxes from Interpretation Sequences with Methods of Formal Concept Analysis

Incremental Learning of TBoxes from Interpretation Sequences with Methods of Formal Concept Analysis Incremental Learning of TBoxes from Interpretation Sequences with Methods of Formal Concept Analysis Francesco Kriegel Institute for Theoretical Computer Science, TU Dresden, Germany francesco.kriegel@tu-dresden.de

More information

Differential Modeling for Cancer Microarray Data

Differential Modeling for Cancer Microarray Data Differential Modeling for Cancer Microarray Data Omar Odibat Department of Computer Science Feb, 01, 2011 1 Outline Introduction Cancer Microarray data Problem Definition Differential analysis Existing

More information

Data Mining and Analysis: Fundamental Concepts and Algorithms

Data Mining and Analysis: Fundamental Concepts and Algorithms : Fundamental Concepts and Algorithms dataminingbook.info Mohammed J. Zaki 1 Wagner Meira Jr. 2 1 Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA 2 Department of Computer

More information

Akademie věd. Concept lattices and attribute implications from data with fuzzy attributes

Akademie věd. Concept lattices and attribute implications from data with fuzzy attributes G) Akademie věd České republiky Teze doktorské disertační práce k získání vědeckého titulu doktor věd ve skupine věd fyzikálně-matematické vědy Concept lattices and attribute implications from data with

More information

Data Mining and Matrices

Data Mining and Matrices Data Mining and Matrices 08 Boolean Matrix Factorization Rainer Gemulla, Pauli Miettinen June 13, 2013 Outline 1 Warm-Up 2 What is BMF 3 BMF vs. other three-letter abbreviations 4 Binary matrices, tiles,

More information

Formal Concept Analysis as a Framework for Business Intelligence Technologies II

Formal Concept Analysis as a Framework for Business Intelligence Technologies II Formal Concept Analysis as a Framework for Business Intelligence Technologies II Juraj Macko Division of Applied computer science Dept. Computer Science Palacky University, Olomouc 17. listopadu 12, CZ-77146

More information

Implications from data with fuzzy attributes

Implications from data with fuzzy attributes Implications from data with fuzzy attributes Radim Bělohlávek, Martina Chlupová, and Vilém Vychodil Dept. Computer Science, Palacký University, Tomkova 40, CZ-779 00, Olomouc, Czech Republic Email: {radim.belohlavek,

More information

TRIAS An Algorithm for Mining Iceberg Tri-Lattices

TRIAS An Algorithm for Mining Iceberg Tri-Lattices TRIAS An Algorithm for Mining Iceberg Tri-Lattices Robert Jschke 1,2, Andreas Hotho 1, Christoph Schmitz 1, Bernhard Ganter 3, Gerd Stumme 1,2 1 Knowledge & Data Engineering Group, University of Kassel,

More information

Introduction to Kleene Algebras

Introduction to Kleene Algebras Introduction to Kleene Algebras Riccardo Pucella Basic Notions Seminar December 1, 2005 Introduction to Kleene Algebras p.1 Idempotent Semirings An idempotent semiring is a structure S = (S, +,, 1, 0)

More information

Concepts of a Discrete Random Variable

Concepts of a Discrete Random Variable Concepts of a Discrete Random Variable Richard Emilion Laboratoire MAPMO, Université d Orléans, B.P. 6759 45067 Orléans Cedex 2, France, richard.emilion@univ-orleans.fr Abstract. A formal concept is defined

More information

Mining Free Itemsets under Constraints

Mining Free Itemsets under Constraints Mining Free Itemsets under Constraints Jean-François Boulicaut Baptiste Jeudy Institut National des Sciences Appliquées de Lyon Laboratoire d Ingénierie des Systèmes d Information Bâtiment 501 F-69621

More information

Some Measures of Picture Fuzzy Sets and Their Application in Multi-attribute Decision Making

Some Measures of Picture Fuzzy Sets and Their Application in Multi-attribute Decision Making I.J. Mathematical Sciences and Computing, 2018, 3, 23-41 Published Online July 2018 in MECS (http://www.mecs-press.net) DOI: 10.5815/ijmsc.2018.03.03 Available online at http://www.mecs-press.net/ijmsc

More information

An Extended Branch and Bound Search Algorithm for Finding Top-N Formal Concepts of Documents

An Extended Branch and Bound Search Algorithm for Finding Top-N Formal Concepts of Documents An Extended Branch and Bound Search Algorithm for Finding Top-N Formal Concepts of Documents Makoto HARAGUCHI and Yoshiaki OKUBO Division of Computer Science Graduate School of Information Science and

More information

Solving Fuzzy PERT Using Gradual Real Numbers

Solving Fuzzy PERT Using Gradual Real Numbers Solving Fuzzy PERT Using Gradual Real Numbers Jérôme FORTIN a, Didier DUBOIS a, a IRIT/UPS 8 route de Narbonne, 3062, Toulouse, cedex 4, France, e-mail: {fortin, dubois}@irit.fr Abstract. From a set of

More information

Distributed Data Mining for Pervasive and Privacy-Sensitive Applications. Hillol Kargupta

Distributed Data Mining for Pervasive and Privacy-Sensitive Applications. Hillol Kargupta Distributed Data Mining for Pervasive and Privacy-Sensitive Applications Hillol Kargupta Dept. of Computer Science and Electrical Engg, University of Maryland Baltimore County http://www.cs.umbc.edu/~hillol

More information

Matrix Factorizations over Non-Conventional Algebras for Data Mining. Pauli Miettinen 28 April 2015

Matrix Factorizations over Non-Conventional Algebras for Data Mining. Pauli Miettinen 28 April 2015 Matrix Factorizations over Non-Conventional Algebras for Data Mining Pauli Miettinen 28 April 2015 Chapter 1. A Bit of Background Data long-haired well-known male Data long-haired well-known male ( ) 1

More information

A Proposition for Sequence Mining Using Pattern Structures

A Proposition for Sequence Mining Using Pattern Structures A Proposition for Sequence Mining Using Pattern Structures Victor Codocedo, Guillaume Bosc, Mehdi Kaytoue, Jean-François Boulicaut, Amedeo Napoli To cite this version: Victor Codocedo, Guillaume Bosc,

More information

An Efficient Algorithm for Enumerating Closed Patterns in Transaction Databases

An Efficient Algorithm for Enumerating Closed Patterns in Transaction Databases An Efficient Algorithm for Enumerating Closed Patterns in Transaction Databases Takeaki Uno, Tatsuya Asai 2 3, Yuzo Uchida 2, and Hiroki Arimura 2 National Institute of Informatics, 2--2, Hitotsubashi,

More information

TRIAS An Algorithm for Mining Iceberg Tri-Lattices

TRIAS An Algorithm for Mining Iceberg Tri-Lattices TRIAS An Algorithm for Mining Iceberg Tri-Lattices Robert Jäschke 1,2, Andreas Hotho 1, Christoph Schmitz 1, Bernhard Ganter 3, Gerd Stumme 1,2 1 Knowledge & Data Engineering Group, University of Kassel,

More information

Data-Driven Logical Reasoning

Data-Driven Logical Reasoning Data-Driven Logical Reasoning Claudia d Amato Volha Bryl, Luciano Serafini November 11, 2012 8 th International Workshop on Uncertainty Reasoning for the Semantic Web 11 th ISWC, Boston (MA), USA. Heterogeneous

More information

Effects of Gap Open and Gap Extension Penalties

Effects of Gap Open and Gap Extension Penalties Brigham Young University BYU ScholarsArchive All Faculty Publications 200-10-01 Effects of Gap Open and Gap Extension Penalties Hyrum Carroll hyrumcarroll@gmail.com Mark J. Clement clement@cs.byu.edu See

More information

Mining bi-sets in numerical data

Mining bi-sets in numerical data Mining bi-sets in numerical data Jérémy Besson, Céline Robardet, Luc De Raedt and Jean-François Boulicaut Institut National des Sciences Appliquées de Lyon - France Albert-Ludwigs-Universitat Freiburg

More information

Expressiveness, decidability, and undecidability of Interval Temporal Logic

Expressiveness, decidability, and undecidability of Interval Temporal Logic University of Udine Department of Mathematics and Computer Science Expressiveness, decidability, and undecidability of Interval Temporal Logic ITL - Beyond the end of the light Ph.D. Defence Dario Della

More information

Key Words: geospatial ontologies, formal concept analysis, semantic integration, multi-scale, multi-context.

Key Words: geospatial ontologies, formal concept analysis, semantic integration, multi-scale, multi-context. Marinos Kavouras & Margarita Kokla Department of Rural and Surveying Engineering National Technical University of Athens 9, H. Polytechniou Str., 157 80 Zografos Campus, Athens - Greece Tel: 30+1+772-2731/2637,

More information

Principles of AI Planning

Principles of AI Planning Principles of 5. Planning as search: progression and regression Malte Helmert and Bernhard Nebel Albert-Ludwigs-Universität Freiburg May 4th, 2010 Planning as (classical) search Introduction Classification

More information

Similarity-based Classification with Dominance-based Decision Rules

Similarity-based Classification with Dominance-based Decision Rules Similarity-based Classification with Dominance-based Decision Rules Marcin Szeląg, Salvatore Greco 2,3, Roman Słowiński,4 Institute of Computing Science, Poznań University of Technology, 60-965 Poznań,

More information

Computing minimal generators from implications: a logic-guided approach

Computing minimal generators from implications: a logic-guided approach Computing minimal generators from implications: a logic-guided approach P. Cordero, M. Enciso, A. Mora, M Ojeda-Aciego Universidad de Málaga. Spain. pcordero@uma.es, enciso@lcc.uma.es, amora@ctima.uma.es,

More information

Fuzzy attribute logic over complete residuated lattices

Fuzzy attribute logic over complete residuated lattices Journal of Experimental & Theoretical Artificial Intelligence Vol. 00, No. 00, Month-Month 200x, 1 8 Fuzzy attribute logic over complete residuated lattices RADIM BĚLOHLÁVEK, VILÉM VYCHODIL Department

More information