Concepts of a Discrete Random Variable
|
|
- Benedict Doyle
- 5 years ago
- Views:
Transcription
1 Concepts of a Discrete Random Variable Richard Emilion Laboratoire MAPMO, Université d Orléans, B.P Orléans Cedex 2, France, richard.emilion@univ-orleans.fr Abstract. A formal concept is defined in the literature as a pair (extent, intent) with respect to a context which is usually empirical, as for example a sample of transactions. This is somewhat unsatisfying since concepts, though born from experiences, should not depend on them. In this paper we consider the above concepts as empirical concepts and we define the notion of concept, in a context-free framework, as a limit intent, by proving, applying the large number law, that : Given a random variable X taking its value in a countable σ-semilattice, the random intents of empirical concepts, with respect to a sample of X, converge almost everywhere to a fixed deterministic limit, called a concept, whose identification shows that it only depends on the distribution P X of X. Moreover, the set of such concepts is the σ-semilattice generated by the support of X and has even a structure of σ-lattice: the lattice concept of a random variable. We also compute the mean number of concepts and frequent itemsets for a hierarchical Bernoulli mixtures model. Last, we propose an algorithm to find out maximal frequent itemsets by using minimal winning coalitions of P X. 1 Introduction An important component of data mining is rule induction, that is extraction of useful if-then rules from data, and a key step in this induction consists in mining what is usually called frequent itemsets (FI s) as introduced in Agrawal et al. (1993 and 1994). In order to understand the ideas beyond these mining algorithms, it is helpful to use the notions of Galois connections, intent, extent, closed sets and so on. A pair (extent, intent) was called concept by Wille (1980) but this notion of concept, widely used in various domains (artificial intelligence, robotics, psychology, software engineering, text mining and so on), depends on the extent which is, roughly speaking, a random sample. In the present paper we call it (random) empirical concept and we define concepts as limit of empirical intents, showing that these limits are no more random and do not depend on the sample. In other words concepts are defined with respect to a random variable rather than to a sample of this random variable. The paper is organized as follows: Random variables taking their values in a σ-semilattice are introduced in Section 2. Random empirical Galois lattices are defined in Section 3 where is also proved the convergence of random intents and is defined the concept lattice of a random variable. In Section 4, the average number of empirical concepts and of frequent itemsets is computed
2 2 Emilion for a hierarchical Bernoulli mixtures model. Frequent itemsets and winning coalitions are studied in Section 5, providing an algorithm for mining maximal frequent itemsets. The present paper answers a question of Edwin Diday who has also a definition of concept as an intent (Bock et al. (2000)). It is dedicated to Edwin Diday who introduced us to several interesting problems. 2 Notations and terminology 2.1 σ-semilattice Our set of observations, say L, is taken very general in order to cover a wide area of applications. It can be a subset of real numbers, real vectors, real functions, fuzzy sets, power set, words of a language, real cumulative distribution functions, real stochastic processes, and so on. Let (L,, ) be a countable semilattice, that is L is a countable set is a partial order relation on the set L is an infimum operator. We will asume in addition that L is a σ-semilattice : for any (countable) subset A L, there exists a largest element in L, denoted by L A L, which is lower than any L A. Without loss of generality, it can also be assumed that there exists a largest element in L, denoted by 1, and by convention L = 1 if A =. L A If (L n ) n 1 is a decreasing sequence in L, then we will say that this sequence is convergent and that its limit is 2.2 L-valued random variable L n. n=1 Let (Ω, B, P) be a probability space. Let X : Ω L be a (discrete) L-valued random variable (r.v.) whose distribution probability P X is a probability measure on L defined as usual as follows: L L, P X (L) = P(X = L) = P(ω Ω : X(ω) = L).
3 Concepts of a Discrete Random Variable 3 The support of P X will play a key role: it is defined as the set S X = {L L : P X (L) > 0}. For any n {1, 2, 3...} a n sample of X is a sequence X 1,...,X n where the X i s are independent and identically distributed (iid) r.v. s distributed as X. 2.3 Data mining context To join the terminology of data mining (which comes from marketing) with the preceding setting, it suffices to take L = (P(J),, ), the power set of a (large) finite set J of items. Any any L L is then a subset of J and is usually called an itemset. The random variable X modellizes random transactions made by customers, the itemset X(ω) L representing the random set of items bought by a customer. The following simple example illustrates what is usually called a binary context. Take J = {a, b, c, d, e} and n = 10 transactions (1 means that the item was bought and 0 not). The last column in Table 1 below contains the random set that will be considered in our approach. a b c d e random set X 1(ω) = {b, e} X 2(ω) = {a, b, e} X 3(ω) = {b, c, d} X 4(ω) = {a, d} X 5(ω) = {b, c, e} X 6(ω) = {a, b, c, d} X 7(ω) = {c, d, e} X 8(ω) = {a, b, d} X 9(ω) = {b, c, d, e} X 10(ω) = {c, d} Table 1. Binary context and random set. 3 Galois lattice for semilattices The notion of Galois connection (GC) was early introduced in Ore (1944), it is also mentionned in the book by Birkhoff (1967), chapter 5. We first note
4 4 Emilion that Barbut and Monjardet elegant and general definition of a Galois lattice (GL), stated for a GC between lattices (Barbut et al. (1970), pages 13 and 25), can be extended to a GC between semilattices: Let < E,, > and < F,, > be two semilattices, a GC between E and F is a pair of mappings (f, g) verifying f : E F and g : F E are decreasing, (1) h = g f : E E and k = f g : F F are extensive, (2) These definitions imply that i.e. x E, x h(x) and y F, y k(y). f h = f, h h = h, g k = g, k k = k. (3) Let I h = {x E : h(x) = x} (resp. I k = {y F : k(y) = y}) be the set of closed (or invariant) elements of E (resp. of F ). It can be seen that the restriction of f to I h is a one-to-one mapping into I k, its inverse being the restriction of g to I k. The Galois lattice (GL) G induced by the GC (f, g) is defined as the set of nodes G = {(x, f(x)), x I h }, which has a lattice structure if, and are defined as follows: It is easily seen that (x, f(x)) (x, f(x )) iff x x and f(x ) f(x), (x, f(x)) (x, f(x )) = (g(f(x) f(x )), f(x) f(x )), (x, f(x)) (x, f(x )) = (x x, f(x x )). G = {(x, f(x)), x I h } = {(g(y), y), y I k }. The mapping f (resp. g) is called an intent (resp. an extent). As any pair (x, y) of the GL satifies y = f(x) and x = g(y), Wille (1980) then proposed to call such a pair a concept. It is worthwhile to mention that the name of Galois appears here because of the analogy with a fundamental result in the celebrated Galois theory on the one-to-one correspondance between the intermediate fields of a field extension and the subgroups of its Galois group (see e.g. Stewart (1975), page 114).
5 Concepts of a Discrete Random Variable Binary GL Let I be a set of objects and J a set of properties. Let R be a binary relation on I J : irj iff object i has property j. For any non-emptyset A E = P(I) let f(a) = {j J : irj for all i A} and f( ) = J (4) be the the intent or the description of A, that is the set of properties satisfied by all objects of A. For any non-empty set B F = P(J ) let g(b) = {i I : irj for all j B} and g( ) = I (5) be the extent of B, that is the set of objects satisfying all the properties given by B. The pair (f, g) is a popular example of GC, it is called a binary GC. 3.2 Explicit formulas for a general GC Let E = P(I), where I) denote a countable set of objects. In most concrete situations, only the descriptions d(i), i I, which belong to a general σ semilattice L, are given. A natural question to ask is the existence of a GC (f, g) such that f({i}) = d(i) with explicit fomulas generalizing formulas (4) (5) of the binary case. The solution exists, and is unique if the GC is supposed maximal (that is not dominated by a GC): Theorem (Diday - Emilion (1997), (2003)) There exists a unique maximal GC (f, g) between E = P(I) and L verifying f({i}) = d(i). It is given by the formulas: f(a) = i A d(i) for any non-empty A E, (6) f( ) = 1, g(l) = {i I : L d(i)} for any L L. (7) Note that (6) and (7) imply h(a) = g(f(a)) = {i I : d(j) d(i)} for any A E, (8) k(l) = f(g(l)) = j A i I:L d(i) d(i) for any L L. (9) In the binary case, L = (P(J ),, ) is isomorphic to ({0, 1} #J,, ), therefore (6) and (7) generalize (4) and (5)
6 6 Emilion 4 Random Galois lattices 4.1 Random empirical Galois lattices As above, let X : Ω L be a (discrete) L-valued random variable (r.v.). Let X 1,..., X n,... be a sequence of iid r.v. s distributed as X. For any n = 1, 2,..., consider the following random Galois connections: < E n,, >=< P{1, 2,..., n},, >, < F,, >= (L,, ), f n (A) = X i, g n (L) = {i {1, 2,..., n} : L X i }, i A h n = g n f n, k n = f n g n for any A E n and L L. Note that h n (A) = {i {1, 2,..., n} : while k n (L) = j A i {1,2,...,n}:L X i X i. X j X i } 4.2 Convergence of random empirical intents We are now in a position to state the announced result on the convergence of random empirical intents with the identification of the deterministic limit. Theorem 1. For any L L the random intents k n (L) = i=1,...,n:l X i X i converge a.e. towards the following deterministic limit: k (L) = lim k n (L) = L. n L S X:L L Proof. For any L L, let 1 (Xi=L)(ω) = 1 if X i (ω) = L and = 0 otherwise. Since the r.v. s 1 (Xi=L) so defined are i.i.d. with expectation P X (L), the large number law provides a nullset N L Ω, N L B, such that P(N L ) = 0, which satisfies ω / N L, 1 n n 1 (Xi=L)(ω) P X (L). i=1 In particular for any L S X, since P X (L) > 0, we have ω / N L, n 1 (Xi=L)(ω) 1 i=1
7 Concepts of a Discrete Random Variable 7 for n large enough. Therefore that is ω / N L, i 1 : X i (ω) = L ω / N L, L {X i (ω), i = 1, 2,...}. As S X is countable, the set N = L S X N L belongs to B, P(N) = 0 and ω / N, S X {X i (ω), i = 1, 2,...}. (10) On the other hand, for any i = 1, 2,..., let Then, we have since L\S X is countable and N i = {ω : X i (ω) / S X }. P(N i ) = 0 P(N i ) = P(X i / S X ) = P(X / S X ) = L/ S X P(X = L) = 0 by definition of S X. Now, by definition of the N i s we have or equivalently ω / ω / N i, X i (ω) S X i = 1, 2,... i=1 N i, {X i (ω), i = 1, 2,...} S X. (11) i=1 So, if we let N 0 = N i=1 N i, then P(N 0 ) = 0 and (10), (11) imply that is, shortly, ω / N 0, {X i (ω), i = 1, 2,...} = S X, {X i, i = 1, 2,...} = S X a.e. (12) Note that (12) holds for any random variable taking its value in a countable set. Observe now that (12) implies that for any L L and thus {X i, i = 1, 2,... : L X i } = {L S X : L L } a.e. i=1,2...,:l X i X i = This completes the proof. L S X:L L L a.e..
8 8 Emilion 4.3 Limit GL Obviously, the above closure operator k can be obtained by the following limit GC < E,, >=< P{1, 2,..., },, >, < F,, >= (L,, ), f (A) = X i, i A g (L) = {i {1, 2,...} : L X i }, h = g f, k = f g for any A E and L L. So, h (A) = {i {1, 2,...} : while k (L) = j A i {1,2,...}:L X i X i. X j X i }, Hence the random limit GL can be defined as the lattice: G = {g (L), k (L)), L L}. Note that the extent g (L) is random and depends on the sample (X i ) i=1,2,... while the intent is deterministic and does not depend on the sequence (X i ) i=1,2, Concepts, concept lattice Definition: A concept of the r.v. X is an element of L such that L = L. L S X:L L The set of concepts will be denoted by C(X, L), shortly, C. The random set of empirical intents w.r.t. a sample X 1,..., X n of X will be denoted by C(X 1,..., X n, L), shortly, C n : The above theorem states that C n = k n (L) = {k n (L), L L}. k (L) = {k (L), L L} = C(X, L) a.e.. Since we have L k (L) k n+1 (L) k n (L) we see that k n (L) = L k (L) = L, in other words k n (L) k n+1 (L) k (L). (13)
9 Concepts of a Discrete Random Variable 9 Proposition 1. C(X, L) is the σ-semilattice generated by S X. In particular, if P(X = L) > 0 then L is a concept. Note however that L such that P(X = L) = 0 can be a concept: let L = {0, c, a, b, 1} where 0 (resp. 1) is the lowest (resp. largest) element of L, a b, b a, c = a b and let X be such that P(X = a) = 1/2, P(X = b) = 1/2. Then c is a concept and P(X = c) = 0. Further, observe that Proposition 2. i) If L < 1 is a concept then P(L X) > 0 ii) {L S X : L L } = {L S X : k (L) L } iii) P(L X) = P(k (L) X) iv) If k (L) < 1 then P(L X) > 0 v) 1 is a concept iff P(X = 1) > 0 vi) If k (L) = 1 then P(L X) > 0 iff P(X = 1) > 0 Note that P(L X) > 0 means that for a.a. sample, L appears infinitely often within an itemset. Also, the converse of i) need not be true (use iv)). Proposition 3. C(X, L) is a σ-lattice. 5 Average number of concepts for hierarchical Bernoulli mixtures Consider the case where L = (P(J),, ) the power set of a (large) finite set J = {1,..., r} of r items, P(J) being identified to {0, 1} r. Suppose that the distribution of the r.v. X = (X (1),...,X (j),..., X (r) ) is a finite mixture of products of Bernoulli s r j=1 B(p U,j), where the r.v. U {1,...,K} is a latent class variable and the weight vector q = (q 1,..., q K ) of the mixture has a Dirichlet distribution D(γ 1,..., γ K ). This precisely means that we have the following hierarchical mixture model (HMM): X U=u,q K q c r j=1 B(p u,j ), (14) u=1 P(U = u q) = q u, (15) q D(γ 1,...,γ K ). (16) The following generalizes some results in Lhote et al. (2005) and Emilion et al. (2005):
10 10 Emilion Proposition 4. For the HMM defined by equations (14), (15), (16) E(#C n ) = and K u=1 γ u γ n i=0 B P(J) ( ) n (1 p u,j ) n i p i u,j i j B j B lim E(#C n) = 2 r = #C. n (1 p u,j ) i, For such a model we can similarly compute the mean number of closed frequent itemsets. j / B 6 Maximal frequent itemsets 6.1 Empirical frequent itemsets We return now to the case of a general σ-semilattice whose elements are still called itemsets. Let α (0, 1) be a fixed treshold. An itemset L is said empirically frequent (w.r.t the empirical context X i, i = 1,...n) iff #g n (L) = #{i {1,..., n} : L X i } nα. As #g n (L) = n 1 L Xi, (17) and the X i s are i.i.d., we see that the r.v. 1 L Xi are Bernoulli i.i.d. and the r.v. #g n (L) has a binomial distribution: where Hence i=1 #g n (L) Binom(n, p L ) p L = P(L X). P(L empirical frequent) = P(#g n (L) nα) = k nα ( ) n p k k L(1 p L ) n k. The average number of empirical frequent itemsets is then equal to Proposition 5. L LP(L (n, α) frequent) = L L k nα ( ) n p k k L(1 p L ) n k.
11 Concepts of a Discrete Random Variable Frequent itemsets By the large number law, (17) implies: so that we are lead to the following: #g n (L) lim = P(L X) a.e. n n Definition: L L is an α-frequent itemset iff P(L X) α. A maximal α-frequent itemset is an α-frequent itemset which is maximal (for the order in L) among the α-frequent itemsets. 6.3 Minimal winning coalitions We now propose an algorithm to find out maximal frequent itemsets by using minimal coalitions of P X. Since X is countable, let S X = {L 1,..., L r,...}, and let p r = P(X = L r ) > 0. An α-winning coalition is a subset of {1,...,r,...}, say A, such that p r α. r A A minimal α-winning coalition is an α-winning coalition which is minimal (for the inclusion order) among the α-winning coalitions. Algorithms for finding minimal coalitions were intensively studied in games theory (see e.g. Matsu et. al (2000)). They can be applied to find out maximal frequent itemsets due to the following: Theorem 2. i) If L is a maximal frequent itemset then L = r A L r where A is an α-minimal coalition. ii) Conversely if A is an α-minimal coalition then L = r A L r is frequent. It is easy to construct an example where A is an α-minimal coalition but L = r A L r is not maximal.
12 12 Emilion 6.4 Algorithm The above theorem can be applied to the empirical measure (which is an estimator of P X ), from a finite table of observed itemsets such as the one in Subsection 2.3: Find the distinct itemsets L 1,..., L k, and their respective frequency p 1,..., p k Find the α-minimal winning coalitions from p 1,...,p k For each of such a coalition, say A, compute L = r A L r The list of such L contains all the maximal frequent itemsets which can be extracted from this list. Such an algorithm will be of interest if the number r of distinct itemsets is much lower than the total number of observed itemsets. Note that the step where are found minimal winning coalitions should be fast since it does not require any access to the dataset. References AGRAWAL, R., IMIELINSKI, T. and SWAMY, A. (1993): Mining Association Rules Between Sets of Items in Large Databases. In: ACM SIGMOD, Int l Conf. on Managment of Data, AGRAWAL, R. and SRIKANT, R. (1994): Fast Algorithm for Mining Association. In: 20th. Intl. Conf. VLDB, BARBUT, M. and MONJARDET, B. (1970): Ordre et classification. Hachette, Paris. BIRKHOFF, G. (1967): Lattice theory. AMS Colloq. Public. Vol. XXV. BOCK, H. H. and DIDAY, E. (2000): Analysis of Symbolic Data. Springer Verlag, Berlin. CASPARD, N. and MONJARDET, B. (2003): The Lattice of Closure Systems. Disc. Appl. Math. J., 127, DIDAY, E. and EMILION, R. (1997): Maximal and Stochastic Galois Lattices. C. R. Acad. Sci. Paris, 325, I (1), DIDAY, E. and EMILION, R. (2003): Maximal and Stochastic Galois Lattices. Disc. Applied Math. J, 27-2, EMILION, R. and LÉVY, G. (2005): Size of Random Galois Lattices and Number of Frequent Itemsets. LHOTE, L., RIOULT, F. and SOULET, A. (2005): Average Number of Frequent (Closed) Patterns in Bernouilli and Markovian Databases. In: Fifth IEEE International Conference on Data Mining (ICDM 05), Houston, Texas, MATSUI, T. and MATSUI, Y. (2000): A Survey of Algorithms for Calculating Power Indices of Weighted Majority Games. J. Oper. Research Soc. Japan, 43. ORE, O. (1944): Galois Connections. Trans. Amer. Math. Soc. 55, STEWART, J. (1975): Galois Theory. Chapman and Hall, New York. WILLE, R. (1980): Restructuring Lattice Theory, Ordered Sets I. Rival ed., Reider.
Encyclopedia of Machine Learning Chapter Number Book CopyRight - Year 2010 Frequent Pattern. Given Name Hannu Family Name Toivonen
Book Title Encyclopedia of Machine Learning Chapter Number 00403 Book CopyRight - Year 2010 Title Frequent Pattern Author Particle Given Name Hannu Family Name Toivonen Suffix Email hannu.toivonen@cs.helsinki.fi
More informationMining Positive and Negative Fuzzy Association Rules
Mining Positive and Negative Fuzzy Association Rules Peng Yan 1, Guoqing Chen 1, Chris Cornelis 2, Martine De Cock 2, and Etienne Kerre 2 1 School of Economics and Management, Tsinghua University, Beijing
More informationOn a quasi-ordering on Boolean functions
Theoretical Computer Science 396 (2008) 71 87 www.elsevier.com/locate/tcs On a quasi-ordering on Boolean functions Miguel Couceiro a,b,, Maurice Pouzet c a Department of Mathematics and Statistics, University
More informationInformation Theory and Statistics Lecture 3: Stationary ergodic processes
Information Theory and Statistics Lecture 3: Stationary ergodic processes Łukasz Dębowski ldebowsk@ipipan.waw.pl Ph. D. Programme 2013/2014 Measurable space Definition (measurable space) Measurable space
More informationcse303 ELEMENTS OF THE THEORY OF COMPUTATION Professor Anita Wasilewska
cse303 ELEMENTS OF THE THEORY OF COMPUTATION Professor Anita Wasilewska LECTURE 1 Course Web Page www3.cs.stonybrook.edu/ cse303 The webpage contains: lectures notes slides; very detailed solutions to
More informationTopology. Xiaolong Han. Department of Mathematics, California State University, Northridge, CA 91330, USA address:
Topology Xiaolong Han Department of Mathematics, California State University, Northridge, CA 91330, USA E-mail address: Xiaolong.Han@csun.edu Remark. You are entitled to a reward of 1 point toward a homework
More informationPolicies Generalization in Reinforcement Learning using Galois Partitions Lattices
Policies Generalization in Reinforcement Learning using Galois Partitions Lattices Marc Ricordeau and Michel Liquière mricorde@wanadoo.fr, liquiere@lirmm.fr Laboratoire d Informatique, de Robotique et
More informationThe dependence graph of a lattice
The depence graph of a lattice Karell Bertet L3I - Université de La Rochelle - av Michel Crépeau - 17042 La Rochelle kbertet@univ-lr.fr Abstract: In this paper, we introduce the depence graph of a lattice
More informationOn the Structure of Rough Approximations
On the Structure of Rough Approximations (Extended Abstract) Jouni Järvinen Turku Centre for Computer Science (TUCS) Lemminkäisenkatu 14 A, FIN-20520 Turku, Finland jjarvine@cs.utu.fi Abstract. We study
More informationAn embedding of ChuCors in L-ChuCors
Proceedings of the 10th International Conference on Computational and Mathematical Methods in Science and Engineering, CMMSE 2010 27 30 June 2010. An embedding of ChuCors in L-ChuCors Ondrej Krídlo 1,
More informationAn Introduction to Formal Concept Analysis
An Introduction to Formal Concept Analysis Mehdi Kaytoue Mehdi Kaytoue mehdi.kaytoue@insa-lyon.fr http://liris.cnrs.fr/mehdi.kaytoue October 29 th 2013 The Knowledge Discovery Process Identified domain(s)
More informationOn the consensus of closure systems
On the consensus of closure systems Bruno LECLERC École des Hautes Études en Sciences Sociales Centre d'analyse et de Mathématique Sociales (UMR 8557) 54 bd Raspail, F-75270 Paris cedex 06, France leclerc@ehess.fr
More informationA Logical Formulation of the Granular Data Model
2008 IEEE International Conference on Data Mining Workshops A Logical Formulation of the Granular Data Model Tuan-Fang Fan Department of Computer Science and Information Engineering National Penghu University
More informationFUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH
FUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH M. De Cock C. Cornelis E. E. Kerre Dept. of Applied Mathematics and Computer Science Ghent University, Krijgslaan 281 (S9), B-9000 Gent, Belgium phone: +32
More informationLearning Spaces, and How to Build them
1 Learning Spaces, and How to Build them JEAN-PAUL DOIGNON Université Libre de Bruxelles Talk at ICFCA 2014, Cluj-Napoca Assessment in Computer-aided Education A project initiated in 1984 with JEAN-CLAUDE
More informationOn matrix variate Dirichlet vectors
On matrix variate Dirichlet vectors Konstencia Bobecka Polytechnica, Warsaw, Poland Richard Emilion MAPMO, Université d Orléans, 45100 Orléans, France Jacek Wesolowski Polytechnica, Warsaw, Poland Abstract
More informationUn nouvel algorithme de génération des itemsets fermés fréquents
Un nouvel algorithme de génération des itemsets fermés fréquents Huaiguo Fu CRIL-CNRS FRE2499, Université d Artois - IUT de Lens Rue de l université SP 16, 62307 Lens cedex. France. E-mail: fu@cril.univ-artois.fr
More informationAssociation Rule. Lecturer: Dr. Bo Yuan. LOGO
Association Rule Lecturer: Dr. Bo Yuan LOGO E-mail: yuanb@sz.tsinghua.edu.cn Overview Frequent Itemsets Association Rules Sequential Patterns 2 A Real Example 3 Market-Based Problems Finding associations
More informationUniversal Algebra for Logics
Universal Algebra for Logics Joanna GRYGIEL University of Czestochowa Poland j.grygiel@ajd.czest.pl 2005 These notes form Lecture Notes of a short course which I will give at 1st School on Universal Logic
More informationAutomata on linear orderings
Automata on linear orderings Véronique Bruyère Institut d Informatique Université de Mons-Hainaut Olivier Carton LIAFA Université Paris 7 September 25, 2006 Abstract We consider words indexed by linear
More informationDistributed Mining of Frequent Closed Itemsets: Some Preliminary Results
Distributed Mining of Frequent Closed Itemsets: Some Preliminary Results Claudio Lucchese Ca Foscari University of Venice clucches@dsi.unive.it Raffaele Perego ISTI-CNR of Pisa perego@isti.cnr.it Salvatore
More informationHomotopy and homology groups of the n-dimensional Hawaiian earring
F U N D A M E N T A MATHEMATICAE 165 (2000) Homotopy and homology groups of the n-dimensional Hawaiian earring by Katsuya E d a (Tokyo) and Kazuhiro K a w a m u r a (Tsukuba) Abstract. For the n-dimensional
More informationTopology Proceedings. COPYRIGHT c by Topology Proceedings. All rights reserved.
Topology Proceedings Web: http://topology.auburn.edu/tp/ Mail: Topology Proceedings Department of Mathematics & Statistics Auburn University, Alabama 36849, USA E-mail: topolog@auburn.edu ISSN: 0146-4124
More informationANNIHILATOR IDEALS IN ALMOST SEMILATTICE
BULLETIN OF THE INTERNATIONAL MATHEMATICAL VIRTUAL INSTITUTE ISSN (p) 2303-4874, ISSN (o) 2303-4955 www.imvibl.org /JOURNALS / BULLETIN Vol. 7(2017), 339-352 DOI: 10.7251/BIMVI1702339R Former BULLETIN
More informationNotes on Ordered Sets
Notes on Ordered Sets Mariusz Wodzicki September 10, 2013 1 Vocabulary 1.1 Definitions Definition 1.1 A binary relation on a set S is said to be a partial order if it is reflexive, x x, weakly antisymmetric,
More informationLars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany
Syllabus Fri. 21.10. (1) 0. Introduction A. Supervised Learning: Linear Models & Fundamentals Fri. 27.10. (2) A.1 Linear Regression Fri. 3.11. (3) A.2 Linear Classification Fri. 10.11. (4) A.3 Regularization
More informationOn minimal models of the Region Connection Calculus
Fundamenta Informaticae 69 (2006) 1 20 1 IOS Press On minimal models of the Region Connection Calculus Lirong Xia State Key Laboratory of Intelligent Technology and Systems Department of Computer Science
More informationErgodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R.
Ergodic Theorems Samy Tindel Purdue University Probability Theory 2 - MA 539 Taken from Probability: Theory and examples by R. Durrett Samy T. Ergodic theorems Probability Theory 1 / 92 Outline 1 Definitions
More informationA spectral gap property for subgroups of finite covolume in Lie groups
A spectral gap property for subgroups of finite covolume in Lie groups Bachir Bekka and Yves Cornulier Dedicated to the memory of Andrzej Hulanicki Abstract Let G be a real Lie group and H a lattice or,
More informationA NOTE ON EXTENSIONS OF PRINCIPALLY QUASI-BAER RINGS. Yuwen Cheng and Feng-Kuo Huang 1. INTRODUCTION
TAIWANESE JOURNAL OF MATHEMATICS Vol. 12, No. 7, pp. 1721-1731, October 2008 This paper is available online at http://www.tjm.nsysu.edu.tw/ A NOTE ON EXTENSIONS OF PRINCIPALLY QUASI-BAER RINGS Yuwen Cheng
More informationIncompleteness Theorems, Large Cardinals, and Automata ov
Incompleteness Theorems, Large Cardinals, and Automata over Finite Words Equipe de Logique Mathématique Institut de Mathématiques de Jussieu - Paris Rive Gauche CNRS and Université Paris 7 TAMC 2017, Berne
More informationUNIVERSALITY OF THE LATTICE OF TRANSFORMATION MONOIDS
UNIVERSALITY OF THE LATTICE OF TRANSFORMATION MONOIDS MICHAEL PINSKER AND SAHARON SHELAH Abstract. The set of all transformation monoids on a fixed set of infinite cardinality λ, equipped with the order
More informationKnowledge spaces from a topological point of view
Knowledge spaces from a topological point of view V.I.Danilov Central Economics and Mathematics Institute of RAS Abstract In this paper we consider the operations of restriction, extension and gluing of
More informationMetamorphosis of Fuzzy Regular Expressions to Fuzzy Automata using the Follow Automata
Metamorphosis of Fuzzy Regular Expressions to Fuzzy Automata using the Follow Automata Rahul Kumar Singh, Ajay Kumar Thapar University Patiala Email: ajayloura@gmail.com Abstract To deal with system uncertainty,
More informationAssignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran
Assignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran 1. Let X, Y be two itemsets, and let denote the support of itemset X. Then the confidence of the rule X Y,
More informationThe Hopf argument. Yves Coudene. IRMAR, Université Rennes 1, campus beaulieu, bat Rennes cedex, France
The Hopf argument Yves Coudene IRMAR, Université Rennes, campus beaulieu, bat.23 35042 Rennes cedex, France yves.coudene@univ-rennes.fr slightly updated from the published version in Journal of Modern
More informationCUTS OF LINEAR ORDERS
CUTS OF LINEAR ORDERS ASHER M. KACH AND ANTONIO MONTALBÁN Abstract. We study the connection between the number of ascending and descending cuts of a linear order, its classical size, and its effective
More informationClosure operators on sets and algebraic lattices
Closure operators on sets and algebraic lattices Sergiu Rudeanu University of Bucharest Romania Closure operators are abundant in mathematics; here are a few examples. Given an algebraic structure, such
More information2.1 Elementary probability; random sampling
Chapter 2 Probability Theory Chapter 2 outlines the probability theory necessary to understand this text. It is meant as a refresher for students who need review and as a reference for concepts and theorems
More informationFuzzy histograms and density estimation
Fuzzy histograms and density estimation Kevin LOQUIN 1 and Olivier STRAUSS LIRMM - 161 rue Ada - 3439 Montpellier cedex 5 - France 1 Kevin.Loquin@lirmm.fr Olivier.Strauss@lirmm.fr The probability density
More informationData Compression. Limit of Information Compression. October, Examples of codes 1
Data Compression Limit of Information Compression Radu Trîmbiţaş October, 202 Outline Contents Eamples of codes 2 Kraft Inequality 4 2. Kraft Inequality............................ 4 2.2 Kraft inequality
More informationIDEMPOTENT n-permutable VARIETIES
IDEMPOTENT n-permutable VARIETIES M. VALERIOTE AND R. WILLARD Abstract. One of the important classes of varieties identified in tame congruence theory is the class of varieties which are n-permutable for
More information1. Propositional Calculus
1. Propositional Calculus Some notes for Math 601, Fall 2010 based on Elliott Mendelson, Introduction to Mathematical Logic, Fifth edition, 2010, Chapman & Hall. 2. Syntax ( grammar ). 1.1, p. 1. Given:
More informationFields and Galois Theory. Below are some results dealing with fields, up to and including the fundamental theorem of Galois theory.
Fields and Galois Theory Below are some results dealing with fields, up to and including the fundamental theorem of Galois theory. This should be a reasonably logical ordering, so that a result here should
More informationSchool of Mathematics and Statistics. MT5836 Galois Theory. Handout 0: Course Information
MRQ 2017 School of Mathematics and Statistics MT5836 Galois Theory Handout 0: Course Information Lecturer: Martyn Quick, Room 326. Prerequisite: MT3505 (or MT4517) Rings & Fields Lectures: Tutorials: Mon
More informationHierarchy among Automata on Linear Orderings
Hierarchy among Automata on Linear Orderings Véronique Bruyère Institut d Informatique Université de Mons-Hainaut Olivier Carton LIAFA Université Paris 7 Abstract In a preceding paper, automata and rational
More informationTHE LATTICE OF SUBVARIETIES OF SEMILATTICE ORDERED ALGEBRAS
THE LATTICE OF SUBVARIETIES OF SEMILATTICE ORDERED ALGEBRAS A. PILITOWSKA 1 AND A. ZAMOJSKA-DZIENIO 2 Abstract. This paper is devoted to the semilattice ordered V-algebras of the form (A, Ω, +), where
More informationInduction, sequences, limits and continuity
Induction, sequences, limits and continuity Material covered: eclass notes on induction, Chapter 11, Section 1 and Chapter 2, Sections 2.2-2.5 Induction Principle of mathematical induction: Let P(n) be
More informationClC (X ) : X ω X } C. (11)
With each closed-set system we associate a closure operation. Definition 1.20. Let A, C be a closed-set system. Define Cl C : : P(A) P(A) as follows. For every X A, Cl C (X) = { C C : X C }. Cl C (X) is
More informationAbstract. We characterize Ramsey theoretically two classes of spaces which are related to γ-sets.
2 Supported by MNZŽS RS 125 MATEMATIQKI VESNIK 58 (2006), 125 129 UDK 515.122 originalni nauqni rad research paper SPACES RELATED TO γ-sets Filippo Cammaroto 1 and Ljubiša D.R. Kočinac 2 Abstract. We characterize
More informationEquational Logic. Chapter Syntax Terms and Term Algebras
Chapter 2 Equational Logic 2.1 Syntax 2.1.1 Terms and Term Algebras The natural logic of algebra is equational logic, whose propositions are universally quantified identities between terms built up from
More informationLöwenheim-Skolem Theorems, Countable Approximations, and L ω. David W. Kueker (Lecture Notes, Fall 2007)
Löwenheim-Skolem Theorems, Countable Approximations, and L ω 0. Introduction David W. Kueker (Lecture Notes, Fall 2007) In its simplest form the Löwenheim-Skolem Theorem for L ω1 ω states that if σ L ω1
More informationDefinition: Let S and T be sets. A binary relation on SxT is any subset of SxT. A binary relation on S is any subset of SxS.
4 Functions Before studying functions we will first quickly define a more general idea, namely the notion of a relation. A function turns out to be a special type of relation. Definition: Let S and T be
More informationDUAL BCK-ALGEBRA AND MV-ALGEBRA. Kyung Ho Kim and Yong Ho Yon. Received March 23, 2007
Scientiae Mathematicae Japonicae Online, e-2007, 393 399 393 DUAL BCK-ALGEBRA AND MV-ALGEBRA Kyung Ho Kim and Yong Ho Yon Received March 23, 2007 Abstract. The aim of this paper is to study the properties
More informationFuzzy M-solid subvarieties
International Journal of Algebra, Vol. 5, 2011, no. 24, 1195-1205 Fuzzy M-Solid Subvarieties Bundit Pibaljommee Department of Mathematics, Faculty of Science Khon kaen University, Khon kaen 40002, Thailand
More informationLifting Smooth Homotopies of Orbit Spaces of Proper Lie Group Actions
Journal of Lie Theory Volume 15 (2005) 447 456 c 2005 Heldermann Verlag Lifting Smooth Homotopies of Orbit Spaces of Proper Lie Group Actions Marja Kankaanrinta Communicated by J. D. Lawson Abstract. By
More informationMachine Learning: Pattern Mining
Machine Learning: Pattern Mining Information Systems and Machine Learning Lab (ISMLL) University of Hildesheim Wintersemester 2007 / 2008 Pattern Mining Overview Itemsets Task Naive Algorithm Apriori Algorithm
More informationMath 203, Solution Set 4.
Math 203, Solution Set 4. Problem 1. Let V be a finite dimensional vector space and let ω Λ 2 V be such that ω ω = 0. Show that ω = v w for some vectors v, w V. Answer: It is clear that if ω = v w then
More informationDYNAMICAL PROPERTIES OF MONOTONE DENDRITE MAPS
DYNAMICAL PROPERTIES OF MONOTONE DENDRITE MAPS Issam Naghmouchi To cite this version: Issam Naghmouchi. DYNAMICAL PROPERTIES OF MONOTONE DENDRITE MAPS. 2010. HAL Id: hal-00593321 https://hal.archives-ouvertes.fr/hal-00593321v2
More informationRecursive definitions on surreal numbers
Recursive definitions on surreal numbers Antongiulio Fornasiero 19th July 2005 Abstract Let No be Conway s class of surreal numbers. I will make explicit the notion of a function f on No recursively defined
More informationThe Game of Normal Numbers
The Game of Normal Numbers Ehud Lehrer September 4, 2003 Abstract We introduce a two-player game where at each period one player, say, Player 2, chooses a distribution and the other player, Player 1, a
More informationRotation set for maps of degree 1 on sun graphs. Sylvie Ruette. January 6, 2019
Rotation set for maps of degree 1 on sun graphs Sylvie Ruette January 6, 2019 Abstract For a continuous map on a topological graph containing a unique loop S, it is possible to define the degree and, for
More informationProbability Theory Review
Cogsci 118A: Natural Computation I Lecture 2 (01/07/10) Lecturer: Angela Yu Probability Theory Review Scribe: Joseph Schilz Lecture Summary 1. Set theory: terms and operators In this section, we provide
More informationFrequent Itemset Mining
ì 1 Frequent Itemset Mining Nadjib LAZAAR LIRMM- UM COCONUT Team (PART I) IMAGINA 17/18 Webpage: http://www.lirmm.fr/~lazaar/teaching.html Email: lazaar@lirmm.fr 2 Data Mining ì Data Mining (DM) or Knowledge
More informationMEASURE-THEORETIC ENTROPY
MEASURE-THEORETIC ENTROPY Abstract. We introduce measure-theoretic entropy 1. Some motivation for the formula and the logs We want to define a function I : [0, 1] R which measures how suprised we are or
More informationON THE PRODUCT OF SEPARABLE METRIC SPACES
Georgian Mathematical Journal Volume 8 (2001), Number 4, 785 790 ON THE PRODUCT OF SEPARABLE METRIC SPACES D. KIGHURADZE Abstract. Some properties of the dimension function dim on the class of separable
More information1. Propositional Calculus
1. Propositional Calculus Some notes for Math 601, Fall 2010 based on Elliott Mendelson, Introduction to Mathematical Logic, Fifth edition, 2010, Chapman & Hall. 2. Syntax ( grammar ). 1.1, p. 1. Given:
More informationLattices, closure operators, and Galois connections.
125 Chapter 5. Lattices, closure operators, and Galois connections. 5.1. Semilattices and lattices. Many of the partially ordered sets P we have seen have a further valuable property: that for any two
More informationRandom sets. Distributions, capacities and their applications. Ilya Molchanov. University of Bern, Switzerland
Random sets Distributions, capacities and their applications Ilya Molchanov University of Bern, Switzerland Molchanov Random sets - Lecture 1. Winter School Sandbjerg, Jan 2007 1 E = R d ) Definitions
More informationMINIMAL GENERATING SETS OF GROUPS, RINGS, AND FIELDS
MINIMAL GENERATING SETS OF GROUPS, RINGS, AND FIELDS LORENZ HALBEISEN, MARTIN HAMILTON, AND PAVEL RŮŽIČKA Abstract. A subset X of a group (or a ring, or a field) is called generating, if the smallest subgroup
More informationLecture 4: September Reminder: convergence of sequences
36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 4: September 6 In this lecture we discuss the convergence of random variables. At a high-level, our first few lectures focused
More informationAn Approach to Classification Based on Fuzzy Association Rules
An Approach to Classification Based on Fuzzy Association Rules Zuoliang Chen, Guoqing Chen School of Economics and Management, Tsinghua University, Beijing 100084, P. R. China Abstract Classification based
More informationFuzzy Limits of Functions
Fuzzy Limits of Functions Mark Burgin Department of Mathematics University of California, Los Angeles 405 Hilgard Ave. Los Angeles, CA 90095 Abstract The goal of this work is to introduce and study fuzzy
More informationAbout partial probabilistic information
About partial probabilistic information Alain Chateauneuf, Caroline Ventura To cite this version: Alain Chateauneuf, Caroline Ventura. About partial probabilistic information. Documents de travail du Centre
More informationAPPROXIMATIONS IN H v -MODULES. B. Davvaz 1. INTRODUCTION
TAIWANESE JOURNAL OF MATHEMATICS Vol. 6, No. 4, pp. 499-505, December 2002 This paper is available online at http://www.math.nthu.edu.tw/tjm/ APPROXIMATIONS IN H v -MODULES B. Davvaz Abstract. In this
More informationThe interaction transform for functions on lattices
The interaction transform for functions on lattices Fabien Lange Keleti Faculty of Economics, Budapest Tech, Tavaszmező 5-7, 084 Budapest, Hungary Michel Grabisch Centre d Économie de la Sorbonne, 06-2
More informationAnalysis I. Classroom Notes. H.-D. Alber
Analysis I Classroom Notes H-D Alber Contents 1 Fundamental notions 1 11 Sets 1 12 Product sets, relations 5 13 Composition of statements 7 14 Quantifiers, negation of statements 9 2 Real numbers 11 21
More informationFree products of topological groups
BULL. AUSTRAL. MATH. SOC. MOS 22A05, 20E30, 20EI0 VOL. 4 (1971), 17-29. Free products of topological groups Sidney A. Morris In this note the notion of a free topological product G of a set {G } of topological
More informationReview of Probability. CS1538: Introduction to Simulations
Review of Probability CS1538: Introduction to Simulations Probability and Statistics in Simulation Why do we need probability and statistics in simulation? Needed to validate the simulation model Needed
More informationDATA MINING LECTURE 3. Frequent Itemsets Association Rules
DATA MINING LECTURE 3 Frequent Itemsets Association Rules This is how it all started Rakesh Agrawal, Tomasz Imielinski, Arun N. Swami: Mining Association Rules between Sets of Items in Large Databases.
More informationA GUIDE FOR MORTALS TO TAME CONGRUENCE THEORY
A GUIDE FOR MORTALS TO TAME CONGRUENCE THEORY Tame congruence theory is not an easy subject and it takes a considerable amount of effort to understand it. When I started this project, I believed that this
More informationSlow P -point Ultrafilters
Slow P -point Ultrafilters Renling Jin College of Charleston jinr@cofc.edu Abstract We answer a question of Blass, Di Nasso, and Forti [2, 7] by proving, assuming Continuum Hypothesis or Martin s Axiom,
More informationData Mining: Concepts and Techniques. (3 rd ed.) Chapter 6
Data Mining: Concepts and Techniques (3 rd ed.) Chapter 6 Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign & Simon Fraser University 2013 Han, Kamber & Pei. All rights
More informationA BOREL SOLUTION TO THE HORN-TARSKI PROBLEM. MSC 2000: 03E05, 03E20, 06A10 Keywords: Chain Conditions, Boolean Algebras.
A BOREL SOLUTION TO THE HORN-TARSKI PROBLEM STEVO TODORCEVIC Abstract. We describe a Borel poset satisfying the σ-finite chain condition but failing to satisfy the σ-bounded chain condition. MSC 2000:
More informationDynamic Programming Approach for Construction of Association Rule Systems
Dynamic Programming Approach for Construction of Association Rule Systems Fawaz Alsolami 1, Talha Amin 1, Igor Chikalov 1, Mikhail Moshkov 1, and Beata Zielosko 2 1 Computer, Electrical and Mathematical
More informationFree-sets : a Condensed Representation of Boolean Data for the Approximation of Frequency Queries
Free-sets : a Condensed Representation of Boolean Data for the Approximation of Frequency Queries To appear in Data Mining and Knowledge Discovery, an International Journal c Kluwer Academic Publishers
More informationThe category of linear modular lattices
Bull. Math. Soc. Sci. Math. Roumanie Tome 56(104) No. 1, 2013, 33 46 The category of linear modular lattices by Toma Albu and Mihai Iosif Dedicated to the memory of Nicolae Popescu (1937-2010) on the occasion
More informationCharacterization of Semantics for Argument Systems
Characterization of Semantics for Argument Systems Philippe Besnard and Sylvie Doutre IRIT Université Paul Sabatier 118, route de Narbonne 31062 Toulouse Cedex 4 France besnard, doutre}@irit.fr Abstract
More informationFree-Sets: A Condensed Representation of Boolean Data for the Approximation of Frequency Queries
Data Mining and Knowledge Discovery, 7, 5 22, 2003 c 2003 Kluwer Academic Publishers. Manufactured in The Netherlands. Free-Sets: A Condensed Representation of Boolean Data for the Approximation of Frequency
More informationWeek 12-13: Discrete Probability
Week 12-13: Discrete Probability November 21, 2018 1 Probability Space There are many problems about chances or possibilities, called probability in mathematics. When we roll two dice there are possible
More informationSOME MEASURABILITY AND CONTINUITY PROPERTIES OF ARBITRARY REAL FUNCTIONS
LE MATEMATICHE Vol. LVII (2002) Fasc. I, pp. 6382 SOME MEASURABILITY AND CONTINUITY PROPERTIES OF ARBITRARY REAL FUNCTIONS VITTORINO PATA - ALFONSO VILLANI Given an arbitrary real function f, the set D
More informationContinuity of partially ordered soft sets via soft Scott topology and soft sobrification A. F. Sayed
Bulletin of Mathematical Sciences and Applications Online: 2014-08-04 ISSN: 2278-9634, Vol. 9, pp 79-88 doi:10.18052/www.scipress.com/bmsa.9.79 2014 SciPress Ltd., Switzerland Continuity of partially ordered
More informationFrom Meaningful Orderings in the Web of Data to Multi-level Pattern Structures
From Meaningful Orderings in the Web of Data to Multi-level Pattern Structures Quentin Brabant, Miguel Couceiro, Amedeo Napoli, Justine Reynaud To cite this version: Quentin Brabant, Miguel Couceiro, Amedeo
More informationCOUNTABLY COMPLEMENTABLE LINEAR ORDERINGS
COUNTABLY COMPLEMENTABLE LINEAR ORDERINGS July 4, 2006 ANTONIO MONTALBÁN Abstract. We say that a countable linear ordering L is countably complementable if there exists a linear ordering L, possibly uncountable,
More informationPositive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise
Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise Guimei Liu 1,2 Jinyan Li 1 Limsoon Wong 2 Wynne Hsu 2 1 Institute for Infocomm Research, Singapore 2 School
More informationFinite pseudocomplemented lattices: The spectra and the Glivenko congruence
Finite pseudocomplemented lattices: The spectra and the Glivenko congruence T. Katriňák and J. Guričan Abstract. Recently, Grätzer, Gunderson and Quackenbush have characterized the spectra of finite pseudocomplemented
More informationMaximal and stochastic Galois lattices
Discrete Applied Mathematics 127 (2003) 271 284 www.elsevier.com/locate/dam Maximal and stochastic Galois lattices EdwinDiday a;, Richard Emilion b a CEREMADE, Universite Paris-Dauphine, 75016 Paris, France
More informationDisjointness conditions in free products of. distributive lattices: An application of Ramsay's theorem. Harry Lakser< 1)
Proc. Univ. of Houston Lattice Theory Conf..Houston 1973 Disjointness conditions in free products of distributive lattices: An application of Ramsay's theorem. Harry Lakser< 1) 1. Introduction. Let L be
More informationGeneral Principles in Random Variates Generation
General Principles in Random Variates Generation E. Moulines and G. Fort Telecom ParisTech June 2015 Bibliography : Luc Devroye, Non-Uniform Random Variate Generator, Springer-Verlag (1986) available on
More informationGuaranteeing the Accuracy of Association Rules by Statistical Significance
Guaranteeing the Accuracy of Association Rules by Statistical Significance W. Hämäläinen Department of Computer Science, University of Helsinki, Finland Abstract. Association rules are a popular knowledge
More information