Detecting Anomalous and Exceptional Behaviour on Credit Data by means of Association Rules. M. Delgado, M.D. Ruiz, M.J. Martin-Bautista, D.
|
|
- Roy Allen
- 5 years ago
- Views:
Transcription
1 Detecting Anomalous and Exceptional Behaviour on Credit Data by means of Association Rules M. Delgado, M.D. Ruiz, M.J. Martin-Bautista, D. Sánchez 18th September 2013
2 Detecting Anom and Exc Behaviour on Credit Data by means of AR 2/20 Motivation Association Rules allow to identify novel, useful and comprehensive knowledge from databases. They symbolize the presence of a set of items together in most of the transactions. Previous approaches using data mining techniques for fraud detection try to discover the usual profiles of legitimate customer behaviour and then search the anomalies using different methodologies such us clustering. New approaches have been developed based on obtaining different kinds of knowledge: peculiar rules, infrequent rules, exception rules, anomalous rules... These rules have several advantages: They provide a comprehensive understanding of a type of information. In general, they are less numerous.
3 Detecting Anom and Exc Behaviour on Credit Data by means of AR 3/20 Objective To automatically detect exceptional or anomalous behaviour that could help for fraud detection, obtaining the common customer behaviour as well as some indicators (exceptions) that happen when the behaviour deviates from an usual one and the anomalous deviations (anomalies).
4 Detecting Anom and Exc Behaviour on Credit Data by means of AR 4/20 Overview 1. Brief Introduction to Association Rules 2. Exception Rules Our Proposal for Mining Exception Rules 3. Anomalous Rules Our Proposal for Mining Anomalous Rules 4. Algorithm and Implementation Issues 5. Experimental Evaluation 6. Conclusions and Future Research 7. References
5 Detecting Anom and Exc Behaviour on Credit Data by means of AR 5/20 Brief Introduction to Association Rules Data is usually stored in datasets D composed by transactions t i (rows) and attributes (columns). We call item to a pair attribute, value or attribute, interval. D i 1 i 2... i j i j+1... i m t t t n Association Rules are expressions of the form A B where A, B are non-empty set of items with no intersection. An association rule represents a relation between the conjoint occurrence of A and B.
6 Detecting Anom and Exc Behaviour on Credit Data by means of AR 6/20 Brief Introduction to Association Rules The support of an itemset A is defined as probability that a transaction contains the item supp(a) = t D : A t D For assessing the ARs validity, the most common measures are support (joint probability P (A B)) and conf idence (conditional probability P (B A) Supp(A B) = supp(a B) supp(a B) ; Conf(A B) = D supp (A) that must be minsupp and minconf resp. (thresholds imposed by the user), that is, the rule is frequent and confident.
7 Detecting Anom and Exc Behaviour on Credit Data by means of AR 7/20 Brief Introduction to Association Rules An alternative framework is to measure the accuracy by means of the certainty factor, CF (A B) Conf(A B) supp(b) if Conf(A B) > supp(b) 1 supp(b) Conf(A B) supp(b) if Conf(A B) < supp(b) supp(b) 0 otherwise. CF measures how our belief that B is in a transaction changes when we are told that A is in that transaction. Certainty factor has better properties than confidence and other quality measures, in particular, it helps to reduce the number of rules obtained by filtering those rules corresponding to statistical independence or negative dependence. When CF (A B) mincf the rule is called certain.
8 Detecting Anom and Exc Behaviour on Credit Data by means of AR 8/20 Exception Rules Idea: An attribute interacting with another may change the consequent of an association rule [Suzuki et al., 1996]. Interpretation: Example: X strongly implies the fulfilment of Y, but, there exists E such that X E implies Y. IF the patient takes antibiotics, THEN it }{{}} tends {{ to recover }, X Y UNLESS staphylococcus appears, }{{} E This example shows how the presence of E changes the usual behaviour of rule X Y.
9 Detecting Anom and Exc Behaviour on Credit Data by means of AR 9/20 Our Proposal for Mining Exception Rules Formally, let D X = {t D : X t}. An exception rule is a pair (csr, exc) satisfying: X Y is frequent and certain in D. E Y is certain in D X. where the certainty factor is used instead of the confidence. Advantages: The quantity of rule pairs (csr, exc) is reduced. Using CF instead of Conf more reliable rules are obtained.
10 Detecting Anom and Exc Behaviour on Credit Data by means of AR 10/20 Anomalous Rules Idea: Anomalous Rules are rules that come to the surface when the dominant effect produced by a strong rule (csr) is removed [Berzal et al., 2004] Interpretation: When X, then we have either Y (usually) or A (unusually) This is captured by the set of rules: X strongly implies Y, but in those cases where X implies Y, then X confidently implies A Example: IF a patient have symptoms X, THEN he has the disease Y, IF NOT, he has the disease A, This example shows that the anomalous rule try to capture what is the deviation (A) from the usual behaviour(x Y ).
11 Detecting Anom and Exc Behaviour on Credit Data by means of AR 11/20 Our Proposal for Mining Anomalous Rules Formally, let D X = {t D : X t}. An anomalous rule is a triple (csr, anom, ref) satisfying: X Y (csr) is frequent and certain in D. Y A (anom) is certain in D X. A Y (ref) is certain in D X. Advantages: The quantity of rule triples (csr, anom, ref) is reduced. (More restrictive approach than that of Berzal et al.) Using CF instead of Conf more reliable rules are obtained.
12 Detecting Anom and Exc Behaviour on Credit Data by means of AR 12/20 Algorithm and Implementation Issues ERSA and ARSA (Exception/Anomalous Rule Search Algorithm) are able to mine together the set of common sense rules their associated exceptions They are based on the Apriori Algorithm using a bit-string representation of items which speeds up the logical operations (, ) Complexity: It depends on D = n, the number of items (i), and the number of csr obtained in the first part of the algorithm (r) O(nri2 i ).
13 Detecting Anom and Exc Behaviour on Credit Data by means of AR 13/20 ERSA algorithm Input: Transactional database, minsupp, minconf or mincf Output: Set of association rules with their associated exception rules. 1. Database Preprocessing 1.1 Transformation of the trans. database into a boolean one. 1.2 Database storage into a vector of BitSets. 2. Mining Process 2.1 Mining Common Sense Rules Searching the set of candidates (frequent itemsets) for the csr. Storing the indexes of BitSet vectors and support of candidates. csr extraction exceeding minsupp and minconf/mincf Mining Exception Rules For every csr X Y we compute the possible exceptions: For each item E I (except those in the csr) Compute X E Y and its support. Compute X E and its support. Compute supp X ( Y ). If CF X (E Y ) mincf then this is an exc.
14 Detecting Anom and Exc Behaviour on Credit Data by means of AR 14/20 Experimental Evaluation Database: German-statLog about credit bank data from the UCI Machine Learning repository transactions 21 attributes: 18 categorical or numerical, 3 continuous (categorized into meaningful intervals) 1.73GHz Intel Core 2Duo notebook with 1024MB of main memory running Windows 7 using Java. The maximum number of items in the antecedent or the consequent of the csr is limited to 3 in order to obtain more manageable rules.
15 Detecting Anom and Exc Behaviour on Credit Data by means of AR 15/20 Experimental Evaluation Number of csr, exc and anom rules found for different thresholds in German-statlog database. minsupp mincf = 0.8 mincf = 0.9 mincf = 0.95 csr exc anom csr exc anom csr exc anom
16 Detecting Anom and Exc Behaviour on Credit Data by means of AR 16/20 Experimental Evaluation Time in seconds for mining exception and anomalous rules for different thresholds in German-statlog database. minsupp mincf = 0.8 mincf = 0.9 mincf = 0.95 ERSA ARSA ERSA ARSA ERSA ARSA
17 Detecting Anom and Exc Behaviour on Credit Data by means of AR 17/20 Experimental Evaluation Some of the obtained rules are: IF present employment since 7 years AN D status & sex = single male T HEN people being liable to provide maintenance for = 1 (Supp = & CF = 0.879) EXCEP T when Purpose = business (CF = 1). IF property = real estate AN D number of existing credits on this bank = 1 T HEN age is in between 18 and 25 (Supp = & CF = 0.972) OR property = car (unusually with CF 1 = 1, CF 2 = 1).
18 Detecting Anom and Exc Behaviour on Credit Data by means of AR 18/20 Conclusions and Future Research We have given new proposals for mining exception and anomalous rules. We provide efficient algorithms for mining these kinds of rules. The implementations have been run in a credit bank database obtaining a manageable set of interesting rules that should be analysed by an expert. Future: Development of new approaches for exception and anomalous rules with uncertain data.
19 References [Suzuki et al., 1996] E. Suzuki and M. Shimura. Exceptional knowledge discovery in databases based on information theory. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pages 275?278. AAAI Press, [Berzal et al., 2004] F. Berzal, J.C. Cubero, N. Marín, and M. Gámez. Anomalous association rules. In IEEE ICDM Workshop Alternative Techniques for Data Mining and Knowledge Discovery, [Delgado et al., 2011] M. Delgado, M.D. Ruiz, and D. Sánchez. New Approaches for Discovering Exception and Anomalous Rules. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, Vol. 19, No. 2 pp , Detecting Anom and Exc Behaviour on Credit Data by means of AR 19/20
20 Thank you. Any questions? Detecting Anom and Exc Behaviour on Credit Data by means of AR 20/20
Anomalous Association Rules
Anomalous Association Rules Fernando Berzal, Juan-Carlos Cubero, Nicolás Marín Department of Computer Science and AI University of Granada Granada 18071 Spain {fberzal jc.cubero nicm}@decsai.ugr.es Matías
More informationMining Exceptional Relationships with Grammar-Guided Genetic Programming
Knowledge and Information Systems manuscript No. (will be inserted by the editor) Mining Exceptional Relationships with Grammar-Guided Genetic Programming J. M. Luna M. Pechenizkiy S. Ventura Received:
More informationMining Positive and Negative Fuzzy Association Rules
Mining Positive and Negative Fuzzy Association Rules Peng Yan 1, Guoqing Chen 1, Chris Cornelis 2, Martine De Cock 2, and Etienne Kerre 2 1 School of Economics and Management, Tsinghua University, Beijing
More informationFUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH
FUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH M. De Cock C. Cornelis E. E. Kerre Dept. of Applied Mathematics and Computer Science Ghent University, Krijgslaan 281 (S9), B-9000 Gent, Belgium phone: +32
More informationAlternative Approach to Mining Association Rules
Alternative Approach to Mining Association Rules Jan Rauch 1, Milan Šimůnek 1 2 1 Faculty of Informatics and Statistics, University of Economics Prague, Czech Republic 2 Institute of Computer Sciences,
More informationRemoving trivial associations in association rule discovery
Removing trivial associations in association rule discovery Geoffrey I. Webb and Songmao Zhang School of Computing and Mathematics, Deakin University Geelong, Victoria 3217, Australia Abstract Association
More informationCS4445 Data Mining and Knowledge Discovery in Databases. B Term 2014 Solutions Exam 2 - December 15, 2014
CS4445 Data Mining and Knowledge Discovery in Databases. B Term 2014 Solutions Exam 2 - December 15, 2014 Prof. Carolina Ruiz Department of Computer Science Worcester Polytechnic Institute NAME: Prof.
More informationFrequent Itemsets and Association Rule Mining. Vinay Setty Slides credit:
Frequent Itemsets and Association Rule Mining Vinay Setty vinay.j.setty@uis.no Slides credit: http://www.mmds.org/ Association Rule Discovery Supermarket shelf management Market-basket model: Goal: Identify
More informationA Methodology for Direct and Indirect Discrimination Prevention in Data Mining
A Methodology for Direct and Indirect Discrimination Prevention in Data Mining Sara Hajian and Josep Domingo-Ferrer IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013 Presented by Polina Rozenshtein
More informationCS5112: Algorithms and Data Structures for Applications
CS5112: Algorithms and Data Structures for Applications Lecture 19: Association rules Ramin Zabih Some content from: Wikipedia/Google image search; Harrington; J. Leskovec, A. Rajaraman, J. Ullman: Mining
More informationMining Molecular Fragments: Finding Relevant Substructures of Molecules
Mining Molecular Fragments: Finding Relevant Substructures of Molecules Christian Borgelt, Michael R. Berthold Proc. IEEE International Conference on Data Mining, 2002. ICDM 2002. Lecturers: Carlo Cagli
More informationApriori algorithm. Seminar of Popular Algorithms in Data Mining and Machine Learning, TKK. Presentation Lauri Lahti
Apriori algorithm Seminar of Popular Algorithms in Data Mining and Machine Learning, TKK Presentation 12.3.2008 Lauri Lahti Association rules Techniques for data mining and knowledge discovery in databases
More informationOutline. Fast Algorithms for Mining Association Rules. Applications of Data Mining. Data Mining. Association Rule. Discussion
Outline Fast Algorithms for Mining Association Rules Rakesh Agrawal Ramakrishnan Srikant Introduction Algorithm Apriori Algorithm AprioriTid Comparison of Algorithms Conclusion Presenter: Dan Li Discussion:
More informationAn Overview of Alternative Rule Evaluation Criteria and Their Use in Separate-and-Conquer Classifiers
An Overview of Alternative Rule Evaluation Criteria and Their Use in Separate-and-Conquer Classifiers Fernando Berzal, Juan-Carlos Cubero, Nicolás Marín, and José-Luis Polo Department of Computer Science
More informationHandling a Concept Hierarchy
Food Electronics Handling a Concept Hierarchy Bread Milk Computers Home Wheat White Skim 2% Desktop Laptop Accessory TV DVD Foremost Kemps Printer Scanner Data Mining: Association Rules 5 Why should we
More informationData Analytics Beyond OLAP. Prof. Yanlei Diao
Data Analytics Beyond OLAP Prof. Yanlei Diao OPERATIONAL DBs DB 1 DB 2 DB 3 EXTRACT TRANSFORM LOAD (ETL) METADATA STORE DATA WAREHOUSE SUPPORTS OLAP DATA MINING INTERACTIVE DATA EXPLORATION Overview of
More informationMachine Learning: Pattern Mining
Machine Learning: Pattern Mining Information Systems and Machine Learning Lab (ISMLL) University of Hildesheim Wintersemester 2007 / 2008 Pattern Mining Overview Itemsets Task Naive Algorithm Apriori Algorithm
More informationCorrelation Preserving Unsupervised Discretization. Outline
Correlation Preserving Unsupervised Discretization Jee Vang Outline Paper References What is discretization? Motivation Principal Component Analysis (PCA) Association Mining Correlation Preserving Discretization
More informationData-Driven Logical Reasoning
Data-Driven Logical Reasoning Claudia d Amato Volha Bryl, Luciano Serafini November 11, 2012 8 th International Workshop on Uncertainty Reasoning for the Semantic Web 11 th ISWC, Boston (MA), USA. Heterogeneous
More informationAssociation Rule Mining on Web
Association Rule Mining on Web What Is Association Rule Mining? Association rule mining: Finding interesting relationships among items (or objects, events) in a given data set. Example: Basket data analysis
More information732A61/TDDD41 Data Mining - Clustering and Association Analysis
732A61/TDDD41 Data Mining - Clustering and Association Analysis Lecture 6: Association Analysis I Jose M. Peña IDA, Linköping University, Sweden 1/14 Outline Content Association Rules Frequent Itemsets
More informationAssociation Analysis. Part 1
Association Analysis Part 1 1 Market-basket analysis DATA: A large set of items: e.g., products sold in a supermarket A large set of baskets: e.g., each basket represents what a customer bought in one
More informationMining Rank Data. Sascha Henzgen and Eyke Hüllermeier. Department of Computer Science University of Paderborn, Germany
Mining Rank Data Sascha Henzgen and Eyke Hüllermeier Department of Computer Science University of Paderborn, Germany {sascha.henzgen,eyke}@upb.de Abstract. This paper addresses the problem of mining rank
More informationC Ahmed Samet 1,2, Eric Lefèvre 2, and Sadok Ben Yahia 1 1 Laboratory of research in Programming, Algorithmic and Heuristic Faculty of Science of Tunis, Tunisia {ahmed.samet, sadok.benyahia}@fst.rnu.tn
More informationDistributed Mining of Frequent Closed Itemsets: Some Preliminary Results
Distributed Mining of Frequent Closed Itemsets: Some Preliminary Results Claudio Lucchese Ca Foscari University of Venice clucches@dsi.unive.it Raffaele Perego ISTI-CNR of Pisa perego@isti.cnr.it Salvatore
More informationFormal Concept Analysis
Formal Concept Analysis 2 Closure Systems and Implications 4 Closure Systems Concept intents as closed sets c e b 2 a 1 3 1 2 3 a b c e 20.06.2005 2 Next-Closure was developed by B. Ganter (1984). Itcanbeused
More informationDiscovering Non-Redundant Association Rules using MinMax Approximation Rules
Discovering Non-Redundant Association Rules using MinMax Approximation Rules R. Vijaya Prakash Department Of Informatics Kakatiya University, Warangal, India vijprak@hotmail.com Dr.A. Govardhan Department.
More informationAssignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran
Assignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran 1. Let X, Y be two itemsets, and let denote the support of itemset X. Then the confidence of the rule X Y,
More informationAn Approach to Classification Based on Fuzzy Association Rules
An Approach to Classification Based on Fuzzy Association Rules Zuoliang Chen, Guoqing Chen School of Economics and Management, Tsinghua University, Beijing 100084, P. R. China Abstract Classification based
More informationCHAPTER 2: DATA MINING - A MODERN TOOL FOR ANALYSIS. Due to elements of uncertainty many problems in this world appear to be
11 CHAPTER 2: DATA MINING - A MODERN TOOL FOR ANALYSIS Due to elements of uncertainty many problems in this world appear to be complex. The uncertainty may be either in parameters defining the problem
More information.. Cal Poly CSC 466: Knowledge Discovery from Data Alexander Dekhtyar..
.. Cal Poly CSC 4: Knowledge Discovery from Data Alexander Dekhtyar.. Data Mining: Mining Association Rules Examples Course Enrollments Itemset. I = { CSC3, CSC3, CSC40, CSC40, CSC4, CSC44, CSC4, CSC44,
More informationPositive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise
Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise Guimei Liu 1,2 Jinyan Li 1 Limsoon Wong 2 Wynne Hsu 2 1 Institute for Infocomm Research, Singapore 2 School
More informationData Mining. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of Computer Science
Data Mining Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of Computer Science 2016 2017 Road map The Apriori algorithm Step 1: Mining all frequent
More informationMining Strong Positive and Negative Sequential Patterns
Mining Strong Positive and Negative Sequential Patter NANCY P. LIN, HUNG-JEN CHEN, WEI-HUA HAO, HAO-EN CHUEH, CHUNG-I CHANG Department of Computer Science and Information Engineering Tamang University,
More informationData Mining: Concepts and Techniques. (3 rd ed.) Chapter 6
Data Mining: Concepts and Techniques (3 rd ed.) Chapter 6 Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign & Simon Fraser University 2013 Han, Kamber & Pei. All rights
More informationReductionist View: A Priori Algorithm and Vector-Space Text Retrieval. Sargur Srihari University at Buffalo The State University of New York
Reductionist View: A Priori Algorithm and Vector-Space Text Retrieval Sargur Srihari University at Buffalo The State University of New York 1 A Priori Algorithm for Association Rule Learning Association
More informationCSE 5243 INTRO. TO DATA MINING
CSE 5243 INTRO. TO DATA MINING Mining Frequent Patterns and Associations: Basic Concepts (Chapter 6) Huan Sun, CSE@The Ohio State University Slides adapted from Prof. Jiawei Han @UIUC, Prof. Srinivasan
More informationDATA MINING LECTURE 3. Frequent Itemsets Association Rules
DATA MINING LECTURE 3 Frequent Itemsets Association Rules This is how it all started Rakesh Agrawal, Tomasz Imielinski, Arun N. Swami: Mining Association Rules between Sets of Items in Large Databases.
More informationFrequent Pattern Mining: Exercises
Frequent Pattern Mining: Exercises Christian Borgelt School of Computer Science tto-von-guericke-university of Magdeburg Universitätsplatz 2, 39106 Magdeburg, Germany christian@borgelt.net http://www.borgelt.net/
More informationMining Class-Dependent Rules Using the Concept of Generalization/Specialization Hierarchies
Mining Class-Dependent Rules Using the Concept of Generalization/Specialization Hierarchies Juliano Brito da Justa Neves 1 Marina Teresa Pires Vieira {juliano,marina}@dc.ufscar.br Computer Science Department
More informationStatistical Privacy For Privacy Preserving Information Sharing
Statistical Privacy For Privacy Preserving Information Sharing Johannes Gehrke Cornell University http://www.cs.cornell.edu/johannes Joint work with: Alexandre Evfimievski, Ramakrishnan Srikant, Rakesh
More informationMeelis Kull Autumn Meelis Kull - Autumn MTAT Data Mining - Lecture 05
Meelis Kull meelis.kull@ut.ee Autumn 2017 1 Sample vs population Example task with red and black cards Statistical terminology Permutation test and hypergeometric test Histogram on a sample vs population
More informationCOMP 5331: Knowledge Discovery and Data Mining
COMP 5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified by Dr. Lei Chen based on the slides provided by Tan, Steinbach, Kumar And Jiawei Han, Micheline Kamber, and Jian Pei 1 10
More informationThe Market-Basket Model. Association Rules. Example. Support. Applications --- (1) Applications --- (2)
The Market-Basket Model Association Rules Market Baskets Frequent sets A-priori Algorithm A large set of items, e.g., things sold in a supermarket. A large set of baskets, each of which is a small set
More informationMining Infrequent Patter ns
Mining Infrequent Patter ns JOHAN BJARNLE (JOHBJ551) PETER ZHU (PETZH912) LINKÖPING UNIVERSITY, 2009 TNM033 DATA MINING Contents 1 Introduction... 2 2 Techniques... 3 2.1 Negative Patterns... 3 2.2 Negative
More informationCSE 5243 INTRO. TO DATA MINING
CSE 5243 INTRO. TO DATA MINING Mining Frequent Patterns and Associations: Basic Concepts (Chapter 6) Huan Sun, CSE@The Ohio State University 10/17/2017 Slides adapted from Prof. Jiawei Han @UIUC, Prof.
More informationFUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH
FUZZY ASSOCIATION RULES: A TWO-SIDED APPROACH M. De Cock C. Cornelis E. E. Kerre Dept. of Applied Mathematics and Computer Science Ghent University, Krijgslaan 281 (S9), B-9000 Gent, Belgium phone: +32
More informationProcessing Count Queries over Event Streams at Multiple Time Granularities
Processing Count Queries over Event Streams at Multiple Time Granularities Aykut Ünal, Yücel Saygın, Özgür Ulusoy Department of Computer Engineering, Bilkent University, Ankara, Turkey. Faculty of Engineering
More informationIntroduction to Data Mining
Introduction to Data Mining Lecture #12: Frequent Itemsets Seoul National University 1 In This Lecture Motivation of association rule mining Important concepts of association rules Naïve approaches for
More informationRegression and Correlation Analysis of Different Interestingness Measures for Mining Association Rules
International Journal of Innovative Research in Computer Scien & Technology (IJIRCST) Regression and Correlation Analysis of Different Interestingness Measures for Mining Association Rules Mir Md Jahangir
More informationChapter 6. Frequent Pattern Mining: Concepts and Apriori. Meng Jiang CSE 40647/60647 Data Science Fall 2017 Introduction to Data Mining
Chapter 6. Frequent Pattern Mining: Concepts and Apriori Meng Jiang CSE 40647/60647 Data Science Fall 2017 Introduction to Data Mining Pattern Discovery: Definition What are patterns? Patterns: A set of
More informationQuantitative Association Rule Mining on Weighted Transactional Data
Quantitative Association Rule Mining on Weighted Transactional Data D. Sujatha and Naveen C. H. Abstract In this paper we have proposed an approach for mining quantitative association rules. The aim of
More informationData Mining and Analysis: Fundamental Concepts and Algorithms
Data Mining and Analysis: Fundamental Concepts and Algorithms dataminingbook.info Mohammed J. Zaki 1 Wagner Meira Jr. 2 1 Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA
More informationOn Minimal Infrequent Itemset Mining
On Minimal Infrequent Itemset Mining David J. Haglin and Anna M. Manning Abstract A new algorithm for minimal infrequent itemset mining is presented. Potential applications of finding infrequent itemsets
More informationGuaranteeing the Accuracy of Association Rules by Statistical Significance
Guaranteeing the Accuracy of Association Rules by Statistical Significance W. Hämäläinen Department of Computer Science, University of Helsinki, Finland Abstract. Association rules are a popular knowledge
More information15 Introduction to Data Mining
15 Introduction to Data Mining 15.1 Introduction to principle methods 15.2 Mining association rule see also: A. Kemper, Chap. 17.4, Kifer et al.: chap 17.7 ff 15.1 Introduction "Discovery of useful, possibly
More informationAlgorithmic Methods of Data Mining, Fall 2005, Course overview 1. Course overview
Algorithmic Methods of Data Mining, Fall 2005, Course overview 1 Course overview lgorithmic Methods of Data Mining, Fall 2005, Course overview 1 T-61.5060 Algorithmic methods of data mining (3 cp) P T-61.5060
More informationAnomaly Detection for the CERN Large Hadron Collider injection magnets
Anomaly Detection for the CERN Large Hadron Collider injection magnets Armin Halilovic KU Leuven - Department of Computer Science In cooperation with CERN 2018-07-27 0 Outline 1 Context 2 Data 3 Preprocessing
More informationAssociation Rule. Lecturer: Dr. Bo Yuan. LOGO
Association Rule Lecturer: Dr. Bo Yuan LOGO E-mail: yuanb@sz.tsinghua.edu.cn Overview Frequent Itemsets Association Rules Sequential Patterns 2 A Real Example 3 Market-Based Problems Finding associations
More informationCPDA Based Fuzzy Association Rules for Learning Achievement Mining
2009 International Conference on Machine Learning and Computing IPCSIT vol.3 (2011) (2011) IACSIT Press, Singapore CPDA Based Fuzzy Association Rules for Learning Achievement Mining Jr-Shian Chen 1, Hung-Lieh
More informationAccelerating Effect of Attribute Variations: Accelerated Gradual Itemsets Extraction
Accelerating Effect of Attribute Variations: Accelerated Gradual Itemsets Extraction Amal Oudni, Marie-Jeanne Lesot, Maria Rifqi To cite this version: Amal Oudni, Marie-Jeanne Lesot, Maria Rifqi. Accelerating
More informationA Posteriori Corrections to Classification Methods.
A Posteriori Corrections to Classification Methods. Włodzisław Duch and Łukasz Itert Department of Informatics, Nicholas Copernicus University, Grudziądzka 5, 87-100 Toruń, Poland; http://www.phys.uni.torun.pl/kmk
More informationD B M G Data Base and Data Mining Group of Politecnico di Torino
Data Base and Data Mining Group of Politecnico di Torino Politecnico di Torino Association rules Objective extraction of frequent correlations or pattern from a transactional database Tickets at a supermarket
More informationUn nouvel algorithme de génération des itemsets fermés fréquents
Un nouvel algorithme de génération des itemsets fermés fréquents Huaiguo Fu CRIL-CNRS FRE2499, Université d Artois - IUT de Lens Rue de l université SP 16, 62307 Lens cedex. France. E-mail: fu@cril.univ-artois.fr
More informationAssociation Rules. Fundamentals
Politecnico di Torino Politecnico di Torino 1 Association rules Objective extraction of frequent correlations or pattern from a transactional database Tickets at a supermarket counter Association rule
More informationA Novel Approach of Multilevel Positive and Negative Association Rule Mining for Spatial Databases
A Novel Approach of Multilevel Positive and Negative Association Rule Mining for Spatial Databases L.K. Sharma 1, O. P. Vyas 1, U. S. Tiwary 2, R. Vyas 1 1 School of Studies in Computer Science Pt. Ravishankar
More informationEncyclopedia of Machine Learning Chapter Number Book CopyRight - Year 2010 Frequent Pattern. Given Name Hannu Family Name Toivonen
Book Title Encyclopedia of Machine Learning Chapter Number 00403 Book CopyRight - Year 2010 Title Frequent Pattern Author Particle Given Name Hannu Family Name Toivonen Suffix Email hannu.toivonen@cs.helsinki.fi
More informationAssociation Rules. Jones & Bartlett Learning, LLC NOT FOR SALE OR DISTRIBUTION. Jones & Bartlett Learning, LLC NOT FOR SALE OR DISTRIBUTION
CHAPTER2 Association Rules 2.1 Introduction Many large retail organizations are interested in instituting information-driven marketing processes, managed by database technology, that enable them to Jones
More informationData mining, 4 cu Lecture 7:
582364 Data mining, 4 cu Lecture 7: Sequential Patterns Spring 2010 Lecturer: Juho Rousu Teaching assistant: Taru Itäpelto Sequential Patterns In many data mining tasks the order and timing of events contains
More informationD B M G. Association Rules. Fundamentals. Fundamentals. Elena Baralis, Silvia Chiusano. Politecnico di Torino 1. Definitions.
Definitions Data Base and Data Mining Group of Politecnico di Torino Politecnico di Torino Itemset is a set including one or more items Example: {Beer, Diapers} k-itemset is an itemset that contains k
More informationD B M G. Association Rules. Fundamentals. Fundamentals. Association rules. Association rule mining. Definitions. Rule quality metrics: example
Association rules Data Base and Data Mining Group of Politecnico di Torino Politecnico di Torino Objective extraction of frequent correlations or pattern from a transactional database Tickets at a supermarket
More informationLars Schmidt-Thieme, Information Systems and Machine Learning Lab (ISMLL), University of Hildesheim, Germany
Syllabus Fri. 21.10. (1) 0. Introduction A. Supervised Learning: Linear Models & Fundamentals Fri. 27.10. (2) A.1 Linear Regression Fri. 3.11. (3) A.2 Linear Classification Fri. 10.11. (4) A.3 Regularization
More informationNetBox: A Probabilistic Method for Analyzing Market Basket Data
NetBox: A Probabilistic Method for Analyzing Market Basket Data José Miguel Hernández-Lobato joint work with Zoubin Gharhamani Department of Engineering, Cambridge University October 22, 2012 J. M. Hernández-Lobato
More informationData Warehousing & Data Mining
Data Warehousing & Data Mining Wolf-Tilo Balke Kinda El Maarry Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 9. Business Intelligence 9. Business Intelligence
More informationCS 484 Data Mining. Association Rule Mining 2
CS 484 Data Mining Association Rule Mining 2 Review: Reducing Number of Candidates Apriori principle: If an itemset is frequent, then all of its subsets must also be frequent Apriori principle holds due
More informationBasic Data Structures and Algorithms for Data Profiling Felix Naumann
Basic Data Structures and Algorithms for 8.5.2017 Overview 1. The lattice 2. Apriori lattice traversal 3. Position List Indices 4. Bloom filters Slides with Thorsten Papenbrock 2 Definitions Lattice Partially
More informationA Concise Representation of Association Rules using Minimal Predictive Rules
A Concise Representation of Association Rules using Minimal Predictive Rules Iyad Batal and Milos Hauskrecht Department of Computer Science University of Pittsburgh {iyad,milos}@cs.pitt.edu Abstract. Association
More information1 Frequent Pattern Mining
Decision Support Systems MEIC - Alameda 2010/2011 Homework #5 Due date: 31.Oct.2011 1 Frequent Pattern Mining 1. The Apriori algorithm uses prior knowledge about subset support properties. In particular,
More informationAssociation Rules Discovery in Multivariate Time Series
ssociation Rules Discovery in Multivariate Time Series Elena Lutsiv University of St.-Petersburg, Faculty of Mathematics and Mechanics eluciv@math.spbu.ru bstract problem of association rules discovery
More informationClassification Based on Logical Concept Analysis
Classification Based on Logical Concept Analysis Yan Zhao and Yiyu Yao Department of Computer Science, University of Regina, Regina, Saskatchewan, Canada S4S 0A2 E-mail: {yanzhao, yyao}@cs.uregina.ca Abstract.
More informationDynamic Programming Approach for Construction of Association Rule Systems
Dynamic Programming Approach for Construction of Association Rule Systems Fawaz Alsolami 1, Talha Amin 1, Igor Chikalov 1, Mikhail Moshkov 1, and Beata Zielosko 2 1 Computer, Electrical and Mathematical
More informationExplaining Results of Neural Networks by Contextual Importance and Utility
Explaining Results of Neural Networks by Contextual Importance and Utility Kary FRÄMLING Dep. SIMADE, Ecole des Mines, 158 cours Fauriel, 42023 Saint-Etienne Cedex 2, FRANCE framling@emse.fr, tel.: +33-77.42.66.09
More informationAssociation Rules Information Retrieval and Data Mining. Prof. Matteo Matteucci
Association Rules Information Retrieval and Data Mining Prof. Matteo Matteucci Learning Unsupervised Rules!?! 2 Market-Basket Transactions 3 Bread Peanuts Milk Fruit Jam Bread Jam Soda Chips Milk Fruit
More informationCS4445 B10 Homework 4 Part I Solution
CS4445 B10 Homework 4 Part I Solution Yutao Wang Consider the zoo.arff dataset converted to arff from the Zoo Data Set available at Univ. of California Irvine KDD Data Repository. 1. Load this dataset
More informationEFFICIENT MINING OF WEIGHTED QUANTITATIVE ASSOCIATION RULES AND CHARACTERIZATION OF FREQUENT ITEMSETS
EFFICIENT MINING OF WEIGHTED QUANTITATIVE ASSOCIATION RULES AND CHARACTERIZATION OF FREQUENT ITEMSETS Arumugam G Senior Professor and Head, Department of Computer Science Madurai Kamaraj University Madurai,
More informationCS 584 Data Mining. Association Rule Mining 2
CS 584 Data Mining Association Rule Mining 2 Recall from last time: Frequent Itemset Generation Strategies Reduce the number of candidates (M) Complete search: M=2 d Use pruning techniques to reduce M
More informationDATA MINING - 1DL360
DATA MINING - 1DL36 Fall 212" An introductory class in data mining http://www.it.uu.se/edu/course/homepage/infoutv/ht12 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology, Uppsala
More information10/19/2017 MIST.6060 Business Intelligence and Data Mining 1. Association Rules
10/19/2017 MIST6060 Business Intelligence and Data Mining 1 Examples of Association Rules Association Rules Sixty percent of customers who buy sheets and pillowcases order a comforter next, followed by
More informationApproximate counting: count-min data structure. Problem definition
Approximate counting: count-min data structure G. Cormode and S. Muthukrishhan: An improved data stream summary: the count-min sketch and its applications. Journal of Algorithms 55 (2005) 58-75. Problem
More informationDATA MINING - 1DL105, 1DL111
1 DATA MINING - 1DL105, 1DL111 Fall 2007 An introductory class in data mining http://user.it.uu.se/~udbl/dut-ht2007/ alt. http://www.it.uu.se/edu/course/homepage/infoutv/ht07 Kjell Orsborn Uppsala Database
More informationMining chains of relations
Mining chains of relations Foto Aftrati 1, Gautam Das 2, Aristides Gionis 3, Heikki Mannila 4, Taneli Mielikäinen 5, and Panayiotis Tsaparas 6 1 National Technical University of Athens, afrati@softlab.ece.ntua.gr
More informationIntroduction to Spatial Data Mining
Introduction to Spatial Data Mining 7.1 Pattern Discovery 7.2 Motivation 7.3 Classification Techniques 7.4 Association Rule Discovery Techniques 7.5 Clustering 7.6 Outlier Detection Introduction: a classic
More informationA Clear View on Quality Measures for Fuzzy Association Rules
A Clear View on Quality Measures for Fuzzy Association Rules Martine De Cock, Chris Cornelis, and Etienne E. Kerre Fuzziness and Uncertainty Modelling Research Unit Department of Applied Mathematics and
More informationData Warehousing & Data Mining
9. Business Intelligence Data Warehousing & Data Mining Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 9. Business Intelligence
More informationMining State Dependencies Between Multiple Sensor Data Sources
Mining State Dependencies Between Multiple Sensor Data Sources C. Robardet Co-Authored with Marc Plantevit and Vasile-Marian Scuturici April 2013 1 / 27 Mining Sensor data A timely challenge? Why is it
More informationRare Event Discovery And Event Change Point In Biological Data Stream
Rare Event Discovery And Event Change Point In Biological Data Stream T. Jagadeeswari 1 M.Tech(CSE) MISTE, B. Mahalakshmi 2 M.Tech(CSE)MISTE, N. Anusha 3 M.Tech(CSE) Department of Computer Science and
More informationECLT 5810 Data Preprocessing. Prof. Wai Lam
ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate
More informationPattern Structures 1
Pattern Structures 1 Pattern Structures Models describe whole or a large part of the data Pattern characterizes some local aspect of the data Pattern is a predicate that returns true for those objects
More informationAssociation Analysis Part 2. FP Growth (Pei et al 2000)
Association Analysis art 2 Sanjay Ranka rofessor Computer and Information Science and Engineering University of Florida F Growth ei et al 2 Use a compressed representation of the database using an F-tree
More informationGeovisualization for Association Rule Mining in CHOPS Well Data
UNIVERSITY OF CALGARY Geovisualization for Association Rule Mining in CHOPS Well Data by Xiaodong Sun A THESIS SUBMITTED TO THE FACULTY OF GRADUATE STUDIES IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR
More information