2002 Journal of Software

Size: px
Start display at page:

Download "2002 Journal of Software"

Transcription

1 1-9825/22/13(3) Journal of Software Vol13, No3,,,, (, 2326) http//wwwustceducn,,,,, Agrawal,,, ; ; ; TP18 A,,,,, ( ),,, ; Agrawal [1], [2],, 1 Agrawal [1], [1],Agrawal,, Heikki Mannila Agrawal, ; (G ); ( ) (197 ),,,,, ; (197 ),,,, ; (197 ),,,, ; (1968 ),,,, ; (1938 ),,,,,

2 411 [3], Tim Oates (MSDD) [4],, Oates, Oates,, 2,,,,,,, [5] ( 1 ), ;,,, ( ), ( 2, 3 ),,,, a c c d e f e c g d d a c f a c c d e f e c g d d a c f Fig1 Transform time series into discrete symol sequenc Fig2 Linear segment representation of multiple time series 1 2, [4],,, 1,,,,,

3 412 Journal of Software 22,13(3) DJI3 Nikkei225 Fig3 Time series of two stock indexes (left) and their linear segment representations (right) 3 ( ) 3 1 S={S 1,S 2,,S m } C={C 1,C 2,,C n } PM=(S i,c j,t s,t e ), S i S,C j C,T s T e 2 P=(V,,T s,t e V PM V,, x, y V,x y, x y, P ;, x, y V, x y y x, P Ts,T e P V V, P 3 P s (V,,T s,t e ) P (V,,T s,t e ), P s P, V V, ( = ),T s =<T s,t e =<T e 4 P (V,,T s,t e ) [t,t ], [t,t ] V =V,, t=<t s,t >=T e 5 P [t,t ], [u,u ] [t,t ], P Si(P) P Si(P)={[t,t ] [t,t ] P } P S, Si(P) Si(P), f min (t t) T 4 C k k,l k k, 1 Find_Freq_Pattern(S,T, f min ) S, T, f min T L k {Si(P) P L k } (1) C 1 = 1 = ; //,C (2) For all P C 1, Si(P); //,Si(P) (3) L 1 ={P C 1 length(si(p))>= f min }; (4) k =1; (5) while L k >=2 (a) k =k +1;

4 413 () C k =C_generator(L k 1 ); // L k 1 k C k, L k 1 (c) For all P C k do // Si(P), 2 ; (d) P 1 P 2, Si(P 1 ) Si(P 2 ) Si(P); (e) L k ={P C k length(si(p))>= f min }; (6) L k (k=1,2,) Si(P) C k ( C_generator ) [1], ( ),, (k 2), Agrawal, k (k 2) (k 1),, k 1 Si(P), k c k, Si(p k 1 ) 2 join(l k 1,Si,C k ) k 1 L k 1 {Si(P) P L k 1 }, C k {Si(P) P C k } (1) for all P C k do (a) find P 1 L k 1,where P 1 (1)=P(1), P 1 (2)=P(2),,and P 1 (k 1)=P(k 1); //P(i) P i () find P 2 L k 1,where P 2 (1)=P(2),P 2 (2)=P(3),,and P 2 (k 1)=P(k); (c) Si(P)={[t,t ] [t,t1] Si(P 1 ) [t2,t ] Si(P 2 ), t<t2,t1<t,t t=<t, [t,t ] }; (2) end, P 1 L k 1 P 2 L k 1, P C k, Si(P) }; Si(P)={[t,t ] 1 1 ] Si(P 1 ) [t 2,t 2 ] Si(P 2 ), t=min{t 1,t 2 },t =max{t 1,t 2 },t t=<t, Si(P 1 ) + Si(P 2 ) Si(P 1 ) + Si(P 2 ), Si(P 1 ) + Si(P 2 ), Si(P 1 ) + Si(P 2 ) P 1 P 2 P C k [t,t ], P 1, P 1 [t 1,t 1 ](t 1 >=t t 1 =<t ), [t 1,t 1 ] Si(P 1 ), [t 11,t 11 ] Si(P 1 ), t 11 >=t 1 t 11 =<t 1, P 2 [t 2,t 2 ] Si(P 2 )(t 2 >=t t 2 =<t ), Si(P 1 ) + Si(P 2 ), 1 f min T L k ( f min,t),k=1,2,,m, Si(P),P L k 1 f min =f min 1 T=T 1 F 1, f min 2 =<f min 1 T 2 >=T 1 F 2, T 2 =T 1, ; T 2 >=T 1,, F A (f min A,T A ) F B (f min B,T B ), f min A <f min B T A <T B F A F B =L A B (f min B, T A )={P P F A &P F B }, P L A B (f min B,T A ),Si(P)=Si A (P),,Si A (P) F A P

5 414 Journal of Software 22,13(3),,, 5 DJI3 Nikkei ~ ( ) ( ),, f min T, 1 f min =3 T 2 T=4 f min,,, Tale 1 Numer of candidate patterns and frequent patterns of different lengths (K) discovered when different spans of time (T ) are adopted 1 (T ) (K) f min =3 K=1 K=2 K=3 K=4 K=5 K=6 T=1 Candidate pattern set Frequent pattern set T=2 Candidate pattern set Frequent pattern set T=3 Candidate pattern set Frequent pattern set T=4 Candidate pattern set Frequent pattern set , Tale 2 Numer of candidate patterns and frequent patterns of different lengths (K) discovered when different frequency thresholds (f min ) are adopted 2 ( f min ) (K) T=4 K=1 K=2 K=3 K=4 K=5 K=6 f min =3 Candidate pattern set Frequent pattern set f min =5 Candidate pattern set Frequent pattern set f min =7 Candidate pattern set Frequent pattern set f min =9 Candidate pattern set Frequent pattern set , 4 f min T, T, f min ns Num er of patter Candidat set Frequent set Numer of patterns Candidate set Frequent set Span of time Frequency threshold,,,, Fig4 Relationship of the numer of patterns with the span of time (left) and frequency threshold (right) 4 T( ) f min ( )

6 415 5 ( ) DJI3 1 5 Nikkei DJI3 Nikkei Fig5 Some discovered frequent patterns of different lengths represented in linear segments 5, (1),,, (2),, References [1] Agrawal, R, Imielinski, T, Swami, A Mining association rules etween sets of items in large dataases In Buneman, P, Jajodia, S, eds Proceedings of the ACM SIGMOD Conference on Management of Data (SIGMOD 93) Washington, DC ACM, ~216 [2] Mannila, H, Toivonen, H Discovering generalized episodes using minimal occurrences In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD 96) Portland, Oregon AAAI Press, ~151 http//wwwcs helsinkifi/~mannila/postscripts/genepisodesps [3] Mannila, H, Toivonen, H, Verkamo, AI Discovery of frequent episodes in event sequences Data Mining and Knowledge Discovery, 1997,1(3)259~289 [4] Oates, T, Cohen, PR Searching for structure in multiple streams of data In Proceedings of the 13th International Conference on Machine Learning Morgan Kaufmann Pulishers, Inc, ~354 http//www-ekslcsumassedu/papers/oates96cps [5] Li, Bin, Tan, Li-xiang, Zhang, Jin-song, et al The study of the data mining oriented method for the symolization of time series Journal of Circuits and Systems, 2,5(2)9~14 (in Chinese)

7 416 Journal of Software 22,13(3) [5],,,,,2,5(2)9~14 An Algorithm for Discovering Frequent Patterns in Non-Synchronous Multiple Time Series LI Bin, TAN Li-xiang, XIE Guang-jun, LI Hai-ying, ZHUANG Zhen-quan (Department of Electronic Science and Technology, University of Science and Technology of China, Hefei 2326, China) http//wwwustceducn Astract Discovering frequent patterns in multiple time series is important in practices Methods appeared in literatures assume that the multiple time series are synchronous, ut in the real world, that is not always satisfied, in most cases they are non-synchronous In this paper, an algorithm for discovering frequent patterns in nonsynchronous multiple time series is proposed In this algorithm, first, the time series is segmented and symolized with the linear segment representation and the vector shape clustering method, so that each symol can represent a primitive and independent pattern Then, the minimal occurrence representation of time series and the association rule discovery algorithm proposed y Agrawal is comined to extract frequent patterns of various structures from non-synchronous multiple time series Compared with the previous methods, the algorithm is more simple and flexile, and does not require time series to e synchronous Experimental results show the efficiency of the algorithm Key words data mining; time series; frequent pattern; minimal occurrence; symolization Received June 15, 2; accepted Septemer 26, 2 Supported y the National Grand Fundamental Research 973 Program of China under Grant NoG ; the National Research Foundation for the Doctoral Program of Higher Education of China under Grant No

Chapter 6. Frequent Pattern Mining: Concepts and Apriori. Meng Jiang CSE 40647/60647 Data Science Fall 2017 Introduction to Data Mining

Chapter 6. Frequent Pattern Mining: Concepts and Apriori. Meng Jiang CSE 40647/60647 Data Science Fall 2017 Introduction to Data Mining Chapter 6. Frequent Pattern Mining: Concepts and Apriori Meng Jiang CSE 40647/60647 Data Science Fall 2017 Introduction to Data Mining Pattern Discovery: Definition What are patterns? Patterns: A set of

More information

CSE 5243 INTRO. TO DATA MINING

CSE 5243 INTRO. TO DATA MINING CSE 5243 INTRO. TO DATA MINING Mining Frequent Patterns and Associations: Basic Concepts (Chapter 6) Huan Sun, CSE@The Ohio State University Slides adapted from Prof. Jiawei Han @UIUC, Prof. Srinivasan

More information

Encyclopedia of Machine Learning Chapter Number Book CopyRight - Year 2010 Frequent Pattern. Given Name Hannu Family Name Toivonen

Encyclopedia of Machine Learning Chapter Number Book CopyRight - Year 2010 Frequent Pattern. Given Name Hannu Family Name Toivonen Book Title Encyclopedia of Machine Learning Chapter Number 00403 Book CopyRight - Year 2010 Title Frequent Pattern Author Particle Given Name Hannu Family Name Toivonen Suffix Email hannu.toivonen@cs.helsinki.fi

More information

Association Rule. Lecturer: Dr. Bo Yuan. LOGO

Association Rule. Lecturer: Dr. Bo Yuan. LOGO Association Rule Lecturer: Dr. Bo Yuan LOGO E-mail: yuanb@sz.tsinghua.edu.cn Overview Frequent Itemsets Association Rules Sequential Patterns 2 A Real Example 3 Market-Based Problems Finding associations

More information

CSE 5243 INTRO. TO DATA MINING

CSE 5243 INTRO. TO DATA MINING CSE 5243 INTRO. TO DATA MINING Mining Frequent Patterns and Associations: Basic Concepts (Chapter 6) Huan Sun, CSE@The Ohio State University 10/17/2017 Slides adapted from Prof. Jiawei Han @UIUC, Prof.

More information

Discovery of Functional and Approximate Functional Dependencies in Relational Databases

Discovery of Functional and Approximate Functional Dependencies in Relational Databases JOURNAL OF APPLIED MATHEMATICS AND DECISION SCIENCES, 7(1), 49 59 Copyright c 2003, Lawrence Erlbaum Associates, Inc. Discovery of Functional and Approximate Functional Dependencies in Relational Databases

More information

DATA MINING FOR TELECONNECTIONS IN GLOBAL CLIMATE

DATA MINING FOR TELECONNECTIONS IN GLOBAL CLIMATE DATA MINING FOR TELECONNECTIONS IN GLOBAL CLIMATE DATASETS * Xiaoping Gao*, Kunqing Xie, Xingxing Jin, Xiaofeng Lei Center of Information Science, Peking University, Beijing, 100871, China *Email: xiaopinggao@gmail.com

More information

On Minimal Infrequent Itemset Mining

On Minimal Infrequent Itemset Mining On Minimal Infrequent Itemset Mining David J. Haglin and Anna M. Manning Abstract A new algorithm for minimal infrequent itemset mining is presented. Potential applications of finding infrequent itemsets

More information

Mining Interesting Infrequent and Frequent Itemsets Based on Minimum Correlation Strength

Mining Interesting Infrequent and Frequent Itemsets Based on Minimum Correlation Strength Mining Interesting Infrequent and Frequent Itemsets Based on Minimum Correlation Strength Xiangjun Dong School of Information, Shandong Polytechnic University Jinan 250353, China dongxiangjun@gmail.com

More information

Discovery of Frequent Word Sequences in Text. Helena Ahonen-Myka. University of Helsinki

Discovery of Frequent Word Sequences in Text. Helena Ahonen-Myka. University of Helsinki Discovery of Frequent Word Sequences in Text Helena Ahonen-Myka University of Helsinki Department of Computer Science P.O. Box 26 (Teollisuuskatu 23) FIN{00014 University of Helsinki, Finland, helena.ahonen-myka@cs.helsinki.fi

More information

Data mining, Hypergraph Transversals, and Machine Learning. Extended abstract

Data mining, Hypergraph Transversals, and Machine Learning. Extended abstract Data mining, Hypergraph Transversals, and Machine Learning Extended astract Dimitrios Gunopulos IBM Almaden RC K55/B1 650 Harry Rd. San Jose CA 95120 USA gunopulo@almaden.im.com Roni Khardon y Aiken Computation

More information

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 6

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 6 Data Mining: Concepts and Techniques (3 rd ed.) Chapter 6 Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign & Simon Fraser University 2013 Han, Kamber & Pei. All rights

More information

Free-sets : a Condensed Representation of Boolean Data for the Approximation of Frequency Queries

Free-sets : a Condensed Representation of Boolean Data for the Approximation of Frequency Queries Free-sets : a Condensed Representation of Boolean Data for the Approximation of Frequency Queries To appear in Data Mining and Knowledge Discovery, an International Journal c Kluwer Academic Publishers

More information

Free-Sets: A Condensed Representation of Boolean Data for the Approximation of Frequency Queries

Free-Sets: A Condensed Representation of Boolean Data for the Approximation of Frequency Queries Data Mining and Knowledge Discovery, 7, 5 22, 2003 c 2003 Kluwer Academic Publishers. Manufactured in The Netherlands. Free-Sets: A Condensed Representation of Boolean Data for the Approximation of Frequency

More information

CPDA Based Fuzzy Association Rules for Learning Achievement Mining

CPDA Based Fuzzy Association Rules for Learning Achievement Mining 2009 International Conference on Machine Learning and Computing IPCSIT vol.3 (2011) (2011) IACSIT Press, Singapore CPDA Based Fuzzy Association Rules for Learning Achievement Mining Jr-Shian Chen 1, Hung-Lieh

More information

Application of Apriori Algorithm in Open Experiment

Application of Apriori Algorithm in Open Experiment 2011 International Conference on Computer Science and Information Technology (ICCSIT 2011) IPCSIT vol. 51 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V51.130 Application of Apriori Algorithm

More information

Similarity of Attributes by External Probes. P.O. Box 26, FIN Helsinki, Finland. with larger domains.)

Similarity of Attributes by External Probes. P.O. Box 26, FIN Helsinki, Finland. with larger domains.) Similarity of Attributes by External Probes Gautam Das University of Memphis Department of Mathematical Sciences Memphis TN 8, USA dasg@msci.memphis.edu Heikki Mannila and Pirjo Ronkainen University of

More information

Mining Positive and Negative Fuzzy Association Rules

Mining Positive and Negative Fuzzy Association Rules Mining Positive and Negative Fuzzy Association Rules Peng Yan 1, Guoqing Chen 1, Chris Cornelis 2, Martine De Cock 2, and Etienne Kerre 2 1 School of Economics and Management, Tsinghua University, Beijing

More information

Levelwise Search and Borders of Theories in Knowledge Discovery

Levelwise Search and Borders of Theories in Knowledge Discovery Data Mining and Knowledge Discovery 1, 241 258 (1997) c 1997 Kluwer Academic Publishers. Manufactured in The Netherlands. Levelwise Search and Borders of Theories in Knowledge Discovery HEIKKI MANNILA

More information

Theory of Dependence Values

Theory of Dependence Values Theory of Dependence Values ROSA MEO Università degli Studi di Torino A new model to evaluate dependencies in data mining problems is presented and discussed. The well-known concept of the association

More information

arxiv: v1 [cs.db] 19 May 2018

arxiv: v1 [cs.db] 19 May 2018 Free-rider Episode Screening via Dual Partition Model Xiang Ao 1,2, Yang Liu 1,2, Zhen Huang 3, Luo Zuo 1,2, and Qing He 1,2 1 Key Lab of Intelligent Information Processing of Chinese Academy of Sciences

More information

ASSOCIATION RULE MINING AND CLASSIFIER APPROACH FOR QUANTITATIVE SPOT RAINFALL PREDICTION

ASSOCIATION RULE MINING AND CLASSIFIER APPROACH FOR QUANTITATIVE SPOT RAINFALL PREDICTION ASSOCIATION RULE MINING AND CLASSIFIER APPROACH FOR QUANTITATIVE SPOT RAINFALL PREDICTION T.R. SIVARAMAKRISHNAN 1, S. MEGANATHAN 2 1 School of EEE, SASTRA University, Thanjavur, 613 401 2 Department of

More information

Mining Strong Positive and Negative Sequential Patterns

Mining Strong Positive and Negative Sequential Patterns Mining Strong Positive and Negative Sequential Patter NANCY P. LIN, HUNG-JEN CHEN, WEI-HUA HAO, HAO-EN CHUEH, CHUNG-I CHANG Department of Computer Science and Information Engineering Tamang University,

More information

Algorithmic Methods of Data Mining, Fall 2005, Course overview 1. Course overview

Algorithmic Methods of Data Mining, Fall 2005, Course overview 1. Course overview Algorithmic Methods of Data Mining, Fall 2005, Course overview 1 Course overview lgorithmic Methods of Data Mining, Fall 2005, Course overview 1 T-61.5060 Algorithmic methods of data mining (3 cp) P T-61.5060

More information

Guaranteeing the Accuracy of Association Rules by Statistical Significance

Guaranteeing the Accuracy of Association Rules by Statistical Significance Guaranteeing the Accuracy of Association Rules by Statistical Significance W. Hämäläinen Department of Computer Science, University of Helsinki, Finland Abstract. Association rules are a popular knowledge

More information

Approximating a Collection of Frequent Sets

Approximating a Collection of Frequent Sets Approximating a Collection of Frequent Sets Foto Afrati National Technical University of Athens Greece afrati@softlab.ece.ntua.gr Aristides Gionis HIIT Basic Research Unit Dept. of Computer Science University

More information

Removing trivial associations in association rule discovery

Removing trivial associations in association rule discovery Removing trivial associations in association rule discovery Geoffrey I. Webb and Songmao Zhang School of Computing and Mathematics, Deakin University Geelong, Victoria 3217, Australia Abstract Association

More information

A Framework of Detecting Burst Events from Micro-blogging Streams

A Framework of Detecting Burst Events from Micro-blogging Streams , pp.379-384 http://dx.doi.org/10.14257/astl.2013.29.78 A Framework of Detecting Burst Events from Micro-blogging Streams Kaifang Yang, Yongbo Yu, Lizhou Zheng, Peiquan Jin School of Computer Science and

More information

An Approach to Classification Based on Fuzzy Association Rules

An Approach to Classification Based on Fuzzy Association Rules An Approach to Classification Based on Fuzzy Association Rules Zuoliang Chen, Guoqing Chen School of Economics and Management, Tsinghua University, Beijing 100084, P. R. China Abstract Classification based

More information

Modified Entropy Measure for Detection of Association Rules Under Simpson's Paradox Context.

Modified Entropy Measure for Detection of Association Rules Under Simpson's Paradox Context. Modified Entropy Measure for Detection of Association Rules Under Simpson's Paradox Context. Murphy Choy Cally Claire Ong Michelle Cheong Abstract The rapid explosion in retail data calls for more effective

More information

Mining Partially Periodic Event Patterns With. Unknown Periods. Sheng Ma and Joseph L. Hellerstein. IBM T.J. Watson Research Center

Mining Partially Periodic Event Patterns With. Unknown Periods. Sheng Ma and Joseph L. Hellerstein. IBM T.J. Watson Research Center Mining Partially Periodic Event Patterns With Unknown Periods Sheng Ma and Joseph L. Hellerstein IBM T.J. Watson Research Center Hawthorne, NY 10532 Abstract Periodic behavior is common in real-world applications.

More information

Warmr: a data mining tool for chemical data

Warmr: a data mining tool for chemical data Journal of Computer-Aided Molecular Design, 15: 173 181, 2001. KLUWER/ESCOM 2001 Kluwer Academic Publishers. Printed in the Netherlands. 173 Warmr: a data mining tool for chemical data Ross D. King a,

More information

Quantitative Association Rule Mining on Weighted Transactional Data

Quantitative Association Rule Mining on Weighted Transactional Data Quantitative Association Rule Mining on Weighted Transactional Data D. Sujatha and Naveen C. H. Abstract In this paper we have proposed an approach for mining quantitative association rules. The aim of

More information

Journal of Chemical and Pharmaceutical Research, 2014, 6(5): Research Article

Journal of Chemical and Pharmaceutical Research, 2014, 6(5): Research Article Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(5):266-270 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Anomaly detection of cigarette sales using ARIMA

More information

Cube Lattices: a Framework for Multidimensional Data Mining

Cube Lattices: a Framework for Multidimensional Data Mining Cube Lattices: a Framework for Multidimensional Data Mining Alain Casali Rosine Cicchetti Lotfi Lakhal Abstract Constrained multidimensional patterns differ from the well-known frequent patterns from a

More information

Mining Class-Dependent Rules Using the Concept of Generalization/Specialization Hierarchies

Mining Class-Dependent Rules Using the Concept of Generalization/Specialization Hierarchies Mining Class-Dependent Rules Using the Concept of Generalization/Specialization Hierarchies Juliano Brito da Justa Neves 1 Marina Teresa Pires Vieira {juliano,marina}@dc.ufscar.br Computer Science Department

More information

PATTERN VISUALIZATION ON METEOROLOGICAL DATA FOR RAINFALL PREDICTION MODEL

PATTERN VISUALIZATION ON METEOROLOGICAL DATA FOR RAINFALL PREDICTION MODEL PATTERN VISUALIZATION ON METEOROLOGICAL DATA FOR RAINFALL PREDICTION MODEL 1 S. MEGANATHAN, 2 T.R. SIVARAMAKRISHNAN 1 Department of CSE, SASTRA University, Kumbakonam, 612 001 2 School of EEE, SASTRA University,

More information

On the robustness of association rules

On the robustness of association rules On the robustness of association rules Philippe Lenca, Benoît Vaillant, Stéphane Lallich GET/ENST Bretagne CNRS UMR 2872 TAMCIC Technopôle de Brest Iroise CS 8388, 29238 Brest Cedex, France Email: philippe.lenca@enst-bretagne.fr

More information

Dynamic Programming Approach for Construction of Association Rule Systems

Dynamic Programming Approach for Construction of Association Rule Systems Dynamic Programming Approach for Construction of Association Rule Systems Fawaz Alsolami 1, Talha Amin 1, Igor Chikalov 1, Mikhail Moshkov 1, and Beata Zielosko 2 1 Computer, Electrical and Mathematical

More information

Mining periodic patterns from nested event logs

Mining periodic patterns from nested event logs University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 2014 Mining periodic patterns from nested event

More information

Bottom-Up Propositionalization

Bottom-Up Propositionalization Bottom-Up Propositionalization Stefan Kramer 1 and Eibe Frank 2 1 Institute for Computer Science, Machine Learning Lab University Freiburg, Am Flughafen 17, D-79110 Freiburg/Br. skramer@informatik.uni-freiburg.de

More information

CLUSTERING AND PROFILING TRAFFIC ROADS BY MEANS OF ACCIDENT DATA. Karolien Geurts Geert Wets Tom Brijs Koen Vanhoof Limburg University

CLUSTERING AND PROFILING TRAFFIC ROADS BY MEANS OF ACCIDENT DATA. Karolien Geurts Geert Wets Tom Brijs Koen Vanhoof Limburg University CLUSTERING AND PROFILING TRAFFIC ROADS BY MEANS OF ACCIDENT DATA Karolien Geurts Geert Wets Tom Brijs Koen Vanhoof Limburg University 1. INTRODUCTION In Belgium, every year approximately 50.000 injury

More information

Mining Free Itemsets under Constraints

Mining Free Itemsets under Constraints Mining Free Itemsets under Constraints Jean-François Boulicaut Baptiste Jeudy Institut National des Sciences Appliquées de Lyon Laboratoire d Ingénierie des Systèmes d Information Bâtiment 501 F-69621

More information

Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data

Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional -Series Data Xiaolei Li, Jiawei Han University of Illinois at Urbana-Champaign VLDB 2007 1 Series Data Many applications produce time series

More information

Density-Based Clustering

Density-Based Clustering Density-Based Clustering idea: Clusters are dense regions in feature space F. density: objects volume ε here: volume: ε-neighborhood for object o w.r.t. distance measure dist(x,y) dense region: ε-neighborhood

More information

Machine Learning: Pattern Mining

Machine Learning: Pattern Mining Machine Learning: Pattern Mining Information Systems and Machine Learning Lab (ISMLL) University of Hildesheim Wintersemester 2007 / 2008 Pattern Mining Overview Itemsets Task Naive Algorithm Apriori Algorithm

More information

Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise

Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise Guimei Liu 1,2 Jinyan Li 1 Limsoon Wong 2 Wynne Hsu 2 1 Institute for Infocomm Research, Singapore 2 School

More information

Efficient discovery of statistically significant association rules

Efficient discovery of statistically significant association rules fficient discovery of statistically significant association rules Wilhelmiina Hämäläinen Department of Computer Science University of Helsinki Finland whamalai@cs.helsinki.fi Matti Nykänen Department of

More information

Finding All Minimal Infrequent Multi-dimensional Intervals

Finding All Minimal Infrequent Multi-dimensional Intervals Finding All Minimal nfrequent Multi-dimensional ntervals Khaled M. Elbassioni Max-Planck-nstitut für nformatik, Saarbrücken, Germany; elbassio@mpi-sb.mpg.de Abstract. Let D be a database of transactions

More information

Estimating a Finite Population Mean under Random Non-Response in Two Stage Cluster Sampling with Replacement

Estimating a Finite Population Mean under Random Non-Response in Two Stage Cluster Sampling with Replacement Open Journal of Statistics, 07, 7, 834-848 http://www.scirp.org/journal/ojs ISS Online: 6-798 ISS Print: 6-78X Estimating a Finite Population ean under Random on-response in Two Stage Cluster Sampling

More information

Mining Partially Periodic Event Patterns With Unknown Periods*

Mining Partially Periodic Event Patterns With Unknown Periods* Mining Partially Periodic Event Patterns With Unknown Periods* Sheng Ma and Joseph L. Hellerstein IBM T.J. Watson Research Center Hawthorne, NY 10532 { shengma, jlh} @us.ibm.com Abstract Periodic behavior

More information

Distributed Mining of Frequent Closed Itemsets: Some Preliminary Results

Distributed Mining of Frequent Closed Itemsets: Some Preliminary Results Distributed Mining of Frequent Closed Itemsets: Some Preliminary Results Claudio Lucchese Ca Foscari University of Venice clucches@dsi.unive.it Raffaele Perego ISTI-CNR of Pisa perego@isti.cnr.it Salvatore

More information

2008, hm 2. ( Commodity Bundle) [ 6], 25 4 Vol. 25 No JOURNAL OF NATURAL RESOURCES Apr., , 2, 3, 1, 2 3*,

2008, hm 2. ( Commodity Bundle) [ 6], 25 4 Vol. 25 No JOURNAL OF NATURAL RESOURCES Apr., , 2, 3, 1, 2 3*, 25 4 Vol. 25 No. 4 2010 4 JOURNAL OF NATURAL RESOURCES Apr., 2010 1, 2, 3, 3*, 3, 3, 1, 2 ( 1., 100101; 2., 100049; 3., 100193) :,,,,, ;, 2005, 12 7 5, 2005 :,,, : ; ; ; ; : F301. 21 : A : 1000-3037( 2010)

More information

Image information measurement for video retrieval

Image information measurement for video retrieval 37 2 Vol.37 No.2 2016 2 Journal on Communications Feruary 2016 doi:10.11959/j.issn.1000-436x.2016033 1,2 3,4 3,4 3,4 1,2 (1100193 2 100029 3 100190 4 100190 SEII map 4.4% 1.5 TP37 A Image information measurement

More information

SPATIO-TEMPORAL PATTERN MINING ON TRAJECTORY DATA USING ARM

SPATIO-TEMPORAL PATTERN MINING ON TRAJECTORY DATA USING ARM SPATIO-TEMPORAL PATTERN MINING ON TRAJECTORY DATA USING ARM S. Khoshahval a *, M. Farnaghi a, M. Taleai a a Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, Tehran, Iran

More information

Course overview. Heikki Mannila Laboratory for Computer and Information Science Helsinki University of Technology

Course overview. Heikki Mannila Laboratory for Computer and Information Science Helsinki University of Technology Course overview Heikki Mannila Laboratory for Computer and Information Science Helsinki University of Technology Heikki.Mannila@tkk.fi Algorithmic methods of data mining, Autumn 2007, Course overview 1

More information

Algorithmic methods of data mining, Autumn 2007, Course overview0-0. Course overview

Algorithmic methods of data mining, Autumn 2007, Course overview0-0. Course overview Algorithmic methods of data mining, Autumn 2007, Course overview0-0 Course overview Heikki Mannila Laboratory for Computer and Information Science Helsinki University of Technology Heikki.Mannila@tkk.fi

More information

InfoMiner: Mining Surprising Periodic Patterns

InfoMiner: Mining Surprising Periodic Patterns InfoMiner: Mining Surprising Periodic Patterns Jiong Yang IBM Watson Research Center jiyang@us.ibm.com Wei Wang IBM Watson Research Center ww1@us.ibm.com Philip S. Yu IBM Watson Research Center psyu@us.ibm.com

More information

Connections between mining frequent itemsets and learning generative models

Connections between mining frequent itemsets and learning generative models Connections between mining frequent itemsets and learning generative models Srivatsan Laxman Microsoft Research Labs India slaxman@microsoft.com Prasad Naldurg Microsoft Research Labs India prasadn@microsoft.com

More information

Data Mining in Chemometrics Sub-structures Learning Via Peak Combinations Searching in Mass Spectra

Data Mining in Chemometrics Sub-structures Learning Via Peak Combinations Searching in Mass Spectra Data Mining in Chemometrics Sub-structures Learning Via Peak Combinations Searching in Mass Spectra Yu Tang 1,2, Yizeng Liang 3, Kai-Tai Fang 1 1 Hong Kong Baptist University 2 Suzhou University 3 Central

More information

New Constructions of Sonar Sequences

New Constructions of Sonar Sequences INTERNATIONAL JOURNAL OF BASIC & APPLIED SCIENCES IJBAS-IJENS VOL.:14 NO.:01 12 New Constructions of Sonar Sequences Diego F. Ruiz 1, Carlos A. Trujillo 1, and Yadira Caicedo 2 1 Department of Mathematics,

More information

Analysis and summarization of correlations in data cubes and its application in microarray data analysis

Analysis and summarization of correlations in data cubes and its application in microarray data analysis Intelligent Data Analysis 9 (2005) 43 57 43 IOS Press Analysis and summarization of correlations in data cubes and its application in microarray data analysis Chien-Yu Chen, Shien-Ching Hwang and Yen-Jen

More information

Backward Time Related Association Rule mining with Database Rearrangement in Traffic Volume Prediction

Backward Time Related Association Rule mining with Database Rearrangement in Traffic Volume Prediction Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Backward Related Association Rule mining with Database Rearrangement in Traffic

More information

Transaction Databases, Frequent Itemsets, and Their Condensed Representations

Transaction Databases, Frequent Itemsets, and Their Condensed Representations Transaction Databases, Frequent Itemsets, and Their Condensed Representations Taneli Mielikäinen HIIT Basic Research Unit Department of Computer Science University of Helsinki, Finland Abstract. Mining

More information

2005 Journal of Software (, ) A Spatial Index to Improve the Response Speed of WebGIS Servers

2005 Journal of Software (, ) A Spatial Index to Improve the Response Speed of WebGIS Servers 1000-9825/2005/16(05)0819 2005 Journal of Software Vol.16, No.5 WebGIS +,, (, 410073) A Spatial Index to Improve the Response Speed of WebGIS Servers YE Chang-Chun +, LUO Jin-Ping, ZHOU Xing-Ming (Institute

More information

Finding Association Rules that Trade Support Optimally Against Confidence

Finding Association Rules that Trade Support Optimally Against Confidence Finding Association Rules that Trade Support Optimally Against Confidence Tobias Scheffer Humboldt-Universität zu Berlin, Department of Computer Science Unter den Linden 6, 199 Berlin, Germany scheffer@informatik.hu-berlin.de

More information

Technical Report CSD-TR No Mining Frequent Itemsets in Uncertain Datasets

Technical Report CSD-TR No Mining Frequent Itemsets in Uncertain Datasets Technical Report CSD-TR No. 030042 Mining Frequent Itemsets in Uncertain Datasets Richard Muntz and Yi Xia Computer Science Department University of California Los Angeles, CA 90095 {muntz, xiayi}@cs.ucla.edu

More information

Fuzzy Logic Controller Based on Association Rules

Fuzzy Logic Controller Based on Association Rules Annals of the University of Craiova, Mathematics and Computer Science Series Volume 37(3), 2010, Pages 12 21 ISSN: 1223-6934 Fuzzy Logic Controller Based on Association Rules Ion IANCU and Mihai GABROVEANU

More information

Entropy and Self Organizing in Edge Organizations

Entropy and Self Organizing in Edge Organizations Entropy and Self Organizing in Edge Organizations Yan Jin and Qianyu Liu Dept of Aerospace & Mechanical Engineering University of Southern California 14 th ICCRTS, June 15 17, 2009 Washington D.C. Background

More information

Data mining, Hypergraph Transversals, and Machine Learning*

Data mining, Hypergraph Transversals, and Machine Learning* Data mining, Hypergraph Transversals, and Machine Learning* Extended abstract Dimitrios Gunopulos IBM Ahnaden RC K55/Bl 650 Han+ Rd. San Jose CA 95120 USA gmopuloqalmaden.ibm.com Roni Khardont A&en Computation

More information

Improving time-series rule matching performance for detecting energy consumption patterns

Improving time-series rule matching performance for detecting energy consumption patterns Improving time-series rule matching performance for detecting energy consumption patterns Maël Guilleme, Laurence Rozé, Véronique Masson, Cérès Carton, René Quiniou, Alexandre Termier To cite this version:

More information

Temporality and Context for Detecting Adverse Drug Reactions from Longitudinal Data

Temporality and Context for Detecting Adverse Drug Reactions from Longitudinal Data Noname manuscript No. (will be inserted by the editor) Temporality and Context for Detecting Adverse Drug Reactions from Longitudinal Data Henry Lo Wei Ding Zohreh Nazeri the date of receipt and acceptance

More information

Simultaneous Reliability Evaluation of Generality and Accuracy for Rule Discovery in Databases Einoshin Suzuki

Simultaneous Reliability Evaluation of Generality and Accuracy for Rule Discovery in Databases Einoshin Suzuki Simultaneous Reliability Evaluation of Generality and Accuracy for Rule Discovery in Databases Einoshin Suzuki Electrical and Computer Engineering, Yokohama National University, 79-5, Toldwadai, Hodogaya,

More information

Constructing comprehensive summaries of large event sequences

Constructing comprehensive summaries of large event sequences Constructing comprehensive summaries of large event sequences JERRY KIERNAN IBM Silicon Valley Lab and EVIMARIA TERZI IBM Almaden Research Center Event sequences capture system and user activity over time.

More information

Mining Frequent Partite Episodes with Partwise Constraints

Mining Frequent Partite Episodes with Partwise Constraints Mining Frequent Partite Episodes with Partwise Constraints Takashi Katoh, Shin-ichiro Tago, Tatsuya Asai, Hiroaki Morikawa, Junichi Shigezumi, and Hiroya Inakoshi Fujitsu Laboratories Ltd., Kawasaki, 211-8588,

More information

Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya Yanki, Bojana Dimcheva, Donald Kossmann

Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya Yanki, Bojana Dimcheva, Donald Kossmann Towards Indexing Functions: Answering Scalar Product Queries Arijit Khan, Pouya anki, Bojana Dimcheva, Donald Kossmann Systems Group ETH Zurich Moving Objects Intersection Finding Position at a future

More information

Discovering generalized episodes using minimal occurrences. Heikki Mannila and Hannu Toivonen. University of Helsinki. Department of Computer Science

Discovering generalized episodes using minimal occurrences. Heikki Mannila and Hannu Toivonen. University of Helsinki. Department of Computer Science Discovering generalized episodes using minimal occurrences Heikki Mannila and Hannu Toivonen University of Helsinki Department of Computer Science P.O. Box 26, FIN-00014 Helsinki, Finland Heikki.Mannila@cs.Helsinki.FI,

More information

A Brief Introduction to Temporality and Causality

A Brief Introduction to Temporality and Causality A Brief Introduction to Temporality and Causality Kamran Karimi D-Wave Systems Inc. Burnaby, BC, Canada kkarimi@dwavesys.com Abstract Causality is a non-obvious concept that is often considered to be related

More information

[11] [3] [1], [12] HITS [4] [2] k-means[9] 2 [11] [3] [6], [13] 1 ( ) k n O(n k ) 1 2

[11] [3] [1], [12] HITS [4] [2] k-means[9] 2 [11] [3] [6], [13] 1 ( ) k n O(n k ) 1 2 情報処理学会研究報告 1,a) 1 1 2 IT Clustering by Clique Enumeration and Data Cleaning like Method Abstract: Recent development on information technology has made bigdata analysis more familiar in research and industrial

More information

Nonlinear dynamic simulation model of switched reluctance linear machine

Nonlinear dynamic simulation model of switched reluctance linear machine Procedia Earth and Planetary Science 1 (2009) 1320 1324 Procedia Earth and Planetary Science www.elsevier.com/locate/procedia The 6 th International Conference on Mining Science & Technology Nonlinear

More information

An Intersection Inequality for Discrete Distributions and Related Generation Problems

An Intersection Inequality for Discrete Distributions and Related Generation Problems An Intersection Inequality for Discrete Distributions and Related Generation Problems E. Boros 1, K. Elbassioni 1, V. Gurvich 1, L. Khachiyan 2, and K. Makino 3 1 RUTCOR, Rutgers University, 640 Bartholomew

More information

Association Rules Discovery in Multivariate Time Series

Association Rules Discovery in Multivariate Time Series ssociation Rules Discovery in Multivariate Time Series Elena Lutsiv University of St.-Petersburg, Faculty of Mathematics and Mechanics eluciv@math.spbu.ru bstract problem of association rules discovery

More information

Optimizing Abstaining Classifiers using ROC Analysis. Tadek Pietraszek / 'tʌ dek pɪe 'trʌ ʃek / ICML 2005 August 9, 2005

Optimizing Abstaining Classifiers using ROC Analysis. Tadek Pietraszek / 'tʌ dek pɪe 'trʌ ʃek / ICML 2005 August 9, 2005 IBM Zurich Research Laboratory, GSAL Optimizing Abstaining Classifiers using ROC Analysis Tadek Pietraszek / 'tʌ dek pɪe 'trʌ ʃek / pie@zurich.ibm.com ICML 2005 August 9, 2005 To classify, or not to classify:

More information

Discovering Non-Redundant Association Rules using MinMax Approximation Rules

Discovering Non-Redundant Association Rules using MinMax Approximation Rules Discovering Non-Redundant Association Rules using MinMax Approximation Rules R. Vijaya Prakash Department Of Informatics Kakatiya University, Warangal, India vijprak@hotmail.com Dr.A. Govardhan Department.

More information

Layered critical values: a powerful direct-adjustment approach to discovering significant patterns

Layered critical values: a powerful direct-adjustment approach to discovering significant patterns Mach Learn (2008) 71: 307 323 DOI 10.1007/s10994-008-5046-x TECHNICAL NOTE Layered critical values: a powerful direct-adjustment approach to discovering significant patterns Geoffrey I. Webb Received:

More information

On Approximating Minimum Infrequent and Maximum Frequent Sets

On Approximating Minimum Infrequent and Maximum Frequent Sets On Approximating Minimum Infrequent and Maximum Frequent Sets Mario Boley Fraunhofer IAIS, Schloss Birlinghoven, Sankt Augustin, Germany mario.boley@iais.fraunhofer.de Abstract. The maximum cardinality

More information

Selecting the Right Interestingness Measure for Association Patterns

Selecting the Right Interestingness Measure for Association Patterns Selecting the Right ingness Measure for Association Patterns Pang-Ning Tan Department of Computer Science and Engineering University of Minnesota 2 Union Street SE Minneapolis, MN 55455 ptan@csumnedu Vipin

More information

Nantonac Collaborative Filtering Recommendation Based on Order Responces. 1 SD Recommender System [23] [26] Collaborative Filtering; CF

Nantonac Collaborative Filtering Recommendation Based on Order Responces. 1 SD Recommender System [23] [26] Collaborative Filtering; CF Nantonac Collaborative Filtering Recommendation Based on Order Responces (Toshihiro KAMISHIMA) (National Institute of Advanced Industrial Science and Technology (AIST)) Abstract: A recommender system suggests

More information

Mining Posets from Linear Orders

Mining Posets from Linear Orders Mining Posets from Linear Orders Proceso L. Fernandez 1, Lenwood S. Heath 2, Naren Ramakrishnan 2, Michael Tan 1 and John Paul C. Vergara 1 corresponding author (pfernandez@ateneo.edu) 1 Department of

More information

LINGO optimization model-based gymnastics team competition best starting line-up research

LINGO optimization model-based gymnastics team competition best starting line-up research Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 0, 6(7):98-95 Research Article ISSN : 0975-738 CODEN(USA) : JCPRC5 LINGO optimization model-based gymnastics team competition

More information

An Analysis of Quantitative Measures Associated with Rules

An Analysis of Quantitative Measures Associated with Rules An Analysis of Quantitative Measures Associated with Rules Y.Y. Yao 1 and Ning Zhong 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca

More information

Approximate counting: count-min data structure. Problem definition

Approximate counting: count-min data structure. Problem definition Approximate counting: count-min data structure G. Cormode and S. Muthukrishhan: An improved data stream summary: the count-min sketch and its applications. Journal of Algorithms 55 (2005) 58-75. Problem

More information

On the average diameter of directed loop networks

On the average diameter of directed loop networks 34 2 Vol.34 No. 2 203 2 Journal on Communications February 203 doi:0.3969/j.issn.000-436x.203.02.07 2. 243002 2. 24304 L- 4 a b p q 2 L- TP393.0 B 000-436X(203)02-038-09 On the average diameter of directed

More information

arxiv: v1 [cs.cv] 28 Nov 2017

arxiv: v1 [cs.cv] 28 Nov 2017 A fatal point concept and a low-sensitivity quantitative measure for traffic safety analytics arxiv:1711.10131v1 [cs.cv] 28 Nov 2017 Shan Suthaharan Department of Computer Science University of North Carolina

More information

Conditional Sparse Linear Regression

Conditional Sparse Linear Regression Conditional Sparse Linear Regression Brendan Juba Washington University in St. Louis bjuba@wustl.edu August 17, 2016 Abstract Machine learning and statistics typically focus on building models that capture

More information

Detecting Anomalous and Exceptional Behaviour on Credit Data by means of Association Rules. M. Delgado, M.D. Ruiz, M.J. Martin-Bautista, D.

Detecting Anomalous and Exceptional Behaviour on Credit Data by means of Association Rules. M. Delgado, M.D. Ruiz, M.J. Martin-Bautista, D. Detecting Anomalous and Exceptional Behaviour on Credit Data by means of Association Rules M. Delgado, M.D. Ruiz, M.J. Martin-Bautista, D. Sánchez 18th September 2013 Detecting Anom and Exc Behaviour on

More information

Maintaining Frequent Itemsets over High-Speed Data Streams

Maintaining Frequent Itemsets over High-Speed Data Streams Maintaining Frequent Itemsets over High-Speed Data Streams James Cheng, Yiping Ke, and Wilfred Ng Department of Computer Science Hong Kong University of Science and Technology Clear Water Bay, Kowloon,

More information

Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data

Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data Jinzhong Wang April 13, 2016 The UBD Group Mobile and Social Computing Laboratory School of Software, Dalian University

More information

EFFICIENT MINING OF WEIGHTED QUANTITATIVE ASSOCIATION RULES AND CHARACTERIZATION OF FREQUENT ITEMSETS

EFFICIENT MINING OF WEIGHTED QUANTITATIVE ASSOCIATION RULES AND CHARACTERIZATION OF FREQUENT ITEMSETS EFFICIENT MINING OF WEIGHTED QUANTITATIVE ASSOCIATION RULES AND CHARACTERIZATION OF FREQUENT ITEMSETS Arumugam G Senior Professor and Head, Department of Computer Science Madurai Kamaraj University Madurai,

More information

NEGATED ITEMSETS OBTAINING METHODS FROM TREE- STRUCTURED STREAM DATA

NEGATED ITEMSETS OBTAINING METHODS FROM TREE- STRUCTURED STREAM DATA NEGATED ITEMSETS OBTAINING METHODS FROM TREE- STRUCTURED STREAM DATA JURYON PAIK Pyeongtaek University, Department of Digital Information and Statistics, Gyeonggi-do 17869, South Korea E-mail: jrpaik@ptu.ac.kr

More information