An Efficient Algorithm for Discovering Frequent Subgraphs

Size: px
Start display at page:

Download "An Efficient Algorithm for Discovering Frequent Subgraphs"

Transcription

1 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 1 An Effiient Algorithm for Disovering Frequent Sugrphs Mihihiro Kurmohi nd George Krpis, Memer, IEEE Deprtment of Computer Siene Universit of Minnesot EE/CS Building, 200 Union St SE Minnepolis, MN {kurm, krpis}@s.umn.edu Astrt Over the ers, frequent itemset disover lgorithms hve een used to find interesting ptterns in vrious pplition res. However, s dt mining tehniques re eing inresingl pplied to non-trditionl domins, eisting frequent pttern disover pproh nnot e used. This is euse the trnstion frmework tht is ssumed these lgorithms nnot e used to effetivel model the dtsets in these domins. An lternte w of modeling the ojets in these dtsets is to represent them using grphs. Within tht model, one w of formulting the frequent pttern disover prolem is s tht of disovering sugrphs tht our frequentl over the entire set of grphs. In this pper we present omputtionll effiient lgorithm, lled FSG, for finding ll frequent sugrphs in lrge grph dtsets. We eperimentll evlute the performne of FSG using vriet of rel nd sntheti dtsets. Our results show tht despite the underling ompleit ssoited with frequent sugrph disover, FSG is effetive in finding ll frequentl ourring sugrphs in dtsets ontining over 200,000 grph trnstions nd sles linerl with respet to the sie of the dtset. Inde Terms Dt mining, sientifi dtsets, frequent pttern disover, hemil ompound dtsets. I. INTRODUCTION EFFICIENT lgorithms for finding frequent ptterns oth sequentil nd non-sequentil in ver lrge dtsets hve een one of the ke suess stories of dt mining reserh [1], [2], [20], [36], [41], [49]. Nevertheless, s dt mining tehniques hve een inresingl pplied to nontrditionl domins, there is need to develop effiient nd generl-purpose frequent pttern disover lgorithms tht re ple of pturing the strong sptil, topologil, geometri, nd/or reltionl nture of the dtsets tht hrterie these domins. In reent ers, leled topologil grphs hve emerged s promising strtion to pture the hrteristis of these dtsets. In this pproh, eh ojet to e nled is represented vi seprte grph whose verties orrespond to the entities in the ojet nd the edges orrespond to the reltions etween them. Within tht model, one w of This work ws supported NSF CCR , EIA , ACI nd ACI , Arm Reserh Offie ontrt DA/DAAG , nd Arm High Performne Computing Reserh Center ontrt numer DAAH04-95-C Aess to omputing filities ws provided the Minnesot Superomputing Institute. formulting the frequent pttern disover prolem is s tht of disovering sugrphs tht our frequentl over the entire set of grphs. The power of grphs to model omple dtsets hs een reognied vrious reserhers [3], [6], [10], [14], [19], [23], [26], [30], [37], [43], [46] s it llows us to represent ritrr reltions mong entities nd solve prolems tht we ould not previousl solve. For instne, onsider the prolem of mining hemil ompounds to find reurrent sustrutures. We n hieve tht using grph-sed pttern disover lgorithm reting grph for eh one of the ompounds whose verties orrespond to different toms, nd whose edges orrespond to onds etween them. We n ssign to eh verte lel orresponding to the tom involved (nd potentill its hrge), nd ssign to eh edge lel orresponding to the tpe of the ond (nd potentill informtion out their reltive 3D orienttion). One these grphs hve een reted, reurrent sustrutures ross different ompounds eome frequentl ourring sugrphs. In ft, within the ontet of hemil ompound lssifition, suh tehniques hve een used to mine hemil ompounds nd identif the sustrutures tht est disriminte etween the different lsses [5], [11], [27], [42], nd were shown to produe superior lssifiers thn more trditionl methods [21]. Developing lgorithms tht disover ll frequentl ourring sugrphs in lrge grph dtset is prtiulrl hllenging nd omputtionll intensive, s grph nd sugrph isomorphisms pl ke role throughout the omputtions. In this pper we present new lgorithm, lled FSG, for finding ll onneted sugrphs tht pper frequentl in lrge grph dtset. Our lgorithm finds frequent sugrphs using the level--level epnsion strteg dopted Apriori [2]. The ke fetures of FSG re the following: (i) it uses sprse grph representtion tht minimies oth storge nd omputtion; (ii) it inreses the sie of frequent sugrphs dding one edge t time, llowing it to generte the ndidtes effiientl; (iii) it inorportes vrious optimitions for ndidte genertion nd frequen ounting whih enles it to sle to lrge grph dtsets; nd (iv) it uses sophistited lgorithms for nonil leling to uniquel identif the vrious generted sugrphs without hving to resort to omputtionll epensive grph- nd sugrph-

2 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 2 isomorphism omputtions. We eperimentll evluted FSG on three tpes of dtsets. The first two dtsets orrespond to vrious hemil ompounds ontining over 200,000 trnstions nd frequent ptterns whose sie is lrge, nd the third tpe orresponds to vrious grph dtsets tht were snthetill generted using frmework similr to tht used for mrket-sket trnstion genertion [2]. Our results illustrte tht FSG n operte on ver lrge grph dtsets nd find ll frequentl ourring sugrphs in resonle mount of time nd sles linerl with the dtset sie. For emple, in dtset ontining over 200,000 hemil ompounds, FSG n disover ll sugrphs tht our in t lest 1% of the trnstions in pproimtel one hour. Furthermore, our detiled evlution using the snthetill generted grphs shows tht for dtsets tht hve modertel lrge numer of different verte nd edge lels, FSG is le to hieve good performne s the trnstion sie inreses. The rest of the pper is orgnied s follows. Setion II provides some definitions nd introdues the nottion tht is used in the pper. Setion III formll defines the prolem of frequent sugrph disover nd disusses the modeling strengths of the disovered ptterns nd the hllenges ssoited with finding them in omputtionll effiient mnner. Setion IV desries in detil the lgorithm. Setion V desries the vrious optimitions tht we developed for effiientl omputing the nonil lel of the ptterns. Setion VI provides detiled eperimentl evlution of FSG on lrge numer of rel nd sntheti dtsets. Setion VII desries the relted reserh in this re, nd finll, Setion VIII provides some onluding remrks. II. DEFINITIONS AND NOTATION A grph G = (V, E) is mde of two sets, the set of verties V nd the set of edges E. Eh edge itself is pir of verties, nd throughout this pper we ssume tht the grph is undireted, i.e., eh edge is n unordered pir of verties. Furthermore, we will ssume tht the grph is leled. Tht is, eh verte nd edge hs lel ssoited with it tht is drwn from predefined set of verte lels (L V ) nd edge lels (L E ). Eh verte (or edge) of the grph is not required to hve unique lel nd the sme lel n e ssigned to mn verties (or edges) in the sme grph. Given grph G = (V, E), grph G s = (V s, E s ) will e sugrph of G if nd onl if V s V nd E s E nd it will e n indued sugrph of G if V s V nd E s ontins ll the edges of E tht onnet verties in V s. A grph is onneted if there is pth etween ever pir of verties in the grph. Two grphs G 1 = (V 1, E 1 ) nd G 2 = (V 2, E 2 ) re isomorphi if the re topologill identil to eh other, tht is, there is mpping from V 1 to V 2 suh tht eh edge in E 1 is mpped to single edge in E 2 nd vie vers. In the se of leled grphs, this mpping must lso preserve the lels on the verties nd edges. An utomorphism is n isomorphism mpping where G 1 = G 2. Given two grphs G 1 = (V 1, E 1 ) nd G 2 = (V 2, E 2 ), the prolem of sugrph isomorphism is to find n isomorphism etween G 2 nd sugrph of G 1, TABLE I NOTATION USED THROUGHOUT THE PAPER Nottion Desription k-sugrph A onneted sugrph with k edges (lso written s sie-k sugrph) G k, H k (Su)grphs of sie k E(G) Edges of (su)grph G V (G) Verties of (su)grph G l(g) A nonil lel of grph G,,, e, f edges u, v verties d(v) Degree of verte v l(v) The lel of verte v l(e) The lel of n edge e H = G e H is grph otined the deletion of edge e E(G) D A dtset of grph trnstions {D 1, D 2,..., D N } Disjoint N prtitions of D (for i nd j, i j, D i D j = nd i Di = D) T A grph trnstion C A ndidte sugrph C k A set of ndidtes with k edges C A set of ll ndidtes F A frequent sugrph F k A set of frequent k-sugrphs F A set of ll frequent sugrphs k The sie of the lrgest frequent sugrph in D L E A set of ll edge lels in D L V A set of ll verte lels in D i.e., to determine whether or not G 2 is inluded in G 1. The nonil lel of grph G = (V, E), l(g), is defined to e unique ode (i.e., sequene of its, string, or sequene of numers) tht is invrint on the ordering of the verties nd edges in the grph [15]. As result, two grphs will hve the sme nonil lel if the re isomorphi. Emples of different nonil lel odes nd detils on how the re omputed re presented in Setion V. Both nonil leling nd determining grph isomorphism re not known to e either in P or in NP-omplete [15]. The sie of grph G = (V, E) is defined to e equl to E. Given sie-k onneted grph G = (V, E), dding n edge we will refer to the opertion in whih n edge e = (u, v) is dded to the grph so tht the resulting sie-(k + 1) grph remins onneted. Similrl, deleting n edge we refer to the opertion in whih e = (u, v) suh tht e E is deleted from the grph nd the resulting sie-(k 1) grph remins onneted. Note tht depending on the prtiulr hoie of e, the deletion of the edge m result in deleting t most one of its inident verties if tht verte hs onl e s its inident edge. Finll, the nottion tht we will e using through-out the pper is shown in Tle I. III. FREQUENT SUBGRAPH DISCOVERY PROBLEM DEFINITION The prolem of finding frequentl ourring onneted sugrphs in set of grphs is defined s follows: Definition 1 (Sugrph Disover): Given set of grphs D eh of whih is n undireted leled grph, nd prmeter σ suh tht 0 < σ 1, find ll onneted undireted grphs tht re sugrphs in t lest σ D of the input grphs. We will refer to eh of the grphs in D s grph trnstion or simpl trnstion when the ontet is ler, to D s the grph trnstion dtset, nd to σ s the support threshold.

3 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 3 There re two ke spets in the ove prolem sttement. First, we re onl interested in sugrphs tht re onneted. This is motivted the ft tht the resulting frequent sugrphs will e enpsulting reltions (or edges) etween some of the entities (or verties) of vrious ojets. Within this ontet, onnetivit is nturl propert of frequent ptterns. An dditionl enefit of this restrition is tht it redues the ompleit of the prolem, s we do not need to onsider disonneted omintions of frequent onneted sugrphs. Seond, we llow the grphs to e leled, nd s disussed in Setion II, input grph trnstions nd disovered frequent ptterns n ontin multiple verties nd edges rring the sme lel. This gretl inreses our modeling ilit, s it llow us to find ptterns involving multiple ourrenes of the sme entities nd reltions, ut t the sme time mkes the prolem of finding suh frequentl ourring sugrphs nontrivil. This is euse in suh ses, n frequent sugrph disover lgorithm needs to orretl identif how prtiulr sugrph mps to the verties nd edges of eh grph trnstion, tht n onl e done solving mn instnes of the sugrph isomorphism prolem, whih hs een shown to e in NP-omplete [16]. IV. FSG FREQUENT SUBGRAPH DISCOVERY ALGORITHM In developing our frequent sugrph disover lgorithm, we deided to follow the level--level struture of the Apriori [2] lgorithm used for finding frequent itemsets. The motivtion ehind this hoie is the ft tht the level--level struture of Apriori requires the smllest numer of sugrph isomorphism omputtions during frequen ounting, s it llows it to tke full dvntge of the downwrd losed propert of the minimum support onstrint nd hieves the highest mount of pruning when ompred with the most reentl developed depth-first-sed pprohes suh s delt [49], Tree Projetion [1], nd FP-growth [20]. In ft, despite the etr overhed due to ndidte genertion tht is inurred the level--level pproh, reent studies hve shown tht euse of its effetive pruning, it hieves omprle performne with tht hieved the vrious depth-firstsed pprohes, s long s the dt set is not dense or the support vlue is not etremel smll [18], [22]. The overll flow of our lgorithm, lled FSG, is similr to tht of Apriori, nd works s follows. FSG strts enumerting ll frequent single- nd doule-edge sugrphs. Then, it enters its min omputtionl phse, whih onsists of min itertion loop. During eh itertion, FSG first genertes ll ndidte sugrphs whose sie is greter thn the previous frequent ones one edge, nd then ounts the frequen for eh of these ndidtes nd prunes sugrphs tht do no stisf the support onstrint. FSG stops when no frequent sugrphs re generted for prtiulr itertion. Detils on how FSG genertes the ndidtes sugrphs, nd on how it omputes their frequen re provided in Setion IV- A nd Setion IV-B, respetivel. To ensure tht the vrious grph-relted opertions re performed effiientl, FSG stores the vrious input grphs nd the vrious ndidte nd frequent sugrphs tht it genertes using n djen list representtion. A. Cndidte Genertion FSG genertes ndidte sugrphs of sie k +1 joining two frequent sie-k sugrphs. In order for two suh frequent sie-k sugrphs to e eligile for joining the must ontin the sme sie-(k 1) onneted sugrph. The simplest w to generte the omplete set of ndidte sugrphs is to join ll pirs of sie-k frequent sugrphs tht hve ommon sie-(k 1) sugrph. Unfortuntel, the prolem with this pproh is tht prtiulr sie-k sugrph, n hve up to k different sie-(k 1) sugrphs. As result, if we onsider ll suh possile sugrphs nd perform the resulting join opertions, we will end up generting the sme ndidte pttern multiple times, nd generting lrge numer of ndidte ptterns tht re not downwrd losed. The net effet of this, is tht the resulting lgorithm spends signifint mount of time identifing unique ndidtes nd eliminting non-downwrd losed ndidtes (oth of whih opertions re non-trivil s the require to determine the nonil lel of the generted sugrphs). Note tht ndidte genertion pprohes in the ontet of frequent itemsets, (e.g., Apriori [2]) do not suffer from this prolem euse the use onsistent w to order the items within n itemset (e.g., leiogrphill). Using this ordering, the onl join two sie-k itemsets if the hve the sme (k 1)-prefi. For emple, prtiulr itemset {A, B, C, D} will onl e generted one ( joining {A, B, C} nd {A, B, D}), nd if tht itemset is not downwrd losed, it will never e generted if onl its {A, B, C} nd {B, C, D} susets were frequent. Fortuntel, the sitution for sugrph ndidte genertion is not s severe s the ove disussion seems to indite nd FSG ddresses oth of these prolems onl joining two frequent sugrphs if nd onl if the shre ertin, properl seleted, sie-(k 1) sugrph. Speifill, for eh frequent sie-k sugrph F i, let P(F i ) = {H i,1, H i,2 } e the two sie-(k 1) onneted sugrphs of F i suh tht H i,1 hs the smllest nonil lel nd H i,2 hs the seond smllest nonil lel mong the vrious onneted sie-(k 1) sugrphs of F i. We will refer to these sugrphs s the primr sugrphs of F i. Note tht if ever sie-(k 1) sugrph of F i is isomorphi to eh other, H i,1 = H i,2 nd P(F i ) = 1. FSG will onl join two frequent sugrphs F i nd F j, if nd onl if P(F i ) P(F j ), nd the join opertion will e done with respet to the ommon sie-(k 1) sugrph(s). The proof tht this pproh will orretl generte ll vlid ndidte sugrphs is presented in Appendi. This ndidte genertion pproh drmtill redues the numer of redundnt nd non-downwrd losed ptterns tht re generted nd leds to signifint performne improvements over the nive pproh (originll implemented in [29]). The tul join opertion of two frequent sie-k sugrphs F i nd F j tht hve ommon primr sugrph H is performed generting ndidte sie-(k + 1) sugrph tht ontins H plus the two edges tht were deleted from F i nd F j to otin H. However, unlike the joining of itemsets

4 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 4 Fig. 1. G 4 1 G G 4 2 G 5 2 Join G 5 1 () B verte leling Join G 6 1 G 6 2 G 5 2 () B multiple utomorphisms of single ore Two ses of joining in whih two frequent sie-k itemsets led to unique sie- (k + 1) itemset, the joining of two sie-k sugrphs m produe multiple distint sie-(k+1) ndidtes. This hppens for the following two resons. First, the differene etween the ommon primr sugrph nd the two frequent sugrphs n e verte tht hs the sme lel. In this se, the joining of suh sie-k sugrphs will generte two distint sugrphs of sie k + 1. Fig. 1() shows suh n emple, in whih the pir of grphs G 4 nd G 4 genertes two different ndidtes G 5 nd G 5. Seond, the primr sugrph itself m hve multiple utomorphisms, nd eh of them n led to different sie-(k + 1) ndidte. In the worst se, when the primr sugrph is n unleled lique, the numer of utomorphisms is k!. An emple for this se is shown in Fig. 1(), in whih the primr sugrph squre of four verties leled with hs four utomorphisms resulting in three different ndidtes of sie si. Finll, in ddition to joining two different sugrphs, FSG lso needs to perform self join. This hppens, for emple, when the two grphs G k i nd G k j in Fig. 1 re identil. It is neessr euse, for emple, onsider grph trnstions without n lels. Then, there will e onl one frequent sie-1 sugrph nd one frequent sie-2 sugrph regrdless of the support threshold, euse those re the onl llowed strutures, nd edges nd verties do not hve lels ssigned. In generl, whenever F k = 1, self join is neessr to otin set of vlid (k+1)- ndidtes. B. Frequen Counting G 6 3 The simplest w to determine the frequen of eh ndidte sugrph is to sn eh one of the dtset trnstions nd determine if it is ontined or not using sugrph isomorphism. Nonetheless, hving to ompute these isomorphisms is prtiulrl epensive nd this pproh is not fesile for lrge dtsets. In the ontet of frequent itemset disover Apriori, the frequen ounting is performed sustntill fster uilding hsh-tree of ndidte itemsets nd snning eh trnstion to determine whih of the itemsets in the hsh-tree it supports. Developing suh n lgorithm for frequent sugrphs, however, is hllenging s there is no nturl w to uild the hsh-tree for grphs. For this reson, FSG insted uses trnstion identifier (TID) lists, proposed [13], [40], [47]. In this pproh for eh frequent sugrph FSG keeps list of trnstion identifiers tht support it. Now when FSG needs to ompute the frequen of G k+1, it first omputes the intersetion of the TID lists of its frequent k-sugrphs. If the sie of the intersetion is elow the support, G k+1 is pruned, otherwise FSG omputes the frequen of G k+1 using sugrph isomorphism limiting the serh onl to the set of trnstions in the intersetion of the TID lists. The dvntges of this pproh re two-fold. First, in the ses where the intersetion of the TID lists is ellow the minimum support level, FSG is le to prune the ndidte sugrph without performing n sugrph isomorphism omputtions. Seond, when the intersetion set is suffiientl lrge, FSG onl needs to ompute sugrph isomorphisms for those grphs tht n potentill ontin the ndidte sugrph nd not for ll the grph trnstions. 1) Reduing Memor Requirements of TID lists: The omputtionl dvntges of TID lists ome t the epense of higher memor requirements for mintining them. To ddress this limittion we implemented dtse-prtitioning-sed sheme tht ws motivted similr sheme developed for mining frequent itemsets [39]. In this pproh, the dtse is prtitioned into N disjoint prts D = {D 1, D 2,..., D N }. Eh of these su-dtses D i is mined to find set of frequent sugrphs F i, lled lol frequent sugrphs. The union of the lol frequent sugrphs C = i F i, lled glol ndidtes, is determined nd their frequen in the entire dtse is omputed reding eh grph trnstion nd finding the set of sugrphs tht it supports. The suset of C tht stisfies the minimum support onstrint is output s the finl set of frequent ptterns F. Sine the memor required for storing the TID lists depends on the sie of the dtse, their overll memor requirements n e redued prtitioning the dtse in suffiientl lrge numer of prtitions. One of the prolems with nive implementtion of the ove lgorithm is tht it n drmtill inrese the numer of sugrph isomorphism opertions tht re required to determine the frequen of the glol ndidte set. In order to ddress this prolem, FSG inorportes three tehniques: (i) priori pruning the numer of ndidte sugrphs tht need to e onsidered; (ii) using itmps to limit the frequen ounting of prtiulr ndidte sugrph to onl those prtitions tht this frequen hs not lred eing determined loll; nd (iii) tking dvntge of the lttie struture of C to hek eh grph trnstion onl ginst the sugrphs tht re desendnts of ptterns tht re lred eing supported tht trnstion. The net effet of these optimitions is tht, s shown in Setion VI-A.1, the FSG s overll run-time inreses slowl s the numer of prtitions inreses. The priori pruning of the ndidte sugrphs is hieved s follows. For eh prtition D i, FSG finds the set of lol frequent sugrphs nd the set of lol negtive order

5 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 5 sugrphs 1, nd stores them into file S i long with their ssoited frequenies. Then, it orgnies the union of the lol frequent nd lol negtive order sugrphs ross the vrious prtitions into lttie struture (lled pttern lttie), inrementll inorporting the informtion from eh file S i. Then, for eh node v of the pttern lttie it omputes n upper ound f (v) of its ourrene frequen dding the orresponding upper ounds for eh one of the N prtitions, f (v) = f1 (v) + + fp (v). For eh prtition D i, fi (v) is determined using the following eqution: { fi fi (v), if v S (v) = i min u (fi, (u)), otherwise where f i (v) is the tul frequen of the pttern orresponding to node v in D i, nd u is onneted sugrph of v tht is smller from it one edge (i.e., it is its prent in the lttie). Note tht the vrious fi (v) vlues n e omputed in ottom-up fshion single sn of S i, nd used diretl to updte the overll f (v) vlues. Now, given this set of frequen upper ounds, FSG proeeds to prune the nodes of the pttern lttie tht re either infrequent or fil the downwrd losure propert. V. CANONICAL LABELING FSG relies on nonil leling to effiientl hek if prtiulr pttern stisfies the downwrd losure propert of the support ondition nd to eliminte duplite ndidte sugrphs. Developing lgorithms tht n effiientl ompute the nonil lel of the vrious sugrphs is ritil to ensure tht FSG n sle to ver lrge grph dtsets. Rell from Setion II tht the nonil lel of grph is nothing more thn ode tht uniquel identifies the grph suh tht if two grphs re isomorphi to eh other, the will e ssigned the sme ode. A simple w of defining the nonil lel of grph is s the string otined ontenting the upper tringulr entries of the grph s djen mtri when this mtri hs een smmetrill permuted so tht this string eomes the leiogrphill lrgest (or smllest) over the strings tht n e otined from ll suh permuttions. This is illustrted in Fig. 2 tht shows grph G 3 nd the permuttion of its djen mtri 2 tht leds to its nonil lel. In this ode, ws otined ontenting the verte-lels in the order tht the pper in the djen mtri nd ws otined ontenting the olumns of the upper tringulr portion of the mtri. Note tht n other permuttion of G 3 s djen mtri will led to ode tht is leiogrphill smller (or equl) to. If grph hs V verties, the ompleit of determining its nonil lel using this sheme is in O( V!) mking it imprtil even for moderte sie grphs. In prtie, the ompleit of finding the nonil lel of grph n e redued using vrious heuristis to 1 A lol negtive order sugrph is the one generted s lol ndidte sugrph ut does not stisf the minimum threshold for the prtition. 2 The smol v i in the figure is verte ID, not verte lel, nd lnk elements in the djen mtri mens there is no edge etween the orresponding pir of verties. This nottion will e used in the rest of the setion. Fig. 2. v 2 v 0 v 1 () G 3 v 0 v 1 v 2 v 0 v 1 v 2 () ode = v 1 v 0 v 2 v 1 v 0 v 2 () ode = Simple emples of odes nd nonil djen mtries nrrow down the serh spe or using lternte nonil lel definitions tht tke dvntge of speil properties tht m eist in prtiulr set of grphs [15], [31], [32]. In prtiulr, the Nut progrm [31] developed Brendn MK implements numer of suh heuristis nd hs een shown to sle resonl well to moderte sie grphs. Unfortuntel, Nut does not llow grphs to hve edge lels nd s suh it nnot e used diretl FSG. As result we developed our own nonil leling lgorithm tht inorportes some of the eisting heuristis etended to vertend edge-leled grphs s well s numer of new heuristis tht re well-suited for our prtiulr prolem. Detils of our nonil leling lgorithm re provided in the rest of this setion. Note tht our nonil leling lgorithm opertes on the djen mtri representtion of grph. For this reson, FSG onverts its internl djen list representtion of eh ndidte or frequent sugrph into its orresponding djen mtri representtion, prior to omputing its nonil lel. One the nonil lel hs een otined, the djen mtri representtion is disrded. A. Verte Invrints Verte invrints [15] re some inherent properties of the verties tht do not hnge ross isomorphism mppings. An emple of suh n isomorphism-invrint propert is the degree or lel of verte, whih remins the sme regrdless of the mpping (i.e., verte ordering). Verte invrints n e used to prtition the verties of the grph into equivlene lsses suh tht ll the verties ssigned to the sme prtition hve the sme vlues for the verte invrints. Using these prtitions we n define the nonil lel of grph to e the leiogrphill lrgest ode otined ontenting the olumns of the upper tringulr djen mtri (s it ws done erlier), over ll possile permuttions of the verties sujet to the onstrint tht the verties of eh one of the prtitions re numered onseutivel. Thus, the onl modifition over our erlier definition is tht insted of mimiing over ll permuttions of the verties, we onl mimie over those permuttions tht keep the verties in eh prtition together. Note tht two grphs tht re isomorphi will led to the sme prtitioning of the verties nd the will e ssigned the sme nonil lel. If m is the numer of prtitions reted using verte invrints, ontining p 1, p 2,..., p m verties, respetivel, then the numer of different permuttions tht we need to onsider is m i=1 (p i!), whih n e sustntill smller thn the V! permuttions required the erlier pproh. We

6 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 6 v 0 Fig. 3. v 3 () v 1 v 2 v 0 v 1 v 2 v 3 v 0 v 1 v 2 v 3 ode = 000 () v 1 v 0 v 3 v 2 v 1 v 0 v 3 v 2 p 0 p 1 p 2 ode = 000 () v 1 v 3 v 0 v 2 A smple grph of sie three nd its djen mtries v 1 v 3 v 0 v 2 p 0 p 1 p 2 ode = 000 (d) hve inorported in FSG three tpes of verte invrints tht utilie informtion out the degrees nd lels of the verties, the lels nd degrees of their djent verties, nd informtion out their djent prtitions. ) Verte Degrees nd Lels: This invrint prtitions verties into disjointed groups suh tht eh prtition ontins verties with the sme lel nd the sme degree. Fig. 3 illustrtes the prtitioning indued this set of invrints for n emple grph of sie four. Bsed on their degree nd their lels, the verties re prtitioned into three groups p 0 = {v 1 }, p 1 = {v 0, v 3 } nd p 2 = {v 2 } s shown in Fig. 3(). Fig. 3 shows the djen mtri orresponding to the prtitiononstrined permuttion tht leds to the nonil lel of the grph. Using the prtitioning sed on verte invrints, we tr onl 1! 2! 1! = 2 permuttions, lthough the totl numer of permuttions for four verties is 4! = 24. ) Neighor Lists: Invrints tht led to finer-grin prtitioning n e reted inorporting informtion out the lels of the edges inident on eh verte, the degrees of the djent verties, nd their lels. In prtiulr, we desrie n djent verte v tuple (l(e), d(v), l(v)) where l(e) is the lel of the inident edge e, d(v) is the degree of the djent verte v, nd l(v) is its verte lel. Now, for eh verte u, we onstrut its neighor list nl(u) tht ontins the tuples for eh one of its djent verties. Using these neighor lists, we then prtition the verties into disjoint sets suh tht two verties u nd v will e in the sme prtition if nd onl if nl(u) = nl(v). Note tht this prtitioning is performed within the prtitions lred omputed the previous set of invrints. Fig. 4 illustrtes the prtitioning produed lso inorporting the neighor list invrint on the grph of Fig. 4(). Speifill, Fig. 4() shows the prtitioning produed the verte degrees nd lels, nd Fig. 4() shows the prtitioning tht is produed lso inorporting neighoring lists. The neighor lists re shown in Fig. 4(d). For this emple we were le to redue the numer of permuttions tht needs to e onsidered from 4! 2! to 2!. ) Itertive Prtitioning: Itertive prtitioning generlies the ide of the neighor lists, inorporting the prtition informtion [15]. This time, insted of tuple (l(e), d(v), l(v)), we use pir (p(v), l(e)) for representing the neighor lists where p(v) is the identifier of prtition to whih neighor verte v elongs nd l(e) is the lel of the inident edge to the neighor verte v. The effet of itertive prtitioning is illustrted in Fig. 5. In this emple grph, ll edges hve the sme lel nd ll verties hve the sme lel. Initill the verties re prtitioned into two groups onl their degrees, nd in eh Fig. 4. Fig. 5. v 0 v 2 v 4 v 1 v 3 v 0 v 5 v 2 v 4 v 1 v 3 v 0 v 5 v 1 v 4 v 2 () ode = () v 3 p 0 p 1 p 2 p 3 p 4 Use of neighor lists v 1 v 0 v 2 v 3 v 4 v 5 v 6 v 7 v 5 v 2 v 4 v 1 v 3 v 0 v 5 v 7 v 0 v 1 v 2 v 3 () v 2 v 1 v 0 v 3 v 4 v 5 v 6 v 7 p 0 p 1 v 2 v 4 v 1 v 3 v 0 v 5 p 0 p 1 ode = () (, 3, ), (, 3, ), (, 3, ) (, 3, ), (, 3, ), (, 3, ) (, 1, ), (, 3, ), (, 3, ) (, 3, ), (, 3, ), (, 1, ) (, 3, ) (, 3, ) (p 0, ) (d) (p 0, ), (p 0, ), (p 1, ) (p 0, ), (p 1, ), (p 1, ) (p 0, ), (p 1, ), (p 1, ) (p 0, ) (p 0, ) (p 0, ) (p 0, ) ode = () v 1 v 0 v 2 v 5 v 3 v 4 v 6 v 7 v 1 v 0 v 4 v 6 v 7 v 2 v 5 v 3 p 0 p 1 p 2 (p 1, ), (p 1, ), (p 2, ) (p 0, ), (p 2, ), (p 2, ) (p 0, ), (p 2, ), (p 2, ) (p 0, ) (p 1, ) (p 1, ) (p 1, ) (p 1, ) ode = () v 1 v 0 v 2 v 5 v 3 v 4 v 6 v 7 v 1 v 0 v 4 v 6 v 7 v 6 p 0 p 1 v 2 v 5 v 3 v 5 v 4 p 2 p 3 (d) (p 1, ), (p 1, ), (p 2, ) (p 0, ), (p 3, ), (p 3, ) (p 0, ), (p 3, ), (p 3, ) (p 0, ) (p 1, ) (p 1, ) (p 1, ) (p 1, ) ode = An emple of itertive prtitioning

7 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 7 prtition the re sorted their neighor lists (Fig. 5()). The ordering of those prtitions is sed on the degrees nd the lels of eh verte nd its neighors. Then, we split the first prtition p 0 into two, euse the neighor lists of v 1 is different from those of v 0 nd v 2. B renumering ll the prtitions, updting the neighor lists, nd sorting the verties sed on their neighor lists, we otin the mtri s shown in Fig. 5(). Now, euse the prtition p 2 eomes non-uniform in terms of the neighor lists, we gin divide p 2 to ftor out v 5, renumer prtitions, updte nd sort the neighor lists, nd sort verties to otin the mtri in Fig. 5(d). B. Degree-sed Prtition Ordering In ddition to using the verte invrints to ompute finegrin prtitioning of the verties, the overll run-time of the nonil leling n e further redued properl ordering the vrious prtitions. This is euse, proper ordering of the prtitions m llow us to quikl determine whether set of permuttions n potentill led to ode tht is smller thn the urrent est ode or not; thus, llowing us to prune lrge prts of the serh spe. Rell from Setion V-A tht we otin the ode of grph ontenting its djent mtri in olumn-wise fshion. As result, when we permute the rows nd the olumns of prtiulr prtition, the ode orresponding to the olumns of the preeding prtitions is not ffeted. Now, while we eplore prtiulr set of within-prtition permuttions, if we otin prefi of the finl ode tht is lrger thn the orresponding prefi of the urrentl est ode, then we know tht regrdless of the permuttions of the susequent prtitions, this ode will never e smller thn the urrentl est ode, nd the eplortion of this set of permuttions n e terminted. The ritil propert tht llows us to prune suh unpromising permuttions is our ilit to otin d ode prefi. Idell, we will like to order the prtitions in w suh tht the permuttions of the verties in the initil prtitions led to drmtill different ode prefies, whih it turn will llow us to prune prts of the serh spe. In generl, the likelihood of this hppening depends on the densit (i.e., the numer of edges) of eh prtition, nd for this reson we sort the prtitions in deresing order of the degree of their verties. C. Verte Stilition Verte stilition is effetive for finding isomorphism of grphs with regulr or smmetri strutures [31]. The ke ide is to rek the topologil smmetr of grph foring prtiulr verte into its own prtition, when the itertive prtitioning leves lrge verte prtition whih nnot e deomposed into smller prtitions nmore. For emple, onsider le G = (V, E) of k edges where ll the edges nd the verties hve the sme lel. Eh verte is equivlent to n other sine the re identil in terms of their degree, lel, neighors, nd resulting prtitions. As result, verte nnot e distinguished from others nd there will e onl singe prtition ontining ll the k verties. To otin nonil lel under suh prtitioning with the itertive prtitioning onl, it would require O(k!) opertions. Verte stilition reks suh regulr struture ssuming tht prtiulr verte in lrge prtition with mn equivlent verties is different from the others. The seleted verte forms new singleton prtition for itself, whih triggers for the rest of the verties the suessive itertive prtitioning the detils of whih re desried in Setion V-A.0.. Beuse we hve hosen the verte ritrril, we hve to repet the sme proess for the remining verties in the originl prtition. During the suessive itertive prtitioning, the verte stilition m e pplied repetedl if the itertive prtitioning n not deompose lrge prtition effetivel. For emple, in the se of le with k edges, one prtiulr verte v is hosen from the initil prtition with ll the k verties, it reks the smmetr nd we immeditel otin (k 1)/2 + 1 prtitions sed on the distne from v to eh verte. Thus, the neessr numer of permuttions to ompute the nonil lel fter this prtitioning is ( (k 1)/2 + 1)!. Beuse there re k suh hoies for the first verte v, the entire omputtionl ompleit for the nonil leling of G is ounded O(k(k/2)!) whih is signifintl smller thn O(k!). Note tht the verte stilition is not limited to les nd tht it is pplile to n tpes of grphs. One prtition eomes smll enough, the strightforwrd permuttion n e simpler nd fster thn verte stilition, in order to otin nonil lel. Thus, our nonil leling lgorithm pplies verte stilition onl if the sie of verte prtition is greter thn five. For further detils on verte stilition the reders should refer to tetook on permuttion groups suh s [12]. VI. EXPERIMENTAL EVALUATION We eperimentll evluted the performne of FSG using tul grphs derived from the moleulr struture of hemil ompounds, nd grphs generted snthetill. The first tpe of dtsets llows us to evlute the effetiveness of FSG for finding rther lrge ptterns nd its slilit to lrge rel dtsets, wheres the seond one, set of sntheti dtsets, llows us to evlute the performne of FSG on dtsets whose hrteristis (e.g., numer of grph trnstions, verge grph sie, verge numer of verte nd edge lels, nd verge length of ptterns) differs drmtill; thus, providing insights on how well FSG sles with respet to these hrteristis. All eperiments were done on dul AMD Athlon MP (1.53 GH) mhines with 2 Gtes min memor, running the Linu operting sstem. All the times reported re in seonds. A. Chemil Compound Dtsets We derived grph dtsets from two pulil ville dtsets of hemil ompounds. The first dtset 3 ontins 340 hemil ompounds nd ws originll provided for the Preditive Toiolog Evlution (PTE) Chllenge [43], nd the seond dtset 4 ontins 223,644 hemil ompounds nd 3 ftp://ftp.oml.o..uk/pu/pkges/ilp/dtsets/rinogenesis/ progol/rinogenesis.tr.z 4 dtse/struturl informtion/ struturl dt.html

8 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 8 is ville from the Developmentl Therpeutis Progrm (DTP) t Ntionl Cner Institute. From the desription of hemil ompounds in those two dtsets, we reted trnstion for ompound, verte for n tom, n edge for ond. Eh verte hs verte lel ssigned for its tom tpe nd eh edge hs n edge lel ssigned for its ond tpe. In the PTE dtset there re 66 tom tpes nd 4 ond tpes, nd in the DTP dtset there re 104 tom tpes nd 3 ond tpes. Eh grph trnstion otined from the PTE nd the DTP dtsets hs 27 nd 22 edges on the verge, respetivel. d) Results: Tle II shows the results FSG on four dtsets derived from the PTE nd DTP dtsets. The first dtset ws otined using ll the ompounds of the PTE dtset, wheres the remining three dtsets were otined rndoml seleting 50,000, 100,000, nd 200,000 ompounds from the DTP dtset. There re three tpes of results shown in the tle, the run-time in seonds (t), the sie of the lrgest disovered frequent sugrph (k ), nd the totl numer of frequent ptterns ( F ) tht were generted. The minimum support threshold ws rnging from 10% down to 1.0%. Dshes in the tle orrespond to eperiments tht were orted due to high omputtionl requirements. All the results in this tle were otined using single prtition of the dtset. FSG is le to effetivel operte on dtsets ontining 200,000 trnstions nd disover ll frequent onneted sugrphs whih our in 1% of the trnstions in pproimtel one hour. With respet to the numer of trnstions, the run-time sles lmost linerl. For instne, with the 2% support, the run-time for 50,000 trnstions is 263 seonds, wheres the orresponding run-time for 200,000 trnstions is 1,343 seonds, n inrese ftor of 5.1. As the support dereses, the run-time inreses refleting the inrese of the numer of frequent sugrphs found from the input dtset. For emple, with 200,000 trnstions, the run-time for the 1% support is 4.2 times longer thn tht for the 3% support, nd the numer of found frequent sugrphs for the 1% support ws 8.2 times more thn tht for the 3% support. Compring the performne on the PTE nd DTP-derived dtsets we notie tht the run-time for the PTE dtset drmtill inreses s the minimum support dereses, nd eventull overtkes the run-time for most of the DTP-derived dtsets. This ehvior is due to the mimum sie nd the totl numer of frequent sugrphs tht re disovered in this dtset (oth of whih re shown in Tle II). For lower support vlues the PTE dtset ontins oth more nd longer frequent sugrphs thn the DTP-derived dtsets do. This is due to the inherent hrteristis of the PTE dtset euse it ontins lrger nd more similr ompounds. For emple, the PTE dtset ontins 26 ompounds with over 50 edges nd the lrgest ompound hs 214 edges. Despite tht, FSG requires 459 seonds for support vlue of 2.0%, nd is le to disover ptterns ontining over 22 edges. 1) Reduing Memor Requirement of TID lists: To evlute the effetiveness of the dtse-prtitioning-sed pproh (desried in Setion IV-B.1) for reduing the mount of memor required TID lists (TID list memor), we performed set of eperiments in whih we used two dtsets derived from the DTP dtset ontining 100,000 nd 200,000 hemil ompounds, respetivel. For eh dtset we used FSG to find ll frequent ptterns tht our in t lest 1% of the trnstions prtitioning the dtset in 2, 3, 4, 5, 10, 20, 30, 40, nd 50 prtitions. These results re shown in Tle III. For eh eperiment, this tle shows the totl run-time, the mimum mount of TID list memor, nd the mimum mount of memor required to store the pttern lttie (pttern lttie memor). From these results we n see tht the dtse-prtitioningsed pproh is quite effetive in reduing the TID list memor, euse it dereses lmost linerl s the numer of prtitions. Moreover, the vrious optimitions desried in Setion IV-B.1 re quite effetive in limiting the degrdtion in runtime of the resulting lgorithm. For emple, for the 200,000-ompound dtset nd 50 prtitions, the runtime inreses onl ftor of 3.4 over tht for single prtition. Also, the pttern lttie memor inreses slowl s the numer of prtitions inreses, nd unless the numer of prtitions is quite lrge, it is still dominted the memor required to store the TID lists. Note tht these results suggest tht there is n optiml point for the numer of prtitions tht leds to the lest mount of memor, s the pttern lttie memor will eventull eeed the TID list memor s the numer of prtitions inreses. B. Sntheti Dtsets To evlute the performne of FSG on dtsets with different hrteristis we developed sntheti grph genertor whih n ontrol the numer of trnstions D, the verge numer of edges in eh trnstion T, the verge numer of edges I of the potentill frequent sugrphs, the numer of potentill frequent sugrphs S, the numer of distint edge lels L E, nd the numer of distint verte lels L V of the generted dtset. The design of our genertor ws inspired the sntheti trnstion genertor developed the Quest group t IBM nd used etensivel to evlute lgorithms tht find frequent itemsets [1], [2], [20]. The tul genertor works s follows. First, it genertes set of S potentill frequent onneted sugrphs lled seed ptterns whose sie is determined Poisson distriution with men I. For eh seed pttern, the topolog nd the lels of the edges nd the verties re hosen rndoml. Eh seed pttern hs weight ssigned, whih eomes proilit tht the seed pttern is seleted to e inluded in grph trnstion. The weights re lulted dividing rndom vrile whih oes n eponentil distriution with unit men the numer of edges in the seed pttern, nd the sum of the weights of ll the seed ptterns is normlied to one. We ll this set S of seed ptterns seed pool. The reson tht we divide the eponentil rndom vrile the numer of edges is to redue the hne tht lrger weights re ssigned to lrger seed ptterns. Otherwise, one lrge weight ws ssigned to lrge seed pttern, the resulting dtset would ontin n eponentill lrge numer of frequent ptterns. Net, the genertor retes D trnstions. First, the genertor determines the trget sie of eh trnstion, whih is

9 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 9 TABLE II RUN-TIME IN SECONDS FOR THE PTE AND DTP CHEMICAL COMPOUND DATASETS. Support Run-time t[se], Sie of Lrgest Frequent Pttern k, nd Numer of Frequent Ptterns F threshold PTE D = 340 DTP D = 50, 000 DTP D = 100, 000 DTP D = 200, 000 [%] t[se] k F t[se] k F t[se] k F t[se] k F Note. Dshes indite the omputtion ws orted euse of the too long run-time. D : Numer of trnstions TABLE III RUN-TIME AND TID LIST MEMORY WITH PARTITIONING Run-time [se] D Numer of Prtitions , , Mimum mount of memor for storing TID lists [Mtes] D Numer of Prtitions , , Mimum mount of memor for storing pttern lttie [Mtes] D Numer of Prtitions , , Note. The two dtsets re generted from the DTP dtset smpling 100,000 nd 200,000 hemil ompounds. The minimum support σ = 1.0% Pttern lttie memor is left lnk for single prtition euse the lttie is not uilt. D : Numer of trnstions Poisson rndom vrile whose men is equl to T. Then, the genertor selets seed pttern from the seed pool, rolling n S -sided die. Eh fe of this die orresponds to the proilit ssigned to seed pttern in the seed pool. If the sie of the seleted seed pttern fits in the trget trnstion sie, the genertor dds it to the trnstion. If the sie of the urrent intermedite trnstion does not reh its trget sie, we keep seleting nd putting nother seed pttern into it. When dding the seleted seed pttern mkes the intermedite trnstion sie greter thn the trget trnstion sie, we dd it for the hlf of the ses, nd disrd it nd move onto the net trnstion for the rest of the hlf. The genertor dds seed pttern into trnstion onneting rndoml seleted pir of verties, one from the trnstion nd the other from the seed pttern. ) Results: Using this genertor, we otined numer of different dtsets vring the numer of verte lels L V, the verge sie of the potentill frequent sugrphs I, nd the verge sie of eh trnstion T, while keeping fied the totl numer of trnstions D = 10, 000, seed ptterns S = 200, nd edge lels L E = 1 respetivel. Despite our est efforts in designing the genertor, we oserved tht s oth T nd I inrese, different dtsets reted under the sme prmeter omintion led to different run- Runtime Medin[s] I = 5 T = 40 T = 30 T = 20 T = 10 T = L v Runtime Medin[s] I = 7 T = 40 T = 30 T = 20 T = L v Runtime Medin[s] T = 40 T = 30 T = 20 T = 10 I = L v Fig. 6. Medin of 10 run-times in seonds for sntheti dt sets. T is the verge sie of trnstions, I is the verge sie of seed ptterns, nd L V is the numer of distint verte lels. time euse some m ontin hrder seed ptterns (e.g., regulr ptterns with similr lels) thn others do. To redue this vriilit, we reted ten different dtsets for eh prmeter omintion with different seeds for the pseudo rndom numer genertor nd run FSG on ll of them. The medin of these run-times for eh of the ten dtsets is shown in Fig. 6. Note tht these results were otined using 2% s the minimum support threshold. In generl, the FSG s run-time dereses s the numer of verte lels L V inreses, wheres it inreses when the verge sie of the seed ptterns I or the verge trnstion

10 To pper in IEEE Trnstions on Knowledge nd Dt Engineering 10 sie T inreses. These trends re onsistent with the inherent hrteristis of the dtsets euse of the following resons: (i) As the numer of verte lels inreses, the spe of possile utomorphisms nd sugrph isomorphisms dereses leding to fster ndidte genertion nd frequen ounting. (ii) As the sie of the verge seed pttern inreses, euse of the omintoril nture of the prolem, the totl numer of frequent ptterns to e found from the dtset inreses eponentill inresing the overll run-time. (iii) As the sie of the verge trnstion T inreses frequen ounting sugrph isomorphism eomes epensive, regrdless of the sie of ndidte sugrphs. Moreover, the totl numer of frequent ptterns to e found from the dtset lso inreses euse more seed ptterns n e put into eh trnstion. Both of these ftors ontriute in inresing the overll runtime. VII. RELATED WORK Over the ers, numer of different lgorithms hve een developed to find frequent ptterns orresponding to frequent sugrphs in grph dtsets. Developing suh lgorithms is prtiulrl hllenging nd omputtionll intensive, s grph nd sugrph isomorphisms pl ke role throughout the omputtions. For this reson, onsiderle mount of work hs een foused on pproimte lgorithms [23], [28], [35], [46] tht use vrious heuristis to prune the serh spe. However, numer of et lgorithms hve een developed [5], [10], [17], [24], [25], [45] tht gurntee to find ll sugrphs tht stisf ertin minimum support or other onstrints. Prol the most well-known heuristi-sed pproh is the SUBDUE sstem, originll developed in 1994, ut hs een improved over the ers [8], [23]. SUBDUE finds ptterns whih n effetivel ompress the originl input dt sed on the minimum desription length priniple, sustituting those ptterns with single verte. To nrrow the serh-spe nd improve its omputtionl effiien, SUBDUE uses heuristi em serh pproh, whih quite often results in filing to find sugrphs tht re frequent. Nevertheless, despite its heuristi nture, its omputtionl performne is onsiderl worse ompred to some of the reent frequent sugrph disover lgorithms. Eperiments reported in [17] for the PTE dtset [43], show tht SUBDUE spends out 80 seonds on Pentium III 900 MH omputer to find five most frequent sustrutures. In ontrst, the FSG lgorithm developed our group [29], tkes onl 20 seonds on Pentium III 450 MH to find ll 3,608 frequent sugrphs tht our in t lest 5% of the ompounds. A numer of pprohes for finding ommonl ourring sugrphs hve een developed in the ontet of indutive logi progrmming (ILP) sstems [19], [33], [34], [38], [44], s grphs n e esil epressed using first-order logi. Eh verte nd edge is represented s predite nd sugrph orresponds to onjuntion of suh predites. The gol of ILP-sed pprohes is to indue set of rules ple of orretl lssifing set of positive nd negtive emples. In the se of grphs modeled ILP sstems, these rules usull orrespond to sugrphs. Most ILP-sed pprohes re greed in nture nd use vrious heuristis to prune the spe of possile hpotheses. Thus, the tend to find sugrphs tht hve high support nd n t s good disrimintors etween lsses. However, the re not gurnteed to disover ll frequent sugrphs. A notle eeption is the ILP sstem WARMR developed Dehspe nd De Redt [9] ple of finding ll frequentl ourring sugrphs. WARMR is not speilied for hndling grphs, however, it does not emplo n grph-speifi optimitions nd s suh, it hs high omputtionl requirements. In the lst three ers, three different lgorithms hve een developed ple of finding ll frequentl ourring sugrphs with resonle omputtionl effiien. These re AGM Inokuhi et l. [24], [25], the hemil sustruture disover lgorithm developed Borgelt nd Berthold [5], nd the gspn lgorithm developed Yn nd Hn [45]. Among them, the erl version of AGM [24] ws developed prior to FSG, wheres the other lgorithms were developed fter the initil development of FSG [29]. AGM initill developed to find frequentl indued sugrphs [24] nd lter etended to find ritrr frequent sugrphs [25] disovers the frequent sugrphs using redthfirst pproh, nd grows the frequent sugrphs one-vertet--time. To distinguish sugrph from nother, it uses nonil leling sheme sed on the djen mtri representtion. Eperiments reported in [24] show tht AGM hieves good performne for sntheti dense dtsets, nd it required 40 minutes to 8 ds to find ll frequent indued sugrphs in the PTE dtset, s the minimum support threshold vried from 20% to 10%. Their modified lgorithm [25] uses previousl found emeddings of frequent pttern in trnstion to sve the sugrph isomorphism omputtion nd improves the performne signifintl t the epense of inresed memor requirements. The hemil sustruture mining lgorithm developed Borgelt nd Berthold [5], finds frequent sustrutures (onneted sugrphs) using depth-first pproh similr to tht used delt [49] in the ontet of frequent itemset disover. In this lgorithm, one frequent sugrph hs een identified, it then proeeds to eplore the input dtset for frequent sugrphs ll of whih ontin the frequent sugrph. To redue the numer of sugrph isomorphism opertions, it keeps the emeddings of previousl disovered sugrphs nd tries to etend the emeddings one edge whih is similr to the modified version of AGM [25]. In ddition, sine ll the emeddings of the frequent sugrph re known, the projet the originl dtset into smller one removing edges nd verties tht re not used n emeddings. Nevertheless, despite these optimitions, the reported speed of the lgorithm is slower thn tht hieved FSG. This is primril due to two resons. First, their ndidte sugrph genertion sheme does not ensure tht the sme sugrph is generted onl one, s result, the end up generting nd determining the frequen of the sme sugrph multiple times. Seond, in hemil dtsets, the sme sugrph tends to hve mn emeddings (in the rnge of ), s result the ost of keeping trk of them outweighs n enefits.

1 PYTHAGORAS THEOREM 1. Given a right angled triangle, the square of the hypotenuse is equal to the sum of the squares of the other two sides.

1 PYTHAGORAS THEOREM 1. Given a right angled triangle, the square of the hypotenuse is equal to the sum of the squares of the other two sides. 1 PYTHAGORAS THEOREM 1 1 Pythgors Theorem In this setion we will present geometri proof of the fmous theorem of Pythgors. Given right ngled tringle, the squre of the hypotenuse is equl to the sum of the

More information

Exercise sheet 6: Solutions

Exercise sheet 6: Solutions Eerise sheet 6: Solutions Cvet emptor: These re merel etended hints, rther thn omplete solutions. 1. If grph G hs hromti numer k > 1, prove tht its verte set n e prtitioned into two nonempt sets V 1 nd

More information

12.4 Similarity in Right Triangles

12.4 Similarity in Right Triangles Nme lss Dte 12.4 Similrit in Right Tringles Essentil Question: How does the ltitude to the hpotenuse of right tringle help ou use similr right tringles to solve prolems? Eplore Identifing Similrit in Right

More information

Fast Frequent Free Tree Mining in Graph Databases

Fast Frequent Free Tree Mining in Graph Databases The Chinese University of Hong Kong Fst Frequent Free Tree Mining in Grph Dtses Peixing Zho Jeffrey Xu Yu The Chinese University of Hong Kong Decemer 18 th, 2006 ICDM Workshop MCD06 Synopsis Introduction

More information

A Lower Bound for the Length of a Partial Transversal in a Latin Square, Revised Version

A Lower Bound for the Length of a Partial Transversal in a Latin Square, Revised Version A Lower Bound for the Length of Prtil Trnsversl in Ltin Squre, Revised Version Pooy Htmi nd Peter W. Shor Deprtment of Mthemtil Sienes, Shrif University of Tehnology, P.O.Bo 11365-9415, Tehrn, Irn Deprtment

More information

Global alignment. Genome Rearrangements Finding preserved genes. Lecture 18

Global alignment. Genome Rearrangements Finding preserved genes. Lecture 18 Computt onl Biology Leture 18 Genome Rerrngements Finding preserved genes We hve seen before how to rerrnge genome to obtin nother one bsed on: Reversls Knowledge of preserved bloks (or genes) Now we re

More information

Data Structures LECTURE 10. Huffman coding. Example. Coding: problem definition

Data Structures LECTURE 10. Huffman coding. Example. Coding: problem definition Dt Strutures, Spring 24 L. Joskowiz Dt Strutures LEURE Humn oing Motivtion Uniquel eipherle oes Prei oes Humn oe onstrution Etensions n pplitions hpter 6.3 pp 385 392 in tetook Motivtion Suppose we wnt

More information

Computational Biology Lecture 18: Genome rearrangements, finding maximal matches Saad Mneimneh

Computational Biology Lecture 18: Genome rearrangements, finding maximal matches Saad Mneimneh Computtionl Biology Leture 8: Genome rerrngements, finding miml mthes Sd Mneimneh We hve seen how to rerrnge genome to otin nother one sed on reversls nd the knowledge of the preserved loks or genes. Now

More information

NON-DETERMINISTIC FSA

NON-DETERMINISTIC FSA Tw o types of non-determinism: NON-DETERMINISTIC FS () Multiple strt-sttes; strt-sttes S Q. The lnguge L(M) ={x:x tkes M from some strt-stte to some finl-stte nd ll of x is proessed}. The string x = is

More information

8 THREE PHASE A.C. CIRCUITS

8 THREE PHASE A.C. CIRCUITS 8 THREE PHSE.. IRUITS The signls in hpter 7 were sinusoidl lternting voltges nd urrents of the so-lled single se type. n emf of suh type n e esily generted y rotting single loop of ondutor (or single winding),

More information

Lecture Notes No. 10

Lecture Notes No. 10 2.6 System Identifition, Estimtion, nd Lerning Leture otes o. Mrh 3, 26 6 Model Struture of Liner ime Invrint Systems 6. Model Struture In representing dynmil system, the first step is to find n pproprite

More information

SECTION A STUDENT MATERIAL. Part 1. What and Why.?

SECTION A STUDENT MATERIAL. Part 1. What and Why.? SECTION A STUDENT MATERIAL Prt Wht nd Wh.? Student Mteril Prt Prolem n > 0 n > 0 Is the onverse true? Prolem If n is even then n is even. If n is even then n is even. Wht nd Wh? Eploring Pure Mths Are

More information

Comparing the Pre-image and Image of a Dilation

Comparing the Pre-image and Image of a Dilation hpter Summry Key Terms Postultes nd Theorems similr tringles (.1) inluded ngle (.2) inluded side (.2) geometri men (.) indiret mesurement (.6) ngle-ngle Similrity Theorem (.2) Side-Side-Side Similrity

More information

CS 573 Automata Theory and Formal Languages

CS 573 Automata Theory and Formal Languages Non-determinism Automt Theory nd Forml Lnguges Professor Leslie Lnder Leture # 3 Septemer 6, 2 To hieve our gol, we need the onept of Non-deterministi Finite Automton with -moves (NFA) An NFA is tuple

More information

Matrices SCHOOL OF ENGINEERING & BUILT ENVIRONMENT. Mathematics (c) 1. Definition of a Matrix

Matrices SCHOOL OF ENGINEERING & BUILT ENVIRONMENT. Mathematics (c) 1. Definition of a Matrix tries Definition of tri mtri is regulr rry of numers enlosed inside rkets SCHOOL OF ENGINEERING & UIL ENVIRONEN Emple he following re ll mtries: ), ) 9, themtis ), d) tries Definition of tri Size of tri

More information

Project 6: Minigoals Towards Simplifying and Rewriting Expressions

Project 6: Minigoals Towards Simplifying and Rewriting Expressions MAT 51 Wldis Projet 6: Minigols Towrds Simplifying nd Rewriting Expressions The distriutive property nd like terms You hve proly lerned in previous lsses out dding like terms ut one prolem with the wy

More information

VIBRATION ANALYSIS OF AN ISOLATED MASS WITH SIX DEGREES OF FREEDOM Revision G

VIBRATION ANALYSIS OF AN ISOLATED MASS WITH SIX DEGREES OF FREEDOM Revision G B Tom Irvine Emil: tom@virtiondt.om Jnur 8, 3 VIBRATION ANALYSIS OF AN ISOLATED MASS WITH SIX DEGREES OF FREEDOM Revision G Introdution An vionis omponent m e mounted with isoltor grommets, whih t s soft

More information

Chapter 4 State-Space Planning

Chapter 4 State-Space Planning Leture slides for Automted Plnning: Theory nd Prtie Chpter 4 Stte-Spe Plnning Dn S. Nu CMSC 722, AI Plnning University of Mrylnd, Spring 2008 1 Motivtion Nerly ll plnning proedures re serh proedures Different

More information

AP Calculus BC Chapter 8: Integration Techniques, L Hopital s Rule and Improper Integrals

AP Calculus BC Chapter 8: Integration Techniques, L Hopital s Rule and Improper Integrals AP Clulus BC Chpter 8: Integrtion Tehniques, L Hopitl s Rule nd Improper Integrls 8. Bsi Integrtion Rules In this setion we will review vrious integrtion strtegies. Strtegies: I. Seprte the integrnd into

More information

Technische Universität München Winter term 2009/10 I7 Prof. J. Esparza / J. Křetínský / M. Luttenberger 11. Februar Solution

Technische Universität München Winter term 2009/10 I7 Prof. J. Esparza / J. Křetínský / M. Luttenberger 11. Februar Solution Tehnishe Universität Münhen Winter term 29/ I7 Prof. J. Esprz / J. Křetínský / M. Luttenerger. Ferur 2 Solution Automt nd Forml Lnguges Homework 2 Due 5..29. Exerise 2. Let A e the following finite utomton:

More information

Maintaining Mathematical Proficiency

Maintaining Mathematical Proficiency Nme Dte hpter 9 Mintining Mthemtil Profiieny Simplify the epression. 1. 500. 189 3. 5 4. 4 3 5. 11 5 6. 8 Solve the proportion. 9 3 14 7. = 8. = 9. 1 7 5 4 = 4 10. 0 6 = 11. 7 4 10 = 1. 5 9 15 3 = 5 +

More information

Lesson 2: The Pythagorean Theorem and Similar Triangles. A Brief Review of the Pythagorean Theorem.

Lesson 2: The Pythagorean Theorem and Similar Triangles. A Brief Review of the Pythagorean Theorem. 27 Lesson 2: The Pythgoren Theorem nd Similr Tringles A Brief Review of the Pythgoren Theorem. Rell tht n ngle whih mesures 90º is lled right ngle. If one of the ngles of tringle is right ngle, then we

More information

Reference : Croft & Davison, Chapter 12, Blocks 1,2. A matrix ti is a rectangular array or block of numbers usually enclosed in brackets.

Reference : Croft & Davison, Chapter 12, Blocks 1,2. A matrix ti is a rectangular array or block of numbers usually enclosed in brackets. I MATRIX ALGEBRA INTRODUCTION TO MATRICES Referene : Croft & Dvison, Chpter, Blos, A mtri ti is retngulr rr or lo of numers usull enlosed in rets. A m n mtri hs m rows nd n olumns. Mtri Alger Pge If the

More information

6.5 Improper integrals

6.5 Improper integrals Eerpt from "Clulus" 3 AoPS In. www.rtofprolemsolving.om 6.5. IMPROPER INTEGRALS 6.5 Improper integrls As we ve seen, we use the definite integrl R f to ompute the re of the region under the grph of y =

More information

ANALYSIS AND MODELLING OF RAINFALL EVENTS

ANALYSIS AND MODELLING OF RAINFALL EVENTS Proeedings of the 14 th Interntionl Conferene on Environmentl Siene nd Tehnology Athens, Greee, 3-5 Septemer 215 ANALYSIS AND MODELLING OF RAINFALL EVENTS IOANNIDIS K., KARAGRIGORIOU A. nd LEKKAS D.F.

More information

CS311 Computational Structures Regular Languages and Regular Grammars. Lecture 6

CS311 Computational Structures Regular Languages and Regular Grammars. Lecture 6 CS311 Computtionl Strutures Regulr Lnguges nd Regulr Grmmrs Leture 6 1 Wht we know so fr: RLs re losed under produt, union nd * Every RL n e written s RE, nd every RE represents RL Every RL n e reognized

More information

Linear Algebra Introduction

Linear Algebra Introduction Introdution Wht is Liner Alger out? Liner Alger is rnh of mthemtis whih emerged yers k nd ws one of the pioneer rnhes of mthemtis Though, initilly it strted with solving of the simple liner eqution x +

More information

Algorithms & Data Structures Homework 8 HS 18 Exercise Class (Room & TA): Submitted by: Peer Feedback by: Points:

Algorithms & Data Structures Homework 8 HS 18 Exercise Class (Room & TA): Submitted by: Peer Feedback by: Points: Eidgenössishe Tehnishe Hohshule Zürih Eole polytehnique fédérle de Zurih Politenio federle di Zurigo Federl Institute of Tehnology t Zurih Deprtement of Computer Siene. Novemer 0 Mrkus Püshel, Dvid Steurer

More information

Core 2 Logarithms and exponentials. Section 1: Introduction to logarithms

Core 2 Logarithms and exponentials. Section 1: Introduction to logarithms Core Logrithms nd eponentils Setion : Introdution to logrithms Notes nd Emples These notes ontin subsetions on Indies nd logrithms The lws of logrithms Eponentil funtions This is n emple resoure from MEI

More information

Figure 1. The left-handed and right-handed trefoils

Figure 1. The left-handed and right-handed trefoils The Knot Group A knot is n emedding of the irle into R 3 (or S 3 ), k : S 1 R 3. We shll ssume our knots re tme, mening the emedding n e extended to solid torus, K : S 1 D 2 R 3. The imge is lled tuulr

More information

5. Every rational number have either terminating or repeating (recurring) decimal representation.

5. Every rational number have either terminating or repeating (recurring) decimal representation. CHAPTER NUMBER SYSTEMS Points to Rememer :. Numer used for ounting,,,,... re known s Nturl numers.. All nturl numers together with zero i.e. 0,,,,,... re known s whole numers.. All nturl numers, zero nd

More information

AVL Trees. D Oisín Kidney. August 2, 2018

AVL Trees. D Oisín Kidney. August 2, 2018 AVL Trees D Oisín Kidne August 2, 2018 Astrt This is verified implementtion of AVL trees in Agd, tking ides primril from Conor MBride s pper How to Keep Your Neighours in Order [2] nd the Agd stndrd lirr

More information

Counting Paths Between Vertices. Isomorphism of Graphs. Isomorphism of Graphs. Isomorphism of Graphs. Isomorphism of Graphs. Isomorphism of Graphs

Counting Paths Between Vertices. Isomorphism of Graphs. Isomorphism of Graphs. Isomorphism of Graphs. Isomorphism of Graphs. Isomorphism of Graphs Isomorphism of Grphs Definition The simple grphs G 1 = (V 1, E 1 ) n G = (V, E ) re isomorphi if there is ijetion (n oneto-one n onto funtion) f from V 1 to V with the property tht n re jent in G 1 if

More information

Metodologie di progetto HW Technology Mapping. Last update: 19/03/09

Metodologie di progetto HW Technology Mapping. Last update: 19/03/09 Metodologie di progetto HW Tehnology Mpping Lst updte: 19/03/09 Tehnology Mpping 2 Tehnology Mpping Exmple: t 1 = + b; t 2 = d + e; t 3 = b + d; t 4 = t 1 t 2 + fg; t 5 = t 4 h + t 2 t 3 ; F = t 5 ; t

More information

21.1 Using Formulae Construct and Use Simple Formulae Revision of Negative Numbers Substitution into Formulae

21.1 Using Formulae Construct and Use Simple Formulae Revision of Negative Numbers Substitution into Formulae MEP Jmi: STRAND G UNIT 1 Formule: Student Tet Contents STRAND G: Alger Unit 1 Formule Student Tet Contents Setion 1.1 Using Formule 1. Construt nd Use Simple Formule 1.3 Revision of Negtive Numers 1.4

More information

Algorithm Design and Analysis

Algorithm Design and Analysis Algorithm Design nd Anlysis LECTURE 5 Supplement Greedy Algorithms Cont d Minimizing lteness Ching (NOT overed in leture) Adm Smith 9/8/10 A. Smith; sed on slides y E. Demine, C. Leiserson, S. Rskhodnikov,

More information

Algorithms for Mining the Evolution of Conserved Relational States in Dynamic Networks

Algorithms for Mining the Evolution of Conserved Relational States in Dynamic Networks Algorithms for Mining the Evolution of Conserved Reltionl Sttes in Dnmi Networks Rewn Ahmed, George Krpis Deprtment of Computer Siene & Engineering Universit of Minnesot Minnepolis, MN 55455 Emil: {hmed,krpis}@s.umn.edu

More information

The Ellipse. is larger than the other.

The Ellipse. is larger than the other. The Ellipse Appolonius of Perg (5 B.C.) disovered tht interseting right irulr one ll the w through with plne slnted ut is not perpendiulr to the is, the intersetion provides resulting urve (oni setion)

More information

Introduction to Olympiad Inequalities

Introduction to Olympiad Inequalities Introdution to Olympid Inequlities Edutionl Studies Progrm HSSP Msshusetts Institute of Tehnology Snj Simonovikj Spring 207 Contents Wrm up nd Am-Gm inequlity 2. Elementry inequlities......................

More information

ENERGY AND PACKING. Outline: MATERIALS AND PACKING. Crystal Structure

ENERGY AND PACKING. Outline: MATERIALS AND PACKING. Crystal Structure EERGY AD PACKIG Outline: Crstlline versus morphous strutures Crstl struture - Unit ell - Coordintion numer - Atomi pking ftor Crstl sstems on dense, rndom pking Dense, regulr pking tpil neighor ond energ

More information

Solutions for HW9. Bipartite: put the red vertices in V 1 and the black in V 2. Not bipartite!

Solutions for HW9. Bipartite: put the red vertices in V 1 and the black in V 2. Not bipartite! Solutions for HW9 Exerise 28. () Drw C 6, W 6 K 6, n K 5,3. C 6 : W 6 : K 6 : K 5,3 : () Whih of the following re iprtite? Justify your nswer. Biprtite: put the re verties in V 1 n the lk in V 2. Biprtite:

More information

Preview 11/1/2017. Greedy Algorithms. Coin Change. Coin Change. Coin Change. Coin Change. Greedy algorithms. Greedy Algorithms

Preview 11/1/2017. Greedy Algorithms. Coin Change. Coin Change. Coin Change. Coin Change. Greedy algorithms. Greedy Algorithms Preview Greed Algorithms Greed Algorithms Coin Chnge Huffmn Code Greed lgorithms end to e simple nd strightforwrd. Are often used to solve optimiztion prolems. Alws mke the choice tht looks est t the moment,

More information

Mathematics SKE: STRAND F. F1.1 Using Formulae. F1.2 Construct and Use Simple Formulae. F1.3 Revision of Negative Numbers

Mathematics SKE: STRAND F. F1.1 Using Formulae. F1.2 Construct and Use Simple Formulae. F1.3 Revision of Negative Numbers Mthemtis SKE: STRAND F UNIT F1 Formule: Tet STRAND F: Alger F1 Formule Tet Contents Setion F1.1 Using Formule F1. Construt nd Use Simple Formule F1.3 Revision of Negtive Numers F1.4 Sustitution into Formule

More information

Algorithm Design and Analysis

Algorithm Design and Analysis Algorithm Design nd Anlysis LECTURE 8 Mx. lteness ont d Optiml Ching Adm Smith 9/12/2008 A. Smith; sed on slides y E. Demine, C. Leiserson, S. Rskhodnikov, K. Wyne Sheduling to Minimizing Lteness Minimizing

More information

Finite State Automata and Determinisation

Finite State Automata and Determinisation Finite Stte Automt nd Deterministion Tim Dworn Jnury, 2016 Lnguges fs nf re df Deterministion 2 Outline 1 Lnguges 2 Finite Stte Automt (fs) 3 Non-deterministi Finite Stte Automt (nf) 4 Regulr Expressions

More information

22: Union Find. CS 473u - Algorithms - Spring April 14, We want to maintain a collection of sets, under the operations of:

22: Union Find. CS 473u - Algorithms - Spring April 14, We want to maintain a collection of sets, under the operations of: 22: Union Fin CS 473u - Algorithms - Spring 2005 April 14, 2005 1 Union-Fin We wnt to mintin olletion of sets, uner the opertions of: 1. MkeSet(x) - rete set tht ontins the single element x. 2. Fin(x)

More information

Alpha Algorithm: Limitations

Alpha Algorithm: Limitations Proess Mining: Dt Siene in Ation Alph Algorithm: Limittions prof.dr.ir. Wil vn der Alst www.proessmining.org Let L e n event log over T. α(l) is defined s follows. 1. T L = { t T σ L t σ}, 2. T I = { t

More information

Section 2.3. Matrix Inverses

Section 2.3. Matrix Inverses Mtri lger Mtri nverses Setion.. Mtri nverses hree si opertions on mtries, ition, multiplition, n sutrtion, re nlogues for mtries of the sme opertions for numers. n this setion we introue the mtri nlogue

More information

Compiler Design. Spring Lexical Analysis. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Lexical Analysis. Sample Exercises and Solutions. Prof. Pedro C. Diniz University of Southern Cliforni Computer Siene Deprtment Compiler Design Spring 7 Lexil Anlysis Smple Exerises nd Solutions Prof. Pedro C. Diniz USC / Informtion Sienes Institute 47 Admirlty Wy, Suite

More information

are coplanar. ˆ ˆ ˆ and iˆ

are coplanar. ˆ ˆ ˆ and iˆ SML QUSTION Clss XII Mthemtis Time llowed: hrs Mimum Mrks: Generl Instrutions: i ll questions re ompulsor ii The question pper onsists of 6 questions divided into three Setions, B nd C iii Question No

More information

A Primer on Continuous-time Economic Dynamics

A Primer on Continuous-time Economic Dynamics Eonomis 205A Fll 2008 K Kletzer A Primer on Continuous-time Eonomi Dnmis A Liner Differentil Eqution Sstems (i) Simplest se We egin with the simple liner first-orer ifferentil eqution The generl solution

More information

Nondeterministic Automata vs Deterministic Automata

Nondeterministic Automata vs Deterministic Automata Nondeterministi Automt vs Deterministi Automt We lerned tht NFA is onvenient model for showing the reltionships mong regulr grmmrs, FA, nd regulr expressions, nd designing them. However, we know tht n

More information

Reflection Property of a Hyperbola

Reflection Property of a Hyperbola Refletion Propert of Hperol Prefe The purpose of this pper is to prove nltill nd to illustrte geometrill the propert of hperol wherein r whih emntes outside the onvit of the hperol, tht is, etween the

More information

For a, b, c, d positive if a b and. ac bd. Reciprocal relations for a and b positive. If a > b then a ab > b. then

For a, b, c, d positive if a b and. ac bd. Reciprocal relations for a and b positive. If a > b then a ab > b. then Slrs-7.2-ADV-.7 Improper Definite Integrls 27.. D.dox Pge of Improper Definite Integrls Before we strt the min topi we present relevnt lger nd it review. See Appendix J for more lger review. Inequlities:

More information

Generalization of 2-Corner Frequency Source Models Used in SMSIM

Generalization of 2-Corner Frequency Source Models Used in SMSIM Generliztion o 2-Corner Frequeny Soure Models Used in SMSIM Dvid M. Boore 26 Mrh 213, orreted Figure 1 nd 2 legends on 5 April 213, dditionl smll orretions on 29 My 213 Mny o the soure spetr models ville

More information

50 AMC Lectures Problem Book 2 (36) Substitution Method

50 AMC Lectures Problem Book 2 (36) Substitution Method 0 AMC Letures Prolem Book Sustitution Metho PROBLEMS Prolem : Solve for rel : 9 + 99 + 9 = Prolem : Solve for rel : 0 9 8 8 Prolem : Show tht if 8 Prolem : Show tht + + if rel numers,, n stisf + + = Prolem

More information

TIME AND STATE IN DISTRIBUTED SYSTEMS

TIME AND STATE IN DISTRIBUTED SYSTEMS Distriuted Systems Fö 5-1 Distriuted Systems Fö 5-2 TIME ND STTE IN DISTRIUTED SYSTEMS 1. Time in Distriuted Systems Time in Distriuted Systems euse eh mhine in distriuted system hs its own lok there is

More information

Electromagnetism Notes, NYU Spring 2018

Electromagnetism Notes, NYU Spring 2018 Eletromgnetism Notes, NYU Spring 208 April 2, 208 Ation formultion of EM. Free field desription Let us first onsider the free EM field, i.e. in the bsene of ny hrges or urrents. To tret this s mehnil system

More information

CS 491G Combinatorial Optimization Lecture Notes

CS 491G Combinatorial Optimization Lecture Notes CS 491G Comintoril Optimiztion Leture Notes Dvi Owen July 30, August 1 1 Mthings Figure 1: two possile mthings in simple grph. Definition 1 Given grph G = V, E, mthing is olletion of eges M suh tht e i,

More information

Discrete Structures Lecture 11

Discrete Structures Lecture 11 Introdution Good morning. In this setion we study funtions. A funtion is mpping from one set to nother set or, perhps, from one set to itself. We study the properties of funtions. A mpping my not e funtion.

More information

6.3.2 Spectroscopy. N Goalby chemrevise.org 1 NO 2 H 3 CH3 C. NMR spectroscopy. Different types of NMR

6.3.2 Spectroscopy. N Goalby chemrevise.org 1 NO 2 H 3 CH3 C. NMR spectroscopy. Different types of NMR 6.. Spetrosopy NMR spetrosopy Different types of NMR NMR spetrosopy involves intertion of mterils with the lowenergy rdiowve region of the eletromgneti spetrum NMR spetrosopy is the sme tehnology s tht

More information

Activities. 4.1 Pythagoras' Theorem 4.2 Spirals 4.3 Clinometers 4.4 Radar 4.5 Posting Parcels 4.6 Interlocking Pipes 4.7 Sine Rule Notes and Solutions

Activities. 4.1 Pythagoras' Theorem 4.2 Spirals 4.3 Clinometers 4.4 Radar 4.5 Posting Parcels 4.6 Interlocking Pipes 4.7 Sine Rule Notes and Solutions MEP: Demonstrtion Projet UNIT 4: Trigonometry UNIT 4 Trigonometry tivities tivities 4. Pythgors' Theorem 4.2 Spirls 4.3 linometers 4.4 Rdr 4.5 Posting Prels 4.6 Interloking Pipes 4.7 Sine Rule Notes nd

More information

6.3.2 Spectroscopy. N Goalby chemrevise.org 1 NO 2 CH 3. CH 3 C a. NMR spectroscopy. Different types of NMR

6.3.2 Spectroscopy. N Goalby chemrevise.org 1 NO 2 CH 3. CH 3 C a. NMR spectroscopy. Different types of NMR 6.. Spetrosopy NMR spetrosopy Different types of NMR NMR spetrosopy involves intertion of mterils with the lowenergy rdiowve region of the eletromgneti spetrum NMR spetrosopy is the sme tehnology s tht

More information

Engr354: Digital Logic Circuits

Engr354: Digital Logic Circuits Engr354: Digitl Logi Ciruits Chpter 4: Logi Optimiztion Curtis Nelson Logi Optimiztion In hpter 4 you will lern out: Synthesis of logi funtions; Anlysis of logi iruits; Tehniques for deriving minimum-ost

More information

Calculus Module C21. Areas by Integration. Copyright This publication The Northern Alberta Institute of Technology All Rights Reserved.

Calculus Module C21. Areas by Integration. Copyright This publication The Northern Alberta Institute of Technology All Rights Reserved. Clculus Module C Ares Integrtion Copright This puliction The Northern Alert Institute of Technolog 7. All Rights Reserved. LAST REVISED Mrch, 9 Introduction to Ares Integrtion Sttement of Prerequisite

More information

Unit 4. Combinational Circuits

Unit 4. Combinational Circuits Unit 4. Comintionl Ciruits Digitl Eletroni Ciruits (Ciruitos Eletrónios Digitles) E.T.S.I. Informáti Universidd de Sevill 5/10/2012 Jorge Jun 2010, 2011, 2012 You re free to opy, distriute

More information

Section 1.3 Triangles

Section 1.3 Triangles Se 1.3 Tringles 21 Setion 1.3 Tringles LELING TRINGLE The line segments tht form tringle re lled the sides of the tringle. Eh pir of sides forms n ngle, lled n interior ngle, nd eh tringle hs three interior

More information

NEW CIRCUITS OF HIGH-VOLTAGE PULSE GENERATORS WITH INDUCTIVE-CAPACITIVE ENERGY STORAGE

NEW CIRCUITS OF HIGH-VOLTAGE PULSE GENERATORS WITH INDUCTIVE-CAPACITIVE ENERGY STORAGE NEW CIRCUITS OF HIGH-VOLTAGE PULSE GENERATORS WITH INDUCTIVE-CAPACITIVE ENERGY STORAGE V.S. Gordeev, G.A. Myskov Russin Federl Nuler Center All-Russi Sientifi Reserh Institute of Experimentl Physis (RFNC-VNIIEF)

More information

Arrow s Impossibility Theorem

Arrow s Impossibility Theorem Rep Voting Prdoxes Properties Arrow s Theorem Arrow s Impossiility Theorem Leture 12 Arrow s Impossiility Theorem Leture 12, Slide 1 Rep Voting Prdoxes Properties Arrow s Theorem Leture Overview 1 Rep

More information

Signal Flow Graphs. Consider a complex 3-port microwave network, constructed of 5 simpler microwave devices:

Signal Flow Graphs. Consider a complex 3-port microwave network, constructed of 5 simpler microwave devices: 3/3/009 ignl Flow Grphs / ignl Flow Grphs Consider comple 3-port microwve network, constructed of 5 simpler microwve devices: 3 4 5 where n is the scttering mtri of ech device, nd is the overll scttering

More information

HOMEWORK FOR CLASS XII ( )

HOMEWORK FOR CLASS XII ( ) HOMEWORK FOR CLASS XII 8-9 Show tht the reltion R on the set Z of ll integers defined R,, Z,, is, divisile,, is n equivlene reltion on Z Let f: R R e defined if f if Is f one-one nd onto if If f, g : R

More information

= state, a = reading and q j

= state, a = reading and q j 4 Finite Automt CHAPTER 2 Finite Automt (FA) (i) Derterministi Finite Automt (DFA) A DFA, M Q, q,, F, Where, Q = set of sttes (finite) q Q = the strt/initil stte = input lphet (finite) (use only those

More information

Probability. b a b. a b 32.

Probability. b a b. a b 32. Proility If n event n hppen in '' wys nd fil in '' wys, nd eh of these wys is eqully likely, then proility or the hne, or its hppening is, nd tht of its filing is eg, If in lottery there re prizes nd lnks,

More information

System Validation (IN4387) November 2, 2012, 14:00-17:00

System Validation (IN4387) November 2, 2012, 14:00-17:00 System Vlidtion (IN4387) Novemer 2, 2012, 14:00-17:00 Importnt Notes. The exmintion omprises 5 question in 4 pges. Give omplete explntion nd do not onfine yourself to giving the finl nswer. Good luk! Exerise

More information

( ) 1. 1) Let f( x ) = 10 5x. Find and simplify f( 2) and then state the domain of f(x).

( ) 1. 1) Let f( x ) = 10 5x. Find and simplify f( 2) and then state the domain of f(x). Mth 15 Fettermn/DeSmet Gustfson/Finl Em Review 1) Let f( ) = 10 5. Find nd simplif f( ) nd then stte the domin of f(). ) Let f( ) = +. Find nd simplif f(1) nd then stte the domin of f(). ) Let f( ) = 8.

More information

Lecture 6: Coding theory

Lecture 6: Coding theory Leture 6: Coing theory Biology 429 Crl Bergstrom Ferury 4, 2008 Soures: This leture loosely follows Cover n Thoms Chpter 5 n Yeung Chpter 3. As usul, some of the text n equtions re tken iretly from those

More information

Part 4. Integration (with Proofs)

Part 4. Integration (with Proofs) Prt 4. Integrtion (with Proofs) 4.1 Definition Definition A prtition P of [, b] is finite set of points {x 0, x 1,..., x n } with = x 0 < x 1

More information

CS 2204 DIGITAL LOGIC & STATE MACHINE DESIGN SPRING 2014

CS 2204 DIGITAL LOGIC & STATE MACHINE DESIGN SPRING 2014 S 224 DIGITAL LOGI & STATE MAHINE DESIGN SPRING 214 DUE : Mrh 27, 214 HOMEWORK III READ : Relte portions of hpters VII n VIII ASSIGNMENT : There re three questions. Solve ll homework n exm prolems s shown

More information

Green s Theorem. (2x e y ) da. (2x e y ) dx dy. x 2 xe y. (1 e y ) dy. y=1. = y e y. y=0. = 2 e

Green s Theorem. (2x e y ) da. (2x e y ) dx dy. x 2 xe y. (1 e y ) dy. y=1. = y e y. y=0. = 2 e Green s Theorem. Let be the boundry of the unit squre, y, oriented ounterlokwise, nd let F be the vetor field F, y e y +, 2 y. Find F d r. Solution. Let s write P, y e y + nd Q, y 2 y, so tht F P, Q. Let

More information

Learning Partially Observable Markov Models from First Passage Times

Learning Partially Observable Markov Models from First Passage Times Lerning Prtilly Oservle Mrkov s from First Pssge s Jérôme Cllut nd Pierre Dupont Europen Conferene on Mhine Lerning (ECML) 8 Septemer 7 Outline. FPT in models nd sequenes. Prtilly Oservle Mrkov s (POMMs).

More information

p-adic Egyptian Fractions

p-adic Egyptian Fractions p-adic Egyptin Frctions Contents 1 Introduction 1 2 Trditionl Egyptin Frctions nd Greedy Algorithm 2 3 Set-up 3 4 p-greedy Algorithm 5 5 p-egyptin Trditionl 10 6 Conclusion 1 Introduction An Egyptin frction

More information

The Trapezoidal Rule

The Trapezoidal Rule _.qd // : PM Pge 9 SECTION. Numericl Integrtion 9 f Section. The re of the region cn e pproimted using four trpezoids. Figure. = f( ) f( ) n The re of the first trpezoid is f f n. Figure. = Numericl Integrtion

More information

THE PYTHAGOREAN THEOREM

THE PYTHAGOREAN THEOREM THE PYTHAGOREAN THEOREM The Pythgoren Theorem is one of the most well-known nd widely used theorems in mthemtis. We will first look t n informl investigtion of the Pythgoren Theorem, nd then pply this

More information

Logic Synthesis and Verification

Logic Synthesis and Verification Logi Synthesis nd Verifition SOPs nd Inompletely Speified Funtions Jie-Hong Rolnd Jing 江介宏 Deprtment of Eletril Engineering Ntionl Tiwn University Fll 2010 Reding: Logi Synthesis in Nutshell Setion 2 most

More information

(h+ ) = 0, (3.1) s = s 0, (3.2)

(h+ ) = 0, (3.1) s = s 0, (3.2) Chpter 3 Nozzle Flow Qusistedy idel gs flow in pipes For the lrge vlues of the Reynolds number typilly found in nozzles, the flow is idel. For stedy opertion with negligible body fores the energy nd momentum

More information

University of Sioux Falls. MAT204/205 Calculus I/II

University of Sioux Falls. MAT204/205 Calculus I/II University of Sioux Flls MAT204/205 Clulus I/II Conepts ddressed: Clulus Textook: Thoms Clulus, 11 th ed., Weir, Hss, Giordno 1. Use stndrd differentition nd integrtion tehniques. Differentition tehniques

More information

XPath Rewriting Using Multiple Views. Author. Published. Journal Title DOI. Copyright Statement. Downloaded from. Griffith Research Online

XPath Rewriting Using Multiple Views. Author. Published. Journal Title DOI. Copyright Statement. Downloaded from. Griffith Research Online XPth Rewriting Using Multiple Views Author Wng, Junhu, Yu, Jeffre Pulished 2008 Journl Title Leture Notes in Computer siene DOI https://doi.org/10.1007/978-3-540-85654-2_43 Copright Sttement 2008 Springer

More information

PAIR OF LINEAR EQUATIONS IN TWO VARIABLES

PAIR OF LINEAR EQUATIONS IN TWO VARIABLES PAIR OF LINEAR EQUATIONS IN TWO VARIABLES. Two liner equtions in the sme two vriles re lled pir of liner equtions in two vriles. The most generl form of pir of liner equtions is x + y + 0 x + y + 0 where,,,,,,

More information

April 8, 2017 Math 9. Geometry. Solving vector problems. Problem. Prove that if vectors and satisfy, then.

April 8, 2017 Math 9. Geometry. Solving vector problems. Problem. Prove that if vectors and satisfy, then. pril 8, 2017 Mth 9 Geometry Solving vetor prolems Prolem Prove tht if vetors nd stisfy, then Solution 1 onsider the vetor ddition prllelogrm shown in the Figure Sine its digonls hve equl length,, the prllelogrm

More information

H (2a, a) (u 2a) 2 (E) Show that u v 4a. Explain why this implies that u v 4a, with equality if and only u a if u v 2a.

H (2a, a) (u 2a) 2 (E) Show that u v 4a. Explain why this implies that u v 4a, with equality if and only u a if u v 2a. Chpter Review 89 IGURE ol hord GH of the prol 4. G u v H (, ) (A) Use the distne formul to show tht u. (B) Show tht G nd H lie on the line m, where m ( )/( ). (C) Solve m for nd sustitute in 4, otining

More information

] dx (3) = [15x] 2 0

] dx (3) = [15x] 2 0 Leture 6. Double Integrls nd Volume on etngle Welome to Cl IV!!!! These notes re designed to be redble nd desribe the w I will eplin the mteril in lss. Hopefull the re thorough, but it s good ide to hve

More information

8.3 THE HYPERBOLA OBJECTIVES

8.3 THE HYPERBOLA OBJECTIVES 8.3 THE HYPERBOLA OBJECTIVES 1. Define Hperol. Find the Stndrd Form of the Eqution of Hperol 3. Find the Trnsverse Ais 4. Find the Eentriit of Hperol 5. Find the Asmptotes of Hperol 6. Grph Hperol HPERBOLAS

More information

DETERMINING SIGNIFICANT FACTORS AND THEIR EFFECTS ON SOFTWARE ENGINEERING PROCESS QUALITY

DETERMINING SIGNIFICANT FACTORS AND THEIR EFFECTS ON SOFTWARE ENGINEERING PROCESS QUALITY DETERMINING SIGNIFINT FTORS ND THEIR EFFETS ON SOFTWRE ENGINEERING PROESS QULITY R. Rdhrmnn Jeng-Nn Jung Mil to: rdhrmn_r@merer.edu jung_jn@merer.edu Shool of Engineering, Merer Universit, Mon, G 37 US

More information

Unit-VII: Linear Algebra-I. To show what are the matrices, why they are useful, how they are classified as various types and how they are solved.

Unit-VII: Linear Algebra-I. To show what are the matrices, why they are useful, how they are classified as various types and how they are solved. Unit-VII: Liner lger-i Purpose of lession : To show wht re the mtries, wh the re useful, how the re lssified s vrious tpes nd how the re solved. Introdution: Mtries is powerful tool of modern Mthemtis

More information

y1 y2 DEMUX a b x1 x2 x3 x4 NETWORK s1 s2 z1 z2

y1 y2 DEMUX a b x1 x2 x3 x4 NETWORK s1 s2 z1 z2 BOOLEAN METHODS Giovnni De Miheli Stnford University Boolen methods Exploit Boolen properties. { Don't re onditions. Minimiztion of the lol funtions. Slower lgorithms, etter qulity results. Externl don't

More information

Mid-Term Examination - Spring 2014 Mathematical Programming with Applications to Economics Total Score: 45; Time: 3 hours

Mid-Term Examination - Spring 2014 Mathematical Programming with Applications to Economics Total Score: 45; Time: 3 hours Mi-Term Exmintion - Spring 0 Mthemtil Progrmming with Applitions to Eonomis Totl Sore: 5; Time: hours. Let G = (N, E) e irete grph. Define the inegree of vertex i N s the numer of eges tht re oming into

More information

Chapter 3. Vector Spaces. 3.1 Images and Image Arithmetic

Chapter 3. Vector Spaces. 3.1 Images and Image Arithmetic Chpter 3 Vetor Spes In Chpter 2, we sw tht the set of imges possessed numer of onvenient properties. It turns out tht ny set tht possesses similr onvenient properties n e nlyzed in similr wy. In liner

More information

2.4 Linear Inequalities and Interval Notation

2.4 Linear Inequalities and Interval Notation .4 Liner Inequlities nd Intervl Nottion We wnt to solve equtions tht hve n inequlity symol insted of n equl sign. There re four inequlity symols tht we will look t: Less thn , Less thn or

More information

The Word Problem in Quandles

The Word Problem in Quandles The Word Prolem in Qundles Benjmin Fish Advisor: Ren Levitt April 5, 2013 1 1 Introdution A word over n lger A is finite sequene of elements of A, prentheses, nd opertions of A defined reursively: Given

More information

are fractions which may or may not be reduced to lowest terms, the mediant of ( a

are fractions which may or may not be reduced to lowest terms, the mediant of ( a GENERATING STERN BROCOT TYPE RATIONAL NUMBERS WITH MEDIANTS HAROLD REITER AND ARTHUR HOLSHOUSER Abstrct. The Stern Brocot tree is method of generting or orgnizing ll frctions in the intervl (0, 1 b strting

More information