1 APL13: Suffix Arrays: more space reduction
|
|
- Clare Drusilla McCormick
- 6 years ago
- Views:
Transcription
1 1 APL13: Suffix Arrys: more spce reduction In Section??, we sw tht when lphbet size is included in the time nd spce bounds, the suffix tree for string of length m either requires Θ(m Σ ) spce or the minimum of O(m logm) nd O(m log Σ ) time. Similrly, serching for pttern P of length n using suffix tree cn be done in O(n) time only if Θ(m Σ ) spce is used for the tree, or if we ssume tht up to Σ chrcter comprisons cost only constnt time. Otherwise, the serch tkes the minimum of O(n log m) nd O(n log Σ ) comprisons. For these resons, suffix tree my require too much spce to be prcticl in some pplictions. Hence more spce efficient pproch is desired tht still retins most of the dvntges of serching with suffix tree. In the context of the substring problem (see Section??) where fixed string T will be serched mny times, the key issues re the time needed for the serch, nd the spce used by the fixed dt structure representing T. The spce used during the preprocessing of T is of less concern, lthough it should still be resonble. Mnber nd Myers [1] proposed new dt structure, clled suffix rry, tht is very spce efficient nd yet cn be used to solve the exct mtching problem or the substring problem lmost s efficiently s with suffix tree. Suffix rrys re likely to be n importnt contribution to certin string problems in computtionl moleculr biology, where the lphbet cn be lrge (we will discuss some of the resons for lrge lphbets below). Interestingly, lthough the more forml notion of suffix rry nd the bsic lgorithms for building nd using it were developed in [1], mny of the ides were nticipted in the biologicl literture by Mrtinez [2]. After defining suffix rrys we show how to convert suffix tree to suffix rry in liner time. It is importnt to be cler on the setting of the problem. String T will be held fixed for long time, while P will vry. Therefore the gol is to find spce efficient representtion for T ( suffix rry) tht will be held fixed nd tht fcilittes serch problems in T. But the mount of spce used during the construction of tht representtion is not so criticl. In the exercises we consider more spce efficient wy to build the representtion itself. Definition: Given n m chrcter string T,suffix rry for T, clled Pos, is n rry of the integers in the rnge 1 to m, specifying the lexicogrphic order of the m suffixes of string T. Tht is, the suffix strting t position Pos(1) of T is the lexiclly smllest suffix, nd in generl suffix Pos(i) of T is lexiclly smller thn suffix Pos(i+1). As usul, we will ffix terminl symbol $ to the end of S, but now interpret it to be lexiclly less thn ny other chrcter in the lphbet. This is in contrst to its interprettion in the previous section. As n exmple of suffix rry, if T is mississippi, then the suffix rry Pos is 11, 8, 5, 2, 1, 10, 9, 7, 4, 6, 3. Figure 1 lists the eleven suffixes in lexicogrphic order. Notice tht the suffix rry holds only integers nd hence contins no informtion bout the lphbet used in string T. Therefore, the spce required by suffix rrys 1
2 11: i 8: ippi 5: issippi 2: ississippi 1: mississippi 10: pi 9: ppi 7: sippi 4: sisippi 6: ssippi 3: ssissippi Figure 1: The eleven suffixes of mississippi listed in lexicogrphic order. The strting positions of those suffixes define the suffix rry Pos. is modest for string of length m, the rry cn be stored in exctly m computer words, ssuming word size of t lest log m bits. When ugmented with n dditionl 2m vlues (clled Lcp vlues nd defined lter), the suffix rry cn be used to find ll the occurrences in T of pttern P in O(n + log 2 m) single-chrcter comprison nd bookkeeping opertions. Moreover, this bound is independent of the lphbet size. Since for most problems of interest log 2 m is O(n), the substring problem is solved by using suffix rrys s efficiently s by using suffix trees. 1.1 Suffix tree to suffix rry in liner time We ssume tht sufficient spce is vilble to build suffix tree for T (this is done once during preprocessing phse), but tht the suffix tree cnnot be kept intct to be used in the (mny) subsequent serches for ptterns in T. Insted we convert the suffix tree to the more spce efficient suffix rry. Exercises??,?? nd?? develop more spce efficient (but slower) method for building suffix rry. A suffix rry for T cn be obtined from the suffix tree T for T by performing lexicl depth-first trversl of T. Once the suffix rry is built, the suffix tree is discrded. Definition Define n edge (v, u) to be lexiclly less thn n edge (v, w) if nd only if the first chrcter on the (v, u) edge is lexiclly less thn the first chrcter on (v, w). (In this ppliction, the end of string chrcter $ is lexiclly less thn ny other chrcter.) Since no two edges out of v hve lbels beginning with the sme chrcter, there is strict lexicl ordering of the edges out of v. This ordering implies tht the pth 2
3 from the root of T following the lexiclly smllest edge out of ech encountered node leds to lef of T representing the lexiclly smllest suffix of T. More generlly, depth-first trversl of T which trverses the edges out of ech node v in their lexicl order will encounter the leves of T in the lexicl order of the suffixes they represent. Suffix rry Pos is therefore just the ordered list of suffix numbers encountered t the leves of T during the lexicl depth- first-serch. The suffix tree for T is constructed in liner time, nd the trversl lso tkes only liner time, so we hve the following Theorem 1.1 The suffix rry Pos for string T of length m cn be constructed in O(m) time. For exmple, the suffix tree for T = trtr is shown in Figure 2. The lexicl depth-first trversl visits the nodes in the order 5,2,6,3,4,1, defining the vlues of rry Pos. r $ 1 t t r $ 4 r t r r $ t 6 $ r 5 Figure 2: The lexicl depth-first trversl of the suffix tree visits the leves in order 5,2,6,3,4,1. As n implementtion detil, if the brnches out of ech node of the tree re orgnized in sorted linked list (s discussed in Section??, pge??), then the overhed to do lexicl depth-first serch is the sme s for ny depth-first serch. Every time the serch must choose n edge out of node v to trverse, it simply picks the next edge on v s linked list. 1.2 How to serch for pttern using suffix rry The suffix rry for string T llows very simple lgorithm to find ll occurrences of ny pttern P in T. The key is tht if P occurs in T then ll the loctions of those 3 $ $ 2 3
4 occurrences will be grouped consecutively in Pos. For exmple, P = issi occurs in mississippi strting t loctions 2 nd 5, which re indeed djcent in Pos (see Figure 1). So to serch for occurrences of P in T simply do binry serch over the suffix rry. In more detil, suppose tht P is lexiclly less thn the suffix in the middle position of Pos (i.e., suffix Pos( m/2 )). In tht cse, the first plce in Pos tht contins position where P occurs in T must be in the first hlf of Pos. Similrly, if P is lexiclly greter thn suffix Pos( m/2 ), then the plces where P occurs in T must be in the second hlf of Pos. Using binry serch, one cn therefore find the smllest index i in Pos (if ny) such tht P exctly mtches the first n chrcters of suffix Pos(i). Similrly, one cn find the lrgest index i with tht property. Then pttern P occurs in T strting t every loction given by Pos(i) through Pos(i ). The lexicl comprison of P to ny suffix tkes time proportionl to the length of the common prefix of those two strings. Tht prefix hs length t most n, hence Theorem 1.2 By using binry serch on rry Pos, ll the occurrences of P in T cn be found in O(n log m) time. Of course, the true behvior of the lgorithm depends on how mny long prefixes of P occur in T. If very few long prefixes of P occur in T then it will rrely hppen tht specific lexicl comprison ctully tkes Θ(n) time nd generlly the O(n logm) bound is quite pessimistic. In rndom strings (even on lrge lphbets) this method should run in O(n + log m) expected time. In cses where mny long prefixes of P do occur in T, then the method cn be improved with the following two tricks. 1.3 A simple ccelernt As the binry serch proceeds, let L nd R denote the left nd right boundries of the current serch intervl. At the strt, L equls 1 nd R equls m. Then in ech itertion of the binry serch, query is mde t loction M = (R + L)/2 of Pos. The serch lgorithm keeps trck of the longest prefixes of Pos(L) nd Pos(R) tht mtch prefix of P. Let l nd r denote those two prefix lengths respectively, nd let mlr = min(l, r). The vlue mlr cn be used to ccelerte the lexicl comprison of P nd suffix Pos(M). Since rry Pos gives the lexicl ordering of the suffixes of T, if i is ny index between L nd R, the first mlr chrcters of suffix Pos(i) must be the sme s the first mlr chrcters of suffix Pos(L), nd hence of P. Therefore, the lexicl comprison of P nd suffix Pos(M) cn begin from position mlr+1 of the two strings, rther thn strting from the first position. Mintining mlr during the binry serch dds little dditionl overhed to the lgorithm but voids mny redundnt comprisons. At the strt of the serch, when L = 1 nd R = m, explicitly compre P to suffix Pos(1) nd suffix Pos(m) to find l nd r nd mlr. However, the worst cse time for this revised method is still O(n log m). Myers nd Mnber report tht the use of mlr lone llows the serch to 4
5 run s fst in prctice s the O(n+log m) worst cse method tht we first dvertised. Still, if only becuse of its elegnce, we present the full method tht gurntees tht better worst cse bound. 1.4 A super-ccelernt Cll n exmintion of chrcter in P redundnt if tht chrcter hs been exmined before. The gol of the ccelertion is to reduce the number of redundnt chrcter exmintions to t most one per itertion of the binry serch hence O(log m) in ll. The desired time bound, O(n+log m), follows immeditely. The use of mlr lone does not chieve this gol. Since mlr is the minimum of l nd r, whenever l r, ll chrcters in P from mlr + 1 to the mximum of l nd r will hve lredy been exmined. So ny comprisons of those chrcters will be redundnt. Wht is needed is wy to begin comprisons t the mximum of l nd r. Definition Lcp(i,j) is the length of the longest common prefix of the suffixes specified in positions i nd j of Pos. Tht is, Lcp(i,j) is the length of the longest prefix common to suffix Pos(i) nd suffix Pos(j). The term Lcp stnds for longest common prefix. For exmple, when T = mississippi, suffix Pos(3) is issippi, suffix Pos(4) is ississippi, nd so Lcp(3,4) is four (see Figure 1). To speed up the serch, the lgorithm uses Lcp(L,M) nd Lcp(M,R) for ech triple (L, M, R) tht rises during the execution of the binry serch. For now, we ssume tht these vlues cn be obtined in constnt time when needed, nd show how they help the serch. Lter we will show how to compute the prticulr Lcp vlues needed by the binry serch during the preprocessing of T. How to use Lcp vlues Simplest cse: In ny itertion of the binry serch, if l = r, then compre P to suffix Pos(M) strting from position mlr + 1 = l + 1 = r + 1, s before. Generl cse: When l r, let us ssume without loss of generlity tht l > r. Then there re three subcses: If Lcp(L,M) >l, then the common prefix of suffix Pos(L) nd suffix Pos(M) is longer thn the common prefix of P nd Pos(L). Therefore P grees with suffix Pos(M) up through chrcter l. In other words, chrcters l +1 of suffix Pos(L) nd suffix Pos(M) re identicl nd lexiclly less thn chrcter l + 1 of P (the lst fct follows since P is lexiclly greter thn suffix Pos(L)). Hence ll (if ny) strting loctions of P in T must occur to the right of position M in Pos. So in ny itertion of the binry serch where this cse occurs, no exmintions 5
6 of P re needed; L just gets chnged to M, nd l nd r remin unchnged. (See Figure 3). If Lcp(L,M) <l, then the common prefix of suffix Pos(L) nd Pos(M) is smller thn the common prefix of suffix Pos(L) nd P. Therefore P grees with suffix Pos(M) up through chrcter Lcp(L,M). The Lcp(L,M)+1 chrcters of P nd suffix Pos(L) re identicl nd lexiclly less thn chrcter Lcp(L,M)+1 of suffix Pos(M). Hence ll (if ny) strting loctions of P in T must occur to the left of position M in Pos. So in ny itertion of the binry serch where this cse occurs, no exmintions of P re needed; r is chnged to Lcp(L,M), l remins unchnged, nd R is chnged to M. If Lcp(L,M) = l, then P grees with suffix Pos(M) up to chrcter l. The lgorithm then lexiclly compres P to suffix Pos(M) strting from position l + 1. In the usul mnner, the outcome of tht lexicl comprison determines which of L or R chnge, long with the corresponding chnge of l or r. lcp(l,m) n g g z m f f y e d e d l e d x d r c c c c b b b b P L M R Figure 3: Subcse 1 of the super-ccelernt. Pttern P is bcdemn shown verticlly running upwrds from the first chrcter. The suffixes Pos(L), Pos(M), Pos(R) re lso shown verticlly. In this cse, Lcp(L,M) > 0, nd l > r. Any strting loction of P in T must occur in Pos to the right of M, since P grees with suffix Pos(M) only up to chrcter l. Theorem 1.3 Using the Lcp vlues, the serch lgorithm does t most O(n + log m) comprisons, nd runs in tht time. Proof First, by simple cse nlysis it is esy to verify tht neither l nor r ever decreses during the binry serch. Also, every itertion of the binry serch either termintes the serch, or exmines no chrcters of P, or ends fter the first mismtch occurs in tht itertion. In the two cses (l = r or Lcp(L,M) = l > r) where the lgorithm exmines chrcter during the itertion, the comprisons strt with chrcter mx(l,r) of P. 6
7 Suppose there re k chrcters of P exmined in tht itertion. Then there re k 1 mtches during the itertion, nd t the end of the itertion mx(l,r) increses by k 1 (either l or r is chnged to tht vlue). Hence t the strt of ny itertion, chrcter mx(l,r) of P my hve lredy been exmined, but the next chrcter in P hs not been. Tht mens t most one redundnt comprison per itertion is done. So no more thn log 2 m redundnt comprisons re done overll. There re t most n non-redundnt comprisons of chrcters of P, giving totl bound of n + log m comprisons. All the other work in the lgorithm cn clerly be done in time proportionl to these comprisons. 1.5 How to obtin the Lcp vlues The originl method of Mnber nd Myers hs been improved nd notes on it will be distributed. 1.6 Where do lrge lphbet problems rise? A lrge prt of the motivtion for suffix rrys comes from problems tht rise in using suffix trees when the underlying lphbet is lrge. So it is nturl to sk where lrge lphbets occur. First, there re nturl lnguges, such s Chinese, with lrge lphbets (using some computer representtion of the Chinese pictogrms.) But most lrge lphbets of interest to us rise becuse the string contins numbers, ech of which is treted s chrcter. One simple exmple is string tht comes from picture where ech chrcter in the string gives the color or grey level of pixel. String nd substring mtching problems where the lphbet contins numbers, nd where P nd T re lrge, lso rise in computtionl problems in moleculr biology. One exmple is the mp mtching problem. A restriction enzyme mp for single enzyme specifies the loctions in DNA string where copies of certin substring ( restriction enzyme recognition site) occurs. Ech such site my be seprted from the next one by mny thousnds of bses. Hence, the restriction enzyme mp for tht single enzyme is represented s string consisting of sequence of integers specifying the distnces between successive enzyme sites. Considered s string, ech integer is chrcter of (huge) underlying lphbet. More generlly, mp my disply the sites of mny different ptterns of interest (whether or not they re restriction enzyme sites), so the string (mp) consists of chrcters from finite lphbet (representing the known ptterns of interest) lternting with integers giving the distnces between such sites. The lphbet is huge becuse the rnge of integers is huge, nd since distnces re often known with high precision, rounding is not used. Moreover, the vriety of known ptterns of interest is itself lrge (see [3] for librry of severl hundred known, significnt ptterns). It often hppens tht DNA substring is obtined nd studied without knowing 7
8 where tht DNA is locted in the genome or whether tht substring hs been previously reserched. If both the new nd the previously studied DNA re fully sequenced nd put in dtbse, then the issue of previous work or loctions would be solved by exct string mtching. But most DNA substrings tht re studied re not fully sequenced mps re esier nd cheper to obtin thn sequences. So the following mtching problem on mps rises nd trnsltes to n mtching problem on strings with lrge lphbets: Given n estblished (restriction enzyme) mp for lrge DNA string, nd mp from smller string, determine if the smller string is substring of the lrger one. Since ech mp is represented s n lternting string of chrcters nd integers, the underlying lphbet is huge. This provides one motivtion for using suffix rrys for mtching or substring serching in plce of suffix trees. Of course, the problems become more difficult in the presence of errors, when the integers in the strings my not be exct, or when sites re missing or spuriously dded. Tht problem, clled mp lignment, is discussed in Section??. References [1] U. Mnber nd G. Myers. Suffix rrys: new method for on-line serch. SIAM J. on Computing, pges , [2] H. Mrtinez. An efficient method for finding repets in moleculr sequences. Nucl. Acids Res., 11: , [3] E. Trifonov nd V. Brendel. Gnomic: A dictionry of genetic codes. VCH Press, Deerfield, Florid,
Module 9: Tries and String Matching
Module 9: Tries nd String Mtching CS 240 - Dt Structures nd Dt Mngement Sjed Hque Veronik Irvine Tylor Smith Bsed on lecture notes by mny previous cs240 instructors Dvid R. Cheriton School of Computer
More informationModule 9: Tries and String Matching
Module 9: Tries nd String Mtching CS 240 - Dt Structures nd Dt Mngement Sjed Hque Veronik Irvine Tylor Smith Bsed on lecture notes by mny previous cs240 instructors Dvid R. Cheriton School of Computer
More informationWhere did dynamic programming come from?
Where did dynmic progrmming come from? String lgorithms Dvid Kuchk cs302 Spring 2012 Richrd ellmn On the irth of Dynmic Progrmming Sturt Dreyfus http://www.eng.tu.c.il/~mi/cd/ or50/1526-5463-2002-50-01-0048.pdf
More informationThe Regulated and Riemann Integrals
Chpter 1 The Regulted nd Riemnn Integrls 1.1 Introduction We will consider severl different pproches to defining the definite integrl f(x) dx of function f(x). These definitions will ll ssign the sme vlue
More informationAlignment of Long Sequences. BMI/CS Spring 2016 Anthony Gitter
Alignment of Long Sequences BMI/CS 776 www.biostt.wisc.edu/bmi776/ Spring 2016 Anthony Gitter gitter@biostt.wisc.edu Gols for Lecture Key concepts how lrge-scle lignment differs from the simple cse the
More informationp-adic Egyptian Fractions
p-adic Egyptin Frctions Contents 1 Introduction 1 2 Trditionl Egyptin Frctions nd Greedy Algorithm 2 3 Set-up 3 4 p-greedy Algorithm 5 5 p-egyptin Trditionl 10 6 Conclusion 1 Introduction An Egyptin frction
More informationTheoretical foundations of Gaussian quadrature
Theoreticl foundtions of Gussin qudrture 1 Inner product vector spce Definition 1. A vector spce (or liner spce) is set V = {u, v, w,...} in which the following two opertions re defined: (A) Addition of
More information19 Optimal behavior: Game theory
Intro. to Artificil Intelligence: Dle Schuurmns, Relu Ptrscu 1 19 Optiml behvior: Gme theory Adversril stte dynmics hve to ccount for worst cse Compute policy π : S A tht mximizes minimum rewrd Let S (,
More information2. Udi Manber, Gene Myers: Suffix arrays: A new method for online string searching, SIAM Journal on Computing 22:935-48,1993
7 Suffix rrys This exposition ws developed y Clemens Gröpl nd Knut Reinert. It is sed on the following sources, which re ll recommended reding: 1. Simon J. Puglisi, W. F. Smyth, nd Andrew Turpin, A txonomy
More informationState space systems analysis (continued) Stability. A. Definitions A system is said to be Asymptotically Stable (AS) when it satisfies
Stte spce systems nlysis (continued) Stbility A. Definitions A system is sid to be Asymptoticlly Stble (AS) when it stisfies ut () = 0, t > 0 lim xt () 0. t A system is AS if nd only if the impulse response
More information4.4 Areas, Integrals and Antiderivatives
. res, integrls nd ntiderivtives 333. Ares, Integrls nd Antiderivtives This section explores properties of functions defined s res nd exmines some connections mong res, integrls nd ntiderivtives. In order
More informationI1 = I2 I1 = I2 + I3 I1 + I2 = I3 + I4 I 3
2 The Prllel Circuit Electric Circuits: Figure 2- elow show ttery nd multiple resistors rrnged in prllel. Ech resistor receives portion of the current from the ttery sed on its resistnce. The split is
More informationConvert the NFA into DFA
Convert the NF into F For ech NF we cn find F ccepting the sme lnguge. The numer of sttes of the F could e exponentil in the numer of sttes of the NF, ut in prctice this worst cse occurs rrely. lgorithm:
More informationNew data structures to reduce data size and search time
New dt structures to reduce dt size nd serch time Tsuneo Kuwbr Deprtment of Informtion Sciences, Fculty of Science, Kngw University, Hirtsuk-shi, Jpn FIT2018 1D-1, No2, pp1-4 Copyright (c)2018 by The Institute
More informationBalanced binary search trees
02110 Inge Li Gørtz Overview Blnced binry serch trees: Red-blck trees nd 2-3-4 trees Amortized nlysis Dynmic progrmming Network flows String mtching String indexing Computtionl geometry Introduction to
More informationUNIFORM CONVERGENCE. Contents 1. Uniform Convergence 1 2. Properties of uniform convergence 3
UNIFORM CONVERGENCE Contents 1. Uniform Convergence 1 2. Properties of uniform convergence 3 Suppose f n : Ω R or f n : Ω C is sequence of rel or complex functions, nd f n f s n in some sense. Furthermore,
More informationMath 61CM - Solutions to homework 9
Mth 61CM - Solutions to homework 9 Cédric De Groote November 30 th, 2018 Problem 1: Recll tht the left limit of function f t point c is defined s follows: lim f(x) = l x c if for ny > 0 there exists δ
More informationAnatomy of a Deterministic Finite Automaton. Deterministic Finite Automata. A machine so simple that you can understand it in less than one minute
Victor Admchik Dnny Sletor Gret Theoreticl Ides In Computer Science CS 5-25 Spring 2 Lecture 2 Mr 3, 2 Crnegie Mellon University Deterministic Finite Automt Finite Automt A mchine so simple tht you cn
More informationRiemann Sums and Riemann Integrals
Riemnn Sums nd Riemnn Integrls Jmes K. Peterson Deprtment of Biologicl Sciences nd Deprtment of Mthemticl Sciences Clemson University August 26, 2013 Outline 1 Riemnn Sums 2 Riemnn Integrls 3 Properties
More informationReview of basic calculus
Review of bsic clculus This brief review reclls some of the most importnt concepts, definitions, nd theorems from bsic clculus. It is not intended to tech bsic clculus from scrtch. If ny of the items below
More informationRecitation 3: More Applications of the Derivative
Mth 1c TA: Pdric Brtlett Recittion 3: More Applictions of the Derivtive Week 3 Cltech 2012 1 Rndom Question Question 1 A grph consists of the following: A set V of vertices. A set E of edges where ech
More informationThe Wave Equation I. MA 436 Kurt Bryan
1 Introduction The Wve Eqution I MA 436 Kurt Bryn Consider string stretching long the x xis, of indeterminte (or even infinite!) length. We wnt to derive n eqution which models the motion of the string
More informationAPPROXIMATE INTEGRATION
APPROXIMATE INTEGRATION. Introduction We hve seen tht there re functions whose nti-derivtives cnnot be expressed in closed form. For these resons ny definite integrl involving these integrnds cnnot be
More informationRiemann Sums and Riemann Integrals
Riemnn Sums nd Riemnn Integrls Jmes K. Peterson Deprtment of Biologicl Sciences nd Deprtment of Mthemticl Sciences Clemson University August 26, 203 Outline Riemnn Sums Riemnn Integrls Properties Abstrct
More informationReview of Calculus, cont d
Jim Lmbers MAT 460 Fll Semester 2009-10 Lecture 3 Notes These notes correspond to Section 1.1 in the text. Review of Clculus, cont d Riemnn Sums nd the Definite Integrl There re mny cses in which some
More informationW. We shall do so one by one, starting with I 1, and we shall do it greedily, trying
Vitli covers 1 Definition. A Vitli cover of set E R is set V of closed intervls with positive length so tht, for every δ > 0 nd every x E, there is some I V with λ(i ) < δ nd x I. 2 Lemm (Vitli covering)
More informationMath 1B, lecture 4: Error bounds for numerical methods
Mth B, lecture 4: Error bounds for numericl methods Nthn Pflueger 4 September 0 Introduction The five numericl methods descried in the previous lecture ll operte by the sme principle: they pproximte the
More informationNUMERICAL INTEGRATION. The inverse process to differentiation in calculus is integration. Mathematically, integration is represented by.
NUMERICAL INTEGRATION 1 Introduction The inverse process to differentition in clculus is integrtion. Mthemticlly, integrtion is represented by f(x) dx which stnds for the integrl of the function f(x) with
More informationLecture 1. Functional series. Pointwise and uniform convergence.
1 Introduction. Lecture 1. Functionl series. Pointwise nd uniform convergence. In this course we study mongst other things Fourier series. The Fourier series for periodic function f(x) with period 2π is
More informationDATA Search I 魏忠钰. 复旦大学大数据学院 School of Data Science, Fudan University. March 7 th, 2018
DATA620006 魏忠钰 Serch I Mrch 7 th, 2018 Outline Serch Problems Uninformed Serch Depth-First Serch Bredth-First Serch Uniform-Cost Serch Rel world tsk - Pc-mn Serch problems A serch problem consists of:
More informationand that at t = 0 the object is at position 5. Find the position of the object at t = 2.
7.2 The Fundmentl Theorem of Clculus 49 re mny, mny problems tht pper much different on the surfce but tht turn out to be the sme s these problems, in the sense tht when we try to pproimte solutions we
More informationLecture 14: Quadrature
Lecture 14: Qudrture This lecture is concerned with the evlution of integrls fx)dx 1) over finite intervl [, b] The integrnd fx) is ssumed to be rel-vlues nd smooth The pproximtion of n integrl by numericl
More informationMinimal DFA. minimal DFA for L starting from any other
Miniml DFA Among the mny DFAs ccepting the sme regulr lnguge L, there is exctly one (up to renming of sttes) which hs the smllest possile numer of sttes. Moreover, it is possile to otin tht miniml DFA
More information1.3 Regular Expressions
56 1.3 Regulr xpressions These hve n importnt role in describing ptterns in serching for strings in mny pplictions (e.g. wk, grep, Perl,...) All regulr expressions of lphbet re 1.Ønd re regulr expressions,
More informationLecture 20: Numerical Integration III
cs4: introduction to numericl nlysis /8/0 Lecture 0: Numericl Integrtion III Instructor: Professor Amos Ron Scribes: Mrk Cowlishw, Yunpeng Li, Nthnel Fillmore For the lst few lectures we hve discussed
More informationDuality # Second iteration for HW problem. Recall our LP example problem we have been working on, in equality form, is given below.
Dulity #. Second itertion for HW problem Recll our LP emple problem we hve been working on, in equlity form, is given below.,,,, 8 m F which, when written in slightly different form, is 8 F Recll tht we
More informationLecture Note 9: Orthogonal Reduction
MATH : Computtionl Methods of Liner Algebr 1 The Row Echelon Form Lecture Note 9: Orthogonl Reduction Our trget is to solve the norml eution: Xinyi Zeng Deprtment of Mthemticl Sciences, UTEP A t Ax = A
More information1.4 Nonregular Languages
74 1.4 Nonregulr Lnguges The number of forml lnguges over ny lphbet (= decision/recognition problems) is uncountble On the other hnd, the number of regulr expressions (= strings) is countble Hence, ll
More informationNumerical integration
2 Numericl integrtion This is pge i Printer: Opque this 2. Introduction Numericl integrtion is problem tht is prt of mny problems in the economics nd econometrics literture. The orgniztion of this chpter
More informationx = b a N. (13-1) The set of points used to subdivide the range [a, b] (see Fig. 13.1) is
Jnury 28, 2002 13. The Integrl The concept of integrtion, nd the motivtion for developing this concept, were described in the previous chpter. Now we must define the integrl, crefully nd completely. According
More informationMath 8 Winter 2015 Applications of Integration
Mth 8 Winter 205 Applictions of Integrtion Here re few importnt pplictions of integrtion. The pplictions you my see on n exm in this course include only the Net Chnge Theorem (which is relly just the Fundmentl
More informationCMSC 330: Organization of Programming Languages. DFAs, and NFAs, and Regexps (Oh my!)
CMSC 330: Orgniztion of Progrmming Lnguges DFAs, nd NFAs, nd Regexps (Oh my!) CMSC330 Spring 2018 Types of Finite Automt Deterministic Finite Automt (DFA) Exctly one sequence of steps for ech string All
More informationInfinite Geometric Series
Infinite Geometric Series Finite Geometric Series ( finite SUM) Let 0 < r < 1, nd let n be positive integer. Consider the finite sum It turns out there is simple lgebric expression tht is equivlent to
More informationFingerprint idea. Assume:
Fingerprint ide Assume: We cn compute fingerprint f(p) of P in O(m) time. If f(p) f(t[s.. s+m 1]), then P T[s.. s+m 1] We cn compre fingerprints in O(1) We cn compute f = f(t[s+1.. s+m]) from f(t[s.. s+m
More informationFinite Automata. Informatics 2A: Lecture 3. John Longley. 22 September School of Informatics University of Edinburgh
Lnguges nd Automt Finite Automt Informtics 2A: Lecture 3 John Longley School of Informtics University of Edinburgh jrl@inf.ed.c.uk 22 September 2017 1 / 30 Lnguges nd Automt 1 Lnguges nd Automt Wht is
More informationReview of Gaussian Quadrature method
Review of Gussin Qudrture method Nsser M. Asi Spring 006 compiled on Sundy Decemer 1, 017 t 09:1 PM 1 The prolem To find numericl vlue for the integrl of rel vlued function of rel vrile over specific rnge
More informationIntermediate Math Circles Wednesday, November 14, 2018 Finite Automata II. Nickolas Rollick a b b. a b 4
Intermedite Mth Circles Wednesdy, Novemer 14, 2018 Finite Automt II Nickols Rollick nrollick@uwterloo.c Regulr Lnguges Lst time, we were introduced to the ide of DFA (deterministic finite utomton), one
More informationChapter 3 Polynomials
Dr M DRAIEF As described in the introduction of Chpter 1, pplictions of solving liner equtions rise in number of different settings In prticulr, we will in this chpter focus on the problem of modelling
More informationAlgorithms in Computational. Biology. More on BWT
Algorithms in Computtionl Biology More on BWT tody Plese Lst clss! don't forget to submit And by next (vi emil, repo ) implementtion week or shre prgectfltw get Not I would like reding overview! Discuss
More informationGoals: Determine how to calculate the area described by a function. Define the definite integral. Explore the relationship between the definite
Unit #8 : The Integrl Gols: Determine how to clculte the re described by function. Define the definite integrl. Eplore the reltionship between the definite integrl nd re. Eplore wys to estimte the definite
More information(e) if x = y + z and a divides any two of the integers x, y, or z, then a divides the remaining integer
Divisibility In this note we introduce the notion of divisibility for two integers nd b then we discuss the division lgorithm. First we give forml definition nd note some properties of the division opertion.
More information5.7 Improper Integrals
458 pplictions of definite integrls 5.7 Improper Integrls In Section 5.4, we computed the work required to lift pylod of mss m from the surfce of moon of mss nd rdius R to height H bove the surfce of the
More informationPhysics 116C Solution of inhomogeneous ordinary differential equations using Green s functions
Physics 6C Solution of inhomogeneous ordinry differentil equtions using Green s functions Peter Young November 5, 29 Homogeneous Equtions We hve studied, especilly in long HW problem, second order liner
More informationGRADE 4. Division WORKSHEETS
GRADE Division WORKSHEETS Division division is shring nd grouping Division cn men shring or grouping. There re cndies shred mong kids. How mny re in ech shre? = 3 There re 6 pples nd go into ech bsket.
More informationMath 360: A primitive integral and elementary functions
Mth 360: A primitive integrl nd elementry functions D. DeTurck University of Pennsylvni October 16, 2017 D. DeTurck Mth 360 001 2017C: Integrl/functions 1 / 32 Setup for the integrl prtitions Definition:
More informationTypes of Finite Automata. CMSC 330: Organization of Programming Languages. Comparing DFAs and NFAs. NFA for (a b)*abb.
CMSC 330: Orgniztion of Progrmming Lnguges Finite Automt 2 Types of Finite Automt Deterministic Finite Automt () Exctly one sequence of steps for ech string All exmples so fr Nondeterministic Finite Automt
More informationChapter 0. What is the Lebesgue integral about?
Chpter 0. Wht is the Lebesgue integrl bout? The pln is to hve tutoril sheet ech week, most often on Fridy, (to be done during the clss) where you will try to get used to the ides introduced in the previous
More informationCMSC 330: Organization of Programming Languages
CMSC 330: Orgniztion of Progrmming Lnguges Finite Automt 2 CMSC 330 1 Types of Finite Automt Deterministic Finite Automt (DFA) Exctly one sequence of steps for ech string All exmples so fr Nondeterministic
More informationTypes of Finite Automata. CMSC 330: Organization of Programming Languages. Comparing DFAs and NFAs. Comparing DFAs and NFAs (cont.) Finite Automata 2
CMSC 330: Orgniztion of Progrmming Lnguges Finite Automt 2 Types of Finite Automt Deterministic Finite Automt () Exctly one sequence of steps for ech string All exmples so fr Nondeterministic Finite Automt
More informationUSA Mathematical Talent Search Round 1 Solutions Year 21 Academic Year
1/1/21. Fill in the circles in the picture t right with the digits 1-8, one digit in ech circle with no digit repeted, so tht no two circles tht re connected by line segment contin consecutive digits.
More informationA REVIEW OF CALCULUS CONCEPTS FOR JDEP 384H. Thomas Shores Department of Mathematics University of Nebraska Spring 2007
A REVIEW OF CALCULUS CONCEPTS FOR JDEP 384H Thoms Shores Deprtment of Mthemtics University of Nebrsk Spring 2007 Contents Rtes of Chnge nd Derivtives 1 Dierentils 4 Are nd Integrls 5 Multivrite Clculus
More informationf(x) dx, If one of these two conditions is not met, we call the integral improper. Our usual definition for the value for the definite integral
Improper Integrls Every time tht we hve evluted definite integrl such s f(x) dx, we hve mde two implicit ssumptions bout the integrl:. The intervl [, b] is finite, nd. f(x) is continuous on [, b]. If one
More informationThe graphs of Rational Functions
Lecture 4 5A: The its of Rtionl Functions s x nd s x + The grphs of Rtionl Functions The grphs of rtionl functions hve severl differences compred to power functions. One of the differences is the behvior
More informationCS 275 Automata and Formal Language Theory
CS 275 Automt nd Forml Lnguge Theory Course Notes Prt II: The Recognition Problem (II) Chpter II.6.: Push Down Automt Remrk: This mteril is no longer tught nd not directly exm relevnt Anton Setzer (Bsed
More informationA recursive construction of efficiently decodable list-disjunct matrices
CSE 709: Compressed Sensing nd Group Testing. Prt I Lecturers: Hung Q. Ngo nd Atri Rudr SUNY t Bufflo, Fll 2011 Lst updte: October 13, 2011 A recursive construction of efficiently decodble list-disjunct
More informationFarey Fractions. Rickard Fernström. U.U.D.M. Project Report 2017:24. Department of Mathematics Uppsala University
U.U.D.M. Project Report 07:4 Frey Frctions Rickrd Fernström Exmensrete i mtemtik, 5 hp Hledre: Andres Strömergsson Exmintor: Jörgen Östensson Juni 07 Deprtment of Mthemtics Uppsl University Frey Frctions
More informationLecture 19: Continuous Least Squares Approximation
Lecture 19: Continuous Lest Squres Approximtion 33 Continuous lest squres pproximtion We begn 31 with the problem of pproximting some f C[, b] with polynomil p P n t the discrete points x, x 1,, x m for
More informationHow do we solve these things, especially when they get complicated? How do we know when a system has a solution, and when is it unique?
XII. LINEAR ALGEBRA: SOLVING SYSTEMS OF EQUATIONS Tody we re going to tlk bout solving systems of liner equtions. These re problems tht give couple of equtions with couple of unknowns, like: 6 2 3 7 4
More informationWeek 10: Line Integrals
Week 10: Line Integrls Introduction In this finl week we return to prmetrised curves nd consider integrtion long such curves. We lredy sw this in Week 2 when we integrted long curve to find its length.
More informationGlobal alignment. Genome Rearrangements Finding preserved genes. Lecture 18
Computt onl Biology Leture 18 Genome Rerrngements Finding preserved genes We hve seen before how to rerrnge genome to obtin nother one bsed on: Reversls Knowledge of preserved bloks (or genes) Now we re
More informationProperties of Integrals, Indefinite Integrals. Goals: Definition of the Definite Integral Integral Calculations using Antiderivatives
Block #6: Properties of Integrls, Indefinite Integrls Gols: Definition of the Definite Integrl Integrl Clcultions using Antiderivtives Properties of Integrls The Indefinite Integrl 1 Riemnn Sums - 1 Riemnn
More information7.2 The Definite Integral
7.2 The Definite Integrl the definite integrl In the previous section, it ws found tht if function f is continuous nd nonnegtive, then the re under the grph of f on [, b] is given by F (b) F (), where
More informationThe steps of the hypothesis test
ttisticl Methods I (EXT 7005) Pge 78 Mosquito species Time of dy A B C Mid morning 0.0088 5.4900 5.5000 Mid Afternoon.3400 0.0300 0.8700 Dusk 0.600 5.400 3.000 The Chi squre test sttistic is the sum of
More informationChapter Five: Nondeterministic Finite Automata. Formal Language, chapter 5, slide 1
Chpter Five: Nondeterministic Finite Automt Forml Lnguge, chpter 5, slide 1 1 A DFA hs exctly one trnsition from every stte on every symol in the lphet. By relxing this requirement we get relted ut more
More informationDIRECT CURRENT CIRCUITS
DRECT CURRENT CUTS ELECTRC POWER Consider the circuit shown in the Figure where bttery is connected to resistor R. A positive chrge dq will gin potentil energy s it moves from point to point b through
More informationHandout: Natural deduction for first order logic
MATH 457 Introduction to Mthemticl Logic Spring 2016 Dr Json Rute Hndout: Nturl deduction for first order logic We will extend our nturl deduction rules for sententil logic to first order logic These notes
More informationNumerical Analysis: Trapezoidal and Simpson s Rule
nd Simpson s Mthemticl question we re interested in numericlly nswering How to we evlute I = f (x) dx? Clculus tells us tht if F(x) is the ntiderivtive of function f (x) on the intervl [, b], then I =
More informationNatural examples of rings are the ring of integers, a ring of polynomials in one variable, the ring
More generlly, we define ring to be non-empty set R hving two binry opertions (we ll think of these s ddition nd multipliction) which is n Abelin group under + (we ll denote the dditive identity by 0),
More informationCompiler Design. Fall Lexical Analysis. Sample Exercises and Solutions. Prof. Pedro C. Diniz
University of Southern Cliforni Computer Science Deprtment Compiler Design Fll Lexicl Anlysis Smple Exercises nd Solutions Prof. Pedro C. Diniz USC / Informtion Sciences Institute 4676 Admirlty Wy, Suite
More informationDISCRETE MATHEMATICS HOMEWORK 3 SOLUTIONS
DISCRETE MATHEMATICS 21228 HOMEWORK 3 SOLUTIONS JC Due in clss Wednesdy September 17. You my collborte but must write up your solutions by yourself. Lte homework will not be ccepted. Homework must either
More informationSection 6: Area, Volume, and Average Value
Chpter The Integrl Applied Clculus Section 6: Are, Volume, nd Averge Vlue Are We hve lredy used integrls to find the re etween the grph of function nd the horizontl xis. Integrls cn lso e used to find
More information20 MATHEMATICS POLYNOMIALS
0 MATHEMATICS POLYNOMIALS.1 Introduction In Clss IX, you hve studied polynomils in one vrible nd their degrees. Recll tht if p(x) is polynomil in x, the highest power of x in p(x) is clled the degree of
More information2.4 Linear Inequalities and Interval Notation
.4 Liner Inequlities nd Intervl Nottion We wnt to solve equtions tht hve n inequlity symol insted of n equl sign. There re four inequlity symols tht we will look t: Less thn , Less thn or
More informationCMDA 4604: Intermediate Topics in Mathematical Modeling Lecture 19: Interpolation and Quadrature
CMDA 4604: Intermedite Topics in Mthemticl Modeling Lecture 19: Interpoltion nd Qudrture In this lecture we mke brief diversion into the res of interpoltion nd qudrture. Given function f C[, b], we sy
More informationFirst Midterm Examination
Çnky University Deprtment of Computer Engineering 203-204 Fll Semester First Midterm Exmintion ) Design DFA for ll strings over the lphet Σ = {,, c} in which there is no, no nd no cc. 2) Wht lnguge does
More informationChapter 2 Finite Automata
Chpter 2 Finite Automt 28 2.1 Introduction Finite utomt: first model of the notion of effective procedure. (They lso hve mny other pplictions). The concept of finite utomton cn e derived y exmining wht
More informationJim Lambers MAT 169 Fall Semester Lecture 4 Notes
Jim Lmbers MAT 169 Fll Semester 2009-10 Lecture 4 Notes These notes correspond to Section 8.2 in the text. Series Wht is Series? An infinte series, usully referred to simply s series, is n sum of ll of
More informationIs there an easy way to find examples of such triples? Why yes! Just look at an ordinary multiplication table to find them!
PUSHING PYTHAGORAS 009 Jmes Tnton A triple of integers ( bc,, ) is clled Pythgoren triple if exmple, some clssic triples re ( 3,4,5 ), ( 5,1,13 ), ( ) fond of ( 0,1,9 ) nd ( 119,10,169 ). + b = c. For
More informationLecture 3. In this lecture, we will discuss algorithms for solving systems of linear equations.
Lecture 3 3 Solving liner equtions In this lecture we will discuss lgorithms for solving systems of liner equtions Multiplictive identity Let us restrict ourselves to considering squre mtrices since one
More information1 Probability Density Functions
Lis Yn CS 9 Continuous Distributions Lecture Notes #9 July 6, 28 Bsed on chpter by Chris Piech So fr, ll rndom vribles we hve seen hve been discrete. In ll the cses we hve seen in CS 9, this ment tht our
More informationWe will see what is meant by standard form very shortly
THEOREM: For fesible liner progrm in its stndrd form, the optimum vlue of the objective over its nonempty fesible region is () either unbounded or (b) is chievble t lest t one extreme point of the fesible
More informationp(t) dt + i 1 re it ireit dt =
Note: This mteril is contined in Kreyszig, Chpter 13. Complex integrtion We will define integrls of complex functions long curves in C. (This is bit similr to [relvlued] line integrls P dx + Q dy in R2.)
More informationParse trees, ambiguity, and Chomsky normal form
Prse trees, miguity, nd Chomsky norml form In this lecture we will discuss few importnt notions connected with contextfree grmmrs, including prse trees, miguity, nd specil form for context-free grmmrs
More information221B Lecture Notes WKB Method
Clssicl Limit B Lecture Notes WKB Method Hmilton Jcobi Eqution We strt from the Schrödinger eqution for single prticle in potentil i h t ψ x, t = [ ] h m + V x ψ x, t. We cn rewrite this eqution by using
More information1 Structural induction, finite automata, regular expressions
Discrete Structures Prelim 2 smple uestions s CS2800 Questions selected for spring 2017 1 Structurl induction, finite utomt, regulr expressions 1. We define set S of functions from Z to Z inductively s
More informationP 3 (x) = f(0) + f (0)x + f (0) 2. x 2 + f (0) . In the problem set, you are asked to show, in general, the n th order term is a n = f (n) (0)
1 Tylor polynomils In Section 3.5, we discussed how to pproximte function f(x) round point in terms of its first derivtive f (x) evluted t, tht is using the liner pproximtion f() + f ()(x ). We clled this
More informationDynamic Fully-Compressed Suffix Trees
Motivtion Dynmic FCST s Conclusions Dynmic Fully-Compressed Suffix Trees Luís M. S. Russo Gonzlo Nvrro Arlindo L. Oliveir INESC-ID/IST {lsr,ml}@lgos.inesc-id.pt Dept. of Computer Science, University of
More informationSUMMER KNOWHOW STUDY AND LEARNING CENTRE
SUMMER KNOWHOW STUDY AND LEARNING CENTRE Indices & Logrithms 2 Contents Indices.2 Frctionl Indices.4 Logrithms 6 Exponentil equtions. Simplifying Surds 13 Opertions on Surds..16 Scientific Nottion..18
More informationMORE FUNCTION GRAPHING; OPTIMIZATION. (Last edited October 28, 2013 at 11:09pm.)
MORE FUNCTION GRAPHING; OPTIMIZATION FRI, OCT 25, 203 (Lst edited October 28, 203 t :09pm.) Exercise. Let n be n rbitrry positive integer. Give n exmple of function with exctly n verticl symptotes. Give
More information1 The Riemann Integral
The Riemnn Integrl. An exmple leding to the notion of integrl (res) We know how to find (i.e. define) the re of rectngle (bse height), tringle ( (sum of res of tringles). But how do we find/define n re
More information