Learning to model sequences generated by switching distributions

Size: px
Start display at page:

Download "Learning to model sequences generated by switching distributions"

Transcription

1 earning to model sequenes generated by swithing distributions Yoav Freund A Bell abs 00 Mountain Ave Murray Hill NJ USA Dana on omputer Siene nstitute Hebrew University Jerusalem srael Abstrat We study eiient algorithms or solving the ollowing problem whih we all the swithing distributions learning problem A sequene over a inite alphabet is generated in the ollowing way he sequene is a onatenation o runs eah o whih is a onseutive subsequene ah run is generated by independent rom draws rom a distribution over where is an element in a set o distributions he learning algorithm is given this sequene its goal is to ind approximations o the distributions give an approximate segmentation o the sequene into its onstituting runs We give an eiient algorithm or solving this problem show onditions under whih the algorithm is guaranteed to work with high probability 1 ntrodution Our work is motivated by the Hidden Markov Model HMM) he HMM is a model or the distribution o sequenes over a inite alphabet $ An ' HMM onsists o a inite number o eah o whih is assoiated with a hidden states! # distribution over the alphabet here is a transition probability *) assoiated with eah pair o states he HMM an be seen as a model whih generates ininite sequenes as ollows At eah time step the model generates a single harater rom aording to where is its urrent hidden state it then makes a transition to a new state with probability *) HMMs are a popular model in the ontext o speeh analysis One an view the hidden state as representing the state o the voal trat o the speaker whih is not diretly observable but ontrols the distribution o the observable sounds he Baum- Welh algorithm [2] is the predominant algorithm or learning HMMs rom examples produes n many real-world ases this algorithm produes aurate hypotheses ater a small num- ber o iterations here is almost no theory or explaining why Baum-Welh perorms so well in some ases badly in others he theoretial results regarding the problem o learning HMMs o whih we are aware are mostly negative Abe Warmuth [1] Gillman Sipser [5] show that learning HMMs is N-hard under various onditions he model o sequenesthat we onsider is similar to the HMM with the restrition that the transition probabilities assign a very high value to the transition rom eah hidden state to itsel n other words the model tends to stay at the same hidden state or long periods o time swith rom state to state only inrequently Suh an assumption an be justiied in the ontext o speeh analysis beause the time sale in whih speeh is sampled is usually an order o magnitude smaller than the time sale o hanges in the voal trat he assumption o the inrequent transitions lets us alleviate the problem o estimating the transition probabilities rephrase the learning problem in a slightly dierent way n our new ormulation the learning problem onsists o two interdependent problems: the modeling problem whih is to estimate the distributions the segmentation problem whih is to partition the sequene into short runs that orrespond to the dierent distributions Given a solution to the segmentation problem the transition probabilities an be easily estimated We deine the swithing distributions learning problem as ollows he learning target onsists o distributions a segmentation sequene over the integers he segmentation sequene is a onatenation o runs eah o whih is a repetition o a single index he 0 element is the index o the distribution that generates the th element 45 8 o the sequene 2 We assume that we are given a sequene over a inite alphabet We as- :9 sume that ie that the same distribution is used in many dierent runs he learner reeives a single sequene o length that is generated by the target its goal is to generate a hypothesis segmentation a set o hypothesis distributions whih are lose to those o the target his problem is related to the problem o learning swithing onepts studied by Blum halasani [3] However in their setup the swithing entities are onepts ie mappings rom some domain to =< while in our setup the swithing enti- For an introdution on HMMs their use in speeh analysis the use o Hidden Markov Model see abiner Juang []

2 2 ties are distributions over a single spae n this work we give an eiient algorithm or learning swithing distributions We desribe several variants o the algorithm eah o whih is guaranteed to sueed under slightly dierent onditions regarding the proess whih is generating the sequene Our algorithm works in the ollowing general way t starts by inding rough approximations o the distributions his is done by inding short subsequenes o whih with high probability are generated mostly by a single distribution Starting with these approximations o the distributions the algorithm iterates the ollowing two steps whih are very similar to the expetation maximization steps o the Baum- Welh algorithm 1 Using the approximate distributions the algorithm inds an approximate segmentation o the sequene 2 From the approximate segmentation the algorithm inds new estimates o the distributions Our analysis shows that i the errors in the initial estimates o the distributions are suiiently small then the re-estimation proess desribed above onverges rapidly Speiially i the errors in the initial estimates are smaller than a onstant ator o the distane between the target distributions then the number o mistakes in the segmentation is with high probability smaller than the error in the re-estimated distributions is n other words i the sequene is long enough with respet to the number o runs other parameters o the soure whih we speiy later) then even rough initial estimates o the target distributions are suiient to ahieve very aurate estimates within a single iteration o the algorithm he paper is organized as ollows n the Setion 2 we give the exat statement o the swithing distributions problem n Setion 3 we desribe the general algorithm that we use to solve the problem he details o the algorithm depend on the number o distributions on the amount o inormation given to the algorithm onerning the dierent parameters o the problem n Setion 4 we present our main results Our strongest result is or is given in Setion n Setion we give a general algorithm or in Setion 8 we desribe how to treat unspeiied parameters 2 Desription o the roblem We are interested in the ollowing problem et 0 80 be$ an alphabet o size et 0 $ be the target segmentation sequene ontaining 0 at0 most runs where a run is a onseutive subsequene! onsisting o! # repetitions o a single index in $ et - be the set o target probability vetors where or eah #4$ is a dimensional probability vetor deined over We denote by the probability assigned to by ie the oordinate o ) 2 We assume that a single sequene o elements rom is generated aording to the target - in the ollowing manner For eah the element is hosen independently at rom aording to the distribution deined by ' *) We are interested in algorithms whih given suh a sequene onstrut a hypothesis 3 - where is a hypothesis segmentation sequene - is a set o distributions he error o a hypothesis = - with respet to the target - is deined as ollows -10 = ' - 8#9 :) < '1 *)*= is a measure o divergene between distribution vetors We give results with respet to three divergene measures: = A B D AF B 3 = A B D AG B H = A B J AK B We use H to denote the errors o a hypothesis with respet to H respetively As or any distribution vetors A B = A B M = A B N H = A B -1 is the most sensitive measure o the error o the hypothesis - H is the least sensitive measure ntuitively a hypothesis with small error is one whih deines a sequene o distributions that is very similar to the one whih generated the sequene the target distributions - are all ar rom eah other then the at that the error o a hypothesis is small implies that to eah target distribution there orresponds a hypothesis distribution whih is lose to it that this hypothesis distribution is mathed to it on most o the sequene his means that a hypothesis whih has small error solves both the segmentation the modeling problems desribed above A learning algorithm or the swithing distributions problem reeives as input a single sequene together with an auray parameter O < a reliability parameter < Ater time polynomial in 1O 1 the algorithm outputs a hypothesis We require that there exists a polynomial 1O Q suh that i O then with probability at least the error o the hypothesis is smaller than O 3 he general algorithm n this setion we desribe an eiient algorithm or solving the swithing distributions learning problem Some elements o the algorithm are let unspeiied hese elements are implemented dierently or the ase o two target distributions ) or the general ase o more than two distributions are desribed in detail in the ollowing setions he reason or the two dierent implementations is that we were able to derive better results or the ase by using proedures whih exploit the at that the sequene is generated by no more than two distributions he algorithm is desribed in Figure 1 t onsists o two parts n the irst part the algorithm inds rough approximations o the target distributions his is implemented as will be shown in more detail in the ollowing setions by loating short subsequenes that appear to be generated eah by a single distribution he seond part o the algorithm is an iterative part in whih the approximate distributions are used to generate an approximate segmentation this segmentation is then used to re-estimate the distributions hese two steps are repeated S times or On iteration a ost vetor UW some S is assoiated with eah approximate distribution his ost vetor is used to alulate a total ost or any hypothesis segmentation o the sequene as deined in step 2b o the algorithm he algorithm

3 3 O O General algorithm or learning swithing distributions nput: A sequene - he size o the alphabet - he number o unknown distributions - the maximal number o runs - the minimal ration o the sequene that orresponds to eah distribution 1 nitialization: Find initial approximations o the target distributions: 2 Do or!# : a) Set the ost vetors ' $ $ as untions o the distributions b) Find the segmentation ) * - ' whih minimizes the total ost: 954 $ 8 ) = $ <=? 9A@ - 9 9:8 ) alulate the new estimates o the distributions For BDF HG set J to the empirial distribution o the elements o or whih ) -2K #B 3 Output the segmentation )MON the distributions Figure 1: he general learning algorithm selets the segmentation that ahieves the minimal ost Using this segmentation the algorithm generates new estimates o the distributions the proess repeats Finally ater suh iterations the hypothesis QS - is output We were not able to demonstrate that several iterations ahieve substantially better perormane than a single one so our analysis onentrates on the ase S However in pratie it seems likely that additional iterations would improve the auray o the hypothesis We return to this point at the end o Setion Finding the segmentation with at most runs that minimizes the total ost or a given set o ost vetors an be perormed in time Q H using a dynami programming tehnique whih is essentially the same as the well known iterbi algorithm [8] he ost vetors are hosen so that with high probability the segmentation with the lowest total ost does not dier signiiantly rom the target segmentation More speiially our goal is to selet ost vetors that satisy the ollowing two properties: 1) he expeted ost o the target segmentation is smaller than the expeted ost o any other segmentation with at most runs) 2) With high probability the segmentation that minimizes the ost on a sample sequene has a small number o segmentation errors One the segmentation with the lowest total ost is ound the probability distributions are re-estimated the proess is repeated he key property o this iterative proess as we shall show in the ollowing setions is that i the error o the initial estimates o the distributions is smaller than some threshold then the iterative proess inreases the auray o the models very rapidly We shall show that this threshold need not depend on the approximation parameter O but rather is o the order o the smallest distane between any pair o target distributions 4 Summary o esults Beore we present our results we add the ollowing notation For # we use to denote the number o elements in orresponding to the distribution deine U FWJ 2435 n other words U is the minimal ration o elements in orresponding to any single distributions We use Y to denote the length o the shortest run in We summarize our results in our theorems whih orrespond to dierent variants o the algorithm desribed in Setion 3 n the irst variant we assume that there are only two target distributions that the algorithm reeives as input U he error o the algorithm s hypothesis is measured with respet to whih is the most sensitive distane measure we use heorem 1 Main heorem or the ase o two distributions) here exists a swithing distributions learning algorithm suh that or any target * deining sequenes whose length satisies [Z*\ ^]`_ \ Q a a ^]b_ d eu U g then the algorithm when reeiving a sequene generated aording to the target together with U outputs a hypothesis 1 suh that with probability at least -1 1 O n the seond variant we remove the assumption that is at most two but we assume that the algorithm is given Y We would like to note that we do not assume the algorithm knows n this ase we require that the target distributions are well separated We measure the separation between the target distributions by 2 35 hwij - `k `k Our requirements on are that U are polynomially related to O that Y grows logarithmially with he error o the algorithm s hypothesis is measured with respet to t atually suies that the algorithm reeive only an upper bound on a lower bound on n suh a ase in the theorem below are simply exhanged by these bounds A similar statement holds in the ase o heorem 2 where we may assume that the algorithm reeives only a lower bound on l as well as an upper bound on )

4 U U O a O heorem 2 Main heorem or general ) here exists a swithing distributions learning algorithm suh that or any 5 Useul nequalities target - deining sequenes whose length satisies [Z*\ a FWJ O H O? are a onatenation o runs o length at least a Y Y *Z \ then the algorithm when reeiving a sequene generated aording to the target together with Y outputs a hypothesis - suh that with probability at least -1 - N O he third variant o our algorithm needs no input other than makes only the assumption that he error o the algorithm s hypothesis is measured with respet to H heorem 3 Main heorem or two distributions unspeiied parameters) here exists a swithing distributions learning algorithm suh that or any target * deining sequenes whose length satisies *Z*\ ^]`_ \ Q a a ]`_ eu U then the algorithm when reeiving a sequene aording to the target outputs a hypothesis that with probability at least -1 H 1 generated suh O n our ourth variant we do not assume that the algorithm is given any o the parameters o the problem However we still require the existene o a lower bound on Y whih grows logarithmially with that U are polynomially related to O he error o the algorithm s hypothesis is measured with respet to H heorem 4 Main heorem or general unspeiied parameters) here exists a swithing distributions learning algorithm suh that or any target - deining sequenes whose length satisies [Z \ FWJ O H O? are a onatenation o runs o length at least Y Y *Z \ then the algorithm when reeiving a sequene generated aording to the target outputs a hypothesis - suh that with probability at least -1 H 1 - < O he theorems presented above result rom an analysis o a single iteration o step 2 o the learning algorithm t is natural to ask whether by inreasing the number o iterations we ould signiiantly weaken the requirements on Our analysis does not support suh a laim we shall later disuss this question briely a n the proos o our theorems lemmas we apply several well known inequalities that are given here as lemmas he irst is a hernohoeding type bound derived by ittlestone [] the seond is due to Sanov [4]page 292) #$ emma 5 For < let be independent rom variables where et hen or < - -A \ - emma Sanov s nequality) For an alphabet o size let be a dimensional probability vetor deined over et S be a rom sample o size generated aording to let the type o S be a dimensional probability vetor deined as ollows: the th oordinate o requeny o the symbol in S hen or any! 1 where is the relative < is the Kullbak eibler K) divergene between the distributions is deined as ollows: 2435 ') One more useul inequality is the ollowing: emma et * $# be two probability vetors then: he ase o wo Distributions n this setion we onsider the ase in whih there are two target probability distributions over an alphabet o size he Y distane between the two vetors plays an important role in our analysis is denoted by We assume that the algorithm knows the number o swithes in the sequene U the minimum between the ration o elements in the target sequene generated by the ration generated by As noted in Setion 4 it suies that the algorithm have only an upper bound on a lower bound on U n Setion 8 we give bounds on the additional error inurred when we remove this assumption n Figure 2 we desribe how we get initial approximations - o respetively we deine the pair o ost vetors U U given approximations o he initial approximation proedure is based on the ollowing two ats he irst is that by deinition o U both or or there exists a subsequene o o length U that was generated solely aording to that probability distribution he seond at is that in expetation the distane between pairs o empirial distributions deined based on pairs

5 0 B nitial estimates or two distributions 1 Set 1 l the minimum length o any run in is known set - 1 l ) 2 For eah K ` 1! G eah ` G let 9 - be the ration o the elements in N whih are equal to et $ 9 denote the vetor 9 - '0 9 - `0 9-3 Find the pair o indies K K 1! or whih $ 9! $ 9 is maximized 4 Set 9 9 hoie o ost vetors or two distributions 1 Given estimates let! # 4 $ - ' - or Bg 4 2 et $ be deined as ollows: 1 e $ - - -!g 0 2! e $ - 3 Set $ $ $ $ let - )! # * Figure 2: he initialization proedure the hoie o the ost vetors or the ase o two distributions - o subsequenes is maximized when one o the subsequenes was generated aording to the seond aording to Using these two ats we show that the pair o distributions hosen are good initial approximations o Our main result or the ase o two distributions is stated in heorem 1 whose proo is divided into three lemmas We use the ollowing notation Similar to the deinition o whih is used with respet to the target segmentation we use 0 *43* to denote the number o sequene loations or whih 3* 0 )k *3* to denote the number o sequene loations or whih Observe that the deinition o the distribution o sample sequenes is invariant under renaming o the target distribution t is thus lear that the hypothesis generated by any learning algorithm an be lose to the target only up to an arbitrary permutation o the distributions his issue is side-stepped by our deinition o the error o a hypothesis but the lemmas that onstitute the proo o heorem 1 reer to the one-to-one mapping whih deines this permutation However as the proos are idential or any permutation we shall reer to this mapping only in the statements o the lemmas otherwise onsider only the ase in whih the names given to the distributions by the target by the hypothesis are idential First we show that i the length o the sequene is large enough then the initial estimates - o are guaranteed to have small error emma 8 nitialization error or two distributions) or some <89 the length o the sequene satisies F a U then with probability at least there exists a one-toone mapping :< = suh that or? ) A@- roo: For a given index $ AB 1 ] let *) DF 2435 DF let be suh that the ration o symbols in *) G)H that were generated by are respetively We say that a vetor is pure i either < or For every pair o indies G B let - J t is not hard to show that the expeted value o - ahieves the maximal value o or a pair o pure vetors satisying < ) < ) whih we set subse- We shall show that or some J[ quently) or every K 1) et D DM be the pair o vetors or whih -D DM is maximized Without loss o generality we assume that G M Based on our hoie o B there must exist a pair o pure vetors having < Sine or this pure pair - or the maximizing pair D G - D DM as well t ollows that D 3 N@O DM< t remains to show that the seletion o assumed in quation 1 exists Applying emma using the bound on the K divergene given in emma we have that or every < QB 4 S B U _W \ here are at most suh vetors hene by setting as in the statement o the lemma we ind that with probability at least 3 or every as required Seondly we show that i the errors o the estimates are not too large then with high probability only a small number o mistakes exist in the segmentation with the lowest ost deined using the orresponding ost vetors 2)

6 @ < 0 k U U k k emma 9 Segmentation error or two distributions) et FWJ ^]b_ ' ) 1) = where ranges over the two) one-to-one mappings rom to et : be a mapping that ahieves the minimum 1 then with probability at least )? ) 8 )? ) d Ë where is the th hypothesis segmentation roo: Assume without loss o generality that : is the identity mapping By our deinition o the ost vetors U4 UW we have that or any given segmentation 43 9 U U = 9 U U 3 = ) 3 ) * 3 3) where U UW is as deined in Figure 2 We irst veriy that 4< < hene or every segmentation 3 U U U U 3 For let - thus - we get - 4) \ 5) ' ) - ' ) ) \ ) 9 = - - ) Similarly 3 t remains to bound the probability that 8 U4 U )3 8 U U or a segmentation 3 suh that ) *43 8 ) a )D 3 2 )*) ) ost 8 U U )3 8 U U 0 ontributions o the elements o or whih 9) 10) he dierene in the total is a sum o the hese ontributions are independent the dierene between the two values that are possible or eah element is exatly 1 hus applying emma 5 the probability that the total ost o is smaller than that o is upper bounded by 9 U U 3 = 9 U U = -1A! -1A \ ) ) where is a weighted average o \ ) ) ) ) ) ) ) d ) ) $# ) ' 11) : 12) 13) From our assumption on the size o ) ) by substituting our bounds on the probability above is bounded by Sine the total number o possible segmentation ontaining at most runs is at most we get the statement o the lemma n the third lemma we show that i the segmentation small number o errors then good new estimates o an be omputed using has a emma 10 eevaluation error or two distributions) Sup- is a one to one mapping suh that pose :Q# = $ ^]`_ )? ) 1 U hen or? ) = where ) * 2U )? ) 1 ) roo: Assume without loss o generality that : is the identity mapping For a given segmentation 3 or let be the empirial distribution o the elements in 3 We deine the deviation - 3* o mean as ollows: - * \ ) *3* ) 3 ) 3 8 ) 3 We show that there exists or whih rom its 14) whih is set subsequently) suh that or every 3 having ]`_ d ) 3Q ) *3Q - *3* - 3*< hus in partiular - - N8 ) ) ) 8 ) K ) 1 ) ) 1 ) ) ) K 15) 1) 1) where the last inequality ollows rom our assumptions on he same bound on is obtained analogously =

7 U 9 Y t remains to bound Applying emma using the bound on the K divergene given in emma we have that with probability at least or every segmentation 3 having ^]b_ d ) *43* ) 3QD or - * * 3 < 41 2U 18) 8 roo Sketh o heorem 1: -1 n order to show that < O we separate our argument into three parts aording to the values o U First we onsider the ase in whih both U are greater than O n this ase we show that the segmentation error is O that the estimation error is O t is not hard to veriy that or the right hoie o onstants the total error is then bounded by O he ondition on the length o the sequene together with emma 8 guarantee that with high probability the initialization proedure generates estimates $ whose error is smaller than Using this in emma 9 we get that the segmentation error is O U From our assumption on we have that this expression is upper bounded by O as desired Using our assumption on the bound in emma 10 we get that the error o the re-estimated probabilities is O? as required U O then it is not hard to veriy that under our assumption on the size o with probability at least the hypothesis whih onstitutes o a single run together with the orresponding probability distribution deined based on the omplete sequene) has error bounded by O As U is part o the input to the algorithm the algorithm heks or this ondition output the single-run hypothesis O then the single-run hypothesis has small error as well However as is not part o the input the algorithm annot hek or this ondition Nonetheless it is easy to hek that in this situation any hypothesis segmentation 3 or whih hwj d *3Q *43* U together with the orresponding estimated probability distributions) has error at most O hus as a last step o the algorithm we hek i FWJ d U this ondition holds then we simply output Otherwise we output the single-run hypothesis t ollows rom the irst part o this proo that i then the ondition holds with high probability O? heorem 1 summarizes the onvergene properties o the algorithm when step 3 o the algorithm is exeuted one t is interesting to onsider the onvergene properties that are implied by emmas 9 10 in a little more detail Aording to emma 9 i the errors o the estimates o are smaller than a onstant ration o 3 then the number o segmentation errors an be dereased to an arbitrarily small ration o by inreasing this in turn dereases the error o the new estimates o the distributions that result rom the segmentation ntuitively this means that there is a basin o attration o the estimates o the distributions whose size is proportional to the distane the algorithm starts with an estimate o the distribution that is within this basin o attration then the estimate it gives in the ollowing iteration is very aurate On the other h our analysis predits that iterating the algorithm more than one will not signiiantly improve the segmentation error his is beause even i the estimates o the distributions are peret there will be segmentation errors as a result o the romness o the sequene Next we onsider the error in the estimates o the distributions From our analysis we see that given our lower bound on the length o the sequene the estimation error ater the initialization step ish O G re-estimation step is O the error ater a single A General Number o Distributions n this setion we onsider the ase in whih there are target probability distributions n this ase the problem o inding good ost vetors to be assoiated with the dierent distributions is more ompliated We have to hoose ost vetors one per distribution but we also have to satisy = sets o requirements beause eah pair o distributions has to be distinguished well by the orresponding pair o ost vetors he hoie o the ost vetors that we have ound is desribed in Figure 3 his hoie is a generalization o the squared loss in the binary ase ) allows us to bound the error o the algorithm aording to -1 whih is weaker ie less sensitive than -1 ) he initialization proedure that is used in the two-distribution ase annot be applied to the general ase he initialization proedure that we suggest requires that the segmentation sequene is suh that all o the runs in are longer than some integer parameter Y his allows the algorithm to assume that in eah segment o o length Y there is at most one swith between distributions onsequently it an identiy i both parts were generated almost solely by the same probability distribution he initialization proedure is desribed in Figure 3 We assume that the algorithm reeives the parameters Y as input his assumption is removed at some additional ost) in the next setion he roo o heorem 2 is very similar to the proo o heorem 1 ollows rom the lemmas given below emma 11 nitialization error or general ) or some the minimal length Y o runs in satisies Y 9 a = then with probability at least the initialization proedure desribed in Figure 3 generates a set o estimates whih approximates in the ollowing $ $ sense here exists a one-to-one mapping : rom to suh that or all? 1) roo: he initialization proedure starts by onsidering all pairs o windows o the orm D DF gnoring the dependene on

8 nitial estimates o the distributions general ase) 1 Set 2 Set 3 For eah K 1! eah F G let 9 - be the ration o the elements in N whih are equal to et $ 9 denote the vetor et be the set o vetors - $ 9 $ 9 or all K 1! suh that $ 9! $ Start with as an empty set repeat the ollowing until beomes empty: Selet an element $ rom add it to emove rom all elements $ suh that $ 0! $ Output the set as the set o initial distribution estimates hoie o ost vetors general ase) With eah estimate assoiate the ost vetor $ - * - - k 8 -! - * k 8 - -!^ - Figure 3: he initialization proedure the hoie o the ost vetors or the general ase GF DF D he assumption on the minimal length o runs in implies that eah suh pair overlaps with at most one swith between runs in We onentrate on some partiular pair o windows whose orresponding estimates are DF assume without loss o generality that the swith is in the seond window et be the distribution whih generates the elements beore the swith be the distribution ater the swith hen the expeted value o is the expeted value o or DF is some < Using emma our requirement on Y we show that or all KB the atual value o whih is assoiated with eah window is lose to its expeted value Speiially with probability at least the Y distane between eah estimate its expeted value is at most We thus get that the pair GF is added to the set only i so that DF his implies that the estimate that is added to the set ase satisies DF hus the estimates in the set target distributions are all 19) 20) in this o atual On the other h we are guaranteed that eah run ontains at least one pair o estimates that are both pure it is easy to hek that the auray o the estimates guarantees that this pair will be aepted into hus eah target distribution has a least one representative in he goal o step 4 o the initialization proedure is to ind a single representative or eah target distribution Simple arguments show that the distane between two dierent representatives o the same distribution is at the distane between two representatives o two dierent distributions is at From the requirement on Y in the statement o the lemma we get hus step 4 generates a set o distributions with one representative per target distribution as required in the statement o the lemma emma 12 Segmentation error or general ) et hwj ^]`_ ' ) ) ) where ranges over all one-to-one mappings rom $ to $ minimum J then with probability at least the segmentation that minimizes the total ost satisies? )`k 1) 1 N a - j k d! Assume that : is the mapping that ahieves the roo: Assume without loss o generality that : is the identity mapping Assume some element in the sequene is generated by the distribution then assigned a ost U rom the ost vetor U k whih orresponds to the approximated distribution k Deine U# k hen the expeted value o U is - $ ' k - k k!*)

9 9 U - - W ) hus the expeted ontribution o any element to the total ost is a sum o two terms he irst term depends only on the underlying target distribution the seond term depends on the Y norm o the approximation error `k Similarly to the analysis in the proo o emma 9 we now onsider the dierene between the total osts orresponding to the approximate ost vetors o two dierent segmentations he irst is the orret segmentation the seond is the best segmentation whih minimizes the total ost on learly only the elements on whih dier ontribute to the dierene in the total ost From quation 21) we get that the expeted total dierene is 9 U - = 9 U - 3 = ) k k - j `k = hus i k K or all -3 22) then the expeted ost dierene is guaranteed to be negative Using the triangle inequality or the Y norm we ind that the onditions on the minimal separation on the maximal error imply that -3 `k We thus get that the expeted total dierene in osts is bounded by 9 U - = 9 U - 3 = k - j ) k 23) We want to bound the probability that a segmentation with many errors has a total ost whih is smaller than that o the orret segmentation he ost dierene is a sum o the ost dierenes in the plaes where the segmentations disagree hese are independent rom variables t is easy to hek that the oordinates o any ost vetor are bounded in the range < We an thus apply emma 5 get that or any individual segmentation the probability that the segmentation ahieves smaller total ost than the orret segmentation is upper bounded by thus the ost dierene is bounded in 9 U - 3 = U _ W d 9 U - = k - j )k ' 24) here are less than segmentations with runs ombining this with the last equation we get the statement o the lemma emma 13 ' eevaluation error or general ) Suppose : $ = $ is a one to one mapping suh that ]`_? 1) - j k )`k W U then with probability at least there exists a one-to-one mapping :A# = ' $ suh that or every #? ) ) where ) * D 8 2U 8 1 he roo o emma 13 is the same as the proo o emma 10 exept or the use o the Y norm in plae o the Y norm 8 reating Unspeiied parameters So ar we have assumed that our learning algorithms reeive as input parameters that desribe properties o the sequene n the two distribution ase the algorithm reeives as input U in the multiple distribution ase the algorithm reeives Y n this setion we show that these assumptions an be removed with some inrease in the error i we measure the error o the hypothesis with - H eall that -1 H is always smaller than - - thus the bounds we previously got or ases where the algorithm reeives additional input an be used here he idea is to try all possible settings o the unknown parameters then selet the best resulting hypothesis While the dierent algorithms reeive as input dierent subsets o U Y the only variable that is ontrolled by these parameters is B the length o the segments that are used or initialization As there are at most possible settings or B we need to run the algorithm times then ompare the hypotheses he dierent hypotheses are ompared in terms o the total ost deined in Figure 3 the hypothesis with the minimal total ost is seleted More ormally assume that we have hypotheses ) For eah 9 aording to the ost vetors U ) - we alulate the total ost U ) - = as deined in Figure 3 or the ase o more than two distributions We then selet the hypothesis with the minimal total ost as our inal hypothesis he ollowing lemma shows that the error o the seleted hypothesis is with high probability only slightly larger than that o the best hypothesis in the set emma 14 et be a sequene generated aording to the segmentation the distributions et ) - - be a set o hypotheses let argmin 8 U ) - hen with probability at least with respet to the distribution o -1 H )3 - N FWJ - -1 H ) - ) where )

10 2 roo: We irst onsider the expeted value o the total ost o the sequene or some ixed hypothesis = - et be the th element in hus is generated by the distribution '1 *) then assigned a ost U aording to the ost vetor U ' :) whih orresponds to the approximated distribution ' :) Deine - ' *) < ' *) hen similarly to quation 21) the expeted value o U is U ' *) - 25) Summing the expeted value o U over that the expeted total ost o is - - we get Where is a onstant independent o the hypothesis he range o all o the U s is < they are independent rom variables hus the deviation o the ost o the sequene rom its expeted value an be bounded using emma 5 as ollows 2 - U - U U _ W \ We now return to the set o hypotheses As eah o these hypotheses is seleted rom a set o at most segmentations we ind that by setting to be!k we get that with probability at least the total osts o all M possible hypotheses are within o their respetive expeted values hus the expeted total ost o the segmentation that appears to be best is within o the expeted value o the atual best segmentation Whih gives the statement o the lemma Applying emma 14 to heorem 1 we get heorem 3 n this ase we add to the idate hypotheses that orrespond to the hoies o B the hypothesis * H where H onsists o a single run is the empirial distribution o the sequene his hypothesis is aurate i either U or are smaller than O thus we an remove any assumption on U Applying emma 14 to heorem 2 we get heorem 4 n this ase we still need to assume a lower bound on Y in order to assure that at least one o the exeutions o the algorithm sueeds n both ases the requirement on that we have to make in order that the extra error be smaller than O is Z 9 O 1 = his requirement is subsumed by the requirements in heorems 1 2 Aknowledgments We would like to thank Yoram Singer ali ishby or disussions that motivated this work Dana on would like to thank the shkol Fellowship or its support art o this work was done while Dana on was visiting A Bell aboratories eerenes [1] N Abe M K Warmuth On the omputational omplexity o approximating distributions by probabilisti automata Mahine earning 923) 1992 Speial issue or O90 [2] eonard Baum J A agon An inequality with appliations to statistial estimation or probabilisti untions o markov proesses to a model or eology Bulletin o the Amerian Mathematial Soiety 3: [3] Avrim Blum rasad halasani earning swithing onepts n roeedings o the Fith Annual AM Workshop on omputational earning heory pages July 1992 [4] homas M over Joy A homas lements o normation heory Wiley 1991 [5] David Gillman Mihael Sipser nerene minimization o hidden markov hains n roeedings o the Seventh Annual AM onerene on omputational earning heory pages July 1994 [] Nik ittlestone Notes on the derivation o herno-type bounds or sums o rom variables Unpublished Manusript 1990 [] abiner B H Juang An introdution to hidden markov models ASS Magazine 31):4 1 January 198 [8] AJ iterbi rror bounds or onvulutional odes an asymptotially optimal deoding algorithm rans norm heory 13: Open roblems t is lear that improvements in our upper bounds should be possible n addition it would be interesting to searh or mathing lower bounds sine no lower bounds on the ahievable error are known t would be interesting to show whether our bounds an be improved i we use the likelihood ost as done in the Baum-Welh algorithm rather than the osts we invented A natural saling o the parameters is to ix $ the swithing rate to let = Our analysis breaks down in this ase

Complexity of Regularization RBF Networks

Complexity of Regularization RBF Networks Complexity of Regularization RBF Networks Mark A Kon Department of Mathematis and Statistis Boston University Boston, MA 02215 mkon@buedu Leszek Plaskota Institute of Applied Mathematis University of Warsaw

More information

Hankel Optimal Model Order Reduction 1

Hankel Optimal Model Order Reduction 1 Massahusetts Institute of Tehnology Department of Eletrial Engineering and Computer Siene 6.245: MULTIVARIABLE CONTROL SYSTEMS by A. Megretski Hankel Optimal Model Order Redution 1 This leture overs both

More information

Acoustic Attenuation Performance of Helicoidal Resonator Due to Distance Change from Different Cross-sectional Elements of Cylindrical Ducts

Acoustic Attenuation Performance of Helicoidal Resonator Due to Distance Change from Different Cross-sectional Elements of Cylindrical Ducts Exerpt rom the Proeedings o the COMSOL Conerene 1 Paris Aousti Attenuation Perormane o Helioidal Resonator Due to Distane Change rom Dierent Cross-setional Elements o Cylindrial Duts Wojieh ŁAPKA* Division

More information

Nonreversibility of Multiple Unicast Networks

Nonreversibility of Multiple Unicast Networks Nonreversibility of Multiple Uniast Networks Randall Dougherty and Kenneth Zeger September 27, 2005 Abstrat We prove that for any finite direted ayli network, there exists a orresponding multiple uniast

More information

ON THE FOUR-COLOUR CONJECTURE. By W. T. TUTTE. [Received 27 November 1945 Read 13 December 1945]

ON THE FOUR-COLOUR CONJECTURE. By W. T. TUTTE. [Received 27 November 1945 Read 13 December 1945] ON THE FOUR COLOUR CONJECTURE 37 ON THE FOUR-COLOUR CONJECTURE By W. T. TUTTE [Reeived 7 November 945 Read 3 Deember 945]. Introdution The maps disussed in this paper are dissetions o suraes into simple

More information

The Effectiveness of the Linear Hull Effect

The Effectiveness of the Linear Hull Effect The Effetiveness of the Linear Hull Effet S. Murphy Tehnial Report RHUL MA 009 9 6 Otober 009 Department of Mathematis Royal Holloway, University of London Egham, Surrey TW0 0EX, England http://www.rhul.a.uk/mathematis/tehreports

More information

Millennium Relativity Acceleration Composition. The Relativistic Relationship between Acceleration and Uniform Motion

Millennium Relativity Acceleration Composition. The Relativistic Relationship between Acceleration and Uniform Motion Millennium Relativity Aeleration Composition he Relativisti Relationship between Aeleration and niform Motion Copyright 003 Joseph A. Rybzyk Abstrat he relativisti priniples developed throughout the six

More information

A simple expression for radial distribution functions of pure fluids and mixtures

A simple expression for radial distribution functions of pure fluids and mixtures A simple expression for radial distribution funtions of pure fluids and mixtures Enrio Matteoli a) Istituto di Chimia Quantistia ed Energetia Moleolare, CNR, Via Risorgimento, 35, 56126 Pisa, Italy G.

More information

Lightpath routing for maximum reliability in optical mesh networks

Lightpath routing for maximum reliability in optical mesh networks Vol. 7, No. 5 / May 2008 / JOURNAL OF OPTICAL NETWORKING 449 Lightpath routing for maximum reliability in optial mesh networks Shengli Yuan, 1, * Saket Varma, 2 and Jason P. Jue 2 1 Department of Computer

More information

Computer Science 786S - Statistical Methods in Natural Language Processing and Data Analysis Page 1

Computer Science 786S - Statistical Methods in Natural Language Processing and Data Analysis Page 1 Computer Siene 786S - Statistial Methods in Natural Language Proessing and Data Analysis Page 1 Hypothesis Testing A statistial hypothesis is a statement about the nature of the distribution of a random

More information

PREDICTION OF CONCRETE COMPRESSIVE STRENGTH

PREDICTION OF CONCRETE COMPRESSIVE STRENGTH PREDICTION OF CONCRETE COMPRESSIVE STRENGTH Dunja Mikuli (1), Ivan Gabrijel (1) and Bojan Milovanovi (1) (1) Faulty o Civil Engineering, University o Zagreb, Croatia Abstrat A ompressive strength o onrete

More information

A Queueing Model for Call Blending in Call Centers

A Queueing Model for Call Blending in Call Centers A Queueing Model for Call Blending in Call Centers Sandjai Bhulai and Ger Koole Vrije Universiteit Amsterdam Faulty of Sienes De Boelelaan 1081a 1081 HV Amsterdam The Netherlands E-mail: {sbhulai, koole}@s.vu.nl

More information

CMSC 451: Lecture 9 Greedy Approximation: Set Cover Thursday, Sep 28, 2017

CMSC 451: Lecture 9 Greedy Approximation: Set Cover Thursday, Sep 28, 2017 CMSC 451: Leture 9 Greedy Approximation: Set Cover Thursday, Sep 28, 2017 Reading: Chapt 11 of KT and Set 54 of DPV Set Cover: An important lass of optimization problems involves overing a ertain domain,

More information

Likelihood-confidence intervals for quantiles in Extreme Value Distributions

Likelihood-confidence intervals for quantiles in Extreme Value Distributions Likelihood-onfidene intervals for quantiles in Extreme Value Distributions A. Bolívar, E. Díaz-Franés, J. Ortega, and E. Vilhis. Centro de Investigaión en Matemátias; A.P. 42, Guanajuato, Gto. 36; Méxio

More information

Reliability Estimation of Solder Joints Under Thermal Fatigue with Varying Parameters by using FORM and MCS

Reliability Estimation of Solder Joints Under Thermal Fatigue with Varying Parameters by using FORM and MCS Proeedings o the World Congress on Engineering 2007 Vol II Reliability Estimation o Solder Joints Under Thermal Fatigue with Varying Parameters by using FORM and MCS Ouk Sub Lee, Yeon Chang Park, and Dong

More information

Singular Event Detection

Singular Event Detection Singular Event Detetion Rafael S. Garía Eletrial Engineering University of Puerto Rio at Mayagüez Rafael.Garia@ee.uprm.edu Faulty Mentor: S. Shankar Sastry Researh Supervisor: Jonathan Sprinkle Graduate

More information

23.1 Tuning controllers, in the large view Quoting from Section 16.7:

23.1 Tuning controllers, in the large view Quoting from Section 16.7: Lesson 23. Tuning a real ontroller - modeling, proess identifiation, fine tuning 23.0 Context We have learned to view proesses as dynami systems, taking are to identify their input, intermediate, and output

More information

Supplementary Materials

Supplementary Materials Supplementary Materials Neural population partitioning and a onurrent brain-mahine interfae for sequential motor funtion Maryam M. Shanehi, Rollin C. Hu, Marissa Powers, Gregory W. Wornell, Emery N. Brown

More information

Methods of evaluating tests

Methods of evaluating tests Methods of evaluating tests Let X,, 1 Xn be i.i.d. Bernoulli( p ). Then 5 j= 1 j ( 5, ) T = X Binomial p. We test 1 H : p vs. 1 1 H : p>. We saw that a LRT is 1 if t k* φ ( x ) =. otherwise (t is the observed

More information

The Hanging Chain. John McCuan. January 19, 2006

The Hanging Chain. John McCuan. January 19, 2006 The Hanging Chain John MCuan January 19, 2006 1 Introdution We onsider a hain of length L attahed to two points (a, u a and (b, u b in the plane. It is assumed that the hain hangs in the plane under a

More information

ON THE LEAST PRIMITIVE ROOT EXPRESSIBLE AS A SUM OF TWO SQUARES

ON THE LEAST PRIMITIVE ROOT EXPRESSIBLE AS A SUM OF TWO SQUARES #A55 INTEGERS 3 (203) ON THE LEAST PRIMITIVE ROOT EPRESSIBLE AS A SUM OF TWO SQUARES Christopher Ambrose Mathematishes Institut, Georg-August Universität Göttingen, Göttingen, Deutshland ambrose@uni-math.gwdg.de

More information

A NETWORK SIMPLEX ALGORITHM FOR THE MINIMUM COST-BENEFIT NETWORK FLOW PROBLEM

A NETWORK SIMPLEX ALGORITHM FOR THE MINIMUM COST-BENEFIT NETWORK FLOW PROBLEM NETWORK SIMPLEX LGORITHM FOR THE MINIMUM COST-BENEFIT NETWORK FLOW PROBLEM Cen Çalışan, Utah Valley University, 800 W. University Parway, Orem, UT 84058, 801-863-6487, en.alisan@uvu.edu BSTRCT The minimum

More information

arxiv: v2 [math.pr] 9 Dec 2016

arxiv: v2 [math.pr] 9 Dec 2016 Omnithermal Perfet Simulation for Multi-server Queues Stephen B. Connor 3th Deember 206 arxiv:60.0602v2 [math.pr] 9 De 206 Abstrat A number of perfet simulation algorithms for multi-server First Come First

More information

Doppler effect of the rupture process of the great M W 7.9 Wenchuan earthquake

Doppler effect of the rupture process of the great M W 7.9 Wenchuan earthquake Earthq Si (1)3: 535 539 535 539 Doi: 1.17/s11589-1-75-4 Doppler eet o the rupture proess o the great M W 7.9 Wenhuan earthquake Ge Jin 1,, Youai Tang 1 Shiyong Zhou 1 and Yongshun John Chen 1 1 Institute

More information

Convergence of reinforcement learning with general function approximators

Convergence of reinforcement learning with general function approximators Convergene of reinforement learning with general funtion approximators assilis A. Papavassiliou and Stuart Russell Computer Siene Division, U. of California, Berkeley, CA 94720-1776 fvassilis,russellg@s.berkeley.edu

More information

Journal of Inequalities in Pure and Applied Mathematics

Journal of Inequalities in Pure and Applied Mathematics Journal of Inequalities in Pure and Applied Mathematis A NEW ARRANGEMENT INEQUALITY MOHAMMAD JAVAHERI University of Oregon Department of Mathematis Fenton Hall, Eugene, OR 97403. EMail: javaheri@uoregon.edu

More information

Remark 4.1 Unlike Lyapunov theorems, LaSalle s theorem does not require the function V ( x ) to be positive definite.

Remark 4.1 Unlike Lyapunov theorems, LaSalle s theorem does not require the function V ( x ) to be positive definite. Leture Remark 4.1 Unlike Lyapunov theorems, LaSalle s theorem does not require the funtion V ( x ) to be positive definite. ost often, our interest will be to show that x( t) as t. For that we will need

More information

UPPER-TRUNCATED POWER LAW DISTRIBUTIONS

UPPER-TRUNCATED POWER LAW DISTRIBUTIONS Fratals, Vol. 9, No. (00) 09 World Sientifi Publishing Company UPPER-TRUNCATED POWER LAW DISTRIBUTIONS STEPHEN M. BURROUGHS and SARAH F. TEBBENS College of Marine Siene, University of South Florida, St.

More information

Optimization of Statistical Decisions for Age Replacement Problems via a New Pivotal Quantity Averaging Approach

Optimization of Statistical Decisions for Age Replacement Problems via a New Pivotal Quantity Averaging Approach Amerian Journal of heoretial and Applied tatistis 6; 5(-): -8 Published online January 7, 6 (http://www.sienepublishinggroup.om/j/ajtas) doi:.648/j.ajtas.s.65.4 IN: 36-8999 (Print); IN: 36-96 (Online)

More information

Sensor management for PRF selection in the track-before-detect context

Sensor management for PRF selection in the track-before-detect context Sensor management for PRF seletion in the tra-before-detet ontext Fotios Katsilieris, Yvo Boers, and Hans Driessen Thales Nederland B.V. Haasbergerstraat 49, 7554 PA Hengelo, the Netherlands Email: {Fotios.Katsilieris,

More information

max min z i i=1 x j k s.t. j=1 x j j:i T j

max min z i i=1 x j k s.t. j=1 x j j:i T j AM 221: Advaned Optimization Spring 2016 Prof. Yaron Singer Leture 22 April 18th 1 Overview In this leture, we will study the pipage rounding tehnique whih is a deterministi rounding proedure that an be

More information

arxiv:gr-qc/ v2 6 Feb 2004

arxiv:gr-qc/ v2 6 Feb 2004 Hubble Red Shift and the Anomalous Aeleration of Pioneer 0 and arxiv:gr-q/0402024v2 6 Feb 2004 Kostadin Trenčevski Faulty of Natural Sienes and Mathematis, P.O.Box 62, 000 Skopje, Maedonia Abstrat It this

More information

A Characterization of Wavelet Convergence in Sobolev Spaces

A Characterization of Wavelet Convergence in Sobolev Spaces A Charaterization of Wavelet Convergene in Sobolev Spaes Mark A. Kon 1 oston University Louise Arakelian Raphael Howard University Dediated to Prof. Robert Carroll on the oasion of his 70th birthday. Abstrat

More information

Finite Formulation of Electromagnetic Field

Finite Formulation of Electromagnetic Field Finite Formulation o Eletromagneti Field Enzo TONTI Dept.Civil Engin., Univ. o Trieste, Piazzale Europa 1, 34127 Trieste, Italia. e-mail: tonti@univ.trieste.it Otober 16, 2000 Abstrat This paper shows

More information

ETNA Kent State University

ETNA Kent State University % { Eletroni Transations on Numerial Analysis. Volume 21, pp. 20-27, 2005. Copyright 2005,. ISSN 1068-9613. QR FACTORIZATIONS USING A RESTRICTED SET OF ROTATIONS DIANNE P. O LEARY AND STEPHEN S. BULLOCK

More information

Eigenvalues of tridiagonal matrix using Strum Sequence and Gerschgorin theorem

Eigenvalues of tridiagonal matrix using Strum Sequence and Gerschgorin theorem Eigenvalues o tridiagonal matrix using Strum Sequene and Gershgorin theorem T.D.Roopamala Department o Computer Siene and Engg., Sri Jayahamarajendra College o Engineering Mysore INDIA roopa_td@yahoo.o.in

More information

Chapter 8 Hypothesis Testing

Chapter 8 Hypothesis Testing Leture 5 for BST 63: Statistial Theory II Kui Zhang, Spring Chapter 8 Hypothesis Testing Setion 8 Introdution Definition 8 A hypothesis is a statement about a population parameter Definition 8 The two

More information

Most results in this section are stated without proof.

Most results in this section are stated without proof. Leture 8 Level 4 v2 he Expliit formula. Most results in this setion are stated without proof. Reall that we have shown that ζ (s has only one pole, a simple one at s =. It has trivial zeros at the negative

More information

DIGITAL DISTANCE RELAYING SCHEME FOR PARALLEL TRANSMISSION LINES DURING INTER-CIRCUIT FAULTS

DIGITAL DISTANCE RELAYING SCHEME FOR PARALLEL TRANSMISSION LINES DURING INTER-CIRCUIT FAULTS CHAPTER 4 DIGITAL DISTANCE RELAYING SCHEME FOR PARALLEL TRANSMISSION LINES DURING INTER-CIRCUIT FAULTS 4.1 INTRODUCTION Around the world, environmental and ost onsiousness are foring utilities to install

More information

Lecture 7: Sampling/Projections for Least-squares Approximation, Cont. 7 Sampling/Projections for Least-squares Approximation, Cont.

Lecture 7: Sampling/Projections for Least-squares Approximation, Cont. 7 Sampling/Projections for Least-squares Approximation, Cont. Stat60/CS94: Randomized Algorithms for Matries and Data Leture 7-09/5/013 Leture 7: Sampling/Projetions for Least-squares Approximation, Cont. Leturer: Mihael Mahoney Sribe: Mihael Mahoney Warning: these

More information

Control Theory association of mathematics and engineering

Control Theory association of mathematics and engineering Control Theory assoiation of mathematis and engineering Wojieh Mitkowski Krzysztof Oprzedkiewiz Department of Automatis AGH Univ. of Siene & Tehnology, Craow, Poland, Abstrat In this paper a methodology

More information

Sequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson & J. Fischer) January 21,

Sequence Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson & J. Fischer) January 21, Sequene Analysis, WS 14/15, D. Huson & R. Neher (this part by D. Huson & J. Fisher) January 21, 201511 9 Suffix Trees and Suffix Arrays This leture is based on the following soures, whih are all reommended

More information

Error Bounds for Context Reduction and Feature Omission

Error Bounds for Context Reduction and Feature Omission Error Bounds for Context Redution and Feature Omission Eugen Bek, Ralf Shlüter, Hermann Ney,2 Human Language Tehnology and Pattern Reognition, Computer Siene Department RWTH Aahen University, Ahornstr.

More information

A Spatiotemporal Approach to Passive Sound Source Localization

A Spatiotemporal Approach to Passive Sound Source Localization A Spatiotemporal Approah Passive Sound Soure Loalization Pasi Pertilä, Mikko Parviainen, Teemu Korhonen and Ari Visa Institute of Signal Proessing Tampere University of Tehnology, P.O.Box 553, FIN-330,

More information

Development of moving sound source localization system

Development of moving sound source localization system Eletroni Journal «Tehnial Aoustis» http://www.ejta.org 2006, 11 Doh-Hyoung Kim 1, Youngjin Park 2 Korea Advaned Institute o Siene and Tehnology, ME3076, 373-1, Guseong-Dong, Yuseong-Gu, Daejon, 305-701,

More information

LECTURE NOTES FOR , FALL 2004

LECTURE NOTES FOR , FALL 2004 LECTURE NOTES FOR 18.155, FALL 2004 83 12. Cone support and wavefront set In disussing the singular support of a tempered distibution above, notie that singsupp(u) = only implies that u C (R n ), not as

More information

Physical Laws, Absolutes, Relative Absolutes and Relativistic Time Phenomena

Physical Laws, Absolutes, Relative Absolutes and Relativistic Time Phenomena Page 1 of 10 Physial Laws, Absolutes, Relative Absolutes and Relativisti Time Phenomena Antonio Ruggeri modexp@iafria.om Sine in the field of knowledge we deal with absolutes, there are absolute laws that

More information

f 2 f n where m is the total mass of the object. Expression (6a) is plotted in Figure 8 for several values of damping ( ).

f 2 f n where m is the total mass of the object. Expression (6a) is plotted in Figure 8 for several values of damping ( ). F o F o / k A = = 6 k 1 + 1 + n r n n n RESONANCE It is seen in Figure 7 that displaement and stress levels tend to build up greatly when the oring requeny oinides with the natural requeny, the buildup

More information

An I-Vector Backend for Speaker Verification

An I-Vector Backend for Speaker Verification An I-Vetor Bakend for Speaker Verifiation Patrik Kenny, 1 Themos Stafylakis, 1 Jahangir Alam, 1 and Marel Kokmann 2 1 CRIM, Canada, {patrik.kenny, themos.stafylakis, jahangir.alam}@rim.a 2 VoieTrust, Canada,

More information

3 Tidal systems modelling: ASMITA model

3 Tidal systems modelling: ASMITA model 3 Tidal systems modelling: ASMITA model 3.1 Introdution For many pratial appliations, simulation and predition of oastal behaviour (morphologial development of shorefae, beahes and dunes) at a ertain level

More information

An Adaptive Optimization Approach to Active Cancellation of Repeated Transient Vibration Disturbances

An Adaptive Optimization Approach to Active Cancellation of Repeated Transient Vibration Disturbances An aptive Optimization Approah to Ative Canellation of Repeated Transient Vibration Disturbanes David L. Bowen RH Lyon Corp / Aenteh, 33 Moulton St., Cambridge, MA 138, U.S.A., owen@lyonorp.om J. Gregory

More information

The Laws of Acceleration

The Laws of Acceleration The Laws of Aeleration The Relationships between Time, Veloity, and Rate of Aeleration Copyright 2001 Joseph A. Rybzyk Abstrat Presented is a theory in fundamental theoretial physis that establishes the

More information

Danielle Maddix AA238 Final Project December 9, 2016

Danielle Maddix AA238 Final Project December 9, 2016 Struture and Parameter Learning in Bayesian Networks with Appliations to Prediting Breast Caner Tumor Malignany in a Lower Dimension Feature Spae Danielle Maddix AA238 Final Projet Deember 9, 2016 Abstrat

More information

arxiv: v1 [physics.gen-ph] 5 Jan 2018

arxiv: v1 [physics.gen-ph] 5 Jan 2018 The Real Quaternion Relativity Viktor Ariel arxiv:1801.03393v1 [physis.gen-ph] 5 Jan 2018 In this work, we use real quaternions and the basi onept of the final speed of light in an attempt to enhane the

More information

Robust Recovery of Signals From a Structured Union of Subspaces

Robust Recovery of Signals From a Structured Union of Subspaces Robust Reovery of Signals From a Strutured Union of Subspaes 1 Yonina C. Eldar, Senior Member, IEEE and Moshe Mishali, Student Member, IEEE arxiv:87.4581v2 [nlin.cg] 3 Mar 29 Abstrat Traditional sampling

More information

Estimating the probability law of the codelength as a function of the approximation error in image compression

Estimating the probability law of the codelength as a function of the approximation error in image compression Estimating the probability law of the odelength as a funtion of the approximation error in image ompression François Malgouyres Marh 7, 2007 Abstrat After some reolletions on ompression of images using

More information

Motor Sizing Application Note

Motor Sizing Application Note PAE-TILOGY Linear Motors 70 Mill orest d. Webster, TX 77598 (8) 6-7750 ax (8) 6-7760 www.trilogysystems.om E-mail emn_support_trilogy@parker.om Motor Sizing Appliation Note By Jak Marsh Introdution Linear

More information

Product Policy in Markets with Word-of-Mouth Communication. Technical Appendix

Product Policy in Markets with Word-of-Mouth Communication. Technical Appendix rodut oliy in Markets with Word-of-Mouth Communiation Tehnial Appendix August 05 Miro-Model for Inreasing Awareness In the paper, we make the assumption that awareness is inreasing in ustomer type. I.e.,

More information

REFINED UPPER BOUNDS FOR THE LINEAR DIOPHANTINE PROBLEM OF FROBENIUS. 1. Introduction

REFINED UPPER BOUNDS FOR THE LINEAR DIOPHANTINE PROBLEM OF FROBENIUS. 1. Introduction Version of 5/2/2003 To appear in Advanes in Applied Mathematis REFINED UPPER BOUNDS FOR THE LINEAR DIOPHANTINE PROBLEM OF FROBENIUS MATTHIAS BECK AND SHELEMYAHU ZACKS Abstrat We study the Frobenius problem:

More information

Packing Plane Spanning Trees into a Point Set

Packing Plane Spanning Trees into a Point Set Paking Plane Spanning Trees into a Point Set Ahmad Biniaz Alfredo Garía Abstrat Let P be a set of n points in the plane in general position. We show that at least n/3 plane spanning trees an be paked into

More information

Case I: 2 users In case of 2 users, the probability of error for user 1 was earlier derived to be 2 A1

Case I: 2 users In case of 2 users, the probability of error for user 1 was earlier derived to be 2 A1 MUTLIUSER DETECTION (Letures 9 and 0) 6:33:546 Wireless Communiations Tehnologies Instrutor: Dr. Narayan Mandayam Summary By Shweta Shrivastava (shwetash@winlab.rutgers.edu) bstrat This artile ontinues

More information

Average Rate Speed Scaling

Average Rate Speed Scaling Average Rate Speed Saling Nikhil Bansal David P. Bunde Ho-Leung Chan Kirk Pruhs May 2, 2008 Abstrat Speed saling is a power management tehnique that involves dynamially hanging the speed of a proessor.

More information

Answers to test yourself questions

Answers to test yourself questions Answers to test yoursel questions Topi.1 Osilliations 1 a A n osillation is any motion in whih the displaement o a partile rom a ixed point keeps hanging diretion and there is a periodiity in the motion

More information

Inference-based Ambiguity Management in Decentralized Decision-Making: Decentralized Diagnosis of Discrete Event Systems

Inference-based Ambiguity Management in Decentralized Decision-Making: Decentralized Diagnosis of Discrete Event Systems Inerene-based Ambiguity Management in Deentralized Deision-Making: Deentralized Diagnosis o Disrete Event Systems Ratnesh Kumar and Shigemasa Takai Abstrat The task o deentralized deision-making involves

More information

IMPEDANCE EFFECTS OF LEFT TURNERS FROM THE MAJOR STREET AT A TWSC INTERSECTION

IMPEDANCE EFFECTS OF LEFT TURNERS FROM THE MAJOR STREET AT A TWSC INTERSECTION 09-1289 Citation: Brilon, W. (2009): Impedane Effets of Left Turners from the Major Street at A TWSC Intersetion. Transportation Researh Reord Nr. 2130, pp. 2-8 IMPEDANCE EFFECTS OF LEFT TURNERS FROM THE

More information

CSC2515 Winter 2015 Introduc3on to Machine Learning. Lecture 5: Clustering, mixture models, and EM

CSC2515 Winter 2015 Introduc3on to Machine Learning. Lecture 5: Clustering, mixture models, and EM CSC2515 Winter 2015 Introdu3on to Mahine Learning Leture 5: Clustering, mixture models, and EM All leture slides will be available as.pdf on the ourse website: http://www.s.toronto.edu/~urtasun/ourses/csc2515/

More information

Maximum Entropy and Exponential Families

Maximum Entropy and Exponential Families Maximum Entropy and Exponential Families April 9, 209 Abstrat The goal of this note is to derive the exponential form of probability distribution from more basi onsiderations, in partiular Entropy. It

More information

Lecture 3 - Lorentz Transformations

Lecture 3 - Lorentz Transformations Leture - Lorentz Transformations A Puzzle... Example A ruler is positioned perpendiular to a wall. A stik of length L flies by at speed v. It travels in front of the ruler, so that it obsures part of the

More information

A Small Footprint i-vector Extractor

A Small Footprint i-vector Extractor A Small Footprint i-vetor Extrator Patrik Kenny Centre de reherhe informatique de Montréal (CRIM) Patrik.Kenny@rim.a Abstrat Both the memory and omputational requirements of algorithms traditionally used

More information

Tests of fit for symmetric variance gamma distributions

Tests of fit for symmetric variance gamma distributions Tests of fit for symmetri variane gamma distributions Fragiadakis Kostas UADPhilEon, National and Kapodistrian University of Athens, 4 Euripidou Street, 05 59 Athens, Greee. Keywords: Variane Gamma Distribution,

More information

Dynamic Programming and Multi Objective Linear Programming approaches

Dynamic Programming and Multi Objective Linear Programming approaches 0 NSP Vol. () (0), 9 Dynami Programming and Multi Objetive Linear Programming approahes P. K. De and Amita Bhinher Department o Mathematis, National Institute o Tehnology SILCHAR - 788 00 (Assam) India

More information

Model-based mixture discriminant analysis an experimental study

Model-based mixture discriminant analysis an experimental study Model-based mixture disriminant analysis an experimental study Zohar Halbe and Mayer Aladjem Department of Eletrial and Computer Engineering, Ben-Gurion University of the Negev P.O.Box 653, Beer-Sheva,

More information

SQUARE ROOTS AND AND DIRECTIONS

SQUARE ROOTS AND AND DIRECTIONS SQUARE ROOS AND AND DIRECIONS We onstrut a lattie-like point set in the Eulidean plane that eluidates the relationship between the loal statistis of the frational parts of n and diretions in a shifted

More information

Einstein s Three Mistakes in Special Relativity Revealed. Copyright Joseph A. Rybczyk

Einstein s Three Mistakes in Special Relativity Revealed. Copyright Joseph A. Rybczyk Einstein s Three Mistakes in Speial Relativity Revealed Copyright Joseph A. Rybzyk Abstrat When the evidene supported priniples of eletromagneti propagation are properly applied, the derived theory is

More information

Advances in Radio Science

Advances in Radio Science Advanes in adio Siene 2003) 1: 99 104 Copernius GmbH 2003 Advanes in adio Siene A hybrid method ombining the FDTD and a time domain boundary-integral equation marhing-on-in-time algorithm A Beker and V

More information

Analysis of discretization in the direct simulation Monte Carlo

Analysis of discretization in the direct simulation Monte Carlo PHYSICS OF FLUIDS VOLUME 1, UMBER 1 OCTOBER Analysis of disretization in the diret simulation Monte Carlo iolas G. Hadjionstantinou a) Department of Mehanial Engineering, Massahusetts Institute of Tehnology,

More information

Probabilistic Simulation Approach to Evaluate the Tooth-Root Strength of Spur Gears with FEM-Based Verification

Probabilistic Simulation Approach to Evaluate the Tooth-Root Strength of Spur Gears with FEM-Based Verification Engineering, 2011, 3, 1137-1148 doi:10.4236/eng.2011.312142 Published Online Deember 2011 (http://www.sirp.org/journal/eng) Abstrat Probabilisti Simulation Approah to Evaluate the Tooth-Root Strength o

More information

Array Design for Superresolution Direction-Finding Algorithms

Array Design for Superresolution Direction-Finding Algorithms Array Design for Superresolution Diretion-Finding Algorithms Naushad Hussein Dowlut BEng, ACGI, AMIEE Athanassios Manikas PhD, DIC, AMIEE, MIEEE Department of Eletrial Eletroni Engineering Imperial College

More information

7 Max-Flow Problems. Business Computing and Operations Research 608

7 Max-Flow Problems. Business Computing and Operations Research 608 7 Max-Flow Problems Business Computing and Operations Researh 68 7. Max-Flow Problems In what follows, we onsider a somewhat modified problem onstellation Instead of osts of transmission, vetor now indiates

More information

Frequency hopping does not increase anti-jamming resilience of wireless channels

Frequency hopping does not increase anti-jamming resilience of wireless channels Frequeny hopping does not inrease anti-jamming resiliene of wireless hannels Moritz Wiese and Panos Papadimitratos Networed Systems Seurity Group KTH Royal Institute of Tehnology, Stoholm, Sweden {moritzw,

More information

Measuring & Inducing Neural Activity Using Extracellular Fields I: Inverse systems approach

Measuring & Inducing Neural Activity Using Extracellular Fields I: Inverse systems approach Measuring & Induing Neural Ativity Using Extraellular Fields I: Inverse systems approah Keith Dillon Department of Eletrial and Computer Engineering University of California San Diego 9500 Gilman Dr. La

More information

Phase Diffuser at the Transmitter for Lasercom Link: Effect of Partially Coherent Beam on the Bit-Error Rate.

Phase Diffuser at the Transmitter for Lasercom Link: Effect of Partially Coherent Beam on the Bit-Error Rate. Phase Diffuser at the Transmitter for Laserom Link: Effet of Partially Coherent Beam on the Bit-Error Rate. O. Korotkova* a, L. C. Andrews** a, R. L. Phillips*** b a Dept. of Mathematis, Univ. of Central

More information

Searching All Approximate Covers and Their Distance using Finite Automata

Searching All Approximate Covers and Their Distance using Finite Automata Searhing All Approximate Covers and Their Distane using Finite Automata Ondřej Guth, Bořivoj Melihar, and Miroslav Balík České vysoké učení tehniké v Praze, Praha, CZ, {gutho1,melihar,alikm}@fel.vut.z

More information

The universal model of error of active power measuring channel

The universal model of error of active power measuring channel 7 th Symposium EKO TC 4 3 rd Symposium EKO TC 9 and 5 th WADC Workshop nstrumentation for the CT Era Sept. 8-2 Kosie Slovakia The universal model of error of ative power measuring hannel Boris Stogny Evgeny

More information

ON A PROCESS DERIVED FROM A FILTERED POISSON PROCESS

ON A PROCESS DERIVED FROM A FILTERED POISSON PROCESS ON A PROCESS DERIVED FROM A FILTERED POISSON PROCESS MARIO LEFEBVRE and JEAN-LUC GUILBAULT A ontinuous-time and ontinuous-state stohasti proess, denoted by {Xt), t }, is defined from a proess known as

More information

MOLECULAR ORBITAL THEORY- PART I

MOLECULAR ORBITAL THEORY- PART I 5.6 Physial Chemistry Leture #24-25 MOLECULAR ORBITAL THEORY- PART I At this point, we have nearly ompleted our rash-ourse introdution to quantum mehanis and we re finally ready to deal with moleules.

More information

Wavetech, LLC. Ultrafast Pulses and GVD. John O Hara Created: Dec. 6, 2013

Wavetech, LLC. Ultrafast Pulses and GVD. John O Hara Created: Dec. 6, 2013 Ultrafast Pulses and GVD John O Hara Created: De. 6, 3 Introdution This doument overs the basi onepts of group veloity dispersion (GVD) and ultrafast pulse propagation in an optial fiber. Neessarily, it

More information

Stochastic Adaptive Learning Rate in an Identification Method: An Approach for On-line Drilling Processes Monitoring

Stochastic Adaptive Learning Rate in an Identification Method: An Approach for On-line Drilling Processes Monitoring 9 Amerian Control Conerene Hyatt Regeny Riverront, St. Louis, MO, USA June -, 9 FrB3. Stohasti Adaptive Learning Rate in an Identiiation Method: An Approah or On-line Drilling Proesses Monitoring A. BA,

More information

Relativistic Dynamics

Relativistic Dynamics Chapter 7 Relativisti Dynamis 7.1 General Priniples of Dynamis 7.2 Relativisti Ation As stated in Setion A.2, all of dynamis is derived from the priniple of least ation. Thus it is our hore to find a suitable

More information

Taste for variety and optimum product diversity in an open economy

Taste for variety and optimum product diversity in an open economy Taste for variety and optimum produt diversity in an open eonomy Javier Coto-Martínez City University Paul Levine University of Surrey Otober 0, 005 María D.C. Garía-Alonso University of Kent Abstrat We

More information

Combined Electric and Magnetic Dipoles for Mesoband Radiation, Part 2

Combined Electric and Magnetic Dipoles for Mesoband Radiation, Part 2 Sensor and Simulation Notes Note 53 3 May 8 Combined Eletri and Magneti Dipoles for Mesoband Radiation, Part Carl E. Baum University of New Mexio Department of Eletrial and Computer Engineering Albuquerque

More information

RESEARCH ON RANDOM FOURIER WAVE-NUMBER SPECTRUM OF FLUCTUATING WIND SPEED

RESEARCH ON RANDOM FOURIER WAVE-NUMBER SPECTRUM OF FLUCTUATING WIND SPEED The Seventh Asia-Paifi Conferene on Wind Engineering, November 8-1, 9, Taipei, Taiwan RESEARCH ON RANDOM FORIER WAVE-NMBER SPECTRM OF FLCTATING WIND SPEED Qi Yan 1, Jie Li 1 Ph D. andidate, Department

More information

LOGISTIC REGRESSION IN DEPRESSION CLASSIFICATION

LOGISTIC REGRESSION IN DEPRESSION CLASSIFICATION LOGISIC REGRESSIO I DEPRESSIO CLASSIFICAIO J. Kual,. V. ran, M. Bareš KSE, FJFI, CVU v Praze PCP, CS, 3LF UK v Praze Abstrat Well nown logisti regression and the other binary response models an be used

More information

Sufficient Conditions for a Flexible Manufacturing System to be Deadlocked

Sufficient Conditions for a Flexible Manufacturing System to be Deadlocked Paper 0, INT 0 Suffiient Conditions for a Flexile Manufaturing System to e Deadloked Paul E Deering, PhD Department of Engineering Tehnology and Management Ohio University deering@ohioedu Astrat In reent

More information

Geometry of Transformations of Random Variables

Geometry of Transformations of Random Variables Geometry of Transformations of Random Variables Univariate distributions We are interested in the problem of finding the distribution of Y = h(x) when the transformation h is one-to-one so that there is

More information

On the Quantum Theory of Radiation.

On the Quantum Theory of Radiation. Physikalishe Zeitshrift, Band 18, Seite 121-128 1917) On the Quantum Theory of Radiation. Albert Einstein The formal similarity between the hromati distribution urve for thermal radiation and the Maxwell

More information

Critical Reflections on the Hafele and Keating Experiment

Critical Reflections on the Hafele and Keating Experiment Critial Refletions on the Hafele and Keating Experiment W.Nawrot In 1971 Hafele and Keating performed their famous experiment whih onfirmed the time dilation predited by SRT by use of marosopi loks. As

More information

Flexible Word Design and Graph Labeling

Flexible Word Design and Graph Labeling Flexible Word Design and Graph Labeling Ming-Yang Kao Manan Sanghi Robert Shweller Abstrat Motivated by emerging appliations for DNA ode word design, we onsider a generalization of the ode word design

More information

A Functional Representation of Fuzzy Preferences

A Functional Representation of Fuzzy Preferences Theoretial Eonomis Letters, 017, 7, 13- http://wwwsirporg/journal/tel ISSN Online: 16-086 ISSN Print: 16-078 A Funtional Representation of Fuzzy Preferenes Susheng Wang Department of Eonomis, Hong Kong

More information

estimated pulse sequene we produe estimates of the delay time struture of ripple-red events. Formulation of the Problem We model the k th seismi trae

estimated pulse sequene we produe estimates of the delay time struture of ripple-red events. Formulation of the Problem We model the k th seismi trae Bayesian Deonvolution of Seismi Array Data Using the Gibbs Sampler Eri A. Suess Robert Shumway, University of California, Davis Rong Chen, Texas A&M University Contat: Eri A. Suess, Division of Statistis,

More information