arxiv: v1 [cs.lg] 4 Jul 2018 Doina Precup McGill University / DeepMind Montreal

Size: px
Start display at page:

Download "arxiv: v1 [cs.lg] 4 Jul 2018 Doina Precup McGill University / DeepMind Montreal"

Transcription

1 Connecting Weighted Automata and Recurrent Neura Networks through Spectra Learning Guiaume Rabusseau McGi University Tianyu Li McGi University arxiv: v [cs.lg] 4 Ju 08 Doina Precup McGi University / DeepMind Montrea dprecup@cs.mcgi.ca Abstract In this paper, we unrave a undamenta connection between weighted inite automata (WFAs) and second-order recurrent neura networks (-RNNs): in the case o sequences o discrete symbos, WFAs and -RNNs with inear activation unctions are expressivey equivaent. Motivated by this resut, we buid upon a recent extension o the spectra earning agorithm to vector-vaued WFAs and propose the irst provabe earning agorithm or inear -RNNs deined over sequences o continuous input vectors. This agorithm reies on estimating ow rank sub-bocks o the so-caed Hanke tensor, rom which the parameters o a inear -RNN can be provaby recovered. The perormances o the proposed method are assessed in a simuation study. Introduction Many tasks in natura anguage processing, computationa bioogy, reinorcement earning, and time series anaysis rey on earning with sequentia data, i.e. estimating unctions deined over sequences o observations rom training data. Weighted inite automata (WFAs) and recurrent neura networks (RNNs) are two poweru and exibe casses o modes which can eicienty represent such unctions. On the one hand, WFAs are tractabe, they encompass a wide range o machine earning modes (they can or exampe compute any probabiity distribution deined by a hidden Markov mode (HMM) [0] and can mode the transition and observation behavior o partiay observabe Markov decision processes []) and they oer appeaing theoretica guarantees. In particuar, the so-caed spectra methods or earning HMMs [9], WFAs [, 4] and reated modes [6, 6], provide an aternative to EM based agorithms that is both computationay eicient and consistent. On the other hand, RNNs are remarkaby expressive modes they can represent any computabe unction [] and they have successuy tacked many practica probems in speech and audio recognition [7, 4, ], but their theoretica anaysis is diicut. Even though recent work provides interesting resuts on their expressive power [, 7] as we as aternative training agorithms coming with earning guarantees [], the theoretica understanding o RNNs is sti imited. In this work, we bridge a gap between these two casses o modes by unraveing a undamenta connection between WFAs and second-order RNNs (-RNNs): when considering input sequences o discrete symbos, -RNNs with inear activation unctions and WFAs are one and the same, i.e. they are expressivey equivaent and there exists a one-to-one mapping between the two casses (moreover, this mapping conserves mode sizes). Whie connections between inite state machines (e.g. deterministic Preprint. Work in progress.

2 inite automata) and recurrent neura networks have been noticed and investigated in the past (see e.g. [4, 5]), to the best o our knowedge this is the irst time that such a cear equivaence between inear -RNNs and weighted automata is expicity ormaized. This resut naturay eads to the observation that inear -RNNs are a natura generaization o WFAs (which take sequences o discrete observations as inputs) to sequences o continuous vectors, and raises the question o whether the spectra earning agorithm or WFAs can be extended to inear -RNNs. The second contribution o this paper is to show that the answer is in the positive: buiding upon the spectra earning agorithm or vector-vaued WFAs introduced recenty in [9], we propose the irst provabe earning agorithm or second-order RNNs with inear activation unctions. Our earning agorithm reies on estimating sub-bocks o the so-caed Hanke tensor, rom which the parameters o a -inear RNN can be recovered using basic inear agebra operations. One o the key technica diicuties in designing this agorithm resides in estimating these sub-bocks rom training data where the inputs are sequences o continuous vectors. We everage mutiinear properties o inear -RNNs and the act that the Hanke sub-bocks can be reshaped into higher-order tensors o ow tensor train rank (a resut we beieve is o independent interest) to perorm this estimation eicienty using matrix sensing and tensor recovery techniques. As a proo o concept, we vaidate our theoretica indings in a simuation study on toy exampes where we experimentay compare dierent recovery methods and investigate the robustness o our agorithm to noise and rank mis-speciication. Reated work. Combining the spectra earning agorithm or WFAs with matrix competion techniques (a probem which is cosey reated to matrix sensing) has been theoreticay investigated in [5]. The connections between tensors and RNNs have been previousy everaged to study the expressive power o RNNs in [] and to achieve mode compression in [7, 6, 4]. Exporing the reationship between RNNs and automata or interpretabiity purposes has been expored in [5] and the abiity o RNNs to earn casses o orma anguages has been investigated in []. The predictive state RNN mode introduced in [] is cosey reated to -RNNs and the authors propose to use the spectra earning agorithm or predictive state representations to initiaize a gradient based agorithm; their approach however comes without theoretica guarantees. Lasty, a provabe agorithm or RNNs reying on the tensor method o moments has been proposed in [] but it is imited to irst-order RNNs with quadratic activation unctions (which do not encompass inear -RNNs). The proos o the resuts given in the paper can be ound in the suppementary materia. Preiminaries In this section, we irst present basic notions o tensor agebra beore introducing second-order recurrent neura network, weighted inite automata and the spectra earning agorithm. We start by introducing some notation. For any integer k we use [k] to denote the set o integers rom to k. We use to denote the smaest integer greater or equa to. For any set S, we denote by S = k N Sk the set o a inite-ength sequences o eements o S (in particuar, Σ wi denote the set o strings on a inite aphabet Σ). We use ower case bod etters or vectors (e.g. v R d ), upper case bod etters or matrices (e.g. M R d d ) and bod caigraphic etters or higher order tensors (e.g. T R d d d ). We use e i to denote the ith canonica basis vector o R d (where the dimension d wi aways appear ceary rom context). The d d identity matrix wi be written as I d. The ith row (resp. coumn) o a matrix M wi be denoted by M i,: (resp. M :,i ). This notation is extended to sices o a tensor in the straightorward way. I v R d and v R d, we use v v R d d to denote the Kronecker product between vectors, and its straightorward extension to matrices and tensors. Given a matrix M R d d, we use vec(m) R d d to denote the coumn vector obtained by concatenating the coumns o M. The inverse o M is denoted by M, its Moore-Penrose pseudo-inverse by M, and the transpose o its inverse by M ; the Frobenius norm is denoted by M F and the nucear norm by M. Tensors. We irst reca basic deinitions o tensor agebra; more detais can be ound in []. A tensor T R d dp can simpy be seen as a mutidimensiona array (T i,,i p : i n [d n ], n [p]). The mode-n ibers o T are the vectors obtained by ixing a indices except the nth one, e.g. T :,i,,i p R d. The nth mode matricization o T is the matrix having the mode-n ibers o T or coumns and is denoted by T (n) R dn d dn dn+ dp. The vectorization o a tensor is deined by vec(t ) = vec(t () ). In the oowing T aways denotes a tensor o size d d p.

3 T 4 = d d d d 4 d G R G R G R G 4 d d d 4 Figure : Tensor network representation o a rank R tensor train decomposition (nodes represent tensors and an edge between two nodes represents a contraction between the corresponding modes o the two tensors). The mode-n matrix product o the tensor T and a matrix X R m dn is a tensor denoted by T n X. It is o size d d n m d n+ d p and is deined by the reation Y = T n X Y (n) = XT (n). The mode-n vector product o the tensor T and a vector v R dn is a tensor deined by T n v = T n v R d dn dn+ dp. It is easy to check that the n-mode product satisies (T n A) n B = T n BA where we assume compatibe dimensions o the tensor T and the matrices A and B. Given stricty positive integers n,, n k satisying i n i = p, we use the notation (T ) n,n,,n k to denote the kth order tensor obtained by reshaping T into a tensor o size ( n i d = i ) ( n i d = n +i ) ( n k i k = d n + +n k +i k ). In particuar we have (T ) p = vec(t ) and (T ),p = T (). A rank R tensor train (TT) decomposition [6] o a tensor T R d dp consists in actorizing T into the product o p core tensors G R d R, G R R d R,, G p R R dp R, G p R R dp, and is deined by R T i,,i p = (G ) i,r (G ) r,i,r (G ) r,i,r (G p ) rp,i p,r p (G p ) rp,i p r,,r p = or a indices i [d ],, i p [d p ]; we wi use the notation T = G,, G p to denote such a decomposition. A tensor network representation o this decomposition is shown in Figure. Whie the probem o inding the best approximation o TT-rank R o a given tensor is NP-hard [8], a quasi-optima SVD based compression agorithm (TT-SVD) has been proposed in [6]. It is worth mentioning that the TT decomposition is invariant under change o basis in the sense that, or any invertibe matrix M and any core tensors G, G,, G p, we have G,, G p = G M, G M M,, G p M M, G p M. Second-order RNNs. A second-order recurrent neura network (-RNN) [5, 7, ] with n hidden units can be deined as a tupe M = (h 0, A, Ω) where h 0 R n is the initia state, A R n d n is the transition tensor, and Ω R p n is the output matrix, with d and p being the input and output dimensions respectivey. A -RNN maps any sequence o inputs x,, x k R d to a sequence o outputs y,, y k R p deined by y t = z (Ωh t ) with h t = z (A x t h t ) or t =,, k () where z : R n R n and z : R p R p are activation unctions. Aternativey, one can think o a -RNN as computing a unction M : (R d ) R p mapping each input sequence x,, x k to the corresponding ina output y k. Whie z and z are usuay non-inear component-wise unctions, we consider in this paper the case where both z and z are the identity, and we reer to the resuting mode as a inear -RNN. For a inear -RNN M, the unction M is mutiinear in the sense that, or any integer, its restriction to the domain (R d ) is mutiinear. Another useu observation is that inear -RNNs are invariant under change o basis: or any invertibe matrix P, the inear -RNN M = (P h 0, A P P, PΩ) is such that M = M. We wi say that a inear -RNN M with n states is minima i its number o hidden units is minima (i.e. any inear -RNN computing M has at east n hidden units). Note that the speciic ordering used to perorm matricization, vectorization and such a reshaping is not reevant as ong as it is consistent across a operations. The cassica deinition o the TT-decomposition aows the rank R to be dierent or each mode, but this deinition is suicient or the purpose o this paper.

4 Weighted automata and spectra earning. Vector-vaued weighted inite automaton (vv-wfa) have been introduced in [9] as a natura generaization o weighted automata rom scaar-vaued unctions to vector-vaued ones. A p-dimensiona vv-wfa with n states is a tupe A = (α, {A σ } σ Σ, Ω) where α R n is the initia weights vector, Ω R p n is the matrix o ina weights, and A σ R n n is the transition matrix or each symbo σ in a inite aphabet Σ. A vv-wfa A computes a unction A : Σ R p deined by A (x) = Ω(A x A x A x k ) α or each word x = x x x k Σ. We ca a vv-wfa minima i its number o states is minima. Given a unction : Σ R p we denote by rank() the number o states o a minima vv-wfa computing (which is set to i cannot be computed by a vv-wfa). The spectra earning agorithm or vv-wfas reies on the oowing undamenta theorem reating the rank o a unction : Σ R d to its Hanke tensor H R Σ Σ p, which is deined by H u,v,: = (uv) or a u, v Σ. Theorem ([9]). Let : Σ R d and et H be its Hanke tensor. Then rank() = rank(h () ). The vv-wfa earning agorithm everages the act that the proo o this theorem is constructive: one can recover a vv-wfa computing rom any ow rank actorization o H (). In practice, a inite subbock H R P S p o the Hanke tensor is used to recover the vv-wfa, where P, S Σ are inite sets o preixes and suixes orming a compete basis or, i.e. such that rank( H () ) = rank(h () ). More detais can be ound in [9]. A Fundamenta Reation between WFAs and Linear -RNNs We start by unraveing a undamenta connection between vv-wfas and inear -RNNs: vv-wfas and inear -RNNs are expressivey equivaent or representing unctions deined over sequences o discrete symbos. More ormay, we have the oowing theorem. Theorem. Any unction that can be computed by a vv-wfa with n states can be computed by a inear -RNN with n hidden units. Conversey, any unction that can be computed by a inear -RNN with n hidden units on sequences o one-hot vectors (i.e. canonica basis vectors) can be computed by a WFA with n states. More precisey, the WFA A = (α, {A σ } σ Σ, Ω) with n states and the inear -RNN M = (α, A, Ω) with n hidden units, where A R n Σ n is deined by A :,σ,: = A σ or a σ Σ, are such that A (σ σ σ k ) = M (x, x,, x k ) or a sequences o input symbos σ,, σ k Σ, where or each i [k] the input vector x i R Σ is the one-hot encoding o the symbo σ i. This resut irst impies that inear -RNNs deined over sequence o discrete symbos (using one-hot encoding) can be provaby earned using the spectra earning agorithm or WFAs/vv-WFAs; indeed, these agorithms have been proved to return consistent estimators. Furthermore, Theorem reveas that inear -RNNs are a natura generaization o cassica weighted automata to unctions deined over sequences o continuous vectors instead o discrete symbos. This spontaneousy raises the question o whether the spectra earning agorithms or WFAs and vv-wfas can be extended to the genera setting o inear -RNNs; we show that the answer is in the positive in the next section. 4 Spectra Learning o Linear -RNNs In this section, we extend the earning agorithm or vv-wfas to inear -RNNs, thus providing the irst provabe earning agorithm or inear second-order RNNs. 4. Recovering a -RNN rom Hanke Tensors We irst present an identiiabiity resut showing how one can recover a inear -RNN computing a unction : (R d ) R p rom observabe tensors extracted rom some Hanke tensor associated with. Intuitivey, we obtain this resut by reducing the probem to the one o earning a vv-wfa. This is done by considering the restriction o to canonica basis vectors; oosey speaking, since the domain o this restricted unction is isomorphic to [d], this aows us to a back onto the setting o sequences o discrete symbos. 4

5 α = n A n A n A n A n Ω H (4) d d d d p Figure : Tensor network representation o the TT decomposition o the Hanke tensor H (4) by a inear -RNN (α, A, Ω). induced Given a unction : (R d ) R p, we deine its Hanke tensor H R [d] [d] p by (H ) i i s,j j t,: = (e i,, e is, e j,, e jt ) or a i,, i s, j,, j t [d], which is ininite in two o its modes. It is easy to see that H is aso the Hanke tensor associated with the unction : [d] R p mapping any sequence i i i k [d] to (e i,, e ik ). Moreover, in the specia case where can be computed by a inear -RNN, one can use the mutiinearity o to show that (x,, x k ) = d i,,i k = (x ) i (x ) ik (i i k ), making it intuitivey cear that one can earn by earning a vv-wfa computing using the spectra earning agorithm. That is, given a arge enough sub-bock H R P S p o H or some preix and suix sets P, S [d], one shoud be abe to recover a vv-wfa computing and consequenty a inear -RNN computing using Theorem. For the sake o carity, we present the earning agorithm or the particuar case where there exists an L such that the preix and suix sets consisting o a sequences o ength L, i.e. P = S = [d] L, orms a compete basis or. This assumption aows us to present in a simper way a the key eements o the agorithm, the technica detais needed to it this assumption are given in the suppementary materia. For any integer, we deine the inite tensor H () R d d p o order + by (H () ) i,,i,: = (e i,, e i ) or a i,, i [d]. Observe that or any integer, the tensor H () can be obtained by reshaping a inite sub-bock o the Hanke tensor H. When is computed by a inear -RNN, we have the useu property that, or any integer, (x,, x ) = H () x x () or any sequence o inputs x,, x R d (which can easiy be shown using the mutiinearity o ). Beore ormaizing the identiiabiity resut or inear -RNNs in Theorem beow, we remark that the tensors H () are o ow tensor train rank. Indeed, or any, it is easy to check that H () = A α, A,, A, Ω (the tensor network representation o this decomposition is shown }{{} times in Figure ). This property wi be particuary reevant to the earning agorithm we design in the oowing section, but it is aso a undamenta reation that deserves some attention on its own: it impies in particuar that, beyond the cassica reation between the rank o the Hanke matrix H and the number states o a minima WFA computing, the Hanke matrix possesses a deeper structure intrinsicay connecting weighted automata to the tensor train decomposition. Whie this reation have been noticed previousy (see e.g. [8, 9, 8]), we beieve that it coud be successuy everaged to design more eicient earning agorithms (both in terms o sampe and computationa compexity) or WFAs and reated modes. Theorem. Let : (R d ) R p be a unction computed by a minima inear -RNN with n hidden units and et L be an integer such that rank((h (L) ) L,L+ ) = n. Then, or any P R dl n and S R n dlp such that (H (L) ) L,L+ = PS, the inear -RNN M = (α, A, Ω) deined by α = (S ) (H (L) ) L+, A = ((H (L+) ) L,,L+ ) P (S ), Ω = P (H (L) is a minima inear -RNN computing. ) L, 5

6 First observe that such an integer L exists under the assumption that P = S = [d] L orms a compete basis or. It is aso worth mentioning that a necessary condition or rank((h (L) ) L,L+ ) = n is that d L n, i.e. L must be o the order og d (n). 4. Hanke Tensors Recovery rom Linear Measurements We showed in the previous section that, given the Hanke tensors H (L), H (L) and H (L+), one can recover a inear -RNN computing i it exists. This readiy impies that the cass o unctions that can be computed by inear -RNNs is earnabe in Anguin s exact earning mode [] where one has access to an orace that can answer membership queries (e.g. what is the vaue computed by the target on (x,, x k )?) and equivaence queries (e.g. is my current hypothesis h equa to the target?). Whie this undamenta resut is o signiicant theoretica interest, assuming access to such an orace is unreaistic. In this section, we show that a stronger earnabiity resut can be obtained in a more reaistic setting, where we ony assume access to randomy generated input/output exampes ((x (n), x(n),, x(n) ), y n ) (R d ) R p where y n = (x (n), x(n),, x(n) ). The key observation is that such an input/output exampe ((x (n), x(n),, x(n) ), y n ) can be seen as a inear measurement o the Hanke tensor H (). Indeed, we have y n = (x (n), x(n),, x(n) ) = H () x x = (H () ), x(n) where x (n) = x (n) x (n) R d. Hence, by regrouping N output exampes y n into the matrix Y R N p and the corresponding input vectors x (n) into the matrix X R N d, one can recover H () by soving the inear system Y = X(H () ),, which has a unique soution whenever X is o u coumn rank. This naturay eads to the oowing theorem, whose proo reies on the act that the matrix X wi be o u coumn rank whenever N d and the components o each x (n) i or i [], n [N] are drawn independenty rom a continuous distribution over R d (w.r.t. the Lebesgue measure). Theorem 4. Let (h 0, A, Ω) be a minima inear -RNN with n hidden units computing a unction : (R d ) R p, and et L be an integer such that rank((h (L) ) L,L+ ) = n. Suppose we have access to datasets D = {((x (n), x(n) {L, L, L + } where the entries o each x (n) i norma distribution and where each y n = (x (n), x(n),, x(n) ).,, x(n) ), y n )} N n= (Rd ) R p or are drawn independenty rom the standard Then, whenever N d or each {L, L, L + }, the inear -RNN M returned by Agorithm with the east-squares method satisies M = with probabiity one. A ew remarks on this theorem are in order. The irst observation is that the datasets D L, D L and D L+ can either be drawn independenty or not (e.g. the sequences in D L can be preixes o the sequences in D L but it is not necessary). In particuar, the resut sti hods when the datasets D are constructed rom a unique dataset S = {((x (n), x(n),, x(n) T ), (y(n), y(n),, y(n) T ))}N n= o input/output sequences with T L +, where y (n) t = (x (n), x(n),, x(n) T ) or any t [T ]. Observe that having access to such input/output training sequences is not an unreaistic assumption: or exampe when training RNNs or anguage modeing the output y t is the next symbo conditiona probabiity vector, and or cassiication tasks the output is the one-hot encoded abe or a time steps. Lasty, when the outputs y n are noisy, one can sove the east-squares probem Y X(H () ), F to approximate the Hanke tensors; we wi empiricay evauate this approach in Section 5 and we deer its theoretica anaysis in the noisy setting to uture work. Whie the east-squares method is suicient to obtain the theoretica guarantees o Theorem, it does not everage the ow rank structure o the Hanke tensors H (). We now propose three aternative recovery methods to everage this structure, whose eiciency wi be assessed in a simuation study in Section 5 (deriving improved sampe compexity guarantees using these methods is et Note that the theorem can be adapted i such an integer L does not exists (see suppementary materia). 6

7 Agorithm RNN-SL: Spectra Learning o inear -RNNs Input: Three training datasets D L, D L, D L+ with input sequences o ength L, L and L + respectivey, a recovery_method, rank R and earning rate γ (or IHT/TIHT). : or {L, L, L + } do : From the dataset D = {((x (n), x(n) with rows x (n) x (n) x (n),, x(n) ), y n )} N n= (Rd ) R p, buid X R N d or n [N ] and Y R N p with rows y n or n [N ]. : i recovery_method = "Least-Squares" then 4: H () = arg min T R d d p X(T ), Y F. 5: ese i recovery_method = "Nucear Norm Minimization" then 6: H () = arg min T R d d p (T ) /, / + subject to X(T ), = Y. 7: ese i recovery_method = "IHT" or "TIHT" then 8: Initiaize H () R d d p to 0. 9: repeat 0: (H () ), = (H () ), + γx (Y X(H () ), ) : H () = project(h (), R) (using either SVD or IHT or TT-SVD or TIHT) : unti convergence : end i 4: end or 5: Let (H () ),+ = PS be a rank R actorization. 6: Return the inear -RNN (h 0, A, Ω) where α = (S ) (H () ), A = + ((H(+) ),,+ ) P (S ), Ω = P (H () ), or uture work). In the noiseess setting, we irst propose to repace soving the inear system Y = X(H () ), with a nucear norm minimization probem (see ine 6 o Agorithm ), thus everaging the act that (H () ) /, / + is potentiay o ow matrix rank. We aso propose to use iterative hard threshoding (IHT) [0] and its tensor counterpart TIHT [0], which are based on the cassica projected gradient descent agorithm and have shown to be robust to noise in practice. These two methods are impemented in ines 8- o Agorithm. There, the project method either projects (H () ) /, / + onto the maniod o ow rank matrices using SVD (IHT) or projects H () onto the maniod o tensors with TT-rank R (TIHT). 5 Experiments In this section, we perorm experiments on synthetic data to compare how the choice o the recovery method (LeastSquares, NucearNorm, IHT and TIHT) aects the sampe eiciency o the earning agorithm 4, and to assess the robustness o RNN-SL to noise and the robustness o the IHT and TIHT recovery methods to rank mis-speciication. We perorm two experiments. In the irst one, we randomy generated a inear -RNN with 5 units computing a unction : R R by drawing the entries o a parameters (h 0, A, Ω) independenty rom the norma distribution N (0, 0.). The training data consists o independenty drawn sets D = {((x (n), x(n),, x(n) ), y n )} N n= (Rd ) R p or {L, L, L + } with L =, where each x (n) i N (0, I) and where the outputs can be noisy, i.e. y n = (x (n), x(n),, x(n) ) + ξ n where ξ n N (0, σ ) or some noise variance σ. In the second experiment, the goa is to earn a simpe arithmetic unction that computes the sum o the running dierences between the two components o a sequence o -dimensiona vectors, i.e. (x,, x k ) = k i= v x i where v = ( ). The training datasets are generated using the same process as above and a constant entry equa to one is added to a the input vectors to encode a bias term (one can check that the resuting unction can be computed by a -RNN with hidden units). 4 The NucearNorm method was ony used in the noiseess setting (though it coud be extended to the noisy recovery setting in practice). 7

8 MSE MSE = 0 = 0. = TIHT(R = 5) 0 0 TIHT(R = 6) IHT(R = 5) 0 IHT(R = 6) LeastSquares NucearNorm Training Size Figure : Average MSE as a unction o the training set size or the irst experiment (earning a random inear -RNN) or dierent vaues o output noise = 0 0 = Training Size Figure 4: Average MSE as a unction o the training set size or the second experiment (earning a simpe arithmetic unction) or dierent vaues o output noise. 0 = TIHT(R = ) TIHT(R = 4) IHT(R = ) IHT(R = 4) LeastSquares NucearNorm We run the experiments or dierent sizes o datasets N ranging rom 0 to 5, 000 (we set N L = N L = N L+ = N) and we compare the dierent methods in terms o mean squared error 5 (MSE) on a test set o size 5, 000 containing sequences o ength 6 generated in the same way as the training data (note that the training data ony contains sequences o engths up to 5). The resuts are reported in Figure and 4 where we see that a recovery methods ead to consistent estimates o the target unction given enough training data. This is the case even in the presence o noise (in which case more sampes are needed to achieve the same accuracy, as expected). We can aso see that IHT and TIHT are overa more sampe eicient than the other methods (especiay with noisy data), showing that taking the ow rank structure o the Hanke tensors into account is proitabe. Moreover, TIHT tends to perorm better than its matrix counterpart, conirming our intuition that everaging the tensor train structure is beneicia. Lasty, as one can expect, when the rank parameter R is over-estimated IHT and TIHT sti converge to the target but they need more sampes (when the rank parameter was underestimated both agorithms did not earn at a). The running time o a methods were overa o the same order. 6 Concusion and Future Directions In this paper, we proposed the irst provabe earning agorithm or second-order RNNs with inear activation unctions. The eaboration o this meaningu earnabiity resut reied on showing that inear -RNNs are a natura extension o vv-wfas, and on extending the vv-wfa spectra earning agorithm to inear -RNNs. We beieve that the resuts presented in this paper open a number o exciting and promising research directions on both the theoretica and practica perspectives. We irst pan to use the spectra earning estimate as a starting point or gradient based methods to train - RNNs. More precisey, since inear -RNNs can be thought o as -RNN using LeakyReu activation unctions with negative sope, one can use a inear -RNN as initiaization beore graduay reducing the negative sope parameter during training. The extension o spectra earning to inear -RNNs aso opens the door to scaing up the cassica spectra method to probem with arge size discrete aphabets (which is a known caveat o the spectra agorithm or WFAs) since it aows one to use ow dimensiona embeddings o arge vocabuaries (using e.g. wordvec or atent semantic anaysis). From the theoretica perspective, we pan on deriving earning guarantees or inear -RNNs in the noisy setting (e.g. using the PAC earnabiity ramework). Even though it is intuitive that such guarantees shoud hod (given the continuity o a operations used in our agorithm), we beieve 5 Using the IHT/TIHT methods, it sedom happens on sma datasets that RNN-SL returns aberrant modes (due to numerica instabiities). We used the oowing scheme to circumvent this issue: when the training MSE o the hypothesis was greater than the one o the zero unction, the zero unction was returned instead. 8

9 that such an anaysis may entai resuts o independent interest. In particuar, anaogousy to the matrix case studied in [7], obtaining rate optima convergence rates or the recovery o the ow TT-rank Hanke tensors rom rank one measurements is an interesting direction; such a resut coud or exampe aow us to improve the generaization bounds provided in [5] or spectra earning o genera WFAs. 9

10 Reerences [] Dana Anguin. Queries and concept earning. Machine earning, (4):9 4, 988. [] Enes Avcu, Chihiro Shibata, and Jerey Heinz. Subreguar compexity and deep earning. arxiv preprint arxiv: , 07. [] Raphaë Baiy, François Denis, and Liva Raaivoa. Grammatica inerence as a principa component anaysis probem. In ICML, pages 40, 009. [4] Borja Bae, Xavier Carreras, Franco M Luque, and Ariadna Quattoni. Spectra earning o weighted automata. Machine earning, 96(-): 6, 04. [5] Borja Bae and Mehryar Mohri. Spectra earning o genera weighted automata via constrained matrix competion. In NIPS, pages 59 67, 0. [6] Byron Boots, Sajid M. Siddiqi, and Georey J. Gordon. Cosing the earning-panning oop with predictive state representations. Internationa Journa o Robotics Research, 0(7): , 0. [7] T Tony Cai, Anru Zhang, et a. Rop: Matrix recovery via rank-one projections. The Annas o Statistics, 4():0 8, 05. [8] Andrew Critch. Agebraic geometry o hidden Markov and reated modes. PhD thesis, University o Caiornia, Berkeey, 0. [9] Andrew Critch and Jason Morton. Agebraic geometry o matrix product states. SIGMA, 0(095):095, 04. [0] François Denis and Yann Esposito. On rationa stochastic anguages. Fundamenta Inormaticae, 86(, ):4 77, 008. [] Carton Downey, Ahmed Heny, Byron Boots, Georey J Gordon, and Boyue Li. Predictive state recurrent neura networks. In NIPS, pages , 07. [] Herbert Federer. Geometric measure theory. Springer, 04. [] Feix A Gers, Jürgen Schmidhuber, and Fred Cummins. Learning to orget: Continua prediction with LSTM. Neura Computation, (0):45 47, 000. [4] C Lee Gies, Ciord B Mier, Dong Chen, Hsing-Hen Chen, Guo-Zheng Sun, and Yee-Chun Lee. Learning and extracting inite state automata with second-order recurrent neura networks. Neura Computation, 4():9 405, 99. [5] C Lee Gies, Guo-Zheng Sun, Hsing-Hen Chen, Yee-Chun Lee, and Dong Chen. Higher order recurrent networks and grammatica inerence. In NIPS, pages 80 87, 990. [6] Hadrien Gaude and Oivier Pietquin. Pac earning o probabiistic automaton based on the method o moments. In ICML, pages 80 89, 06. [7] Aex Graves, Abde-rahman Mohamed, and Georey Hinton. Speech recognition with deep recurrent neura networks. In ICASSP, pages IEEE, 0. [8] Christopher J Hiar and Lek-Heng Lim. Most tensor probems are np-hard. Journa o the ACM (JACM), 60(6):45, 0. [9] Danie J. Hsu, Sham M. Kakade, and Tong Zhang. A spectra agorithm or earning hidden markov modes. In COLT, 009. [0] Prateek Jain, Raghu Meka, and Inderjit S Dhion. Guaranteed rank minimization via singuar vaue projection. In NIPS, pages , 00. [] Vaentin Khrukov, Aexander Novikov, and Ivan Oseedets. Expressive power o recurrent neura networks. arxiv preprint arxiv:7.008, 07. [] Tamara G Koda and Brett W Bader. Tensor decompositions and appications. SIAM review, 5(): , 009. [] YC Lee, Gary Dooen, HH Chen, GZ Sun, Tom Maxwe, HY Lee, and C Lee Gies. Machine earning using a higher order correation network. Physica D: Noninear Phenomena, (- ):76 06, 986. [4] Tomáš Mikoov, Stean Kombrink, Lukáš Burget, Jan Černockỳ, and Sanjeev Khudanpur. Extensions o recurrent neura network anguage mode. In ICASSP, pages IEEE, 0. 0

11 [5] Christian W Omin and C Lee Gies. Constructing deterministic inite-state automata in recurrent neura networks. Journa o the ACM (JACM), 4(6):97 97, 996. [6] Ivan V Oseedets. Tensor-train decomposition. SIAM Journa on Scientiic Computing, (5):95 7, 0. [7] Jordan B Poack. The induction o dynamica recognizers. In Connectionist Approaches to Language Learning, pages 48. Springer, 99. [8] Guiaume Rabusseau. A Tensor Perspective on Weighted Automata, Low-Rank Regression and Agebraic Mixtures. PhD thesis, Aix-Marseie Université, 06. [9] Guiaume Rabusseau, Borja Bae, and Joee Pineau. Mutitask spectra earning o weighted automata. In NIPS, pages , 07. [0] Hoger Rauhut, Reinhod Schneider, and Žejka Stojanac. Low rank tensor recovery via iterative hard threshoding. Linear Agebra and its Appications, 5:0 6, 07. [] Hanie Sedghi and Anima Anandkumar. Training input-output recurrent neura networks through spectra methods. arxiv preprint arxiv: , 06. [] Hava T Siegemann and Eduardo D Sontag. On the computationa power o neura nets. In COLT, pages ACM, 99. [] Michae Thon and Herbert Jaeger. Links between mutipicity automata, observabe operator modes and predictive state representations: a uniied earning ramework. Journa o Machine Learning Research, 6:0 47, 05. [4] Andros Tjandra, Sakriani Sakti, and Satoshi Nakamura. Compressing recurrent neura network with tensor train. In IJCNN, pages IEEE, 07. [5] Gai Weiss, Yoav Godberg, and Eran Yahav. Extracting automata rom recurrent neura networks using queries and counterexampes. arxiv preprint arxiv: , 07. [6] Yinchong Yang, Denis Krompass, and Voker Tresp. Tensor-train recurrent neura networks or video cassiication. arxiv preprint arxiv: , 07. [7] Rose Yu, Stephan Zheng, Anima Anandkumar, and Yisong Yue. Long-term orecasting using tensor-train rnns. arxiv preprint arxiv:7.0007, 07.

12 A Proos A. Proo o Theorem Theorem. Any unction that can be computed by a vv-wfa with n states can be computed by a inear -RNN with n hidden units. Conversey, any unction that can be computed by a inear -RNN with n hidden units on sequences o one-hot vectors (i.e. canonica basis vectors) can be computed by a WFA with n states. More precisey, the WFA A = (α, {A σ } σ Σ, Ω) with n states and the inear -RNN M = (α, A, Ω) with n hidden units, where A R n Σ n is deined by A :,σ,: = A σ or a σ Σ, are such that A (σ σ σ k ) = M (x, x,, x k ) or a sequences o input symbos σ,, σ k Σ, where or each i [k] the input vector x i R Σ is the one-hot encoding o the symbo σ i. Proo. We irst show by induction on k that, or any sequence σ σ k Σ, the hidden state h k computed by M (see Eq. ()) on the corresponding one-hot encoded sequence x,, x k R d satisies h k = (A σ A σ k ) α. The case k = 0 is immediate. Suppose the resut true or sequences o ength up to k. One can check easiy check that A x i = A σi or any index i. Using the induction hypothesis it then oows that To concude, we thus have h k+ = A h k x k+ = A σ k+ h k = (A σ k+ ) h k = (A σ k+ ) (A σ A σ k ) α = (A σ A σ k+ ) α. M (x, x,, x k ) = Ωh k = Ω(A σ A σ k ) α = A (σ σ σ k ). A. Proo o Theorem Theorem. Let : (R d ) R p be a unction computed by a minima inear -RNN with n hidden units and et L be an integer such that rank((h (L) ) L,L+ ) = n. Then, or any P R dl n and S R n dlp such that (H (L) ) L,L+ = PS, the inear -RNN M = (α, A, Ω) deined by α = (S ) (H (L) ) L+, A = ((H (L+) ) L,,L+ ) P (S ), Ω = P (H (L) is a minima inear -RNN computing. Proo. Let P R dl n and S R n dlp be such that (H (L) ) L,L+ = PS Deine the tensors ) L, P = A α, A,, A, I }{{} n R d d n and S = I n, A,, A, Ω R n d d p }{{} L times L times o order L + and L + respectivey, and et P = (P ), R d n and S = (S ),L+ R n dp. Using the identity H (j) oowing identities: = A α, A,, A, Ω or any j, one can easiy check the }{{} j times (H (L) ) L,L+ = P S, (H (L+) ) L,,L+ = A P (S ), (H (L) ) L, = P (Ω ), (H (L) ) L+ = (S ) α. Let M = P P. We wi show that α = M α, A = A M M and Ω = MΩ, which wi entai the resuts since inear -RNN are invariant under change o basis (see Section ). First observe that M = S S. Indeed, we have P P S S = P (H () ),+ S = P PSS = I where we used the act that P (resp. S) is o u coumn rank (resp. row rank) or the ast equaity.

13 The oowing derivations then oow rom basic tensor agebra: α = (S ) (H (L) ) L+ = (S ) (S ) α = (S S ) = M α, A = ((H (L+) ) L,,L+ ) P (S ) = (A P (S ) ) P (S ) = A P P (S S ) = A M M, which concudes the proo. A. Proo o Theorem 4 Ω = P (H (L) ) L, = P P (Ω ) = MΩ, Theorem. Let (h 0, A, Ω) be a minima inear -RNN with n hidden units computing a unction : (R d ) R p, and et L be an integer 6 such that rank((h (L) ) L,L+ ) = n. Suppose we have access to datasets D = {((x (n), x(n) {L, L, L + } where the entries o each x (n) i norma distribution and where each y n = (x (n), x(n),, x(n) ).,, x(n) ), y n )} N n= (Rd ) R p or are drawn independenty rom the standard Then, whenever N d or each {L, L, L + }, the inear -RNN M returned by Agorithm with the east-squares method satisies M = with probabiity one. Proo. We just need to show or each {L, L, L + } that, under the hypothesis o the Theorem, the Hanke tensors Ĥ() computed in ine 4 o Agorithm are equa to the true Hanke tensors H () with probabiity one. Reca that these tensors are computed by soving the east-squares probem Ĥ () = arg min T R d d p X(T ), Y F where X R N d is the matrix with rows x (n) x (n) x (n) or each n [N ]. Since X(H () ), = Y and since the soution o the east-squares probem is unique as soon as X is o u coumn rank, we just need to show that this is the case with probabiity one when the entries o the vectors x (n) i are drawn at random rom a standard norma distribution. The resut wi then directy oow by appying Theorem. We wi show that the set S = {(x (n),, x(n) ) n [N ], dim(span({x (n) x (n) x (n) })) < d } has Lebesgue measure 0 in ((R d ) ) N R dn as soon as N d, which wi impy that it has probabiity 0 under any continuous probabiity, hence the resut. For any S = {(x (n),, x(n) )} N n=, we denote by X S R N d the matrix with rows x (n) x (n) x (n). One can easiy check that S S i and ony i X S is o rank stricty ess than d, which is equivaent to the determinant o X S X S being equa to 0. Since this determinant is a poynomia in the entries o the vectors x (n) i, S is an agebraic subvariety o R dn. It is then easy to check that the poynomia det(x S X S) is not uniormy 0 when N d. Indeed, it suices to choose the vectors x (n) i such that the amiy (x (n) x (n) x (n) ) N n= spans the whoe space Rd (which is possibe since we can choose arbitrariy any o the N d eements o this amiy), hence the resut. In concusion, S is a proper agebraic subvariety o R dn and hence has Lebesgue measure zero [, Section.6.5]. 6 Note that the theorem can be adapted i such an integer L does not exists (see suppementary materia).

14 B Liting the simpiying assumption We now show how a our resuts sti hod when there does not exist an L such that rank((h (L) ) L,L+ ) = n. Reca that this simpiying assumption oowed rom assuming that the sets P = S = [d] L orm a compete basis or the unction : [d] R p deined by (i i i k ) = (e i, e i,, e ik ). We irst show that there aways exists an integer L such that P = S = i L [d] i orms a compete basis or. Let M = (α, A, Ω ) be a inear -RNN with n hidden units computing (i.e. such that M = ). It oows rom Theorem and rom the discussion at the beginning o Section 4. that there exists a vv-wfa computing and it is easy to check that rank( ) = n. This impies rank((h ) () ) = n by Theorem. Since P = S = i [d] i converges to [d] as grows to ininity, there exists an L such that the inite sub-bock H R P S p o H R [d] [d] p satisies rank(( H ) () ) = n, i.e. such that P = S = i L [d] i orms a compete basis or. Now consider the inite sub-bocks H + R P [d] S p and H RP p o H deined by (H (L) ( H + ) u,i,v,: = (uiv), and( H ) u,: = (u) or any u P = S and any i [d]. One can check that Theorem hods by repacing mutatis mutandi ) L,L+ by ( H ) (), (H (+) ),,+ by H +, (H (L) ) L, by H by vec( H ). and (H(L) ) L+ To concude, it suices to observe that both H + and H can be constructed rom the entries or the tensors H () or L +, which can be recovered (or estimated in the noisy setting) using the techniques described in Section 4. o the main paper. We thus showed that inear -RNNs can be provaby earned even when there does not exist an L such that rank((h (L) ) L,L+ ) = n. In this setting, one needs to estimate enough o the tensors H () to reconstruct a compete sub-bock H o the Hanke tensor H (aong with the corresponding tensor H + and matrix H ) and recover the inear -RNN by appying Theorem. In addition, one needs to have access to suicienty arge datasets D or each [L + ] rather than ony the three datasets mentioned in Theorem 4. However the data requirement remains the same in the case where we assume that each o the datasets D is constructed rom a unique training dataset S = {((x (n), x(n),, x(n) ), (y(n), y(n),, y(n) ))}N n= o input/output sequences. T T 4

A Brief Introduction to Markov Chains and Hidden Markov Models

A Brief Introduction to Markov Chains and Hidden Markov Models A Brief Introduction to Markov Chains and Hidden Markov Modes Aen B MacKenzie Notes for December 1, 3, &8, 2015 Discrete-Time Markov Chains You may reca that when we first introduced random processes,

More information

Cryptanalysis of PKP: A New Approach

Cryptanalysis of PKP: A New Approach Cryptanaysis of PKP: A New Approach Éiane Jaumes and Antoine Joux DCSSI 18, rue du Dr. Zamenhoff F-92131 Issy-es-Mx Cedex France eiane.jaumes@wanadoo.fr Antoine.Joux@ens.fr Abstract. Quite recenty, in

More information

Bayesian Learning. You hear a which which could equally be Thanks or Tanks, which would you go with?

Bayesian Learning. You hear a which which could equally be Thanks or Tanks, which would you go with? Bayesian Learning A powerfu and growing approach in machine earning We use it in our own decision making a the time You hear a which which coud equay be Thanks or Tanks, which woud you go with? Combine

More information

Statistical Learning Theory: A Primer

Statistical Learning Theory: A Primer Internationa Journa of Computer Vision 38(), 9 3, 2000 c 2000 uwer Academic Pubishers. Manufactured in The Netherands. Statistica Learning Theory: A Primer THEODOROS EVGENIOU, MASSIMILIANO PONTIL AND TOMASO

More information

Appendix of the Paper The Role of No-Arbitrage on Forecasting: Lessons from a Parametric Term Structure Model

Appendix of the Paper The Role of No-Arbitrage on Forecasting: Lessons from a Parametric Term Structure Model Appendix of the Paper The Roe of No-Arbitrage on Forecasting: Lessons from a Parametric Term Structure Mode Caio Ameida cameida@fgv.br José Vicente jose.vaentim@bcb.gov.br June 008 1 Introduction In this

More information

Categories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms

Categories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms 5. oward Eicient arge-scae Perormance odeing o Integrated Circuits via uti-ode/uti-corner Sparse Regression Wangyang Zhang entor Graphics Corporation Ridder Park Drive San Jose, CA 953 wangyan@ece.cmu.edu

More information

Problem set 6 The Perron Frobenius theorem.

Problem set 6 The Perron Frobenius theorem. Probem set 6 The Perron Frobenius theorem. Math 22a4 Oct 2 204, Due Oct.28 In a future probem set I want to discuss some criteria which aow us to concude that that the ground state of a sef-adjoint operator

More information

SUPPLEMENTARY MATERIAL TO INNOVATED SCALABLE EFFICIENT ESTIMATION IN ULTRA-LARGE GAUSSIAN GRAPHICAL MODELS

SUPPLEMENTARY MATERIAL TO INNOVATED SCALABLE EFFICIENT ESTIMATION IN ULTRA-LARGE GAUSSIAN GRAPHICAL MODELS ISEE 1 SUPPLEMENTARY MATERIAL TO INNOVATED SCALABLE EFFICIENT ESTIMATION IN ULTRA-LARGE GAUSSIAN GRAPHICAL MODELS By Yingying Fan and Jinchi Lv University of Southern Caifornia This Suppementary Materia

More information

Partial permutation decoding for MacDonald codes

Partial permutation decoding for MacDonald codes Partia permutation decoding for MacDonad codes J.D. Key Department of Mathematics and Appied Mathematics University of the Western Cape 7535 Bevie, South Africa P. Seneviratne Department of Mathematics

More information

MARKOV CHAINS AND MARKOV DECISION THEORY. Contents

MARKOV CHAINS AND MARKOV DECISION THEORY. Contents MARKOV CHAINS AND MARKOV DECISION THEORY ARINDRIMA DATTA Abstract. In this paper, we begin with a forma introduction to probabiity and expain the concept of random variabes and stochastic processes. After

More information

A. Distribution of the test statistic

A. Distribution of the test statistic A. Distribution of the test statistic In the sequentia test, we first compute the test statistic from a mini-batch of size m. If a decision cannot be made with this statistic, we keep increasing the mini-batch

More information

BP neural network-based sports performance prediction model applied research

BP neural network-based sports performance prediction model applied research Avaiabe onine www.jocpr.com Journa of Chemica and Pharmaceutica Research, 204, 6(7:93-936 Research Artice ISSN : 0975-7384 CODEN(USA : JCPRC5 BP neura networ-based sports performance prediction mode appied

More information

Lecture Note 3: Stationary Iterative Methods

Lecture Note 3: Stationary Iterative Methods MATH 5330: Computationa Methods of Linear Agebra Lecture Note 3: Stationary Iterative Methods Xianyi Zeng Department of Mathematica Sciences, UTEP Stationary Iterative Methods The Gaussian eimination (or

More information

A Fictitious Time Integration Method for a One-Dimensional Hyperbolic Boundary Value Problem

A Fictitious Time Integration Method for a One-Dimensional Hyperbolic Boundary Value Problem Journa o mathematics and computer science 14 (15) 87-96 A Fictitious ime Integration Method or a One-Dimensiona Hyperboic Boundary Vaue Probem Mir Saad Hashemi 1,*, Maryam Sariri 1 1 Department o Mathematics,

More information

Inductive Bias: How to generalize on novel data. CS Inductive Bias 1

Inductive Bias: How to generalize on novel data. CS Inductive Bias 1 Inductive Bias: How to generaize on nove data CS 478 - Inductive Bias 1 Overfitting Noise vs. Exceptions CS 478 - Inductive Bias 2 Non-Linear Tasks Linear Regression wi not generaize we to the task beow

More information

Interactive Fuzzy Programming for Two-level Nonlinear Integer Programming Problems through Genetic Algorithms

Interactive Fuzzy Programming for Two-level Nonlinear Integer Programming Problems through Genetic Algorithms Md. Abu Kaam Azad et a./asia Paciic Management Review (5) (), 7-77 Interactive Fuzzy Programming or Two-eve Noninear Integer Programming Probems through Genetic Agorithms Abstract Md. Abu Kaam Azad a,*,

More information

XSAT of linear CNF formulas

XSAT of linear CNF formulas XSAT of inear CN formuas Bernd R. Schuh Dr. Bernd Schuh, D-50968 Kön, Germany; bernd.schuh@netcoogne.de eywords: compexity, XSAT, exact inear formua, -reguarity, -uniformity, NPcompeteness Abstract. Open

More information

Stochastic Automata Networks (SAN) - Modelling. and Evaluation. Paulo Fernandes 1. Brigitte Plateau 2. May 29, 1997

Stochastic Automata Networks (SAN) - Modelling. and Evaluation. Paulo Fernandes 1. Brigitte Plateau 2. May 29, 1997 Stochastic utomata etworks (S) - Modeing and Evauation Pauo Fernandes rigitte Pateau 2 May 29, 997 Institut ationa Poytechnique de Grenobe { IPG Ecoe ationae Superieure d'informatique et de Mathematiques

More information

CS229 Lecture notes. Andrew Ng

CS229 Lecture notes. Andrew Ng CS229 Lecture notes Andrew Ng Part IX The EM agorithm In the previous set of notes, we taked about the EM agorithm as appied to fitting a mixture of Gaussians. In this set of notes, we give a broader view

More information

The Group Structure on a Smooth Tropical Cubic

The Group Structure on a Smooth Tropical Cubic The Group Structure on a Smooth Tropica Cubic Ethan Lake Apri 20, 2015 Abstract Just as in in cassica agebraic geometry, it is possibe to define a group aw on a smooth tropica cubic curve. In this note,

More information

Separation of Variables and a Spherical Shell with Surface Charge

Separation of Variables and a Spherical Shell with Surface Charge Separation of Variabes and a Spherica She with Surface Charge In cass we worked out the eectrostatic potentia due to a spherica she of radius R with a surface charge density σθ = σ cos θ. This cacuation

More information

arxiv: v1 [stat.ml] 17 Oct 2018

arxiv: v1 [stat.ml] 17 Oct 2018 Hierarchica Methods of Moments Matteo Ruffini Universitat Poitècnica de Cataunya Guiaume Rabusseau McGi University Borja Bae Amazon Research Cambridge arxiv:1810.07468v1 [stat.ml] 17 Oct 2018 Abstract

More information

Uniprocessor Feasibility of Sporadic Tasks with Constrained Deadlines is Strongly conp-complete

Uniprocessor Feasibility of Sporadic Tasks with Constrained Deadlines is Strongly conp-complete Uniprocessor Feasibiity of Sporadic Tasks with Constrained Deadines is Strongy conp-compete Pontus Ekberg and Wang Yi Uppsaa University, Sweden Emai: {pontus.ekberg yi}@it.uu.se Abstract Deciding the feasibiity

More information

ORTHOGONAL MULTI-WAVELETS FROM MATRIX FACTORIZATION

ORTHOGONAL MULTI-WAVELETS FROM MATRIX FACTORIZATION J. Korean Math. Soc. 46 2009, No. 2, pp. 281 294 ORHOGONAL MLI-WAVELES FROM MARIX FACORIZAION Hongying Xiao Abstract. Accuracy of the scaing function is very crucia in waveet theory, or correspondingy,

More information

THE REACHABILITY CONES OF ESSENTIALLY NONNEGATIVE MATRICES

THE REACHABILITY CONES OF ESSENTIALLY NONNEGATIVE MATRICES THE REACHABILITY CONES OF ESSENTIALLY NONNEGATIVE MATRICES by Michae Neumann Department of Mathematics, University of Connecticut, Storrs, CT 06269 3009 and Ronad J. Stern Department of Mathematics, Concordia

More information

C. Fourier Sine Series Overview

C. Fourier Sine Series Overview 12 PHILIP D. LOEWEN C. Fourier Sine Series Overview Let some constant > be given. The symboic form of the FSS Eigenvaue probem combines an ordinary differentia equation (ODE) on the interva (, ) with a

More information

Dynamic Matching Markets and Voting Paths

Dynamic Matching Markets and Voting Paths Dynamic Matching Markets and Voting Paths David J. Abraham Teikepai Kavitha Abstract We consider a matching market, in which the aim is to maintain a popuar matching between a set o appicants and a set

More information

A Solution to the 4-bit Parity Problem with a Single Quaternary Neuron

A Solution to the 4-bit Parity Problem with a Single Quaternary Neuron Neura Information Processing - Letters and Reviews Vo. 5, No. 2, November 2004 LETTER A Soution to the 4-bit Parity Probem with a Singe Quaternary Neuron Tohru Nitta Nationa Institute of Advanced Industria

More information

Simulation of single bubble rising in liquid using front tracking method

Simulation of single bubble rising in liquid using front tracking method Advances in Fuid Mechanics VI 79 Simuation o singe bubbe rising in iquid using ront tracking method J. Hua & J. Lou Institute o High Perormance Computing, #01-01 The Capricorn, Singapore Abstract Front

More information

A NOTE ON QUASI-STATIONARY DISTRIBUTIONS OF BIRTH-DEATH PROCESSES AND THE SIS LOGISTIC EPIDEMIC

A NOTE ON QUASI-STATIONARY DISTRIBUTIONS OF BIRTH-DEATH PROCESSES AND THE SIS LOGISTIC EPIDEMIC (January 8, 2003) A NOTE ON QUASI-STATIONARY DISTRIBUTIONS OF BIRTH-DEATH PROCESSES AND THE SIS LOGISTIC EPIDEMIC DAMIAN CLANCY, University of Liverpoo PHILIP K. POLLETT, University of Queensand Abstract

More information

ASummaryofGaussianProcesses Coryn A.L. Bailer-Jones

ASummaryofGaussianProcesses Coryn A.L. Bailer-Jones ASummaryofGaussianProcesses Coryn A.L. Baier-Jones Cavendish Laboratory University of Cambridge caj@mrao.cam.ac.uk Introduction A genera prediction probem can be posed as foows. We consider that the variabe

More information

6.434J/16.391J Statistics for Engineers and Scientists May 4 MIT, Spring 2006 Handout #17. Solution 7

6.434J/16.391J Statistics for Engineers and Scientists May 4 MIT, Spring 2006 Handout #17. Solution 7 6.434J/16.391J Statistics for Engineers and Scientists May 4 MIT, Spring 2006 Handout #17 Soution 7 Probem 1: Generating Random Variabes Each part of this probem requires impementation in MATLAB. For the

More information

The EM Algorithm applied to determining new limit points of Mahler measures

The EM Algorithm applied to determining new limit points of Mahler measures Contro and Cybernetics vo. 39 (2010) No. 4 The EM Agorithm appied to determining new imit points of Maher measures by Souad E Otmani, Georges Rhin and Jean-Marc Sac-Épée Université Pau Veraine-Metz, LMAM,

More information

First-Order Corrections to Gutzwiller s Trace Formula for Systems with Discrete Symmetries

First-Order Corrections to Gutzwiller s Trace Formula for Systems with Discrete Symmetries c 26 Noninear Phenomena in Compex Systems First-Order Corrections to Gutzwier s Trace Formua for Systems with Discrete Symmetries Hoger Cartarius, Jörg Main, and Günter Wunner Institut für Theoretische

More information

FRST Multivariate Statistics. Multivariate Discriminant Analysis (MDA)

FRST Multivariate Statistics. Multivariate Discriminant Analysis (MDA) 1 FRST 531 -- Mutivariate Statistics Mutivariate Discriminant Anaysis (MDA) Purpose: 1. To predict which group (Y) an observation beongs to based on the characteristics of p predictor (X) variabes, using

More information

Sardinas-Patterson like algorithms in coding theory

Sardinas-Patterson like algorithms in coding theory A.I.Cuza University of Iaşi Facuty of Computer Science Sardinas-Patterson ike agorithms in coding theory Author, Bogdan Paşaniuc Supervisor, Prof. dr. Ferucio Laurenţiu Ţipea Iaşi, September 2003 Contents

More information

A proposed nonparametric mixture density estimation using B-spline functions

A proposed nonparametric mixture density estimation using B-spline functions A proposed nonparametric mixture density estimation using B-spine functions Atizez Hadrich a,b, Mourad Zribi a, Afif Masmoudi b a Laboratoire d Informatique Signa et Image de a Côte d Opae (LISIC-EA 4491),

More information

An Algorithm for Pruning Redundant Modules in Min-Max Modular Network

An Algorithm for Pruning Redundant Modules in Min-Max Modular Network An Agorithm for Pruning Redundant Modues in Min-Max Moduar Network Hui-Cheng Lian and Bao-Liang Lu Department of Computer Science and Engineering, Shanghai Jiao Tong University 1954 Hua Shan Rd., Shanghai

More information

Distributed average consensus: Beyond the realm of linearity

Distributed average consensus: Beyond the realm of linearity Distributed average consensus: Beyond the ream of inearity Usman A. Khan, Soummya Kar, and José M. F. Moura Department of Eectrica and Computer Engineering Carnegie Meon University 5 Forbes Ave, Pittsburgh,

More information

Integrating Factor Methods as Exponential Integrators

Integrating Factor Methods as Exponential Integrators Integrating Factor Methods as Exponentia Integrators Borisav V. Minchev Department of Mathematica Science, NTNU, 7491 Trondheim, Norway Borko.Minchev@ii.uib.no Abstract. Recenty a ot of effort has been

More information

Efficiently Generating Random Bits from Finite State Markov Chains

Efficiently Generating Random Bits from Finite State Markov Chains 1 Efficienty Generating Random Bits from Finite State Markov Chains Hongchao Zhou and Jehoshua Bruck, Feow, IEEE Abstract The probem of random number generation from an uncorreated random source (of unknown

More information

A Simple and Efficient Algorithm of 3-D Single-Source Localization with Uniform Cross Array Bing Xue 1 2 a) * Guangyou Fang 1 2 b and Yicai Ji 1 2 c)

A Simple and Efficient Algorithm of 3-D Single-Source Localization with Uniform Cross Array Bing Xue 1 2 a) * Guangyou Fang 1 2 b and Yicai Ji 1 2 c) A Simpe Efficient Agorithm of 3-D Singe-Source Locaization with Uniform Cross Array Bing Xue a * Guangyou Fang b Yicai Ji c Key Laboratory of Eectromagnetic Radiation Sensing Technoogy, Institute of Eectronics,

More information

Smoothness equivalence properties of univariate subdivision schemes and their projection analogues

Smoothness equivalence properties of univariate subdivision schemes and their projection analogues Numerische Mathematik manuscript No. (wi be inserted by the editor) Smoothness equivaence properties of univariate subdivision schemes and their projection anaogues Phiipp Grohs TU Graz Institute of Geometry

More information

Active Learning & Experimental Design

Active Learning & Experimental Design Active Learning & Experimenta Design Danie Ting Heaviy modified, of course, by Lye Ungar Origina Sides by Barbara Engehardt and Aex Shyr Lye Ungar, University of Pennsyvania Motivation u Data coection

More information

Theory of Generalized k-difference Operator and Its Application in Number Theory

Theory of Generalized k-difference Operator and Its Application in Number Theory Internationa Journa of Mathematica Anaysis Vo. 9, 2015, no. 19, 955-964 HIKARI Ltd, www.m-hiari.com http://dx.doi.org/10.12988/ijma.2015.5389 Theory of Generaized -Difference Operator and Its Appication

More information

Statistical Learning Theory: a Primer

Statistical Learning Theory: a Primer ??,??, 1 6 (??) c?? Kuwer Academic Pubishers, Boston. Manufactured in The Netherands. Statistica Learning Theory: a Primer THEODOROS EVGENIOU AND MASSIMILIANO PONTIL Center for Bioogica and Computationa

More information

Algorithms to solve massively under-defined systems of multivariate quadratic equations

Algorithms to solve massively under-defined systems of multivariate quadratic equations Agorithms to sove massivey under-defined systems of mutivariate quadratic equations Yasufumi Hashimoto Abstract It is we known that the probem to sove a set of randomy chosen mutivariate quadratic equations

More information

BALANCING REGULAR MATRIX PENCILS

BALANCING REGULAR MATRIX PENCILS BALANCING REGULAR MATRIX PENCILS DAMIEN LEMONNIER AND PAUL VAN DOOREN Abstract. In this paper we present a new diagona baancing technique for reguar matrix pencis λb A, which aims at reducing the sensitivity

More information

arxiv: v1 [cs.lg] 31 Oct 2017

arxiv: v1 [cs.lg] 31 Oct 2017 ACCELERATED SPARSE SUBSPACE CLUSTERING Abofaz Hashemi and Haris Vikao Department of Eectrica and Computer Engineering, University of Texas at Austin, Austin, TX, USA arxiv:7.26v [cs.lg] 3 Oct 27 ABSTRACT

More information

4 Separation of Variables

4 Separation of Variables 4 Separation of Variabes In this chapter we describe a cassica technique for constructing forma soutions to inear boundary vaue probems. The soution of three cassica (paraboic, hyperboic and eiptic) PDE

More information

PARSIMONIOUS VARIATIONAL-BAYES MIXTURE AGGREGATION WITH A POISSON PRIOR. Pierrick Bruneau, Marc Gelgon and Fabien Picarougne

PARSIMONIOUS VARIATIONAL-BAYES MIXTURE AGGREGATION WITH A POISSON PRIOR. Pierrick Bruneau, Marc Gelgon and Fabien Picarougne 17th European Signa Processing Conference (EUSIPCO 2009) Gasgow, Scotand, August 24-28, 2009 PARSIMONIOUS VARIATIONAL-BAYES MIXTURE AGGREGATION WITH A POISSON PRIOR Pierric Bruneau, Marc Gegon and Fabien

More information

Symbolic models for nonlinear control systems using approximate bisimulation

Symbolic models for nonlinear control systems using approximate bisimulation Symboic modes for noninear contro systems using approximate bisimuation Giordano Poa, Antoine Girard and Pauo Tabuada Abstract Contro systems are usuay modeed by differentia equations describing how physica

More information

Explicit overall risk minimization transductive bound

Explicit overall risk minimization transductive bound 1 Expicit overa risk minimization transductive bound Sergio Decherchi, Paoo Gastado, Sandro Ridea, Rodofo Zunino Dept. of Biophysica and Eectronic Engineering (DIBE), Genoa University Via Opera Pia 11a,

More information

A Novel Learning Method for Elman Neural Network Using Local Search

A Novel Learning Method for Elman Neural Network Using Local Search Neura Information Processing Letters and Reviews Vo. 11, No. 8, August 2007 LETTER A Nove Learning Method for Eman Neura Networ Using Loca Search Facuty of Engineering, Toyama University, Gofuu 3190 Toyama

More information

Rate-Distortion Theory of Finite Point Processes

Rate-Distortion Theory of Finite Point Processes Rate-Distortion Theory of Finite Point Processes Günther Koiander, Dominic Schuhmacher, and Franz Hawatsch, Feow, IEEE Abstract We study the compression of data in the case where the usefu information

More information

Turbo Codes. Coding and Communication Laboratory. Dept. of Electrical Engineering, National Chung Hsing University

Turbo Codes. Coding and Communication Laboratory. Dept. of Electrical Engineering, National Chung Hsing University Turbo Codes Coding and Communication Laboratory Dept. of Eectrica Engineering, Nationa Chung Hsing University Turbo codes 1 Chapter 12: Turbo Codes 1. Introduction 2. Turbo code encoder 3. Design of intereaver

More information

Some Measures for Asymmetry of Distributions

Some Measures for Asymmetry of Distributions Some Measures for Asymmetry of Distributions Georgi N. Boshnakov First version: 31 January 2006 Research Report No. 5, 2006, Probabiity and Statistics Group Schoo of Mathematics, The University of Manchester

More information

Convergence Property of the Iri-Imai Algorithm for Some Smooth Convex Programming Problems

Convergence Property of the Iri-Imai Algorithm for Some Smooth Convex Programming Problems Convergence Property of the Iri-Imai Agorithm for Some Smooth Convex Programming Probems S. Zhang Communicated by Z.Q. Luo Assistant Professor, Department of Econometrics, University of Groningen, Groningen,

More information

DIGITAL FILTER DESIGN OF IIR FILTERS USING REAL VALUED GENETIC ALGORITHM

DIGITAL FILTER DESIGN OF IIR FILTERS USING REAL VALUED GENETIC ALGORITHM DIGITAL FILTER DESIGN OF IIR FILTERS USING REAL VALUED GENETIC ALGORITHM MIKAEL NILSSON, MATTIAS DAHL AND INGVAR CLAESSON Bekinge Institute of Technoogy Department of Teecommunications and Signa Processing

More information

Formulas for Angular-Momentum Barrier Factors Version II

Formulas for Angular-Momentum Barrier Factors Version II BNL PREPRINT BNL-QGS-06-101 brfactor1.tex Formuas for Anguar-Momentum Barrier Factors Version II S. U. Chung Physics Department, Brookhaven Nationa Laboratory, Upton, NY 11973 March 19, 2015 abstract A

More information

Bridging the Gap Between Refinement and Heuristics in Abstraction

Bridging the Gap Between Refinement and Heuristics in Abstraction Proceedings o the Twenty-Third Internationa Joint Conerence on Artiicia Inteigence Bridging the Gap Between Reinement and Heuristics in Abstraction Christer Bäckström and Peter Jonsson Department o Computer

More information

Stochastic Complement Analysis of Multi-Server Threshold Queues. with Hysteresis. Abstract

Stochastic Complement Analysis of Multi-Server Threshold Queues. with Hysteresis. Abstract Stochastic Compement Anaysis of Muti-Server Threshod Queues with Hysteresis John C.S. Lui The Dept. of Computer Science & Engineering The Chinese University of Hong Kong Leana Goubchik Dept. of Computer

More information

A SIMPLIFIED DESIGN OF MULTIDIMENSIONAL TRANSFER FUNCTION MODELS

A SIMPLIFIED DESIGN OF MULTIDIMENSIONAL TRANSFER FUNCTION MODELS A SIPLIFIED DESIGN OF ULTIDIENSIONAL TRANSFER FUNCTION ODELS Stefan Petrausch, Rudof Rabenstein utimedia Communications and Signa Procesg, University of Erangen-Nuremberg, Cauerstr. 7, 958 Erangen, GERANY

More information

A unified framework for Regularization Networks and Support Vector Machines. Theodoros Evgeniou, Massimiliano Pontil, Tomaso Poggio

A unified framework for Regularization Networks and Support Vector Machines. Theodoros Evgeniou, Massimiliano Pontil, Tomaso Poggio MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES A.I. Memo No. 1654 March23, 1999

More information

Tracking Control of Multiple Mobile Robots

Tracking Control of Multiple Mobile Robots Proceedings of the 2001 IEEE Internationa Conference on Robotics & Automation Seou, Korea May 21-26, 2001 Tracking Contro of Mutipe Mobie Robots A Case Study of Inter-Robot Coision-Free Probem Jurachart

More information

An explicit Jordan Decomposition of Companion matrices

An explicit Jordan Decomposition of Companion matrices An expicit Jordan Decomposition of Companion matrices Fermín S V Bazán Departamento de Matemática CFM UFSC 88040-900 Forianópois SC E-mai: fermin@mtmufscbr S Gratton CERFACS 42 Av Gaspard Coriois 31057

More information

Determining The Degree of Generalization Using An Incremental Learning Algorithm

Determining The Degree of Generalization Using An Incremental Learning Algorithm Determining The Degree of Generaization Using An Incrementa Learning Agorithm Pabo Zegers Facutad de Ingeniería, Universidad de os Andes San Caros de Apoquindo 22, Las Condes, Santiago, Chie pzegers@uandes.c

More information

Nonlinear Analysis of Spatial Trusses

Nonlinear Analysis of Spatial Trusses Noninear Anaysis of Spatia Trusses João Barrigó October 14 Abstract The present work addresses the noninear behavior of space trusses A formuation for geometrica noninear anaysis is presented, which incudes

More information

An approximate method for solving the inverse scattering problem with fixed-energy data

An approximate method for solving the inverse scattering problem with fixed-energy data J. Inv. I-Posed Probems, Vo. 7, No. 6, pp. 561 571 (1999) c VSP 1999 An approximate method for soving the inverse scattering probem with fixed-energy data A. G. Ramm and W. Scheid Received May 12, 1999

More information

T.C. Banwell, S. Galli. {bct, Telcordia Technologies, Inc., 445 South Street, Morristown, NJ 07960, USA

T.C. Banwell, S. Galli. {bct, Telcordia Technologies, Inc., 445 South Street, Morristown, NJ 07960, USA ON THE SYMMETRY OF THE POWER INE CHANNE T.C. Banwe, S. Gai {bct, sgai}@research.tecordia.com Tecordia Technoogies, Inc., 445 South Street, Morristown, NJ 07960, USA Abstract The indoor power ine network

More information

BDD-Based Analysis of Gapped q-gram Filters

BDD-Based Analysis of Gapped q-gram Filters BDD-Based Anaysis of Gapped q-gram Fiters Marc Fontaine, Stefan Burkhardt 2 and Juha Kärkkäinen 2 Max-Panck-Institut für Informatik Stuhsatzenhausweg 85, 6623 Saarbrücken, Germany e-mai: stburk@mpi-sb.mpg.de

More information

Neural Networks Compression for Language Modeling

Neural Networks Compression for Language Modeling Neura Networks Compression for Language Modeing Artem M. Grachev 1,2, Dmitry I. Ignatov 2, and Andrey V. Savchenko 3 arxiv:1708.05963v1 [stat.ml] 20 Aug 2017 1 Samsung R&D Institute Rus, Moscow, Russia

More information

Optimality of Inference in Hierarchical Coding for Distributed Object-Based Representations

Optimality of Inference in Hierarchical Coding for Distributed Object-Based Representations Optimaity of Inference in Hierarchica Coding for Distributed Object-Based Representations Simon Brodeur, Jean Rouat NECOTIS, Département génie éectrique et génie informatique, Université de Sherbrooke,

More information

Multilayer Kerceptron

Multilayer Kerceptron Mutiayer Kerceptron Zotán Szabó, András Lőrincz Department of Information Systems, Facuty of Informatics Eötvös Loránd University Pázmány Péter sétány 1/C H-1117, Budapest, Hungary e-mai: szzoi@csetehu,

More information

(This is a sample cover image for this issue. The actual cover is not yet available at this time.)

(This is a sample cover image for this issue. The actual cover is not yet available at this time.) (This is a sampe cover image for this issue The actua cover is not yet avaiabe at this time) This artice appeared in a journa pubished by Esevier The attached copy is furnished to the author for interna

More information

MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES

MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES Separation of variabes is a method to sove certain PDEs which have a warped product structure. First, on R n, a inear PDE of order m is

More information

<C 2 2. λ 2 l. λ 1 l 1 < C 1

<C 2 2. λ 2 l. λ 1 l 1 < C 1 Teecommunication Network Contro and Management (EE E694) Prof. A. A. Lazar Notes for the ecture of 7/Feb/95 by Huayan Wang (this document was ast LaT E X-ed on May 9,995) Queueing Primer for Muticass Optima

More information

Efficient Generation of Random Bits from Finite State Markov Chains

Efficient Generation of Random Bits from Finite State Markov Chains Efficient Generation of Random Bits from Finite State Markov Chains Hongchao Zhou and Jehoshua Bruck, Feow, IEEE Abstract The probem of random number generation from an uncorreated random source (of unknown

More information

Related Topics Maxwell s equations, electrical eddy field, magnetic field of coils, coil, magnetic flux, induced voltage

Related Topics Maxwell s equations, electrical eddy field, magnetic field of coils, coil, magnetic flux, induced voltage Magnetic induction TEP Reated Topics Maxwe s equations, eectrica eddy fied, magnetic fied of cois, coi, magnetic fux, induced votage Principe A magnetic fied of variabe frequency and varying strength is

More information

Reichenbachian Common Cause Systems

Reichenbachian Common Cause Systems Reichenbachian Common Cause Systems G. Hofer-Szabó Department of Phiosophy Technica University of Budapest e-mai: gszabo@hps.ete.hu Mikós Rédei Department of History and Phiosophy of Science Eötvös University,

More information

Expectation-Maximization for Estimating Parameters for a Mixture of Poissons

Expectation-Maximization for Estimating Parameters for a Mixture of Poissons Expectation-Maximization for Estimating Parameters for a Mixture of Poissons Brandon Maone Department of Computer Science University of Hesini February 18, 2014 Abstract This document derives, in excrutiating

More information

Honors Thesis Bounded Query Functions With Limited Output Bits II

Honors Thesis Bounded Query Functions With Limited Output Bits II Honors Thesis Bounded Query Functions With Limited Output Bits II Daibor Zeený University of Maryand, Batimore County May 29, 2007 Abstract We sove some open questions in the area of bounded query function

More information

FRIEZE GROUPS IN R 2

FRIEZE GROUPS IN R 2 FRIEZE GROUPS IN R 2 MAXWELL STOLARSKI Abstract. Focusing on the Eucidean pane under the Pythagorean Metric, our goa is to cassify the frieze groups, discrete subgroups of the set of isometries of the

More information

Stochastic Variational Inference with Gradient Linearization

Stochastic Variational Inference with Gradient Linearization Stochastic Variationa Inference with Gradient Linearization Suppementa Materia Tobias Pötz * Anne S Wannenwetsch Stefan Roth Department of Computer Science, TU Darmstadt Preface In this suppementa materia,

More information

II. PROBLEM. A. Description. For the space of audio signals

II. PROBLEM. A. Description. For the space of audio signals CS229 - Fina Report Speech Recording based Language Recognition (Natura Language) Leopod Cambier - cambier; Matan Leibovich - matane; Cindy Orozco Bohorquez - orozcocc ABSTRACT We construct a rea time

More information

On the evaluation of saving-consumption plans

On the evaluation of saving-consumption plans On the evauation of saving-consumption pans Steven Vanduffe Jan Dhaene Marc Goovaerts Juy 13, 2004 Abstract Knowedge of the distribution function of the stochasticay compounded vaue of a series of future

More information

DISTRIBUTION OF TEMPERATURE IN A SPATIALLY ONE- DIMENSIONAL OBJECT AS A RESULT OF THE ACTIVE POINT SOURCE

DISTRIBUTION OF TEMPERATURE IN A SPATIALLY ONE- DIMENSIONAL OBJECT AS A RESULT OF THE ACTIVE POINT SOURCE DISTRIBUTION OF TEMPERATURE IN A SPATIALLY ONE- DIMENSIONAL OBJECT AS A RESULT OF THE ACTIVE POINT SOURCE Yury Iyushin and Anton Mokeev Saint-Petersburg Mining University, Vasiievsky Isand, 1 st ine, Saint-Petersburg,

More information

Asynchronous Control for Coupled Markov Decision Systems

Asynchronous Control for Coupled Markov Decision Systems INFORMATION THEORY WORKSHOP (ITW) 22 Asynchronous Contro for Couped Marov Decision Systems Michae J. Neey University of Southern Caifornia Abstract This paper considers optima contro for a coection of

More information

CONJUGATE GRADIENT WITH SUBSPACE OPTIMIZATION

CONJUGATE GRADIENT WITH SUBSPACE OPTIMIZATION CONJUGATE GRADIENT WITH SUBSPACE OPTIMIZATION SAHAR KARIMI AND STEPHEN VAVASIS Abstract. In this paper we present a variant of the conjugate gradient (CG) agorithm in which we invoke a subspace minimization

More information

arxiv: v1 [physics.flu-dyn] 2 Nov 2007

arxiv: v1 [physics.flu-dyn] 2 Nov 2007 A theoretica anaysis of the resoution due to diffusion and size-dispersion of partices in deterministic atera dispacement devices arxiv:7.347v [physics.fu-dyn] 2 Nov 27 Martin Heer and Henrik Bruus MIC

More information

NIKOS FRANTZIKINAKIS. N n N where (Φ N) N N is any Følner sequence

NIKOS FRANTZIKINAKIS. N n N where (Φ N) N N is any Følner sequence SOME OPE PROBLEMS O MULTIPLE ERGODIC AVERAGES IKOS FRATZIKIAKIS. Probems reated to poynomia sequences In this section we give a ist of probems reated to the study of mutipe ergodic averages invoving iterates

More information

Two-Stage Least Squares as Minimum Distance

Two-Stage Least Squares as Minimum Distance Two-Stage Least Squares as Minimum Distance Frank Windmeijer Discussion Paper 17 / 683 7 June 2017 Department of Economics University of Bristo Priory Road Compex Bristo BS8 1TU United Kingdom Two-Stage

More information

Asymptotic Properties of a Generalized Cross Entropy Optimization Algorithm

Asymptotic Properties of a Generalized Cross Entropy Optimization Algorithm 1 Asymptotic Properties of a Generaized Cross Entropy Optimization Agorithm Zijun Wu, Michae Koonko, Institute for Appied Stochastics and Operations Research, Caustha Technica University Abstract The discrete

More information

Converting Z-number to Fuzzy Number using. Fuzzy Expected Value

Converting Z-number to Fuzzy Number using. Fuzzy Expected Value ISSN 1746-7659, Engand, UK Journa of Information and Computing Science Vo. 1, No. 4, 017, pp.91-303 Converting Z-number to Fuzzy Number using Fuzzy Expected Vaue Mahdieh Akhbari * Department of Industria

More information

arxiv:math/ v2 [math.pr] 6 Mar 2005

arxiv:math/ v2 [math.pr] 6 Mar 2005 ASYMPTOTIC BEHAVIOR OF RANDOM HEAPS arxiv:math/0407286v2 [math.pr] 6 Mar 2005 J. BEN HOUGH Abstract. We consider a random wa W n on the ocay free group or equivaenty a signed random heap) with m generators

More information

Title: Crack Diagnostics via Fourier Transform: Real and Imaginary Components vs. Power Spectral Density

Title: Crack Diagnostics via Fourier Transform: Real and Imaginary Components vs. Power Spectral Density Tite: rack Diagnostics via ourier Transform: ea and maginary omponents vs. Power pectra Density Authors: Leonid M. Geman Victor Giurgiutiu van V. Petrunin 3 rd nternationa Workshop on tructura Heath Monitoring

More information

Statistics for Applications. Chapter 7: Regression 1/43

Statistics for Applications. Chapter 7: Regression 1/43 Statistics for Appications Chapter 7: Regression 1/43 Heuristics of the inear regression (1) Consider a coud of i.i.d. random points (X i,y i ),i =1,...,n : 2/43 Heuristics of the inear regression (2)

More information

Alberto Maydeu Olivares Instituto de Empresa Marketing Dept. C/Maria de Molina Madrid Spain

Alberto Maydeu Olivares Instituto de Empresa Marketing Dept. C/Maria de Molina Madrid Spain CORRECTIONS TO CLASSICAL PROCEDURES FOR ESTIMATING THURSTONE S CASE V MODEL FOR RANKING DATA Aberto Maydeu Oivares Instituto de Empresa Marketing Dept. C/Maria de Moina -5 28006 Madrid Spain Aberto.Maydeu@ie.edu

More information

STABLE GRAPHS BENJAMIN OYE

STABLE GRAPHS BENJAMIN OYE STABLE GRAPHS BENJAMIN OYE Abstract. In Reguarity Lemmas for Stabe Graphs [1] Maiaris and Sheah appy toos from mode theory to obtain stronger forms of Ramsey's theorem and Szemeredi's reguarity emma for

More information

PERFORMANCE COMPARISON OF OPEN AND CLOSED LOOP OPERATION OF UPFC

PERFORMANCE COMPARISON OF OPEN AND CLOSED LOOP OPERATION OF UPFC OL. 3, NO. 5, OCTOE 008 ISSN 89-6608 AN Journa o Engineering and Appied Sciences 006-008 Asian esearch ubishing Networ (AN. A rights reserved. www.arpnournas.com EFOMANCE COMAISON OF OEN AND CLOSED LOO

More information