A New Minimum Description Length
|
|
- Madeleine Mathews
- 6 years ago
- Views:
Transcription
1 A New Minimum Description Length Soosan Beheshti, Munther A. Dahleh Laboratory for Information an Decision Systems Massachusetts Institute of Technology Abstract The minimum escription length(mdl) metho is one of the pioneer methos of parametric orer estimation with a wie range of applications. We investigate the efinition of two-stage MDL for parametric linear moel sets an exhibit some rawbacks of the theory behin the existing MDL. We introuce a new escription length which is inspire by the Kolmogorov complexity principle. Introuction One shoul not increase, beyon what is necessary, the number of entities require to explain anything Occam s razor is a logical principle attribute to the meieval philosopher William of Occam. Applying the principal to the statement above, the main message is The simplest explanation is the best The principle states that one shoul not make more assumptions than the minimum neee. A computer scientific approach to this principle is manifeste in Kolmogorov complexity. Let y be a finite binary string an let U be a universal computer. Let l(y) enote the length of the string y. Let U(pg) enote the output of the computer U when presente with program pg. Then the Kolmogorov complexity K U (y) of a string y with respect to a universal computer U is efine as K U (y) = min l(pg) () pg : U(pg)=y The complexity of string y is calle the minimum escription length of y. For any other computer A we have K U (y) K A (y) + c A (2) where c A oes not epen on y. This inequality is known as universality of Kolmogorov complexity. Kolmogorov complexity is a moern notion of ranomness ealing with the quantity of information in iniviual obects; that is pointwise ranomness rather than average ranomness prouce by a ranom source. The two-stage MDL is one of the pioneer methos of computation of escription length which is suggeste base on this principle [9]. Here we aress the rawbacks of the metho of calculation of MDL an efine a new MDL base on the same Kolmogorov principle. We focus on the orer estimation problem for when the ata is generate by a linear moel with aitive noise. The new propose orer estimation metho is comparable to other well known orer estimation methos such as AIC [], BIC [0] an other forms of existing MDL methos [2]. 2 Problem Statement:Linear Moel Consier the class of parametric moel for which the output, y(n), is generate by y(n) = ȳ (n) + w(n) (3) where w(n) is aitive white Gaussian noise with zero mean an variance σ. Also, ȳ is the noiseless ata. Length N of the ata y N = [y(), y(n)] T, a member of ranom variable set Y N, is given. The noiseless ata is escribe with the basis family s i ȳ (n) = M s i (n)θ (i) = A SM (N)θS M (4) i= where s i (n) are boune numbers. Therefore, A SM (N) is a matrix of imension N M. The columns of A Sm (N), s i s, are inepenent an we have N s i 2 2 c (5) where c is a boune number an s i (n) 0 for all i M an n N. The value of M can be a part of the prior knowlege of the moel class. If it is unknown, the proper assumption of M = N is consiere. The parameter θs m = [θ (),, θ (M)] is a boune l -norm vector in R M.
2 A subset S m of Y N is of form x N = [A Sm (N)]θ Sm (6) where the columns of A Sm, a matrix of imension N m, are s i s which are in S m s i S m an θ Sm is a vector of length m for which θ Sm R m. (7) Inspire by the Kolmogorov complexity an the notion of minimal escription length for the string y, we want to search for the subspaces S m s which can provie the minimum escription length(dl) of the ata. Lets first follow principles of the existing two-stage MDL. In each subspace S m, of orer m, the escription length of y is escribe as the minimum coelength which can escribe y by an element of S m. For the coelength in this probabilistic setting the Shannon coing metho is use, therefore DL Sm (y) = min log f(y; g) (8) where the log is base on 2 an f(y ; g) is the probability istribution function(pf) of ranom variable Y when the mean is g an the aitive noise is w has the same characteristics which were efine for w[n] in (3). Note that in this scenario the probability istribution efine by each g in S m ( an therefore by a θ Sm, g = A Sm (N)θ S m ) is a Gaussian istribution with output of form x = g + w. (9) The least-square estimate of y in each subspace, which provies the output DL in subspace S m, is ŷ Sm = arg min g y 2. (0) The DL in each subspace is then efine as ( ) N ŷ DL(y; ŷ Sm ) = log 2πσ 2 Sm y 2 w + 2 log e. () which is the escription length of the noisy ata with an element of S m, ŷ Sm. The comparison of this escription length for ifferent subspaces always leas to the choice of the S m with largest possible imension, S N, for which the output error is zero. This is not a esire outcome, especially if we know that the unknown true number of parameters which can efine ȳ is less than a given M. To avoi this unwante outcome, two-stage MDL introuces a coelength which escribes elements of S m as well [9]. Here the assumption is that the length of the coe escribing any element of subspace S m is the same an is of orer m 2 log(n) (2) Therefore, the total coelength escribing y in subspace S m is efine as the coelength escribing elements of S m in (2) plus the escription length of the output given this estimate from () DL(y; S m ) m 2 log(n) + DL(y; ŷ S m ). (3) However, choosing the escription length for θ Sm s by coes of length m 2 log(n) seems to be an a-hoc metho. Partitioning the subspace of possible θ Sm s can be one with any other escritization per imension factor other than log(n). For this reason, it seems that the coelength for all θ Sm s can be of any form m log(aδ) when each imension has log Aδ elements. Another metho of achieving the escription length in (3) is given in [9]. Given a subspace S m, assume that M = m, then ȳ S m, (4) also assume that the ML estimator in this subspace, ŷ Sm approaches the true noiseless ata ȳ as N grows an N(ŷ Sm ȳ ) converges to a zero mean normal istribution. Then for any prefix coe on Y N, with coelength L(Y N ), the following inequality hols for all prefix coes except a set of prefix coes which will escribe later, E N L(Y N ) N H(f ȳ (Y N ))+( ɛ)m log(n) N, (5) where E(X) enotes the expecte value of ranom variable X an H(fȳ (Y N )) is the ifferential entropy of Y N. The inequality hols for all prefix coes except a set of prefix coes which are generate by the pfs which are generate by a subset of S m. The Lebesgue measure of this subset goes to zero as N grows 2 [9]. In [9] it is argue that the coelength in (3) is optimum since it can achieve the lower boun in the inequality in (5). However, in [4] it is shown that m log(n) in the inequality in (5) can be replace by a family of functions of N, log β(n) for which lim N ( ɛ/2) lim β(n) = (6) N (β(n)) ɛ ɛ/2 N m/2 = 0 (7) Therefore, with the same approach in [9] the coelength in (3) can be generalize to the form DL(y; S m ) m 2 β(n) + DL(y; ŷ S m ). (8) A coe maps the quantize version of elements of Y N to a set of binary coewors with length L(y N ). If no coewor is the prefix of any other, then it is uniquely eciable an is calle a prefix coe 2 Associate with any pf on Y N which is generate by ȳ an is nonzero over Y N, a prefix coelength is available which is proportional to log f(y; ȳ)
3 Hence, not onlylog(n) but any β(n) can be use to escribe the coelength. Another important issue is that in practical problems M is an upper boun for m, the true number of parameters which generate ȳ, an the challenge is to fin the true m using the observe ata. However, this coelength is provie with the assumption that ȳ is an element of a given S m. In practice the same coelength is use for all the subspaces. Even if ȳ is not an element of S m. For this subset is this a vali coelength? the answer to this question is not known. The only important fact known about this escription length is that it is a consistent criterion. So as the length of ata grows if m is smaller than M, the metho points to the correct m. We will iscuss this property of MDL more in the following sections. 2. New Description Length The comparison of the coelengths in () fails because of the following argument: Minimizing the escription length in (8) is the same as arg min log f(y; g) = arg max f(y; g) (9) which provies the ML estimate of ȳ in each subspace. As we iscusse previously, the ML estimation always points to a member of S N, which has the highest possible orer, as a perfect caniate. Therefore, comparison of the coelengths escribing y itself in each subspace is not a proper tool for comparison of the estimates ŷ Sm. Here y is not the string of ata, but y is the ata which is corrupte by an aitive noise. Therefore the coelength of the ML estimates has to be compare with the pf which is generate by the noiseless output, ȳ. To follow the Kolmogorov complexity is to compare the estimate of coelength base on the true, an unknown, pf. Therefore we consier the following escription length in each subspace: Definition The new escription length of ata in subset S m is efine as DL(y; S m ) DL(ŷ Sm ; ȳ ) (20) = log e ŷsm 2.(2) ȳ 2 Therefore, the new minimum escription length is obtaine for S m S m = arg min S m DL(y; S m ). (22) Calculation of this DL is provie in the following section. 2.2 Calculation of the New Description Length Calculation an comparison of the escription length efine in (2) for ifferent subspaces leas to comparison of the reconstruction error ŷ Sm ȳ 2. The available error in each subspace is ŷ Sm y 2. With this available ata we can valiate bouns on the escription length of ȳ S m probabilistically, where DL(ȳ S m ; ȳ ) (23) ȳ S m = E(ŷ Sm ). (24) Details of the valiation step is in [5]. For each subspace S m, the valiate upper an lower bouns are functions of orer of S m, m, Length of the ata, N, the noise variance, σw, 2 the ata s power an the valiation probability P. Next step is to provie probabilistic bouns on the esire DL, DL(ŷS m ; ȳ ). Probabilistic bouns are also provie in [5]. The bouns are functions of the confience probability P 2, Length of ata N, noise variance an the valiate upper an lower bouns on DL(ȳS m ; ȳ ) from step one. For large N, with P an P 2 approaching one, the upper an lower bouns on the esire escription length approach each other an provie a tight estimate for subspaces of low orer m << N [5]. Note that in [4] the calculation of another metho of orer estimation metho, minimum escription complexity (MDC), is provie. the close form criterion for MDC an the new MDL have the same structure for the consiere linear moel class for which the ata is generate by the structure given in (3). 3 Thresholing The existing information theoretic methos attempt to etermine the true parametric moel with m parameters. In most practical problems, m is not finite an we require to etect the optimum estimate for m, which represents the significant part of the noise less ata ȳ. Implementing the MDL metho in this situation, provies an estimate for m which is very sensitive to the variation in signal to noise ratio(snr) 3 an to the length of the output [7]. When the length of the true parameter is infinite, the consistent methos, such as MDL an BIC, point to a higher an higher orer as N an/or SNR grows. Some relate practical problems of these information theoretic methos are aresse in [] an [6]. When the true m is larger than the length of a very long ata, we propose implementing the new information theoretic metho of orer estimation. With this metho we can avoi pointing to higher an higher orers by using a threshol for the escription length. If a threshol ɛ is use for the minimum acceptable DL, 3 SNR= 0 log 0 Y N 2 2 Nσ
4 then we choose the smallest m for which the upperboun on DL is greater or equal to ɛ. An example of this approach is given in the simulation section. 3. MDL Thresholing Can thresholing be use for the two-stage MDL? In orer to make the escription length in (3) a vali coelength, which correspons to a prefix coe an satisfies the Kraft s inequality, it is suggeste to a a normalizing constant C(N) to the suggeste escription length DL Sm (y) = m log(n) + log f Sm (ŷ Sm ; y) + C (N). (25) In [9] it is argue that as N grows C (N)/N 0. However, note that as N grows the factor m log(n) N also goes to zero. For any fixe N, C (N) might be comparable with m log(n) N. Calculation of this normalizing factor is not trivial an is not available. Because of the structure of the DL, in general C (N) is a function of the noiseless output ȳ. Since C (N) is a fixe number in the comparison of ifferent subspaces an in calculation of the MDL this term is ignore. However, since C (N) changes for ifferent orer estimation settings, for example with the change of ȳ, implementation of a threshol is meaningless for this criterion. Here we prove that the use of threshol is meaningful for the new propose MDL. Assume that for any problem setting the escritization in output space Y is the same. We prove that the new escription length is a coelength of a prefix coe. A necessary an sufficient conition for a coe to be prefix is that it satisfies the Kraft s inequality. The new DL satisfies the Kraft s inequality by aing a normalize factor. The normalizing factor is not a function of y or ȳ, but a function of the orer of subset S m an escritization factor of Y. This proves the consistency of the coelengths with universality of Kolmogorov complexity in (2). Theorem The new escription length, efine in (2), satisfies the Kraft s inequality, i.e., correspons to a prefix coe, when a normalize factor C(N) is consiere where DL Sm (y) DL(ŷ Sm ; ȳ ) + C(N) = log f Sm (ŷ Sm ; ȳ + C(N). (26) ) C(N) = Ln2 Ln(δm 2πσw 2 (N m) ). (27) Note that although C(N) is a function of m, C(N)/N goes to zero much faster that the terms in the estimate of log f S m (ŷ Sm ;ȳ ) an it can be ignore for large enough N. Proof The escritize version of y is y an escritize version of ŷ Sm is ŷ S m. The inex is use for the escritize elements in Y N. Therefore, we have DL(ŷ S m (); ȳ ) (28) = log e ȳ ŷ Sm () 2 2 = Ln(2) Ln e ȳ ŷ S m () 2 2. To check the Kraft s inequality for each coe wor of length DL(ŷS m (); ȳ ) we have to show that D DL(ŷ Sm ();ȳ ) (29) where D is the size of alphabet resulte from escritizing the output space Y. Equivalently we can check for is ae such that ( ) LnD e DL(ŷ Sm ();ȳ ) (30) we know that (e ) LnD DL(ŷ Sm ();ȳ ) Note that δ m ( i e DL(ŷ Sm ();ȳ ) ) LnD e ȳ ŷ S m () 2 2 LnD Ln2 ( 2πσ ) m e ȳ ŷ Sm () 2 2σ w 2 (32) ( 2πσ ) m e ȳ ŷs m 2 2 y where δ is the precision per imension in the space Y, or equivalently in the space of the aitive noise W. On the other han, the error ȳs m ŷ Sm has a Gaussian istribution an we have ( 2πσ ) m e ȳ Sm ŷ Sm 2 2σ w 2 y =. (33) Therefore the error ȳ ŷ Sm also has a Gaussian istribution with same variance an a ifferent mean. Hence, we have ( 2πσ ) m e ȳ ŷsm 2 2σ w 2 y =. (34) (3)
5 Therefore e ȳ ŷsm 2 2σ y = ( 2πσ ) N m (35) 0 4 Therefore (e DL(ŷ Sm ();ȳ ) Ln(δm ) Ln2 2πσw 2 (N m) LnD ) Description Length 0 3 i (36) Hence, the normalizing factor is Ln2 Ln(δm 2πσw 2 (N m) ). 4 Linear Moels an Simulation Results Lets consier a linear time invariant(lti) system for which the matrix A Sm (N) in (3) is a Toeplitz matrix of input. The input of the system is a binary sequence of ±. The input is a sample of inepenent ientically istribute Bernoulli ranom process. Note that in this case the basis s i in (3) are asymptotically orthogonal. The input-output relationship is of form y(i) = i θ (k)x i k+ + w(i), (37) k= We use the microwave raio channel, chan0.mat, which is available at m Figure 2: Soli line is the escription length for SNR=0b, an N= : Probabilistic upperboun an lowerboun of the new escription length. Description Length m Figure 3: Soli line is the escription length for SNR=90b, an N= : probabilistic upperboun an lowerboun of the new escription length Figure : Real part of the first 60 taps of a microwave raio channel impulse response. Figure () shows the real part of the first 60 taps of the system impulse response. The simulation result for ata of length N = 300 an SNR=0b is shown in figure(2). Here the optimum impulse response length for ifferent methos are ˆm(AIC)=34 an ˆm(MDL)=32. The new propose criterion selects ˆm = 33. Figure(3) shows the upper an lower boun on the new DL for N=800, SNR=90b. The bouns on the DL are vali with probability 0.84 an valiation probability of In this case all the methos select an impulse response length which is larger than 30. With higher SNR an/or longer ata sample, all the methos choose a larger an larger length for the impulse response estimate. However, if we choose a threshol for the DL to be 0, the new criterion selects m = 37. With this threshol m 37 when SNR grows an/or the length of ata gets larger. Counting for the elay of the system, with the same threshol, the propose metho chooses the 0 taps of the impulse response estimate from 27 to 36 for optimum moelling of the system.. 5 Conclusion In this paper a new information theoretic metho of parametric estimation is introuce. By using the
6 available ata, we are able to probabilistically estimate tight bouns on the new criterion which is in form of a ata escription length. It is shown that the new escription length correspons to a prefix coe an is consistent with the universality of Kolmogorov complexity. References [] H. Akaike. A New Look at the Statistical Moel Ientification. IEEE Trans. on Automatic Control, vol. AC-9, pp , 974. [2] Y. Barron, J. Rissanen, Bin Yu. The Minimum Description Length Principle in Coing an Moeling. IEEE Trans. on Information theory, vol 44, pp , Oct [3] S. Beheshti an M.A. Dahleh. On moel quality evaluation of stable LTI systems. Proceeings of the 39th IEEE Conference on Decision an Control, pp , [4] S. Beheshti. Minimum Description Complexity. Thesis, MIT, September [5] S. Beheshti. an M.A. Dahleh. New Information Theoretic Approach to Orer Estimation Problem. 3th IFAC Symposium on System Ientification, August [6] W. Chen, K.M.Wong, an J. Reilly. Detection of the Number of Signals: A Preicte Eigen-Threshol Approach. IEEE Trans on Signal processing, vol.39, pp , May 99 [7] A.P. Liavas, P.A. Regalia, an J. Delmas. Blin Channel Approximation: Effective Channel Orer Estimation. IEEE Trans. on Signal Processing, vol.47, pp , 999. [8] L. Lung. System Ientification: Theory for the user. NJ: Prentice-Hall, 998. [9] J. Rissanen. Universal Coing, Information, Preiction, an Estimation. IEEE Trans. on Information Theory, vol.it30, pp , 984. [0] G. Schwarz. Estimating The Dimension of a Moel. The Annals of Statistics, vol.6, pp , 978. [] K.M.Wong, Q.T. Zhang, J. Reilly an P.C. Yip. On Information Theoretic Criteria for Determining the Number of Signals in High Resolution Array Processing. IEEE Trans. Acoust. Speech, Signal processing, vol.38, pp , November 990
A NEW INFORMATION THEORETIC APPROACH TO ORDER ESTIMATION PROBLEM. Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.
A EW IFORMATIO THEORETIC APPROACH TO ORDER ESTIMATIO PROBLEM Soosan Beheshti Munther A. Dahleh Massachusetts Institute of Technology, Cambridge, MA 0239, U.S.A. Abstract: We introduce a new method of model
More informationLTI Systems, Additive Noise, and Order Estimation
LTI Systems, Additive oise, and Order Estimation Soosan Beheshti, Munther A. Dahleh Laboratory for Information and Decision Systems Department of Electrical Engineering and Computer Science Massachusetts
More informationTime-of-Arrival Estimation in Non-Line-Of-Sight Environments
2 Conference on Information Sciences an Systems, The Johns Hopkins University, March 2, 2 Time-of-Arrival Estimation in Non-Line-Of-Sight Environments Sinan Gezici, Hisashi Kobayashi an H. Vincent Poor
More informationLeast-Squares Regression on Sparse Spaces
Least-Squares Regression on Sparse Spaces Yuri Grinberg, Mahi Milani Far, Joelle Pineau School of Computer Science McGill University Montreal, Canaa {ygrinb,mmilan1,jpineau}@cs.mcgill.ca 1 Introuction
More informationSurvey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013
Survey Sampling Kosuke Imai Department of Politics, Princeton University February 19, 2013 Survey sampling is one of the most commonly use ata collection methos for social scientists. We begin by escribing
More informationTopic 7: Convergence of Random Variables
Topic 7: Convergence of Ranom Variables Course 003, 2016 Page 0 The Inference Problem So far, our starting point has been a given probability space (S, F, P). We now look at how to generate information
More informationLecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012
CS-6 Theory Gems November 8, 0 Lecture Lecturer: Alesaner Mąry Scribes: Alhussein Fawzi, Dorina Thanou Introuction Toay, we will briefly iscuss an important technique in probability theory measure concentration
More informationu!i = a T u = 0. Then S satisfies
Deterministic Conitions for Subspace Ientifiability from Incomplete Sampling Daniel L Pimentel-Alarcón, Nigel Boston, Robert D Nowak University of Wisconsin-Maison Abstract Consier an r-imensional subspace
More information510 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 58, NO. 2, FEBRUARY X/$ IEEE
510 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 58, NO. 2, FEBRUARY 2010 Noisy Data and Impulse Response Estimation Soosan Beheshti, Senior Member, IEEE, and Munther A. Dahleh, Fellow, IEEE Abstract This
More informationRobust Forward Algorithms via PAC-Bayes and Laplace Distributions. ω Q. Pr (y(ω x) < 0) = Pr A k
A Proof of Lemma 2 B Proof of Lemma 3 Proof: Since the support of LL istributions is R, two such istributions are equivalent absolutely continuous with respect to each other an the ivergence is well-efine
More informationSeparation of Variables
Physics 342 Lecture 1 Separation of Variables Lecture 1 Physics 342 Quantum Mechanics I Monay, January 25th, 2010 There are three basic mathematical tools we nee, an then we can begin working on the physical
More informationSYNCHRONOUS SEQUENTIAL CIRCUITS
CHAPTER SYNCHRONOUS SEUENTIAL CIRCUITS Registers an counters, two very common synchronous sequential circuits, are introuce in this chapter. Register is a igital circuit for storing information. Contents
More informationLeaving Randomness to Nature: d-dimensional Product Codes through the lens of Generalized-LDPC codes
Leaving Ranomness to Nature: -Dimensional Prouct Coes through the lens of Generalize-LDPC coes Tavor Baharav, Kannan Ramchanran Dept. of Electrical Engineering an Computer Sciences, U.C. Berkeley {tavorb,
More informationTutorial on Maximum Likelyhood Estimation: Parametric Density Estimation
Tutorial on Maximum Likelyhoo Estimation: Parametric Density Estimation Suhir B Kylasa 03/13/2014 1 Motivation Suppose one wishes to etermine just how biase an unfair coin is. Call the probability of tossing
More informationComputing Exact Confidence Coefficients of Simultaneous Confidence Intervals for Multinomial Proportions and their Functions
Working Paper 2013:5 Department of Statistics Computing Exact Confience Coefficients of Simultaneous Confience Intervals for Multinomial Proportions an their Functions Shaobo Jin Working Paper 2013:5
More informationAn Analytical Expression of the Probability of Error for Relaying with Decode-and-forward
An Analytical Expression of the Probability of Error for Relaying with Decoe-an-forwar Alexanre Graell i Amat an Ingmar Lan Department of Electronics, Institut TELECOM-TELECOM Bretagne, Brest, France Email:
More informationTable of Common Derivatives By David Abraham
Prouct an Quotient Rules: Table of Common Derivatives By Davi Abraham [ f ( g( ] = [ f ( ] g( + f ( [ g( ] f ( = g( [ f ( ] g( g( f ( [ g( ] Trigonometric Functions: sin( = cos( cos( = sin( tan( = sec
More informationCapacity Analysis of MIMO Systems with Unknown Channel State Information
Capacity Analysis of MIMO Systems with Unknown Channel State Information Jun Zheng an Bhaskar D. Rao Dept. of Electrical an Computer Engineering University of California at San Diego e-mail: juzheng@ucs.eu,
More informationQubit channels that achieve capacity with two states
Qubit channels that achieve capacity with two states Dominic W. Berry Department of Physics, The University of Queenslan, Brisbane, Queenslan 4072, Australia Receive 22 December 2004; publishe 22 March
More informationLower bounds on Locality Sensitive Hashing
Lower bouns on Locality Sensitive Hashing Rajeev Motwani Assaf Naor Rina Panigrahy Abstract Given a metric space (X, X ), c 1, r > 0, an p, q [0, 1], a istribution over mappings H : X N is calle a (r,
More information1. Aufgabenblatt zur Vorlesung Probability Theory
24.10.17 1. Aufgabenblatt zur Vorlesung By (Ω, A, P ) we always enote the unerlying probability space, unless state otherwise. 1. Let r > 0, an efine f(x) = 1 [0, [ (x) exp( r x), x R. a) Show that p f
More informationLogarithmic spurious regressions
Logarithmic spurious regressions Robert M. e Jong Michigan State University February 5, 22 Abstract Spurious regressions, i.e. regressions in which an integrate process is regresse on another integrate
More informationIntroduction to the Vlasov-Poisson system
Introuction to the Vlasov-Poisson system Simone Calogero 1 The Vlasov equation Consier a particle with mass m > 0. Let x(t) R 3 enote the position of the particle at time t R an v(t) = ẋ(t) = x(t)/t its
More informationError Floors in LDPC Codes: Fast Simulation, Bounds and Hardware Emulation
Error Floors in LDPC Coes: Fast Simulation, Bouns an Harware Emulation Pamela Lee, Lara Dolecek, Zhengya Zhang, Venkat Anantharam, Borivoje Nikolic, an Martin J. Wainwright EECS Department University of
More informationConvergence of Random Walks
Chapter 16 Convergence of Ranom Walks This lecture examines the convergence of ranom walks to the Wiener process. This is very important both physically an statistically, an illustrates the utility of
More informationA PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks
A PAC-Bayesian Approach to Spectrally-Normalize Margin Bouns for Neural Networks Behnam Neyshabur, Srinah Bhojanapalli, Davi McAllester, Nathan Srebro Toyota Technological Institute at Chicago {bneyshabur,
More informationarxiv: v1 [cs.it] 21 Aug 2017
Performance Gains of Optimal Antenna Deployment for Massive MIMO ystems Erem Koyuncu Department of Electrical an Computer Engineering, University of Illinois at Chicago arxiv:708.06400v [cs.it] 2 Aug 207
More information'HVLJQ &RQVLGHUDWLRQ LQ 0DWHULDO 6HOHFWLRQ 'HVLJQ 6HQVLWLYLW\,1752'8&7,21
Large amping in a structural material may be either esirable or unesirable, epening on the engineering application at han. For example, amping is a esirable property to the esigner concerne with limiting
More informationThis module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics
This moule is part of the Memobust Hanbook on Methoology of Moern Business Statistics 26 March 2014 Metho: Balance Sampling for Multi-Way Stratification Contents General section... 3 1. Summary... 3 2.
More informationConcentration of Measure Inequalities for Compressive Toeplitz Matrices with Applications to Detection and System Identification
Concentration of Measure Inequalities for Compressive Toeplitz Matrices with Applications to Detection an System Ientification Borhan M Sananaji, Tyrone L Vincent, an Michael B Wakin Abstract In this paper,
More informationA Weak First Digit Law for a Class of Sequences
International Mathematical Forum, Vol. 11, 2016, no. 15, 67-702 HIKARI Lt, www.m-hikari.com http://x.oi.org/10.1288/imf.2016.6562 A Weak First Digit Law for a Class of Sequences M. A. Nyblom School of
More informationDiscrete Mathematics
Discrete Mathematics 309 (009) 86 869 Contents lists available at ScienceDirect Discrete Mathematics journal homepage: wwwelseviercom/locate/isc Profile vectors in the lattice of subspaces Dániel Gerbner
More informationOn the Behavior of Information Theoretic Criteria for Model Order Selection
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 49, NO. 8, AUGUST 2001 1689 On the Behavior of Information Theoretic Criteria for Model Order Selection Athanasios P. Liavas, Member, IEEE, and Phillip A. Regalia,
More informationA Course in Machine Learning
A Course in Machine Learning Hal Daumé III 12 EFFICIENT LEARNING So far, our focus has been on moels of learning an basic algorithms for those moels. We have not place much emphasis on how to learn quickly.
More informationSpace-time Linear Dispersion Using Coordinate Interleaving
Space-time Linear Dispersion Using Coorinate Interleaving Jinsong Wu an Steven D Blostein Department of Electrical an Computer Engineering Queen s University, Kingston, Ontario, Canaa, K7L3N6 Email: wujs@ieeeorg
More informationNecessary and Sufficient Conditions for Sketched Subspace Clustering
Necessary an Sufficient Conitions for Sketche Subspace Clustering Daniel Pimentel-Alarcón, Laura Balzano 2, Robert Nowak University of Wisconsin-Maison, 2 University of Michigan-Ann Arbor Abstract This
More informationState observers and recursive filters in classical feedback control theory
State observers an recursive filters in classical feeback control theory State-feeback control example: secon-orer system Consier the riven secon-orer system q q q u x q x q x x x x Here u coul represent
More informationRelative Entropy and Score Function: New Information Estimation Relationships through Arbitrary Additive Perturbation
Relative Entropy an Score Function: New Information Estimation Relationships through Arbitrary Aitive Perturbation Dongning Guo Department of Electrical Engineering & Computer Science Northwestern University
More informationEstimation of the Maximum Domination Value in Multi-Dimensional Data Sets
Proceeings of the 4th East-European Conference on Avances in Databases an Information Systems ADBIS) 200 Estimation of the Maximum Domination Value in Multi-Dimensional Data Sets Eleftherios Tiakas, Apostolos.
More informationLower Bounds for the Smoothed Number of Pareto optimal Solutions
Lower Bouns for the Smoothe Number of Pareto optimal Solutions Tobias Brunsch an Heiko Röglin Department of Computer Science, University of Bonn, Germany brunsch@cs.uni-bonn.e, heiko@roeglin.org Abstract.
More informationSome Examples. Uniform motion. Poisson processes on the real line
Some Examples Our immeiate goal is to see some examples of Lévy processes, an/or infinitely-ivisible laws on. Uniform motion Choose an fix a nonranom an efine X := for all (1) Then, {X } is a [nonranom]
More informationThe canonical controllers and regular interconnection
Systems & Control Letters ( www.elsevier.com/locate/sysconle The canonical controllers an regular interconnection A.A. Julius a,, J.C. Willems b, M.N. Belur c, H.L. Trentelman a Department of Applie Mathematics,
More informationA Review of Multiple Try MCMC algorithms for Signal Processing
A Review of Multiple Try MCMC algorithms for Signal Processing Luca Martino Image Processing Lab., Universitat e València (Spain) Universia Carlos III e Mari, Leganes (Spain) Abstract Many applications
More informationNOTES ON EULER-BOOLE SUMMATION (1) f (l 1) (n) f (l 1) (m) + ( 1)k 1 k! B k (y) f (k) (y) dy,
NOTES ON EULER-BOOLE SUMMATION JONATHAN M BORWEIN, NEIL J CALKIN, AND DANTE MANNA Abstract We stuy a connection between Euler-MacLaurin Summation an Boole Summation suggeste in an AMM note from 196, which
More informationParameter estimation: A new approach to weighting a priori information
Parameter estimation: A new approach to weighting a priori information J.L. Mea Department of Mathematics, Boise State University, Boise, ID 83725-555 E-mail: jmea@boisestate.eu Abstract. We propose a
More information. Using a multinomial model gives us the following equation for P d. , with respect to same length term sequences.
S 63 Lecture 8 2/2/26 Lecturer Lillian Lee Scribes Peter Babinski, Davi Lin Basic Language Moeling Approach I. Special ase of LM-base Approach a. Recap of Formulas an Terms b. Fixing θ? c. About that Multinomial
More informationOn combinatorial approaches to compressed sensing
On combinatorial approaches to compresse sensing Abolreza Abolhosseini Moghaam an Hayer Raha Department of Electrical an Computer Engineering, Michigan State University, East Lansing, MI, U.S. Emails:{abolhos,raha}@msu.eu
More informationSchrödinger s equation.
Physics 342 Lecture 5 Schröinger s Equation Lecture 5 Physics 342 Quantum Mechanics I Wenesay, February 3r, 2010 Toay we iscuss Schröinger s equation an show that it supports the basic interpretation of
More information7.1 Support Vector Machine
67577 Intro. to Machine Learning Fall semester, 006/7 Lecture 7: Support Vector Machines an Kernel Functions II Lecturer: Amnon Shashua Scribe: Amnon Shashua 7. Support Vector Machine We return now to
More informationLevy Process and Infinitely Divisible Law
Stat205B: Probability Theory (Spring 2003) Lecture: 26 Levy Process an Infinitely Divisible Law Lecturer: James W. Pitman Scribe: Bo Li boli@stat.berkeley.eu Levy Processes an Infinitely Divisible Law
More informationResearch Article When Inflation Causes No Increase in Claim Amounts
Probability an Statistics Volume 2009, Article ID 943926, 10 pages oi:10.1155/2009/943926 Research Article When Inflation Causes No Increase in Claim Amounts Vytaras Brazauskas, 1 Bruce L. Jones, 2 an
More informationWhy Bernstein Polynomials Are Better: Fuzzy-Inspired Justification
Why Bernstein Polynomials Are Better: Fuzzy-Inspire Justification Jaime Nava 1, Olga Kosheleva 2, an Vlaik Kreinovich 3 1,3 Department of Computer Science 2 Department of Teacher Eucation University of
More informationTHE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE
Journal of Soun an Vibration (1996) 191(3), 397 414 THE VAN KAMPEN EXPANSION FOR LINKED DUFFING LINEAR OSCILLATORS EXCITED BY COLORED NOISE E. M. WEINSTEIN Galaxy Scientific Corporation, 2500 English Creek
More informationA new proof of the sharpness of the phase transition for Bernoulli percolation on Z d
A new proof of the sharpness of the phase transition for Bernoulli percolation on Z Hugo Duminil-Copin an Vincent Tassion October 8, 205 Abstract We provie a new proof of the sharpness of the phase transition
More informationTEMPORAL AND TIME-FREQUENCY CORRELATION-BASED BLIND SOURCE SEPARATION METHODS. Yannick DEVILLE
TEMPORAL AND TIME-FREQUENCY CORRELATION-BASED BLIND SOURCE SEPARATION METHODS Yannick DEVILLE Université Paul Sabatier Laboratoire Acoustique, Métrologie, Instrumentation Bât. 3RB2, 8 Route e Narbonne,
More informationarxiv: v4 [math.pr] 27 Jul 2016
The Asymptotic Distribution of the Determinant of a Ranom Correlation Matrix arxiv:309768v4 mathpr] 7 Jul 06 AM Hanea a, & GF Nane b a Centre of xcellence for Biosecurity Risk Analysis, University of Melbourne,
More informationOptimal CDMA Signatures: A Finite-Step Approach
Optimal CDMA Signatures: A Finite-Step Approach Joel A. Tropp Inst. for Comp. Engr. an Sci. (ICES) 1 University Station C000 Austin, TX 7871 jtropp@ices.utexas.eu Inerjit. S. Dhillon Dept. of Comp. Sci.
More informationMath 342 Partial Differential Equations «Viktor Grigoryan
Math 342 Partial Differential Equations «Viktor Grigoryan 6 Wave equation: solution In this lecture we will solve the wave equation on the entire real line x R. This correspons to a string of infinite
More informationHarmonic Modelling of Thyristor Bridges using a Simplified Time Domain Method
1 Harmonic Moelling of Thyristor Briges using a Simplifie Time Domain Metho P. W. Lehn, Senior Member IEEE, an G. Ebner Abstract The paper presents time omain methos for harmonic analysis of a 6-pulse
More informationMulti-edge Optimization of Low-Density Parity-Check Codes for Joint Source-Channel Coding
Multi-ege Optimization of Low-Density Parity-Check Coes for Joint Source-Channel Coing H. V. Beltrão Neto an W. Henkel Jacobs University Bremen Campus Ring 1 D-28759 Bremen, Germany Email: {h.beltrao,
More informationExponential Tracking Control of Nonlinear Systems with Actuator Nonlinearity
Preprints of the 9th Worl Congress The International Feeration of Automatic Control Cape Town, South Africa. August -9, Exponential Tracking Control of Nonlinear Systems with Actuator Nonlinearity Zhengqiang
More informationLinear First-Order Equations
5 Linear First-Orer Equations Linear first-orer ifferential equations make up another important class of ifferential equations that commonly arise in applications an are relatively easy to solve (in theory)
More informationThermal conductivity of graded composites: Numerical simulations and an effective medium approximation
JOURNAL OF MATERIALS SCIENCE 34 (999)5497 5503 Thermal conuctivity of grae composites: Numerical simulations an an effective meium approximation P. M. HUI Department of Physics, The Chinese University
More informationAn Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback
Journal of Machine Learning Research 8 07) - Submitte /6; Publishe 5/7 An Optimal Algorithm for Banit an Zero-Orer Convex Optimization with wo-point Feeback Oha Shamir Department of Computer Science an
More information18 EVEN MORE CALCULUS
8 EVEN MORE CALCULUS Chapter 8 Even More Calculus Objectives After stuing this chapter you shoul be able to ifferentiate an integrate basic trigonometric functions; unerstan how to calculate rates of change;
More informationOn colour-blind distinguishing colour pallets in regular graphs
J Comb Optim (2014 28:348 357 DOI 10.1007/s10878-012-9556-x On colour-blin istinguishing colour pallets in regular graphs Jakub Przybyło Publishe online: 25 October 2012 The Author(s 2012. This article
More informationOn the Value of Partial Information for Learning from Examples
JOURNAL OF COMPLEXITY 13, 509 544 (1998) ARTICLE NO. CM970459 On the Value of Partial Information for Learning from Examples Joel Ratsaby* Department of Electrical Engineering, Technion, Haifa, 32000 Israel
More informationunder the null hypothesis, the sign test (with continuity correction) rejects H 0 when α n + n 2 2.
Assignment 13 Exercise 8.4 For the hypotheses consiere in Examples 8.12 an 8.13, the sign test is base on the statistic N + = #{i : Z i > 0}. Since 2 n(n + /n 1) N(0, 1) 2 uner the null hypothesis, the
More informationThe Entropy of Random Finite Sets
The Entropy of Ranom Finite Sets Mohamma Rezaeian an Ba-Ngu Vo Department of Electrical an Electronic Engineering, University of Melbourne, Victoria, 300, Australia rezaeian, b.vo@ee.unimelb.eu.au Abstract
More informationLeast Distortion of Fixed-Rate Vector Quantizers. High-Resolution Analysis of. Best Inertial Profile. Zador's Formula Z-1 Z-2
High-Resolution Analysis of Least Distortion of Fixe-Rate Vector Quantizers Begin with Bennett's Integral D 1 M 2/k Fin best inertial profile Zaor's Formula m(x) λ 2/k (x) f X(x) x Fin best point ensity
More informationOn Characterizing the Delay-Performance of Wireless Scheduling Algorithms
On Characterizing the Delay-Performance of Wireless Scheuling Algorithms Xiaojun Lin Center for Wireless Systems an Applications School of Electrical an Computer Engineering, Purue University West Lafayette,
More informationFunction Spaces. 1 Hilbert Spaces
Function Spaces A function space is a set of functions F that has some structure. Often a nonparametric regression function or classifier is chosen to lie in some function space, where the assume structure
More informationClosed and Open Loop Optimal Control of Buffer and Energy of a Wireless Device
Close an Open Loop Optimal Control of Buffer an Energy of a Wireless Device V. S. Borkar School of Technology an Computer Science TIFR, umbai, Inia. borkar@tifr.res.in A. A. Kherani B. J. Prabhu INRIA
More informationOPTIMAL CONTROL PROBLEM FOR PROCESSES REPRESENTED BY STOCHASTIC SEQUENTIAL MACHINE
OPTIMA CONTRO PROBEM FOR PROCESSES REPRESENTED BY STOCHASTIC SEQUENTIA MACHINE Yaup H. HACI an Muhammet CANDAN Department of Mathematics, Canaale Onseiz Mart University, Canaale, Turey ABSTRACT In this
More informationImplicit Differentiation
Implicit Differentiation Thus far, the functions we have been concerne with have been efine explicitly. A function is efine explicitly if the output is given irectly in terms of the input. For instance,
More informationCascaded redundancy reduction
Network: Comput. Neural Syst. 9 (1998) 73 84. Printe in the UK PII: S0954-898X(98)88342-5 Cascae reunancy reuction Virginia R e Sa an Geoffrey E Hinton Department of Computer Science, University of Toronto,
More informationStopping-Set Enumerator Approximations for Finite-Length Protograph LDPC Codes
Stopping-Set Enumerator Approximations for Finite-Length Protograph LDPC Coes Kaiann Fu an Achilleas Anastasopoulos Electrical Engineering an Computer Science Dept. University of Michigan, Ann Arbor, MI
More informationCalculus of Variations
16.323 Lecture 5 Calculus of Variations Calculus of Variations Most books cover this material well, but Kirk Chapter 4 oes a particularly nice job. x(t) x* x*+ αδx (1) x*- αδx (1) αδx (1) αδx (1) t f t
More informationMulti-View Clustering via Canonical Correlation Analysis
Technical Report TTI-TR-2008-5 Multi-View Clustering via Canonical Correlation Analysis Kamalika Chauhuri UC San Diego Sham M. Kakae Toyota Technological Institute at Chicago ABSTRACT Clustering ata in
More informationMath Notes on differentials, the Chain Rule, gradients, directional derivative, and normal vectors
Math 18.02 Notes on ifferentials, the Chain Rule, graients, irectional erivative, an normal vectors Tangent plane an linear approximation We efine the partial erivatives of f( xy, ) as follows: f f( x+
More informationA Modification of the Jarque-Bera Test. for Normality
Int. J. Contemp. Math. Sciences, Vol. 8, 01, no. 17, 84-85 HIKARI Lt, www.m-hikari.com http://x.oi.org/10.1988/ijcms.01.9106 A Moification of the Jarque-Bera Test for Normality Moawa El-Fallah Ab El-Salam
More information6 General properties of an autonomous system of two first order ODE
6 General properties of an autonomous system of two first orer ODE Here we embark on stuying the autonomous system of two first orer ifferential equations of the form ẋ 1 = f 1 (, x 2 ), ẋ 2 = f 2 (, x
More informationALGEBRAIC AND ANALYTIC PROPERTIES OF ARITHMETIC FUNCTIONS
ALGEBRAIC AND ANALYTIC PROPERTIES OF ARITHMETIC FUNCTIONS MARK SCHACHNER Abstract. When consiere as an algebraic space, the set of arithmetic functions equippe with the operations of pointwise aition an
More informationQuantum mechanical approaches to the virial
Quantum mechanical approaches to the virial S.LeBohec Department of Physics an Astronomy, University of Utah, Salt Lae City, UT 84112, USA Date: June 30 th 2015 In this note, we approach the virial from
More informationOptimal Cooperative Spectrum Sensing in Cognitive Sensor Networks
Optimal Cooperative Spectrum Sensing in Cognitive Sensor Networks Hai Ngoc Pham, an Zhang, Paal E. Engelsta,,3, Tor Skeie,, Frank Eliassen, Department of Informatics, University of Oslo, Norway Simula
More informationThe Principle of Least Action
Chapter 7. The Principle of Least Action 7.1 Force Methos vs. Energy Methos We have so far stuie two istinct ways of analyzing physics problems: force methos, basically consisting of the application of
More informationCode_Aster. Detection of the singularities and computation of a card of size of elements
Titre : Détection es singularités et calcul une carte [...] Date : 0/0/0 Page : /6 Responsable : Josselin DLMAS Clé : R4.0.04 Révision : 9755 Detection of the singularities an computation of a car of size
More informationOn non-antipodal binary completely regular codes
Discrete Mathematics 308 (2008) 3508 3525 www.elsevier.com/locate/isc On non-antipoal binary completely regular coes J. Borges a, J. Rifà a, V.A. Zinoviev b a Department of Information an Communications
More informationBalancing Expected and Worst-Case Utility in Contracting Models with Asymmetric Information and Pooling
Balancing Expecte an Worst-Case Utility in Contracting Moels with Asymmetric Information an Pooling R.B.O. erkkamp & W. van en Heuvel & A.P.M. Wagelmans Econometric Institute Report EI2018-01 9th January
More informationOn the Surprising Behavior of Distance Metrics in High Dimensional Space
On the Surprising Behavior of Distance Metrics in High Dimensional Space Charu C. Aggarwal, Alexaner Hinneburg 2, an Daniel A. Keim 2 IBM T. J. Watson Research Center Yortown Heights, NY 0598, USA. charu@watson.ibm.com
More informationOptimal Signal Detection for False Track Discrimination
Optimal Signal Detection for False Track Discrimination Thomas Hanselmann Darko Mušicki Dept. of Electrical an Electronic Eng. Dept. of Electrical an Electronic Eng. The University of Melbourne The University
More informationInvariant Extended Kalman Filter: Theory and application to a velocity-aided estimation problem
Invariant Extene Kalman Filter: Theory an application to a velocity-aie estimation problem S. Bonnabel (Mines ParisTech) Joint work with P. Martin (Mines ParisTech) E. Salaun (Georgia Institute of Technology)
More informationHomework 2 Solutions EM, Mixture Models, PCA, Dualitys
Homewor Solutions EM, Mixture Moels, PCA, Dualitys CMU 0-75: Machine Learning Fall 05 http://www.cs.cmu.eu/~bapoczos/classes/ml075_05fall/ OUT: Oct 5, 05 DUE: Oct 9, 05, 0:0 AM An EM algorithm for a Mixture
More informationPolynomial Inclusion Functions
Polynomial Inclusion Functions E. e Weert, E. van Kampen, Q. P. Chu, an J. A. Muler Delft University of Technology, Faculty of Aerospace Engineering, Control an Simulation Division E.eWeert@TUDelft.nl
More informationNew Bounds for Distributed Storage Systems with Secure Repair
New Bouns for Distribute Storage Systems with Secure Repair Ravi Tanon 1 an Soheil Mohajer 1 Discovery Analytics Center & Department of Computer Science, Virginia Tech, Blacksburg, VA Department of Electrical
More information15.1 Upper bound via Sudakov minorization
ECE598: Information-theoretic methos in high-imensional statistics Spring 206 Lecture 5: Suakov, Maurey, an uality of metric entropy Lecturer: Yihong Wu Scribe: Aolin Xu, Mar 7, 206 [E. Mar 24] In this
More informationLevel Construction of Decision Trees in a Partition-based Framework for Classification
Level Construction of Decision Trees in a Partition-base Framework for Classification Y.Y. Yao, Y. Zhao an J.T. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canaa S4S
More informationQuantile function expansion using regularly varying functions
Quantile function expansion using regularly varying functions arxiv:705.09494v [math.st] 9 Aug 07 Thomas Fung a, an Eugene Seneta b a Department of Statistics, Macquarie University, NSW 09, Australia b
More informationA note on asymptotic formulae for one-dimensional network flow problems Carlos F. Daganzo and Karen R. Smilowitz
A note on asymptotic formulae for one-imensional network flow problems Carlos F. Daganzo an Karen R. Smilowitz (to appear in Annals of Operations Research) Abstract This note evelops asymptotic formulae
More informationSpring 2016 Network Science
Spring 206 Network Science Sample Problems for Quiz I Problem [The Application of An one-imensional Poisson Process] Suppose that the number of typographical errors in a new text is Poisson istribute with
More information