arxiv: v1 [cs.sy] 12 Dec 2016

Size: px
Start display at page:

Download "arxiv: v1 [cs.sy] 12 Dec 2016"

Transcription

1 Approximate Recursive Identification of Autoregressive Systems with Sewed Innovations arxiv: v1 [cs.sy] 1 Dec 016 Henri Nurminen Dept. of Automation Science and Engineering Tampere University of Technology Tampere, Finland henri.nurminen@tut.fi Abstract Tohid Ardeshiri Department of Engineering University of Cambridge Cambridge, UK ta417@cam.ac.u We propose a novel recursive system identification algorithm for linear autoregressive systems with sewed innovations. The algorithm is based on the variational Bayes approximation of the model with a multivariate normal prior for the model coefficients, multivariate sew-normally distributed innovations, and matrix-variatenormal inverse-wishart prior for the parameters of the innovation distribution. The proposed algorithm simultaneously estimates the model coefficients as well as the parameters of the innovation distribution, which are both allowed to be slowly time-varying. Through computer simulations, we compare the proposed method with a variational algorithm based on the normally-distributed innovations model, and show that modelling the sewness can provide improvement in identification accuracy. 1 Introduction Many systems produce datasets with sewed noise distribution. Sewness means asymmetry. Positive sewness, for example, intuitively means producing large positive deviations from the median value more frequently than large negative deviations. For instance, some financial data sets show negative sewness because large drops tend to be more frequent than large upsurges [1,, 3, 4]. Wireless networ based positioning often uses time delay measurement as a distance, but non-line-of-sight can produce large positive outliers, so the error distribution becomes positively sewed [5, 6]. One statistical model for sewed error distributions is the sew normal distribution and its multivariate generalisation [7, 8]. The posterior distribution of a normal prior and sew-normal measurement noise model is not analytically tractable. However, the distribution admits a hierarchical formulation whose favorable conjugacy properties enable efficient parameter estimation using the expectation maximisation (EM) algorithm [9, 10, 11] and approximate Bayesian time-series filtering and smoothing based on the variational Bayes (VB) approximation [1, 13]. This paper studies autoregressive (AR) models, where the measurement is modelled to be a linear function of n AR (the model order) previous measurements plus an independent random noise term referred to as the innovation. When the AR coefficients and/or the conditionally sew-normal innovation distribution s statistics are time-varying or they need to be identified online, recursive identification methods are used [14]. In this paper we propose a novel recursive system identification algorithm for AR models with sew-normally distributed measurement noise with unnown possibly slowly time-varying scale and sewness. The proposed approximation is based on a VB approximation. 30th Conference on Neural Information Processing Systems (NIPS 016), Barcelona, Spain.

2 Problem formulation Sew normal distribution is an asymmetric generalization of the normal distribution originally proposed by Azzalini [7]. Its multivariate version was later introduced by Azzalini and Dalla Valle [8]. The version that is used in this report is the canonical fundamental sew normal distribution (CFUSN) introduced by Arellano Valle and Genton [15]. However, we adopt a different parametrization following the guidelines of the canonical fundamental sew t-distribution s parametrization in [16] to obtain a suitable analytical tractability. The probability density function (PDF) of this sew normal distribution z SN(µ, R, ) is p(z) = nz N(z; µ π 1, Ω) F N( T Ω 1 (z µ + π 1); 0, I T Ω 1 ), (1) where 1 is a vector of ones, µ is a location parameter, Ω = R + T, and F N is the cumulative distribution function of the multivariate normal distribution. R R nz nz (symmetric positivedefinite, spd) and R nz nz are shape matrices that determine the scale and sewness, and the sign and structure of determine the direction of sewness as explained in [16]. Examples of the PDF in negatively sewed, symmetric, and positively sewed cases are given in Fig. 1. The moments of this multivariate sew normal distribution given the shape matrices are E[z] = µ, V[z] = R + π T. () Compared to the formulation of [16], we shift the distribution with /π1 so that the mean of the distribution does not depend on. This ensures that the proposed algorithm identifies as a measure of sewness, not as a measure of location. PDF negative sew normal positive sew z Figure 1: The PDFs of negatively-sewed, symmetric, and positively-sewed normal distributions. Each distribution has mean zero and variance one. We formulate the AR coefficient estimation problem as the linear state-space model with the measurement noise being sew-normally distributed conditional on the unnown slowly-varying noise parameters R and p(x 1 ) = N(x 1 ; x 1 0, P 1 0) (3a) x = x 1 + w 1, z = C x + e, w 1 iid N(0, Q 1 ) e iid SN(µ, R, ), where x R nar is the vector of AR coefficients, Q R nar nar (spd) is the process noise covariance matrix that is assumed nown and is thus an algorithm parameter, z R nz is the measurement, C R nz nar = [ z 1 z z nar ] is the measurement model matrix given by n AR previous measurements, and {w R nar } K =1 and {e R nz } K =1 are mutually independent process and measurement noise sequences. 3 Proposed algorithm 3.1 Measurement update Conditional on the parameters R and, the sew-normal random variable e R, SN(µ, R, ) has the hierarchical formulation [9] e u, R, N(µ + (u π 1), R ) (4a) u N + (0, I), (3b) (3c) (4b)

3 where N + is the multivariate normal distribution truncated into positive orthant. To obtain the necessary conjugacy properties, let us assign the matrix-variate-normal inverse-wishart (MVNIW) prior distribution to the joint random variable (R, ): p(r, ) = N( ; 1, R V 1 ) IW(R ; Ψ 1, ν 1 ), (5) where 1 R nz nz, V 1 R nz nz (spd), Ψ 1 R nz nz (spd), and ν 1 (n z, ) are parameters of the prior distribution. N(X; M, U V ) is the PDF of the matrix-variate normal distribution with mean M, and variance parameters U (among-row) and V (among-column) [17, Ch. ], and IW(X; Ψ, ν) is PDF of the inverse-wishart distribution with scale-matrix Ψ and ν degrees of freedom [17, Ch. 3]. The filtering posterior distribution p(x, u, R, z 1: ) of the model defined by (3) and (5) is not analytically tractable. Our solution is to use a variational Bayesian approximation, where we find the functions q x,u (x, u ) and q R, (R, ) such that the reversed Kullbac Leibler divergence (KLD) D KL ( qx,u (x, u ) q R, (R, ) p(x, u, R, z 1: ) ) (6) is minimised, where D KL (q p)= q(x) log( q(x) p(x) ) dx. In general there is no exact analytical solution for (q x,u, q R, ), but the iteration of log q x,u (x, u ) E qr,,[log p(z, x, u, R, z 1: 1 )] + c x,u (7a) log q R, (R, ) E qx,u [log p(z, x, u, R, z 1: 1 )] + c R, (7b) always reduces the KLD (6) and for many models gives a sequence that converges towards the optimal functions (q x,u, q R, ) [18, Chapter 10][19]. The expected values on the right hand sides of (7) are taen with respect to the current q x,u and q R,, and c x,u and c R, are constants with respect to the variables (x, u ), and (R, ), respectively. Thans to the chosen prior distribution structure (5), the update (7b) has a closed form solution that preserves the functional form of the prior, and the moments of the distribution required by other computations are also analytically tractable. The analytical solution of the update (7a) is a multivariate normal distribution truncated by multiple linear constraints. The mean and covariance matrix of this distribution can be approximated using the sequential truncation algorithm [0, 1, 13]. The distribution q x,u (x, u ) is then approximated by the unconstrained multivariate normal distribution with the obtained moments q x,u (x, u ) N ( [ x u ] ; ξ, Ξ ), (8) where ξ and Ξ are the mean and covariance matrix given by the sequential truncation algorithm. Normal marginal posterior approximation for x guarantees that we get a recursive algorithm. The approximative filtering posterior of (R, ) is the MVNIW distribution q R, (R, ) = N( ;, R V ) IW(R ; Ψ, ν ), (9) whose required moments are analytically tractable when ν >n z as shown in Appendix A. 3. Time update The marginal distribution of the AR coefficient vector x in the posterior approximation N ( [ x ) u ] ; ξ, Ξ qr, (R, ) is a normal distribution and the state transition (3b) is linear and Gaussian. Thus, the filter prediction becomes the standard Kalman filter prediction and the prediction distribution is normal. The dynamical model of the model parameters p(r, R 1, 1 ) is typically unnown and/or intractable. Therefore, we adopt the forgetting factor update, which provides the maximum-entropy solution given the KLD from the previous posterior [, 3]. Thus, the used prediction density given the MVNIW approximation of the previous posterior and the forgetting factor γ (0, 1] is ˆp(R, y 1: 1 ) N( ; 1 1, R 1 γ V 1 1) IW(R ; γψ 1 1, γ ν (1 γ) n z ). (10) where the term (1 γ) n z guarantees that the resulting inverse-wishart distribution is well-defined and has an expectation value. The details of the proposed recursive identification algorithm including the prediction equations implied by the time update (10) are given in Appendix B. 3

4 4 Simulated example We simulated 1000 Monte Carlo replications of the AR model with 5 AR coefficients with n z = dimensional sew-normally distributed innovations with parameters R=0.1 I and =[ 1 0 ]. Thus, the true distribution has high positive sewness. The true coefficients were simulated by generating the zeros of the characteristic polynomial from the uniform distribution unif( 1, 1). The number of AR coefficients was assumed nown. The initial prior covariance matrix for the AR coefficient vector was given by the 1st order stable spline ernel [P 1 0 ] i,j = max(i 1,j 1), and the process noise covariance was chosen as [Q 1 ] i,j = ( 1 γ 1) max(diag(p 1 1)) 0.5 max(i 1,j 1) to preserve the stable ernel form of the prior [4, 5]. The proposed method is compared with the Gaussian variational Bayes filter for slowly-drifting noise proposed by Agamennoni et al in [6]. The sew-normal based identification method was given the positive direction of the sewness by using the initial prior π 1 p(r 1, 1 ) = N( 1 ; I, R 1 I) IW(R 1 ; ν I, ν 1 0 ), (11) where ν 1 0 = That is, the variance is divided equally between the symmetric and sewed component in the sense that E[R1 1 ] 1 = E[ 1 ] π = 1 I. The normal distribution based method was given the initial prior p(r 1 ) = IW(R 1 ; (ν 1 0 3)I, ν 1 0 ). (1) The forgetting factor value used with both the methods was γ = 0.975, and the number of VB iterations was 10. Fig. shows the relative difference of the identification error ɛ = nar i=1 (x (x ) true ) (13) as a function of the fed number of measurements. The figure shows that the sew-normal based identification method gives a lower median error than the normal distribution based, and the relative differences increase as the number of measurements increase. Fig. shows that after measurements, the sew-normal based method is more accurate in about 95 % of the cases and gives at least 5 % lower identification error in most of the simulations. 100*(error sew -error normal )/error normal #measurements median 5% / 75% quant. 5% / 95% quant. Figure : The proposed algorithm is more accurate than the normal distribution based algorithm in 95 % of the simulations, and in most of the simulations the error (13) is reduced by more than 5 %. 5 Conclusions We proposed a novel recursive estimation algorithm for identifying the model coefficients and innovation distribution parameters of autoregressive models with sew-normally distributed innovations. Both model coefficients and innovation distribution parameters can be slowly time-varying. Our computer simulation showed that modelling sewness can improve the accuracy of identification. 4

5 Acnowledgments H. Nurminen receives funding from Tampere University of Technology Graduate School, Noia Technologies Oy, the Foundation of Noia Corporation, and Teniian edistämissäätiö. T. Ardeshiri receives funding from Swedish research council s (VR) project Scalable Kalman filters, and from Jaguar Land Rover (JLR), Whitley, UK. References [1] C. R. Harvey and A. Siddique, Autoregressive conditional sewness, The Journal of Financial and Quantitative Analysis, vol. 34, pp , December [] E. Jondeau and M. Rocinger, Conditional volatility, sewness, and urtosis: existence, persistence, and comovements, Journal of Economic Dynamics and Control, vol. 7, pp , 003. [3] P. Christofferssen, S. Heston, and K. Jacobs, Option valuation with conditional sewness, Journal of Econometrics, vol. 131, pp , 006. [4] G. Tsiotas, On generalised asymmetric stochastic volatility models, Computational Statistics and Data Analysis, vol. 56, pp , 01. [5] K. Kaemarungsi and P. Krishnamurthy, Analysis of WLAN s received signal strength indication for indoor location fingerprinting, Pervasive and Mobile Computing, vol. 8, no., pp , 01. Special Issue: Wide-Scale Vehicular Sensor Networs and Mobile Sensing. [6] H. Nurminen, T. Ardeshiri, R. Piché, and F. Gustafsson, A NLOS-robust TOA positioning filter based on a sew-t measurement noise model, in International Conference on Indoor Positioning and Indoor Navigation (IPIN), pp. 1 7, October 015. [7] A. Azzalini, A class of distributions which includes the normal ones, Scandinavian Journal of Statistics, vol. 1, no., pp , [8] A. Azzalini and A. Dalla Valle, The multivariate sew-normal distribution, Biometria, vol. 83, no. 4, pp , [9] T. I. Lin, Maximum lielihood estimation for multivariate sew normal mixture models, Journal of Multivariate Analysis, vol. 100, pp , 009. [10] S. Lee and G. J. McLachlan, Finite mixtures of multivariate sew t-distributions: some recent and new results, Statistics and Computing, vol. 4, no., pp , 014. [11] H. J. Ho, S. Pyne, and T. I. Lin, Maximum lielihood inference for mixtures of sew student-t-normal distributions through practical EM-type algorithms, Statistics and Computing, vol., pp , 01. [1] H. Nurminen, T. Ardeshiri, R. Piché, and F. Gustafsson, Robust inference for state-space models with sewed measurement noise, IEEE Signal Processing Letters, vol., pp , November 015. [13] H. Nurminen, T. Ardeshiri, R. Piché, and F. Gustafsson, Sew-t filter and smoother with improved covariance matrix approximation. Available online at August 016. [14] L. Ljung, Recursive identification algorithms, Circuits, Systems, and Signal Processing, vol. 1, no. 1, pp , 00. [15] R. B. Arellano-Valle and M. G. Genton, On fundamental sew distributions, Journal of Multivariate Analysis, no. 96, pp , 005. [16] S. X. Lee and G. J. McLachlan, Finite mixtures of canonical fundamental sew t-distributions the unification of the restricted and unrestricted sew t-mixture models, Statistics and Computing, no. 6, pp , 016. [17] A. K. Gupta and D. K. Nagar, Matrix variate distributions. Boca Raton, FL: Chapman & Hall/CRC, 000. [18] C. M. Bishop, Pattern Recognition and Machine Learning. Springer, 007. [19] D. G. Tzias, A. C. Lias, and N. P. Galatsanos, The variational approximation for Bayesian inference, IEEE Signal Processing Magazine, vol. 5, pp , Nov

6 [0] T. Perälä and S. Ali-Löytty, Kalman-type positioning filters with floor plan information, in 6th International Conference on Advances in Mobile Computing and Multimedia (MoMM), pp , 008. [1] D. J. Simon and D. L. Simon, Constrained Kalman filtering via density function truncation for turbofan engine health estimation, International Journal of Systems Science, vol. 41, no., pp , 010. [] M. Kárný and K. Dedecius, Approximate Bayesian recursive estimation: On approximation errors, tech. rep., ÚTIA AV ČR, January 01. [3] E. Özan, V. Šmídl, S. Saha, C. Lundquist, and F. Gustafsson, Marginalized adaptive particle filtering for nonlinear models with unnown time-varying noise parameters, Automatica, vol. 49, 013. [4] G. Pillonetto and G. De Nicolao, A new ernel-based approach for linear system identification, Automatica, vol. 46, pp , 010. [5] T. Chen, H. Ohlsson, and L. Ljung, On the estimation of transfer functions, regularizations and Gaussian processes revisited, Automatica, vol. 48, pp , 01. [6] G. Agamennoni, J. Nieto, and E. Nebot, Approximate inference in state-space models with heavy-tailed noise, IEEE Transactions on Signal Processing, vol. 60, pp , October 01. 6

7 Appendices A Variational solution of the measurement update Our variational solution uses this hierarchical formulation of the measurement noise model: z x, u, R, N(C x + (u π 1), R ), u N + (0, I), R N( 1, R V 1 ), R IW(Ψ 1, ν 1 ), (14a) (14b) (14c) (14d) where z R nz is the measurement, u R nz is the sewness variable vector, and 1 R nz nz, V 1 R nz nz (spd), Ψ 1 R nz nz (spd), and ν 1 > n z are the parameters of the joint prior distribution of and R. The prior of and R is implied by the previous filtering posterior and the time update step (filter prediction) that is explained in section 3.. The derivations for the variational solution (7) are given in Sections A.1 and A.. For brevity all constant values are denoted by c in the derivation. The logarithm of the full filtering distribution which is needed for the derivations is log p(z, x, u, R, z 1: 1 ) = 1 (z C x (u π 1))T R 1 1 (x x 1 ) T P 1 1 (x x 1 ) 1 ut u 1 Tr{( 1 )V 1 1 ( 1 ) T R 1 } ν (z C x (u π 1)) log det(r ) 1 Tr{Ψ 1R 1 } + c, u 0, (15) where x 1 and P 1 are the mean and covariance matrix of the current predictive distribution, and Tr{ } is the matrix trace. A.1 Derivations for q x,u Using equation (7a) we obtain log q x,u (x, u ) = 1 E q R, [ (z C x (u π 1))T R 1 (z C x (u π 1))] 1 (x x 1 ) T P 1 1 (x x 1 ) 1 ut u + c (16) = 1 (z C x (u π 1))T R 1 (z C x (u π 1)) 1 (u π 1)T (E qr, [ T R 1 ] T R 1 )(u π 1) 1 (x x 1 ) T P 1 1 (x x 1 ) 1 ut u + c u 0, (17) where (R, ) (E qr, [R 1 ] 1, E qr, [ ]) as well as the identity E qr, [R 1 ] = R 1 are derived in Section A.. The inequality u 0 denotes that each element of the vector u is required to be greater or equal than zero. Further, in Section A. it is proved that 7

8 E qr, [ T R 1 ] = n z V + T R 1, so Eq. (17) becomes log q x,u (x, u ) = 1 (z C x (u π 1)T V (u n z (u π 1))T R 1 π 1) (z C x (u π 1)) 1 (x x 1 ) T P 1 1 (x x 1 ) 1 ut u + c (18) = 1 (z + π 1 [ C ] [ x u ]) T R 1 (z + π 1 [ C ] [ x u ]) 1 ([ x u ] ξ 1 ) T Ξ 1 1 ([ x u ] ξ 1 ) + c, u 0, (19) where [ ] P 1 O Ξ 1 = O (I + n z V ) 1, (0) [ ] ξ 1 = x 1 n z π (I + n zv ) 1 V 1. (1) Hence, q x,u (x, u ) N(z + π 1; [ C ] [ x u ], R ) N([ x u ] ; ξ 1, Ξ 1 ) [u 0] () N([ x u ] ; ξ, Ξ ) [u 0], (3) where [ ] is the Iverson bracet, and ξ and Ξ are the outputs of the Kalman filter update C = [ C ], (4) K = Ξ 1 CT ( C Ξ 1 CT + R ) 1, (5) ξ = ξ 1 + K (z + π 1 C ξ 1 ), (6) Ξ = (I K C )Ξ 1. (7) To mae the algorithm recursive, we approximate q x,u with a multivariate normal distribution q x,u (x, u ) = N([ x u ] ; ξ, Ξ ) [u 0] (8) N([ x u ] ; ξ, Ξ ), (9) whose approximate mean and covariance matrix ξ and Ξ are obtained through approximate moment-matching. Our approach for approximating the moments is the sequential truncation algorithm [0, 1][13, Table I]. Let us denote the approximate distribution with q x,u (x, u ) N([ x u ] ; ξ, Ξ ). In Section A., certain moments of q x,u are required. They are approximated as x E qxu [x ] = [ξ ] 1:nx, (30) P V qxu [x ] = [Ξ ] 1:nx,1:n x, (31) u E qxu [u ] = [ξ ] nx+(1:n z), (3) U V qxu [u ] = [Ξ ] nx+(1:n z),n x+(1:n z), (33) Υ E qxu [x u T ] x u T = [Ξ ] 1:nx,n x+(1:n z), (34) where n x +(1 : n z ) denotes (n x +1) : (n x +n z ). 8

9 A. Derivations for q R, Using equation (7b) and the approximation (9) we obtain logq R, (R, ) = E qx,u [log N(z ; C x + (u ] π 1), R ) + log N( ; 1, R V 1 ) + log IW(R ; Ψ 1, ν 1 ) + c (35) = 1 log det(r ) 1 { ] } Tr E qx,u [(z C x (u π 1))(z C x (u π 1))T R 1 n z log det(r ) 1 { } Tr ( 1 )V 1 1 ( 1 ) T R 1 ν 1 = ν 1 + n z + 1 log det(r ) 1 Tr { Ψ 1 R 1 log det(r ) 1 Tr {( ( U + (u ( z (u ( z (u π 1)(u π 1)T C (Υ + x (u } + c (36) π 1)T) T π 1)T ) ) T π 1)T C (Υ + x (u π 1)T ) ) T ) } R 1 + (z C x )(z C x ) T + C P C T 1 {( ) } Tr ( 1 )V 1 1 ( 1 ) T + Ψ 1 R 1 + c (37) = ν 1 + n z + 1 log det(r ) 1 Tr {( ( U + (u ( z (u ( z (u π 1)(u π 1)T + V 1 π 1)T C (Υ + x (u π 1)T C (Υ + x (u 1) T 1) T ) T π 1)T ) + 1 V 1 π 1)T ) + 1 V (z C x )(z C x ) T + C P C T + 1 V 1 1 T 1 + Ψ 1 = n z log det(r ) 1 Tr{( )V 1 ( ) T R 1 } ν log det(r ) 1 Tr{Ψ R 1 } (39) where V = ( U + (u π 1)(u π 1)T + V 1) 1, (40) = ( (z C x )(u π 1)T C Υ + 1 V 1) V, (41) ν =ν 1 + 1, (4) Therefore, Ψ = 1 V 1 1 T 1 V 1 T + (z C x )(z C x ) T ) R 1 } (38) + C P C T + Ψ 1. (43) q R, (R, ) = N( ;, R V ) IW(R ; Ψ, ν ). (44) 9

10 The following moments are required for the derivations of Section A.1: E qr, [ ] =, (45) R E qr, [R 1 1 ] 1 = ν n z 1 Ψ. (46) Eq. (46) follows from the fact that R IW(Ψ, ν ) implies that R 1 is Wishart-distributed with shape matrix Ψ 1 and ν n z 1 degrees of freedom [17, Ch. 3.4]. Furthermore, and E qr, [R 1 ] = R 1 N( ;, R V ) IW(R ; Ψ, ν ) d dr (47) = R 1 IW(R ; Ψ, ν ) dr (48) = (ν n z 1)Ψ 1 (49) = R 1 (50) E qr, [ T R 1 ] = T R 1 N( ;, R V ) IW(R ; Ψ, ν ) d dr (51) = (Tr{R R 1 }V + T R 1 ) IW(R ; Ψ, ν ) dr (5) = n z V + (ν n z 1) T Ψ 1 (53) = n z V + T R 1, (54) where (5) follows from the matrix-variate normal identity E[X T AX] = Tr{UA T }V + M T AM for X N(M, U V ) [17, Ch..3]. 10

11 B Recursive Identification Algorithm for Linear Systems with Sewed Innovations 1: Inputs: x 1 0, P 1 0, 1 0, V 1 0, Ψ 1 0, ν 1 0, Q 1:K, C 1:K, z 1:K, γ : for = 1 to K do Initialize 3: x x 1 4: u u 1 5: 1 6: V V 1 7: Ψ Ψ 1 8: ν ν : repeat 10: 1 R ν n Ψ z 1 Update q x,u (x, u ) N([ x [ u ] ; ξ, Ξ ) x 1 ] 11: ξ 1 n z /π(inz +n zv ) 1 V 1 1: Ξ 1 blocdiag(p 1, (I + n z V ) 1 ) 13: C [ C ] 14: K Ξ 1 CT ( C Ξ 1 CT + R ) 1 15: ξ ξ 1 + K (z C ξ 1 + π 1) 16: Ξ Ξ 1 K C K T 17: [ξ, Ξ ] seq_trunc( ξ, Ξ, {n AR +1,..., n AR +n z }) See [13, Table I] 18: x [ξ ] 1:nAR 19: P [Ξ ] 1:nAR,1:n AR 0: ũ [ξ ] nar+(1:n z) π 1 1: U [Ξ ] nar+(1:n z),n AR+(1:n z) : Υ [Ξ ] 1:nAR,n AR+(1:n z) Update q R, (R, ) = N( ;, R V ) IW(R ; R, ν ) 3: V ( U + ũ ũ T + V 1 ) 1 1 4: ( (z C x )ũ T C Υ + 1 V 1) V 5: Ψ 1 V 1 1 T 1 V 1 T 6: +(z C x )(z C x ) T + C P C T + Ψ 1 7: until converged Predict 8: x +1 x 9: P +1 P + Q 30: +1 31: V +1 1 γ V 3: Ψ +1 γψ 33: ν +1 γ ν + (1 γ) n z 34: end for 35: Outputs: x and P for = 1,..., K 11

RECURSIVE OUTLIER-ROBUST FILTERING AND SMOOTHING FOR NONLINEAR SYSTEMS USING THE MULTIVARIATE STUDENT-T DISTRIBUTION

RECURSIVE OUTLIER-ROBUST FILTERING AND SMOOTHING FOR NONLINEAR SYSTEMS USING THE MULTIVARIATE STUDENT-T DISTRIBUTION 1 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, SEPT. 3 6, 1, SANTANDER, SPAIN RECURSIVE OUTLIER-ROBUST FILTERING AND SMOOTHING FOR NONLINEAR SYSTEMS USING THE MULTIVARIATE STUDENT-T

More information

NON-LINEAR NOISE ADAPTIVE KALMAN FILTERING VIA VARIATIONAL BAYES

NON-LINEAR NOISE ADAPTIVE KALMAN FILTERING VIA VARIATIONAL BAYES 2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING NON-LINEAR NOISE ADAPTIVE KALMAN FILTERING VIA VARIATIONAL BAYES Simo Särä Aalto University, 02150 Espoo, Finland Jouni Hartiainen

More information

Recursive Noise Adaptive Kalman Filtering by Variational Bayesian Approximations

Recursive Noise Adaptive Kalman Filtering by Variational Bayesian Approximations PREPRINT 1 Recursive Noise Adaptive Kalman Filtering by Variational Bayesian Approximations Simo Särä, Member, IEEE and Aapo Nummenmaa Abstract This article considers the application of variational Bayesian

More information

RAO-BLACKWELLIZED PARTICLE FILTER FOR MARKOV MODULATED NONLINEARDYNAMIC SYSTEMS

RAO-BLACKWELLIZED PARTICLE FILTER FOR MARKOV MODULATED NONLINEARDYNAMIC SYSTEMS RAO-BLACKWELLIZED PARTICLE FILTER FOR MARKOV MODULATED NONLINEARDYNAMIC SYSTEMS Saiat Saha and Gustaf Hendeby Linöping University Post Print N.B.: When citing this wor, cite the original article. 2014

More information

Latent Variable Models and EM algorithm

Latent Variable Models and EM algorithm Latent Variable Models and EM algorithm SC4/SM4 Data Mining and Machine Learning, Hilary Term 2017 Dino Sejdinovic 3.1 Clustering and Mixture Modelling K-means and hierarchical clustering are non-probabilistic

More information

This is a repository copy of A Novel Robust Rauch-Tung-Striebel Smoother Based on Slash and Generalized Hyperbolic Skew Student s T-Distributions.

This is a repository copy of A Novel Robust Rauch-Tung-Striebel Smoother Based on Slash and Generalized Hyperbolic Skew Student s T-Distributions. This is a repository copy of A Novel Robust Rauch-Tung-Striebel Smoother Based on Slash and Generalized Hyperbolic Skew Student s T-Distributions. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/130958/

More information

in a Rao-Blackwellised Unscented Kalman Filter

in a Rao-Blackwellised Unscented Kalman Filter A Rao-Blacwellised Unscented Kalman Filter Mar Briers QinetiQ Ltd. Malvern Technology Centre Malvern, UK. m.briers@signal.qinetiq.com Simon R. Masell QinetiQ Ltd. Malvern Technology Centre Malvern, UK.

More information

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Lecture 2: From Linear Regression to Kalman Filter and Beyond Lecture 2: From Linear Regression to Kalman Filter and Beyond January 18, 2017 Contents 1 Batch and Recursive Estimation 2 Towards Bayesian Filtering 3 Kalman Filter and Bayesian Filtering and Smoothing

More information

Lecture 4: Probabilistic Learning. Estimation Theory. Classification with Probability Distributions

Lecture 4: Probabilistic Learning. Estimation Theory. Classification with Probability Distributions DD2431 Autumn, 2014 1 2 3 Classification with Probability Distributions Estimation Theory Classification in the last lecture we assumed we new: P(y) Prior P(x y) Lielihood x2 x features y {ω 1,..., ω K

More information

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Lecture 2: From Linear Regression to Kalman Filter and Beyond Lecture 2: From Linear Regression to Kalman Filter and Beyond Department of Biomedical Engineering and Computational Science Aalto University January 26, 2012 Contents 1 Batch and Recursive Estimation

More information

The Unscented Particle Filter

The Unscented Particle Filter The Unscented Particle Filter Rudolph van der Merwe (OGI) Nando de Freitas (UC Bereley) Arnaud Doucet (Cambridge University) Eric Wan (OGI) Outline Optimal Estimation & Filtering Optimal Recursive Bayesian

More information

Linear Dynamical Systems

Linear Dynamical Systems Linear Dynamical Systems Sargur N. srihari@cedar.buffalo.edu Machine Learning Course: http://www.cedar.buffalo.edu/~srihari/cse574/index.html Two Models Described by Same Graph Latent variables Observations

More information

Evaluating the Consistency of Estimation

Evaluating the Consistency of Estimation Tampere University of Technology Evaluating the Consistency of Estimation Citation Ivanov, P., Ali-Löytty, S., & Piche, R. (201). Evaluating the Consistency of Estimation. In Proceedings of 201 International

More information

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 13: SEQUENTIAL DATA

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 13: SEQUENTIAL DATA PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 13: SEQUENTIAL DATA Contents in latter part Linear Dynamical Systems What is different from HMM? Kalman filter Its strength and limitation Particle Filter

More information

Online tests of Kalman filter consistency

Online tests of Kalman filter consistency Tampere University of Technology Online tests of Kalman filter consistency Citation Piché, R. (216). Online tests of Kalman filter consistency. International Journal of Adaptive Control and Signal Processing,

More information

Large-scale Ordinal Collaborative Filtering

Large-scale Ordinal Collaborative Filtering Large-scale Ordinal Collaborative Filtering Ulrich Paquet, Blaise Thomson, and Ole Winther Microsoft Research Cambridge, University of Cambridge, Technical University of Denmark ulripa@microsoft.com,brmt2@cam.ac.uk,owi@imm.dtu.dk

More information

Bayesian Inference Course, WTCN, UCL, March 2013

Bayesian Inference Course, WTCN, UCL, March 2013 Bayesian Course, WTCN, UCL, March 2013 Shannon (1948) asked how much information is received when we observe a specific value of the variable x? If an unlikely event occurs then one would expect the information

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 7 Approximate

More information

Dynamic System Identification using HDMR-Bayesian Technique

Dynamic System Identification using HDMR-Bayesian Technique Dynamic System Identification using HDMR-Bayesian Technique *Shereena O A 1) and Dr. B N Rao 2) 1), 2) Department of Civil Engineering, IIT Madras, Chennai 600036, Tamil Nadu, India 1) ce14d020@smail.iitm.ac.in

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 11 Project

More information

Expectation Propagation Algorithm

Expectation Propagation Algorithm Expectation Propagation Algorithm 1 Shuang Wang School of Electrical and Computer Engineering University of Oklahoma, Tulsa, OK, 74135 Email: {shuangwang}@ou.edu This note contains three parts. First,

More information

Properties and approximations of some matrix variate probability density functions

Properties and approximations of some matrix variate probability density functions Technical report from Automatic Control at Linköpings universitet Properties and approximations of some matrix variate probability density functions Karl Granström, Umut Orguner Division of Automatic Control

More information

Bayesian Inference for the Multivariate Normal

Bayesian Inference for the Multivariate Normal Bayesian Inference for the Multivariate Normal Will Penny Wellcome Trust Centre for Neuroimaging, University College, London WC1N 3BG, UK. November 28, 2014 Abstract Bayesian inference for the multivariate

More information

CONSTRAINT KALMAN FILTER FOR INDOOR BLUETOOTH LOCALIZATION

CONSTRAINT KALMAN FILTER FOR INDOOR BLUETOOTH LOCALIZATION CONSTRAINT KALMAN FILTER FOR INDOOR BLUETOOTH LOCALIZATION Liang Chen, Heidi Kuusniemi, Yuwei Chen, Jingbin Liu, Ling Pei, Laura Ruotsalainen, and Ruizhi Chen NLS Finnish Geospatial Research Institute

More information

Simultaneous state and input estimation with partial information on the inputs

Simultaneous state and input estimation with partial information on the inputs Loughborough University Institutional Repository Simultaneous state and input estimation with partial information on the inputs This item was submitted to Loughborough University's Institutional Repository

More information

Curve Fitting Re-visited, Bishop1.2.5

Curve Fitting Re-visited, Bishop1.2.5 Curve Fitting Re-visited, Bishop1.2.5 Maximum Likelihood Bishop 1.2.5 Model Likelihood differentiation p(t x, w, β) = Maximum Likelihood N N ( t n y(x n, w), β 1). (1.61) n=1 As we did in the case of the

More information

F denotes cumulative density. denotes probability density function; (.)

F denotes cumulative density. denotes probability density function; (.) BAYESIAN ANALYSIS: FOREWORDS Notation. System means the real thing and a model is an assumed mathematical form for the system.. he probability model class M contains the set of the all admissible models

More information

A Bayesian Approach to Jointly Estimate Tire Radii and Vehicle Trajectory

A Bayesian Approach to Jointly Estimate Tire Radii and Vehicle Trajectory A Bayesian Approach to Jointly Estimate Tire Radii and Vehicle Trajectory Emre Özan, Christian Lundquist and Fredri Gustafsson Abstract High-precision estimation of vehicle tire radii is considered, based

More information

Gaussian Process Approximations of Stochastic Differential Equations

Gaussian Process Approximations of Stochastic Differential Equations Gaussian Process Approximations of Stochastic Differential Equations Cédric Archambeau Dan Cawford Manfred Opper John Shawe-Taylor May, 2006 1 Introduction Some of the most complex models routinely run

More information

Expectation Propagation in Dynamical Systems

Expectation Propagation in Dynamical Systems Expectation Propagation in Dynamical Systems Marc Peter Deisenroth Joint Work with Shakir Mohamed (UBC) August 10, 2012 Marc Deisenroth (TU Darmstadt) EP in Dynamical Systems 1 Motivation Figure : Complex

More information

A Systematic Approach for Kalman-type Filtering with non-gaussian Noises

A Systematic Approach for Kalman-type Filtering with non-gaussian Noises Tampere University of Technology A Systematic Approach for Kalman-type Filtering with non-gaussian Noises Citation Raitoharju, M., Piche, R., & Nurminen, H. (26). A Systematic Approach for Kalman-type

More information

Self Adaptive Particle Filter

Self Adaptive Particle Filter Self Adaptive Particle Filter Alvaro Soto Pontificia Universidad Catolica de Chile Department of Computer Science Vicuna Mackenna 4860 (143), Santiago 22, Chile asoto@ing.puc.cl Abstract The particle filter

More information

The Particle Filter. PD Dr. Rudolph Triebel Computer Vision Group. Machine Learning for Computer Vision

The Particle Filter. PD Dr. Rudolph Triebel Computer Vision Group. Machine Learning for Computer Vision The Particle Filter Non-parametric implementation of Bayes filter Represents the belief (posterior) random state samples. by a set of This representation is approximate. Can represent distributions that

More information

A student's t filter for heavy tailed process and measurement noise

A student's t filter for heavy tailed process and measurement noise A student's t filter for heavy tailed process and measurement noise Michael Roth, Emre Ozan and Fredri Gustafsson Linöping University Post Print N.B.: When citing this wor, cite the original article. Original

More information

An Introduction to Expectation-Maximization

An Introduction to Expectation-Maximization An Introduction to Expectation-Maximization Dahua Lin Abstract This notes reviews the basics about the Expectation-Maximization EM) algorithm, a popular approach to perform model estimation of the generative

More information

Information geometry for bivariate distribution control

Information geometry for bivariate distribution control Information geometry for bivariate distribution control C.T.J.Dodson + Hong Wang Mathematics + Control Systems Centre, University of Manchester Institute of Science and Technology Optimal control of stochastic

More information

Local Positioning with Parallelepiped Moving Grid

Local Positioning with Parallelepiped Moving Grid Local Positioning with Parallelepiped Moving Grid, WPNC06 16.3.2006, niilo.sirola@tut.fi p. 1/?? TA M P E R E U N I V E R S I T Y O F T E C H N O L O G Y M a t h e m a t i c s Local Positioning with Parallelepiped

More information

Monte Carlo Approximation of Monte Carlo Filters

Monte Carlo Approximation of Monte Carlo Filters Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include: Arnaud Doucet, Axel Finke, Anthony Lee, Nick Whiteley 7th January 2014 Context & Outline Filtering in State-Space

More information

The Expectation-Maximization Algorithm

The Expectation-Maximization Algorithm The Expectation-Maximization Algorithm Francisco S. Melo In these notes, we provide a brief overview of the formal aspects concerning -means, EM and their relation. We closely follow the presentation in

More information

Parameter identification using the skewed Kalman Filter

Parameter identification using the skewed Kalman Filter Parameter identification using the sewed Kalman Filter Katrin Runtemund, Gerhard Müller, U München Motivation and the investigated identification problem Problem: damped SDOF system excited by a sewed

More information

Exercises Tutorial at ICASSP 2016 Learning Nonlinear Dynamical Models Using Particle Filters

Exercises Tutorial at ICASSP 2016 Learning Nonlinear Dynamical Models Using Particle Filters Exercises Tutorial at ICASSP 216 Learning Nonlinear Dynamical Models Using Particle Filters Andreas Svensson, Johan Dahlin and Thomas B. Schön March 18, 216 Good luck! 1 [Bootstrap particle filter for

More information

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS Parametric Distributions Basic building blocks: Need to determine given Representation: or? Recall Curve Fitting Binary Variables

More information

The Variational Gaussian Approximation Revisited

The Variational Gaussian Approximation Revisited The Variational Gaussian Approximation Revisited Manfred Opper Cédric Archambeau March 16, 2009 Abstract The variational approximation of posterior distributions by multivariate Gaussians has been much

More information

COM336: Neural Computing

COM336: Neural Computing COM336: Neural Computing http://www.dcs.shef.ac.uk/ sjr/com336/ Lecture 2: Density Estimation Steve Renals Department of Computer Science University of Sheffield Sheffield S1 4DP UK email: s.renals@dcs.shef.ac.uk

More information

Conditional Posterior Cramér-Rao Lower Bounds for Nonlinear Sequential Bayesian Estimation

Conditional Posterior Cramér-Rao Lower Bounds for Nonlinear Sequential Bayesian Estimation 1 Conditional Posterior Cramér-Rao Lower Bounds for Nonlinear Sequential Bayesian Estimation Long Zuo, Ruixin Niu, and Pramod K. Varshney Abstract Posterior Cramér-Rao lower bounds (PCRLBs) 1] for sequential

More information

On the Reduction of Gaussian inverse Wishart Mixtures

On the Reduction of Gaussian inverse Wishart Mixtures On the Reduction of Gaussian inverse Wishart Mixtures Karl Granström Division of Automatic Control Department of Electrical Engineering Linköping University, SE-58 8, Linköping, Sweden Email: karl@isy.liu.se

More information

Parameterized Joint Densities with Gaussian Mixture Marginals and their Potential Use in Nonlinear Robust Estimation

Parameterized Joint Densities with Gaussian Mixture Marginals and their Potential Use in Nonlinear Robust Estimation Proceedings of the 2006 IEEE International Conference on Control Applications Munich, Germany, October 4-6, 2006 WeA0. Parameterized Joint Densities with Gaussian Mixture Marginals and their Potential

More information

Variational Principal Components

Variational Principal Components Variational Principal Components Christopher M. Bishop Microsoft Research 7 J. J. Thomson Avenue, Cambridge, CB3 0FB, U.K. cmbishop@microsoft.com http://research.microsoft.com/ cmbishop In Proceedings

More information

Extension of the Sparse Grid Quadrature Filter

Extension of the Sparse Grid Quadrature Filter Extension of the Sparse Grid Quadrature Filter Yang Cheng Mississippi State University Mississippi State, MS 39762 Email: cheng@ae.msstate.edu Yang Tian Harbin Institute of Technology Harbin, Heilongjiang

More information

Computer Vision Group Prof. Daniel Cremers. 6. Mixture Models and Expectation-Maximization

Computer Vision Group Prof. Daniel Cremers. 6. Mixture Models and Expectation-Maximization Prof. Daniel Cremers 6. Mixture Models and Expectation-Maximization Motivation Often the introduction of latent (unobserved) random variables into a model can help to express complex (marginal) distributions

More information

(Multivariate) Gaussian (Normal) Probability Densities

(Multivariate) Gaussian (Normal) Probability Densities (Multivariate) Gaussian (Normal) Probability Densities Carl Edward Rasmussen, José Miguel Hernández-Lobato & Richard Turner April 20th, 2018 Rasmussen, Hernàndez-Lobato & Turner Gaussian Densities April

More information

Chris Bishop s PRML Ch. 8: Graphical Models

Chris Bishop s PRML Ch. 8: Graphical Models Chris Bishop s PRML Ch. 8: Graphical Models January 24, 2008 Introduction Visualize the structure of a probabilistic model Design and motivate new models Insights into the model s properties, in particular

More information

Lecture 1a: Basic Concepts and Recaps

Lecture 1a: Basic Concepts and Recaps Lecture 1a: Basic Concepts and Recaps Cédric Archambeau Centre for Computational Statistics and Machine Learning Department of Computer Science University College London c.archambeau@cs.ucl.ac.uk Advanced

More information

Towards a Bayesian model for Cyber Security

Towards a Bayesian model for Cyber Security Towards a Bayesian model for Cyber Security Mark Briers (mbriers@turing.ac.uk) Joint work with Henry Clausen and Prof. Niall Adams (Imperial College London) 27 September 2017 The Alan Turing Institute

More information

Identifiability problems in some non-gaussian spatial random fields

Identifiability problems in some non-gaussian spatial random fields Chilean Journal of Statistics Vol. 3, No. 2, September 2012, 171 179 Probabilistic and Inferential Aspects of Skew-Symmetric Models Special Issue: IV International Workshop in honour of Adelchi Azzalini

More information

Auto-Encoding Variational Bayes

Auto-Encoding Variational Bayes Auto-Encoding Variational Bayes Diederik P Kingma, Max Welling June 18, 2018 Diederik P Kingma, Max Welling Auto-Encoding Variational Bayes June 18, 2018 1 / 39 Outline 1 Introduction 2 Variational Lower

More information

Extended Object and Group Tracking with Elliptic Random Hypersurface Models

Extended Object and Group Tracking with Elliptic Random Hypersurface Models Extended Object and Group Tracing with Elliptic Random Hypersurface Models Marcus Baum Benjamin Noac and Uwe D. Hanebec Intelligent Sensor-Actuator-Systems Laboratory ISAS Institute for Anthropomatics

More information

Outline lecture 6 2(35)

Outline lecture 6 2(35) Outline lecture 35 Lecture Expectation aximization E and clustering Thomas Schön Division of Automatic Control Linöping University Linöping Sweden. Email: schon@isy.liu.se Phone: 13-1373 Office: House

More information

Gaussian Mixture Filter in Hybrid Navigation

Gaussian Mixture Filter in Hybrid Navigation Digest of TISE Seminar 2007, editor Pertti Koivisto, pages 1-5. Gaussian Mixture Filter in Hybrid Navigation Simo Ali-Löytty Institute of Mathematics Tampere University of Technology simo.ali-loytty@tut.fi

More information

PATTERN RECOGNITION AND MACHINE LEARNING

PATTERN RECOGNITION AND MACHINE LEARNING PATTERN RECOGNITION AND MACHINE LEARNING Chapter 1. Introduction Shuai Huang April 21, 2014 Outline 1 What is Machine Learning? 2 Curve Fitting 3 Probability Theory 4 Model Selection 5 The curse of dimensionality

More information

Nonlinear Parameter Estimation for State-Space ARCH Models with Missing Observations

Nonlinear Parameter Estimation for State-Space ARCH Models with Missing Observations Nonlinear Parameter Estimation for State-Space ARCH Models with Missing Observations SEBASTIÁN OSSANDÓN Pontificia Universidad Católica de Valparaíso Instituto de Matemáticas Blanco Viel 596, Cerro Barón,

More information

Online appendix to On the stability of the excess sensitivity of aggregate consumption growth in the US

Online appendix to On the stability of the excess sensitivity of aggregate consumption growth in the US Online appendix to On the stability of the excess sensitivity of aggregate consumption growth in the US Gerdie Everaert 1, Lorenzo Pozzi 2, and Ruben Schoonackers 3 1 Ghent University & SHERPPA 2 Erasmus

More information

1 EM algorithm: updating the mixing proportions {π k } ik are the posterior probabilities at the qth iteration of EM.

1 EM algorithm: updating the mixing proportions {π k } ik are the posterior probabilities at the qth iteration of EM. Université du Sud Toulon - Var Master Informatique Probabilistic Learning and Data Analysis TD: Model-based clustering by Faicel CHAMROUKHI Solution The aim of this practical wor is to show how the Classification

More information

The Effects of Monetary Policy on Stock Market Bubbles: Some Evidence

The Effects of Monetary Policy on Stock Market Bubbles: Some Evidence The Effects of Monetary Policy on Stock Market Bubbles: Some Evidence Jordi Gali Luca Gambetti ONLINE APPENDIX The appendix describes the estimation of the time-varying coefficients VAR model. The model

More information

Machine Learning Lecture Notes

Machine Learning Lecture Notes Machine Learning Lecture Notes Predrag Radivojac January 25, 205 Basic Principles of Parameter Estimation In probabilistic modeling, we are typically presented with a set of observations and the objective

More information

PROBABILITY DISTRIBUTIONS. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

PROBABILITY DISTRIBUTIONS. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception PROBABILITY DISTRIBUTIONS Credits 2 These slides were sourced and/or modified from: Christopher Bishop, Microsoft UK Parametric Distributions 3 Basic building blocks: Need to determine given Representation:

More information

Recursive Least Squares for an Entropy Regularized MSE Cost Function

Recursive Least Squares for an Entropy Regularized MSE Cost Function Recursive Least Squares for an Entropy Regularized MSE Cost Function Deniz Erdogmus, Yadunandana N. Rao, Jose C. Principe Oscar Fontenla-Romero, Amparo Alonso-Betanzos Electrical Eng. Dept., University

More information

Novel spectrum sensing schemes for Cognitive Radio Networks

Novel spectrum sensing schemes for Cognitive Radio Networks Novel spectrum sensing schemes for Cognitive Radio Networks Cantabria University Santander, May, 2015 Supélec, SCEE Rennes, France 1 The Advanced Signal Processing Group http://gtas.unican.es The Advanced

More information

Stochastic Variational Inference

Stochastic Variational Inference Stochastic Variational Inference David M. Blei Princeton University (DRAFT: DO NOT CITE) December 8, 2011 We derive a stochastic optimization algorithm for mean field variational inference, which we call

More information

A Comparison of Particle Filters for Personal Positioning

A Comparison of Particle Filters for Personal Positioning VI Hotine-Marussi Symposium of Theoretical and Computational Geodesy May 9-June 6. A Comparison of Particle Filters for Personal Positioning D. Petrovich and R. Piché Institute of Mathematics Tampere University

More information

Sequential Monte Carlo Methods for Bayesian Computation

Sequential Monte Carlo Methods for Bayesian Computation Sequential Monte Carlo Methods for Bayesian Computation A. Doucet Kyoto Sept. 2012 A. Doucet (MLSS Sept. 2012) Sept. 2012 1 / 136 Motivating Example 1: Generic Bayesian Model Let X be a vector parameter

More information

A variational radial basis function approximation for diffusion processes

A variational radial basis function approximation for diffusion processes A variational radial basis function approximation for diffusion processes Michail D. Vrettas, Dan Cornford and Yuan Shen Aston University - Neural Computing Research Group Aston Triangle, Birmingham B4

More information

The Bayesian Approach to Multi-equation Econometric Model Estimation

The Bayesian Approach to Multi-equation Econometric Model Estimation Journal of Statistical and Econometric Methods, vol.3, no.1, 2014, 85-96 ISSN: 2241-0384 (print), 2241-0376 (online) Scienpress Ltd, 2014 The Bayesian Approach to Multi-equation Econometric Model Estimation

More information

A Gradient-Based Algorithm Competitive with Variational Bayesian EM for Mixture of Gaussians

A Gradient-Based Algorithm Competitive with Variational Bayesian EM for Mixture of Gaussians A Gradient-Based Algorithm Competitive with Variational Bayesian EM for Mixture of Gaussians Miael Kuusela, Tapani Raio, Antti Honela, and Juha Karhunen Abstract While variational Bayesian (VB) inference

More information

Expectation propagation for signal detection in flat-fading channels

Expectation propagation for signal detection in flat-fading channels Expectation propagation for signal detection in flat-fading channels Yuan Qi MIT Media Lab Cambridge, MA, 02139 USA yuanqi@media.mit.edu Thomas Minka CMU Statistics Department Pittsburgh, PA 15213 USA

More information

Kalman filtering and friends: Inference in time series models. Herke van Hoof slides mostly by Michael Rubinstein

Kalman filtering and friends: Inference in time series models. Herke van Hoof slides mostly by Michael Rubinstein Kalman filtering and friends: Inference in time series models Herke van Hoof slides mostly by Michael Rubinstein Problem overview Goal Estimate most probable state at time k using measurement up to time

More information

Tracking of Extended Objects and Group Targets using Random Matrices A New Approach

Tracking of Extended Objects and Group Targets using Random Matrices A New Approach Tracing of Extended Objects and Group Targets using Random Matrices A New Approach Michael Feldmann FGAN Research Institute for Communication, Information Processing and Ergonomics FKIE D-53343 Wachtberg,

More information

Variational Learning : From exponential families to multilinear systems

Variational Learning : From exponential families to multilinear systems Variational Learning : From exponential families to multilinear systems Ananth Ranganathan th February 005 Abstract This note aims to give a general overview of variational inference on graphical models.

More information

GWAS V: Gaussian processes

GWAS V: Gaussian processes GWAS V: Gaussian processes Dr. Oliver Stegle Christoh Lippert Prof. Dr. Karsten Borgwardt Max-Planck-Institutes Tübingen, Germany Tübingen Summer 2011 Oliver Stegle GWAS V: Gaussian processes Summer 2011

More information

DETECTING PROCESS STATE CHANGES BY NONLINEAR BLIND SOURCE SEPARATION. Alexandre Iline, Harri Valpola and Erkki Oja

DETECTING PROCESS STATE CHANGES BY NONLINEAR BLIND SOURCE SEPARATION. Alexandre Iline, Harri Valpola and Erkki Oja DETECTING PROCESS STATE CHANGES BY NONLINEAR BLIND SOURCE SEPARATION Alexandre Iline, Harri Valpola and Erkki Oja Laboratory of Computer and Information Science Helsinki University of Technology P.O.Box

More information

BAYESIAN ESTIMATION OF UNKNOWN PARAMETERS OVER NETWORKS

BAYESIAN ESTIMATION OF UNKNOWN PARAMETERS OVER NETWORKS BAYESIAN ESTIMATION OF UNKNOWN PARAMETERS OVER NETWORKS Petar M. Djurić Dept. of Electrical & Computer Engineering Stony Brook University Stony Brook, NY 11794, USA e-mail: petar.djuric@stonybrook.edu

More information

Approximate Inference Part 1 of 2

Approximate Inference Part 1 of 2 Approximate Inference Part 1 of 2 Tom Minka Microsoft Research, Cambridge, UK Machine Learning Summer School 2009 http://mlg.eng.cam.ac.uk/mlss09/ Bayesian paradigm Consistent use of probability theory

More information

Probabilistic Graphical Models

Probabilistic Graphical Models Probabilistic Graphical Models Brown University CSCI 2950-P, Spring 2013 Prof. Erik Sudderth Lecture 13: Learning in Gaussian Graphical Models, Non-Gaussian Inference, Monte Carlo Methods Some figures

More information

The Kalman Filter ImPr Talk

The Kalman Filter ImPr Talk The Kalman Filter ImPr Talk Ged Ridgway Centre for Medical Image Computing November, 2006 Outline What is the Kalman Filter? State Space Models Kalman Filter Overview Bayesian Updating of Estimates Kalman

More information

Recent Advances in Bayesian Inference Techniques

Recent Advances in Bayesian Inference Techniques Recent Advances in Bayesian Inference Techniques Christopher M. Bishop Microsoft Research, Cambridge, U.K. research.microsoft.com/~cmbishop SIAM Conference on Data Mining, April 2004 Abstract Bayesian

More information

Approximate Inference Part 1 of 2

Approximate Inference Part 1 of 2 Approximate Inference Part 1 of 2 Tom Minka Microsoft Research, Cambridge, UK Machine Learning Summer School 2009 http://mlg.eng.cam.ac.uk/mlss09/ 1 Bayesian paradigm Consistent use of probability theory

More information

Gaussian Mixture Distance for Information Retrieval

Gaussian Mixture Distance for Information Retrieval Gaussian Mixture Distance for Information Retrieval X.Q. Li and I. King fxqli, ingg@cse.cuh.edu.h Department of omputer Science & Engineering The hinese University of Hong Kong Shatin, New Territories,

More information

Human Pose Tracking I: Basics. David Fleet University of Toronto

Human Pose Tracking I: Basics. David Fleet University of Toronto Human Pose Tracking I: Basics David Fleet University of Toronto CIFAR Summer School, 2009 Looking at People Challenges: Complex pose / motion People have many degrees of freedom, comprising an articulated

More information

TSRT14: Sensor Fusion Lecture 8

TSRT14: Sensor Fusion Lecture 8 TSRT14: Sensor Fusion Lecture 8 Particle filter theory Marginalized particle filter Gustaf Hendeby gustaf.hendeby@liu.se TSRT14 Lecture 8 Gustaf Hendeby Spring 2018 1 / 25 Le 8: particle filter theory,

More information

Gaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008

Gaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008 Gaussian processes Chuong B Do (updated by Honglak Lee) November 22, 2008 Many of the classical machine learning algorithms that we talked about during the first half of this course fit the following pattern:

More information

Towards inference for skewed alpha stable Levy processes

Towards inference for skewed alpha stable Levy processes Towards inference for skewed alpha stable Levy processes Simon Godsill and Tatjana Lemke Signal Processing and Communications Lab. University of Cambridge www-sigproc.eng.cam.ac.uk/~sjg Overview Motivation

More information

Sensor Fusion: Particle Filter

Sensor Fusion: Particle Filter Sensor Fusion: Particle Filter By: Gordana Stojceska stojcesk@in.tum.de Outline Motivation Applications Fundamentals Tracking People Advantages and disadvantages Summary June 05 JASS '05, St.Petersburg,

More information

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Cheng Soon Ong & Christian Walder. Canberra February June 2018 Cheng Soon Ong & Christian Walder Research Group and College of Engineering and Computer Science Canberra February June 218 Outlines Overview Introduction Linear Algebra Probability Linear Regression 1

More information

Online Algorithms for Sum-Product

Online Algorithms for Sum-Product Online Algorithms for Sum-Product Networks with Continuous Variables Priyank Jaini Ph.D. Seminar Consistent/Robust Tensor Decomposition And Spectral Learning Offline Bayesian Learning ADF, EP, SGD, oem

More information

TIME SERIES ANALYSIS. Forecasting and Control. Wiley. Fifth Edition GWILYM M. JENKINS GEORGE E. P. BOX GREGORY C. REINSEL GRETA M.

TIME SERIES ANALYSIS. Forecasting and Control. Wiley. Fifth Edition GWILYM M. JENKINS GEORGE E. P. BOX GREGORY C. REINSEL GRETA M. TIME SERIES ANALYSIS Forecasting and Control Fifth Edition GEORGE E. P. BOX GWILYM M. JENKINS GREGORY C. REINSEL GRETA M. LJUNG Wiley CONTENTS PREFACE TO THE FIFTH EDITION PREFACE TO THE FOURTH EDITION

More information

Bayesian Inference via Approximation of Log-likelihood for Priors in Exponential Family

Bayesian Inference via Approximation of Log-likelihood for Priors in Exponential Family 1 Bayesian Inference via Approximation of Log-lielihood for Priors in Exponential Family Tohid Ardeshiri, Umut Orguner, and Fredri Gustafsson arxiv:1510.015v1 [cs.lg 5 Oct 015 Abstract In this paper, a

More information

Pattern Recognition and Machine Learning

Pattern Recognition and Machine Learning Christopher M. Bishop Pattern Recognition and Machine Learning ÖSpri inger Contents Preface Mathematical notation Contents vii xi xiii 1 Introduction 1 1.1 Example: Polynomial Curve Fitting 4 1.2 Probability

More information

ECE276A: Sensing & Estimation in Robotics Lecture 10: Gaussian Mixture and Particle Filtering

ECE276A: Sensing & Estimation in Robotics Lecture 10: Gaussian Mixture and Particle Filtering ECE276A: Sensing & Estimation in Robotics Lecture 10: Gaussian Mixture and Particle Filtering Lecturer: Nikolay Atanasov: natanasov@ucsd.edu Teaching Assistants: Siwei Guo: s9guo@eng.ucsd.edu Anwesan Pal:

More information

CLOSE-TO-CLEAN REGULARIZATION RELATES

CLOSE-TO-CLEAN REGULARIZATION RELATES Worshop trac - ICLR 016 CLOSE-TO-CLEAN REGULARIZATION RELATES VIRTUAL ADVERSARIAL TRAINING, LADDER NETWORKS AND OTHERS Mudassar Abbas, Jyri Kivinen, Tapani Raio Department of Computer Science, School of

More information

Labor-Supply Shifts and Economic Fluctuations. Technical Appendix

Labor-Supply Shifts and Economic Fluctuations. Technical Appendix Labor-Supply Shifts and Economic Fluctuations Technical Appendix Yongsung Chang Department of Economics University of Pennsylvania Frank Schorfheide Department of Economics University of Pennsylvania January

More information