On a Parallel Implementation of the One-Sided Block Jacobi SVD Algorithm
|
|
- Timothy Small
- 5 years ago
- Views:
Transcription
1 Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons On a Parallel Implementaton of the One-Sded Block Jacob SVD Algorthm Gabrel Okša 1, Martn Bečka, 1 Marán Vateršc 2 1 Insttute of Mathematcs Slovak Academy of Scences Bratslava, Slovaka 2 Insttute of Scentfc Computng Unversty of Salzburg Salzburg, Austra Harrachov 2007, Czech Republc, August 19-25, 2007 Gabrel Okša: Jacob SVD 1/16
2 Outlne Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons 1 formulaton 2 One-Sded Block-Jacob Algorthm 3 Acceleratng 4 Parallelzaton 5 Conclusons Gabrel Okša: Jacob SVD 2/16
3 Our task Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons Compute n parallel the Sngular Value Decomposton (SVD) of a complex matrx A of the sze m n, m n: ( ) Σ A = U V H., 0 where U(m m) and V (n n) are orthogonal and Σ = dag(σ ) wth σ 1 σ 2 σ n. Numercally stable way of computaton: one- or two-sded block-jacob methods; large degree of parallelsm. Target archtecture: dstrbuted memory machnes (parallel supercomputers and clusters) wth Message Passng Interface (MPI). Gabrel Okša: Jacob SVD 3/16
4 Man structure of Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons A R m n, m n, ts block-column parttonng A = [A 1, A 2,..., A r ], n = (r 1)n 0 + n r, n r n 0. The OSBJA can be wrtten as an teratve process: A (0) = A, V (0) = I n, A (k+1) = A (k) U (k), V (k+1) = V (k) U (k), k 0, (1) where the n n OG matrx U (k) (block rotaton): I U (k) U (k) U (k) = I. (2) U (k) U (k) A (k) U (k) n (1) mutually orthogonalzes the columns between A (k) and A (k). I Gabrel Okša: Jacob SVD 4/16
5 Pvot Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons The orthogonal matrx Û (k) = ( U (k) U (k) U (k) U (k) of order n + n s called the pvot submatrx of U (k) at step k. At each step k of, the pvot par (, ) s chosen accordng to a gven pvot F : {0, 1,...} P r = {(l, m) : 1 l < m r}. Well-known strateges: cyclc row, cyclc column. ) Gabrel Okša: Jacob SVD 5/16
6 3 parts of one (seral) step Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons Part 1: The pvot par (, ) s gven, compute the symmetrc, postve sem-defnte cross-product matrx ( (k)t A A (k) A (k)t A (k) )  (k) = [A (k) A (k) ] T [A (k) A (k) ] = A (k)t A (k) A (k)t Complexty: m (n + n )(n + n 1)/2 flops. When A (k). dagonal blocks of Â(k) are dagonal, ths reduces to m (n n + n + n ) flops. Part 2: Â(k) s dagonalzed by ts egenvalue decomposton: Û (k)t  (k) Û (k) (k) = ˆΛ, Complexty: on average 8(n + n ) 3 flops (Har 2005). Gabrel Okša: Jacob SVD 6/16
7 3 parts of one (seral) step (cont.) Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons Part 3: Û(k) defnes, after ts parttonng, U (k) n (2), whch s used for updates of A (k) and V (k) n (1). Complexty: 2m(n + n ) 2 flops. Overall complexty: One step of requres: N flop (k) m(n n + n + n ) + 8(n + n ) 3 + 2m(n + n ) 2 flops. = 64n (9n n 0)n (f m = n = n 0 r) If we can choose the block wdth small enough, all computatons of Part 2 can be performed n cache snce Â(k) s of order only n + n. The most complex computaton s n Part 3 (matrx products). How to accelerate t? Gabrel Okša: Jacob SVD 7/16
8 Matrx preprocessng Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons Step 1: QR factorzaton wth column pvotng followed by the LQ factorzaton of the R-factor: AP = Q 1 R and R = LQ2 T. The OSBJA s then appled to the L-factor. Advantages: 1 The off-dagonal norm of the L-factor s concentrated near the man dagonal and the Jacob method usng specal orderngs can be much faster and accurate (Har 2005). 2 Rght sngular vectors can be computed a posteror: LV = UΣ, where UΣ s the fnal stage of the Jacob method appled to L (Drmač 1999). V s not updated durng teratons! Then the SVD of A: A = Q 1 LQ2 T PT = (Q 1 U)Σ(PQ 2 V ) T (postprocessng). Complexty: 2mn n 3 flops + work n the Jacob teratons. Gabrel Okša: Jacob SVD 8/16
9 Step 2: Specal ntalzaton of the recurson of three matrces. We have: L = [L 1, L 2,..., L r ]. Compute: 1. for = 1 : r 2. Â (0) = L T L ; 3. Â (0) = Q (0) 4. (A (0) = L Q (0) 5. end; Γ (0) Q (0)T ; (spectral decomposton) ); (not performed, ust the connecton) Intalze three block matrces wth blocks of small orders n, n : B (0) = [B (0) 1, B(0) 2,..., B(0) r ] = [L 1, L 2,..., L r ], Q (0) = dag(q (0) 11, Q(0) 22,..., Q(0) rr ), Γ (0) = dag(γ (0) 1, Γ(0) 2,..., Γ(0) r ) (stored as vector). Complexty: for n = n 0 r about (n/4 + 8n 0 )n 0 n flops (assumng 4 sweeps n step 3).
10 Recurson k k + 1 Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons 1. Compute the cross-product matrx: ( ) Q (k) T ( (k)t B B (k) Â (k) = = ( Γ (k) Ã (k)t Q (k) Ã (k) Γ (k) ) B (k)t B (k), where Ã(k) B (k)t B (k)t Q (k)t B (k) B (k) ) ( (B (k)t Q (k) Q (k) B (k) )Q (k). 2. Compute the egendecomposton: Â(k) = Û(k) ˆΛ(k) Û (k)t. Copy the egenvalues nto approprate postons of Γ (k+1). 3. Compute the CS decomposton: ( ) ( V (k) (k) C Û (k) = V (k) ˆV (k) ˆT (k) Ŵ (k)t. S (k) S (k) C (k) ) ( W (k) W (k) ) T Gabrel Okša: Jacob SVD 10/16 )
11 Recurson k k + 1 (cont.) Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons 4. Compute new matrces B and Q: (B (k+1), B (k+1) ) = (B (k) Q (k+1) (Q (k) V (k) ), B (k) (Q (k) V (k) = W (k)t, Q (k+1) = W (k)t. ) ˆT (k), Man dea: the large dmenson m s totally elmnated from recurson. Matrx multplcatons are of the form XY, where X s of order n n or n n, and Y s square of order n or n. Fnal update of B and B : specal structure of ˆT (k) (rotatons of columns of length n). Overall complexty (Har 2005): N flop (k) 82n (3n n 0)n. Gabrel Okša: Jacob SVD 11/16
12 Stoppng crteron Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons Let:  (k) = A (k)t A (k) = à (k) 11 à (k) 1r D k = (dag(â(k) )) 1/2 = dag( A (k) e 1,..., A (k) e n ) = (Γ (k) ) 1/2.. à (k)t 1r à (k) rr  (k) S = D 1 k  (k) D 1 k (scaled matrx). Two measures of convergence:, 2 α k off(â(k) S ) = 2 Â(k) S I F, r r ω k off(â(k) ) = Ã(k) t 2 F. =1 t=+1 Computaton of α k : n 3 /2 flops. Gabrel Okša: Jacob SVD 12/16
13 Stoppng crteron (cont.) Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons Stoppng crteron: α k n 2 ɛ. At the end of block sweep t + 1: (t+1)n 1 ω(t+1)n 2 = ω2 tn k=tn Ã(k) (k),(k) 2 F, N = r(r 1)/2. Easy update of ω 2, but a severe cancellaton may occur. Assumng the quadratc convergence of ω for t t 0, montor: ν tn (t+1)n 1 Ã(k) (k),(k) 2 F = ω tn + O(ωtN 2 ), t t 0. k=tn When ν tn n γ ((t+1)n) 1 ɛ wth γ ((t+1)n) 1 = max[γ ((t+1)n) ], compute α (t+1)n. Gabrel Okša: Jacob SVD 13/16
14 Parallel mplementaton Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons Data layout: p processors, two block columns per processor: r = 2p. Preprocessng: The QRFCP and the LQF va the ScaLAPACK s routne PDGEQPF and PDGELQF, respectvely. Each processor computes ts local cross-product matrx and two spectral decompostons of ts dagonal blocks Â(0) ll. Recurson: 1 Choose some parallel block orderng (e.g., round-robn). 2 Each processor stores two block columns and matrx blocks B, B, Q, Q, and two vectors γ, γ. In the worst case, ths amount of data must be transferred between processors at the begnnng of each parallel teraton step (= r/2 subtasks) accordng to the orderng. 3 All computatons are local (LAPACK), no global communcaton. Gabrel Okša: Jacob SVD 14/16
15 Parallel mplementaton (cont.) Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons Stoppng crteron: Its mplementaton requres the global communcaton. Update of ω 2 and ν: Local computaton of the squared Frobenus norm of each nullfed matrx block n each processor, then the global sum of local squares n PE 1, and, fnally, the broadcast of an updated value to all processors: routnes MPI_ALLREDUCE, MPI_ALLGATHER and MPI_BCAST from the ScaLAPACK. Computaton of α: Scalng of the columns and rows of B by the values stored n vector γ, the square of Frobenus norm, the broadcast of new α: routnes MPI_ALLGATHERV, MPI_ALLREDUCE, MPI_ALLGATHER, MPI_BCAST. Gabrel Okša: Jacob SVD 15/16
16 Conclusons Jacob SVD Gabrel Okša formulaton One-Sded Block-Jacob Algorthm Acceleratng Parallelzaton Conclusons Matrx preprocessng: Concentraton of the Frobenus norm towards man dagonal leads to a substantal reducton of the number Jacob sweeps needed for the convergence. Workng wth matrx blocks: Enables to accelerate the computaton even n the seral case usng a fast cache memory. Three-matrx recurson: Elmnates the large dmenson m, all updates can be made n a fast cache memory. Parallelzaton: Addtonal speed-up possble, especally for large n (number of columns). Gabrel Okša: Jacob SVD 16/16
Parallel One-Sided Block Jacobi SVD Algorithm: I. Analysis and Design
Parallel One-Sded Block Jacob SVD Algorthm: I. Analyss and Desgn Gabrel Okša a Marán Vateršc a Mathematcal Insttute, Department of Informatcs, Slovak Academy of Scences, Bratslava, Slovak Republc Techncal
More informationGeneralization of the Dynamic Ordering for the One-Sided Block Jacobi SVD Algorithm: I. Analysis and Design
Generalzaton of the Dynamc Orderng for the One-Sded Block Jacob SVD Algorthm: I. Analyss and Desgn Martn Bečka a Marán Vateršc a Mathematcal Insttute, Department of Informatcs, Slovak Academy of Scences,
More informationSingular Value Decomposition: Theory and Applications
Sngular Value Decomposton: Theory and Applcatons Danel Khashab Sprng 2015 Last Update: March 2, 2015 1 Introducton A = UDV where columns of U and V are orthonormal and matrx D s dagonal wth postve real
More informationA New Refinement of Jacobi Method for Solution of Linear System Equations AX=b
Int J Contemp Math Scences, Vol 3, 28, no 17, 819-827 A New Refnement of Jacob Method for Soluton of Lnear System Equatons AX=b F Naem Dafchah Department of Mathematcs, Faculty of Scences Unversty of Gulan,
More informationSome Comments on Accelerating Convergence of Iterative Sequences Using Direct Inversion of the Iterative Subspace (DIIS)
Some Comments on Acceleratng Convergence of Iteratve Sequences Usng Drect Inverson of the Iteratve Subspace (DIIS) C. Davd Sherrll School of Chemstry and Bochemstry Georga Insttute of Technology May 1998
More informationORTHOGONALIZATION WITH A NON-STANDARD INNER PRODUCT WITH THE APPLICATION TO PRECONDITIONING
ORTHOGONALIZATION WITH A NON-STANDARD INNER PRODUCT WITH THE APPLICATION TO PRECONDITIONING Mroslav Rozložník jont work wth Jří Kopal, Alcja Smoktunowcz and Mroslav Tůma Insttute of Computer Scence, Czech
More informationLecture 3. Ax x i a i. i i
18.409 The Behavor of Algorthms n Practce 2/14/2 Lecturer: Dan Spelman Lecture 3 Scrbe: Arvnd Sankar 1 Largest sngular value In order to bound the condton number, we need an upper bound on the largest
More informationSTAT 309: MATHEMATICAL COMPUTATIONS I FALL 2018 LECTURE 17. a ij x (k) b i. a ij x (k+1) (D + L)x (k+1) = b Ux (k)
STAT 309: MATHEMATICAL COMPUTATIONS I FALL 08 LECTURE 7. sor method remnder: n coordnatewse form, Jacob method s = [ b a x (k) a and Gauss Sedel method s = [ b a = = remnder: n matrx form, Jacob method
More informationPerron Vectors of an Irreducible Nonnegative Interval Matrix
Perron Vectors of an Irreducble Nonnegatve Interval Matrx Jr Rohn August 4 2005 Abstract As s well known an rreducble nonnegatve matrx possesses a unquely determned Perron vector. As the man result of
More informationCS 770G - Parallel Algorithms in Scientific Computing
References CS 770G - Parallel Algorthms n Scentfc Computng Parallel Sortng Introducton to Parallel Computng Kumar, Grama, Gupta, Karyps, Benjamn Cummngs. A porton of the notes comes from Prof. J. Demmel
More informationSTAT 309: MATHEMATICAL COMPUTATIONS I FALL 2018 LECTURE 16
STAT 39: MATHEMATICAL COMPUTATIONS I FALL 218 LECTURE 16 1 why teratve methods f we have a lnear system Ax = b where A s very, very large but s ether sparse or structured (eg, banded, Toepltz, banded plus
More informationU.C. Berkeley CS294: Beyond Worst-Case Analysis Luca Trevisan September 5, 2017
U.C. Berkeley CS94: Beyond Worst-Case Analyss Handout 4s Luca Trevsan September 5, 07 Summary of Lecture 4 In whch we ntroduce semdefnte programmng and apply t to Max Cut. Semdefnte Programmng Recall that
More informationChapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems
Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons
More informationErrors for Linear Systems
Errors for Lnear Systems When we solve a lnear system Ax b we often do not know A and b exactly, but have only approxmatons  and ˆb avalable. Then the best thng we can do s to solve ˆx ˆb exactly whch
More informationYong Joon Ryang. 1. Introduction Consider the multicommodity transportation problem with convex quadratic cost function. 1 2 (x x0 ) T Q(x x 0 )
Kangweon-Kyungk Math. Jour. 4 1996), No. 1, pp. 7 16 AN ITERATIVE ROW-ACTION METHOD FOR MULTICOMMODITY TRANSPORTATION PROBLEMS Yong Joon Ryang Abstract. The optmzaton problems wth quadratc constrants often
More informationSalmon: Lectures on partial differential equations. Consider the general linear, second-order PDE in the form. ,x 2
Salmon: Lectures on partal dfferental equatons 5. Classfcaton of second-order equatons There are general methods for classfyng hgher-order partal dfferental equatons. One s very general (applyng even to
More informationHongyi Miao, College of Science, Nanjing Forestry University, Nanjing ,China. (Received 20 June 2013, accepted 11 March 2014) I)ϕ (k)
ISSN 1749-3889 (prnt), 1749-3897 (onlne) Internatonal Journal of Nonlnear Scence Vol.17(2014) No.2,pp.188-192 Modfed Block Jacob-Davdson Method for Solvng Large Sparse Egenproblems Hongy Mao, College of
More informationMEM 255 Introduction to Control Systems Review: Basics of Linear Algebra
MEM 255 Introducton to Control Systems Revew: Bascs of Lnear Algebra Harry G. Kwatny Department of Mechancal Engneerng & Mechancs Drexel Unversty Outlne Vectors Matrces MATLAB Advanced Topcs Vectors A
More information5 The Rational Canonical Form
5 The Ratonal Canoncal Form Here p s a monc rreducble factor of the mnmum polynomal m T and s not necessarly of degree one Let F p denote the feld constructed earler n the course, consstng of all matrces
More informationCHALMERS, GÖTEBORGS UNIVERSITET. SOLUTIONS to RE-EXAM for ARTIFICIAL NEURAL NETWORKS. COURSE CODES: FFR 135, FIM 720 GU, PhD
CHALMERS, GÖTEBORGS UNIVERSITET SOLUTIONS to RE-EXAM for ARTIFICIAL NEURAL NETWORKS COURSE CODES: FFR 35, FIM 72 GU, PhD Tme: Place: Teachers: Allowed materal: Not allowed: January 2, 28, at 8 3 2 3 SB
More informationKey words. QR factorization, rank revealing, column pivoting, minimize communication
COMMUNICATION AVOIDING RANK REVEALING QR FACTORIZATION WITH COLUMN PIVOTING JAMES W. DEMMEL, LAURA GRIGORI, MING GU, AND HUA XIANG Abstract. In ths paper we ntroduce CARRQR, a communcaton avodng rank revealng
More informationEfficient, General Point Cloud Registration with Kernel Feature Maps
Effcent, General Pont Cloud Regstraton wth Kernel Feature Maps Hanchen Xong, Sandor Szedmak, Justus Pater Insttute of Computer Scence Unversty of Innsbruck 30 May 2013 Hanchen Xong (Un.Innsbruck) 3D Regstraton
More informationNorms, Condition Numbers, Eigenvalues and Eigenvectors
Norms, Condton Numbers, Egenvalues and Egenvectors 1 Norms A norm s a measure of the sze of a matrx or a vector For vectors the common norms are: N a 2 = ( x 2 1/2 the Eucldean Norm (1a b 1 = =1 N x (1b
More informationTime-Varying Systems and Computations Lecture 6
Tme-Varyng Systems and Computatons Lecture 6 Klaus Depold 14. Januar 2014 The Kalman Flter The Kalman estmaton flter attempts to estmate the actual state of an unknown dscrete dynamcal system, gven nosy
More informationInexact Newton Methods for Inverse Eigenvalue Problems
Inexact Newton Methods for Inverse Egenvalue Problems Zheng-jan Ba Abstract In ths paper, we survey some of the latest development n usng nexact Newton-lke methods for solvng nverse egenvalue problems.
More informationVector Norms. Chapter 7 Iterative Techniques in Matrix Algebra. Cauchy-Bunyakovsky-Schwarz Inequality for Sums. Distances. Convergence.
Vector Norms Chapter 7 Iteratve Technques n Matrx Algebra Per-Olof Persson persson@berkeley.edu Department of Mathematcs Unversty of Calforna, Berkeley Math 128B Numercal Analyss Defnton A vector norm
More informationA Parallel Multisplitting Solution of the Least Squares Problem
NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS Numer. Lnear Algebra Appl., 5, 11 31 (1998) A Parallel Multsplttng Soluton of the Least Squares Problem R. A. Renaut Department of Mathematcs, Arzona State Unversty,
More informationBootstrap AMG for Markov Chain Computations
Bootstrap AMG for Markov Chan Computatons Karsten Kahl Bergsche Unverstät Wuppertal May 27, 200 Outlne Markov Chans Subspace Egenvalue Approxmaton Ingredents of Least Squares Interpolaton Egensolver Bootstrap
More informationρ some λ THE INVERSE POWER METHOD (or INVERSE ITERATION) , for , or (more usually) to
THE INVERSE POWER METHOD (or INVERSE ITERATION) -- applcaton of the Power method to A some fxed constant ρ (whch s called a shft), x λ ρ If the egenpars of A are { ( λ, x ) } ( ), or (more usually) to,
More informationMATH Homework #2
MATH609-601 Homework #2 September 27, 2012 1. Problems Ths contans a set of possble solutons to all problems of HW-2. Be vglant snce typos are possble (and nevtable). (1) Problem 1 (20 pts) For a matrx
More information1 Matrix representations of canonical matrices
1 Matrx representatons of canoncal matrces 2-d rotaton around the orgn: ( ) cos θ sn θ R 0 = sn θ cos θ 3-d rotaton around the x-axs: R x = 1 0 0 0 cos θ sn θ 0 sn θ cos θ 3-d rotaton around the y-axs:
More informationCME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 13
CME 30: NUMERICAL LINEAR ALGEBRA FALL 005/06 LECTURE 13 GENE H GOLUB 1 Iteratve Methods Very large problems (naturally sparse, from applcatons): teratve methods Structured matrces (even sometmes dense,
More informationOn the Interval Zoro Symmetric Single-step Procedure for Simultaneous Finding of Polynomial Zeros
Appled Mathematcal Scences, Vol. 5, 2011, no. 75, 3693-3706 On the Interval Zoro Symmetrc Sngle-step Procedure for Smultaneous Fndng of Polynomal Zeros S. F. M. Rusl, M. Mons, M. A. Hassan and W. J. Leong
More informationParallel Implementation of the Givens Rotations in the Neural Network Learning Algorithm
Parallel Implementaton of the Gvens Rotatons n the Neural Network Learnng Algorthm Jaros law Blsk 1(B, Bartosz Kowalczyk 1, and Jacek M. Żurada2 1 Insttute of Computatonal Intellgence, Czȩstochowa Unversty
More informationMath 217 Fall 2013 Homework 2 Solutions
Math 17 Fall 013 Homework Solutons Due Thursday Sept. 6, 013 5pm Ths homework conssts of 6 problems of 5 ponts each. The total s 30. You need to fully justfy your answer prove that your functon ndeed has
More informationDFT with Planewaves pseudopotential accuracy (LDA, PBE) Fast time to solution 1 step in minutes (not hours!!!) to be useful for MD
LLNL-PRES-673679 Ths work was performed under the auspces of the U.S. Department of Energy by under contract DE-AC52-07NA27344. Lawrence Lvermore Natonal Securty, LLC Sequoa, IBM BGQ, 1,572,864 cores O(N)
More informationLecture Notes on Linear Regression
Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume
More informationExplicit constructions of all separable two-qubits density matrices and related problems for three-qubits systems
Explct constructons of all separable two-qubts densty matrces and related problems for three-qubts systems Y. en-ryeh and. Mann Physcs Department, Technon-Israel Insttute of Technology, Hafa 2000, Israel
More informationThe lower and upper bounds on Perron root of nonnegative irreducible matrices
Journal of Computatonal Appled Mathematcs 217 (2008) 259 267 wwwelsevercom/locate/cam The lower upper bounds on Perron root of nonnegatve rreducble matrces Guang-Xn Huang a,, Feng Yn b,keguo a a College
More informationLinear Approximation with Regularization and Moving Least Squares
Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...
More information2.3 Nilpotent endomorphisms
s a block dagonal matrx, wth A Mat dm U (C) In fact, we can assume that B = B 1 B k, wth B an ordered bass of U, and that A = [f U ] B, where f U : U U s the restrcton of f to U 40 23 Nlpotent endomorphsms
More informationPreconditioned Parallel Block Jacobi SVD Algorithm
Parallel Numerics 5, 15-24 M. Vajteršic, R. Trobec, P. Zinterhof, A. Uhl (Eds.) Chapter 2: Matrix Algebra ISBN 961-633-67-8 Preconditioned Parallel Block Jacobi SVD Algorithm Gabriel Okša 1, Marián Vajteršic
More informationRelaxation Methods for Iterative Solution to Linear Systems of Equations
Relaxaton Methods for Iteratve Soluton to Lnear Systems of Equatons Gerald Recktenwald Portland State Unversty Mechancal Engneerng Department gerry@pdx.edu Overvew Techncal topcs Basc Concepts Statonary
More informationDeriving the X-Z Identity from Auxiliary Space Method
Dervng the X-Z Identty from Auxlary Space Method Long Chen Department of Mathematcs, Unversty of Calforna at Irvne, Irvne, CA 92697 chenlong@math.uc.edu 1 Iteratve Methods In ths paper we dscuss teratve
More informationOutline and Reading. Dynamic Programming. Dynamic Programming revealed. Computing Fibonacci. The General Dynamic Programming Technique
Outlne and Readng Dynamc Programmng The General Technque ( 5.3.2) -1 Knapsac Problem ( 5.3.3) Matrx Chan-Product ( 5.3.1) Dynamc Programmng verson 1.4 1 Dynamc Programmng verson 1.4 2 Dynamc Programmng
More informationThe Order Relation and Trace Inequalities for. Hermitian Operators
Internatonal Mathematcal Forum, Vol 3, 08, no, 507-57 HIKARI Ltd, wwwm-hkarcom https://doorg/0988/mf088055 The Order Relaton and Trace Inequaltes for Hermtan Operators Y Huang School of Informaton Scence
More informationMarkov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement
Markov Chan Monte Carlo MCMC, Gbbs Samplng, Metropols Algorthms, and Smulated Annealng 2001 Bonformatcs Course Supplement SNU Bontellgence Lab http://bsnuackr/ Outlne! Markov Chan Monte Carlo MCMC! Metropols-Hastngs
More informationDynamic Systems on Graphs
Prepared by F.L. Lews Updated: Saturday, February 06, 200 Dynamc Systems on Graphs Control Graphs and Consensus A network s a set of nodes that collaborates to acheve what each cannot acheve alone. A network,
More informationform, and they present results of tests comparng the new algorthms wth other methods. Recently, Olschowka & Neumaer [7] ntroduced another dea for choo
Scalng and structural condton numbers Arnold Neumaer Insttut fur Mathematk, Unverstat Wen Strudlhofgasse 4, A-1090 Wen, Austra emal: neum@cma.unve.ac.at revsed, August 1996 Abstract. We ntroduce structural
More informationLecture 4: Constant Time SVD Approximation
Spectral Algorthms and Representatons eb. 17, Mar. 3 and 8, 005 Lecture 4: Constant Tme SVD Approxmaton Lecturer: Santosh Vempala Scrbe: Jangzhuo Chen Ths topc conssts of three lectures 0/17, 03/03, 03/08),
More informationFormulas for the Determinant
page 224 224 CHAPTER 3 Determnants e t te t e 2t 38 A = e t 2te t e 2t e t te t 2e 2t 39 If 123 A = 345, 456 compute the matrx product A adj(a) What can you conclude about det(a)? For Problems 40 43, use
More informationReport on Image warping
Report on Image warpng Xuan Ne, Dec. 20, 2004 Ths document summarzed the algorthms of our mage warpng soluton for further study, and there s a detaled descrpton about the mplementaton of these algorthms.
More informationA property of the elementary symmetric functions
Calcolo manuscrpt No. (wll be nserted by the edtor) A property of the elementary symmetrc functons A. Esnberg, G. Fedele Dp. Elettronca Informatca e Sstemstca, Unverstà degl Stud della Calabra, 87036,
More informationProgramming Project 1: Molecular Geometry and Rotational Constants
Programmng Project 1: Molecular Geometry and Rotatonal Constants Center for Computatonal Chemstry Unversty of Georga Athens, Georga 30602 Summer 2012 1 Introducton Ths programmng project s desgned to provde
More information1 GSW Iterative Techniques for y = Ax
1 for y = A I m gong to cheat here. here are a lot of teratve technques that can be used to solve the general case of a set of smultaneous equatons (wrtten n the matr form as y = A), but ths chapter sn
More informationThe Synchronous 8th-Order Differential Attack on 12 Rounds of the Block Cipher HyRAL
The Synchronous 8th-Order Dfferental Attack on 12 Rounds of the Block Cpher HyRAL Yasutaka Igarash, Sej Fukushma, and Tomohro Hachno Kagoshma Unversty, Kagoshma, Japan Emal: {garash, fukushma, hachno}@eee.kagoshma-u.ac.jp
More informationGeneralized Linear Methods
Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set
More informationA Parallel Symmetric Block-Tridiagonal Divide-and-Conquer Algorithm
A Parallel Symmetrc Block-rdagonal Dvde-and-Conquer Algorthm Yhua Ba * and Robert C. Ward * echncal Report U-CS-05-57 Unversty of ennessee December 005 Abstract We present a parallel mplementaton of the
More informationSlobodan Lakić. Communicated by R. Van Keer
Serdca Math. J. 21 (1995), 335-344 AN ITERATIVE METHOD FOR THE MATRIX PRINCIPAL n-th ROOT Slobodan Lakć Councated by R. Van Keer In ths paper we gve an teratve ethod to copute the prncpal n-th root and
More informationLecture 3: Dual problems and Kernels
Lecture 3: Dual problems and Kernels C4B Machne Learnng Hlary 211 A. Zsserman Prmal and dual forms Lnear separablty revsted Feature mappng Kernels for SVMs Kernel trck requrements radal bass functons SVM
More information4DVAR, according to the name, is a four-dimensional variational method.
4D-Varatonal Data Assmlaton (4D-Var) 4DVAR, accordng to the name, s a four-dmensonal varatonal method. 4D-Var s actually a drect generalzaton of 3D-Var to handle observatons that are dstrbuted n tme. The
More informationCalculation of time complexity (3%)
Problem 1. (30%) Calculaton of tme complexty (3%) Gven n ctes, usng exhaust search to see every result takes O(n!). Calculaton of tme needed to solve the problem (2%) 40 ctes:40! dfferent tours 40 add
More informationDISCRIMINANTS AND RAMIFIED PRIMES. 1. Introduction A prime number p is said to be ramified in a number field K if the prime ideal factorization
DISCRIMINANTS AND RAMIFIED PRIMES KEITH CONRAD 1. Introducton A prme number p s sad to be ramfed n a number feld K f the prme deal factorzaton (1.1) (p) = po K = p e 1 1 peg g has some e greater than 1.
More informationQuantum Mechanics I - Session 4
Quantum Mechancs I - Sesson 4 Aprl 3, 05 Contents Operators Change of Bass 4 3 Egenvectors and Egenvalues 5 3. Denton....................................... 5 3. Rotaton n D....................................
More informationOverall Problem. Parallel Program Design Patterns and Strategies. Contents. 1. Patterns for Functional Decomposition
Parallel Program Desgn Patterns and Strateges 1 ontents 1. Patterns for Functonal Decomposton. Patterns for Dstrbutng Tasks to Processors Mke Baley mjb@cs.oregonstate.edu 3. Patterns for Data Decomposton
More informationFall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede
Fall 0 Analyss of Expermental easurements B. Esensten/rev. S. Errede We now reformulate the lnear Least Squares ethod n more general terms, sutable for (eventually extendng to the non-lnear case, and also
More informationPh 219a/CS 219a. Exercises Due: Wednesday 23 October 2013
1 Ph 219a/CS 219a Exercses Due: Wednesday 23 October 2013 1.1 How far apart are two quantum states? Consder two quantum states descrbed by densty operators ρ and ρ n an N-dmensonal Hlbert space, and consder
More informationInterpolative Separable Density Fitting Decomposition for Accelerating Hybrid Density Functional Calculations with Applications to Defects in Silicon
Cte Ths: J. Chem. Theory Comput. 07,, 540-54 pubs.acs.org/jctc Interpolatve Separable Densty Fttng Decomposton for Acceleratng Hybrd Densty Functonal Calculatons wth Applcatons to Defects n Slcon We Hu,*,
More informationInner Product. Euclidean Space. Orthonormal Basis. Orthogonal
Inner Product Defnton 1 () A Eucldean space s a fnte-dmensonal vector space over the reals R, wth an nner product,. Defnton 2 (Inner Product) An nner product, on a real vector space X s a symmetrc, blnear,
More informationDistributed and Stochastic Machine Learning on Big Data
Dstrbuted and Stochastc Machne Learnng on Bg Data Department of Computer Scence and Engneerng Hong Kong Unversty of Scence and Technology Hong Kong Introducton Synchronous ADMM Asynchronous ADMM Stochastc
More informationNumerical Properties of the LLL Algorithm
Numercal Propertes of the LLL Algorthm Frankln T. Luk a and Sanzheng Qao b a Department of Mathematcs, Hong Kong Baptst Unversty, Kowloon Tong, Hong Kong b Dept. of Computng and Software, McMaster Unv.,
More informationNumerische Mathematik
Numer Math (5) Dgtal Obect Identfer (DOI) 7/s-5-595-4 Numersche Mathematk Fast and stable QR egenvalue algorthms for generalzed companon matrces and secular equatons Daro A Bn, Luca Gemgnan, Vctor Y Pan
More informationDynamic Programming. Preview. Dynamic Programming. Dynamic Programming. Dynamic Programming (Example: Fibonacci Sequence)
/24/27 Prevew Fbonacc Sequence Longest Common Subsequence Dynamc programmng s a method for solvng complex problems by breakng them down nto smpler sub-problems. It s applcable to problems exhbtng the propertes
More informationSL n (F ) Equals its Own Derived Group
Internatonal Journal of Algebra, Vol. 2, 2008, no. 12, 585-594 SL n (F ) Equals ts Own Derved Group Jorge Macel BMCC-The Cty Unversty of New York, CUNY 199 Chambers street, New York, NY 10007, USA macel@cms.nyu.edu
More informationA FORMULA FOR COMPUTING INTEGER POWERS FOR ONE TYPE OF TRIDIAGONAL MATRIX
Hacettepe Journal of Mathematcs and Statstcs Volume 393 0 35 33 FORMUL FOR COMPUTING INTEGER POWERS FOR ONE TYPE OF TRIDIGONL MTRIX H Kıyak I Gürses F Yılmaz and D Bozkurt Receved :08 :009 : ccepted 5
More informationAppendix B. The Finite Difference Scheme
140 APPENDIXES Appendx B. The Fnte Dfference Scheme In ths appendx we present numercal technques whch are used to approxmate solutons of system 3.1 3.3. A comprehensve treatment of theoretcal and mplementaton
More informationREDUCTION MODULO p. We will prove the reduction modulo p theorem in the general form as given by exercise 4.12, p. 143, of [1].
REDUCTION MODULO p. IAN KIMING We wll prove the reducton modulo p theorem n the general form as gven by exercse 4.12, p. 143, of [1]. We consder an ellptc curve E defned over Q and gven by a Weerstraß
More informationPRECONDITIONING IN THE PARALLEL BLOCK-JACOBI SVD ALGORITHM
Proceedings of ALGORITMY 25 pp. 22 211 PRECONDITIONING IN THE PARALLEL BLOCK-JACOBI SVD ALGORITHM GABRIEL OKŠA AND MARIÁN VAJTERŠIC Abstract. One way, how to speed up the computation of the singular value
More informationSupport Vector Machines. Vibhav Gogate The University of Texas at dallas
Support Vector Machnes Vbhav Gogate he Unversty of exas at dallas What We have Learned So Far? 1. Decson rees. Naïve Bayes 3. Lnear Regresson 4. Logstc Regresson 5. Perceptron 6. Neural networks 7. K-Nearest
More informationIntroduction to Simulation - Lecture 5. QR Factorization. Jacob White. Thanks to Deepak Ramaswamy, Michal Rewienski, and Karen Veroy
Introducton to Smulaton - Lecture 5 QR Factorzaton Jacob Whte hanks to Deepak Ramaswamy, Mchal Rewensk, and Karen Veroy Sngular Example LU Factorzaton Fals Strut Jont Load force he resultng nodal matrx
More informationMatrix Approximation via Sampling, Subspace Embedding. 1 Solving Linear Systems Using SVD
Matrx Approxmaton va Samplng, Subspace Embeddng Lecturer: Anup Rao Scrbe: Rashth Sharma, Peng Zhang 0/01/016 1 Solvng Lnear Systems Usng SVD Two applcatons of SVD have been covered so far. Today we loo
More informationModified Mass Matrices and Positivity Preservation for Hyperbolic and Parabolic PDEs
COMMUNICATIONS IN NUMERICAL METHODS IN ENGINEERING Commun. Numer. Meth. Engng 2000; 00:6 Prepared usng cnmauth.cls [Verson: 2000/03/22 v.0] Modfed Mass Matrces and Postvty Preservaton for Hyperbolc and
More informationTwo-level Parallel Computation for Approximate Inverse with AISM Method
Vol 49 No 6 2164 2168 (June 2008) AISM 2 1 2 3 2 AISM SGI Orgn 2400 MPI PE16 PE 13672 Two-level Parallel Computaton for Approxmate Inverse wth AISM Method Lnje Zhang, 1 Kentaro Morya 2 and Takash Nodera
More informationTHE EFFECT OF TORSIONAL RIGIDITY BETWEEN ELEMENTS ON FREE VIBRATIONS OF A TELESCOPIC HYDRAULIC CYLINDER SUBJECTED TO EULER S LOAD
Journal of Appled Mathematcs and Computatonal Mechancs 7, 6(3), 7- www.amcm.pcz.pl p-issn 99-9965 DOI:.75/jamcm.7.3. e-issn 353-588 THE EFFECT OF TORSIONAL RIGIDITY BETWEEN ELEMENTS ON FREE VIBRATIONS
More informationSpin-rotation coupling of the angularly accelerated rigid body
Spn-rotaton couplng of the angularly accelerated rgd body Loua Hassan Elzen Basher Khartoum, Sudan. Postal code:11123 E-mal: louaelzen@gmal.com November 1, 2017 All Rghts Reserved. Abstract Ths paper s
More informationThere are two approaches to Hensel lftng. Lnear lftng starts wth polynomals f = f (0) and teratvely constructs polynomals f () such that ()f () f (?)
On Bvarate Hensel Lftng and ts Parallelzaton Laurent Bernardn Insttut fur Wssenschaftlches Rechnen ETH Zurch bernardn@nf.ethz.ch Abstract We present a new parallel algorthm for performng lnear Hensel lftng
More informationComputing Correlated Equilibria in Multi-Player Games
Computng Correlated Equlbra n Mult-Player Games Chrstos H. Papadmtrou Presented by Zhanxang Huang December 7th, 2005 1 The Author Dr. Chrstos H. Papadmtrou CS professor at UC Berkley (taught at Harvard,
More informationLinear Algebra and its Applications
Lnear Algebra and ts Applcatons 4 (00) 5 56 Contents lsts avalable at ScenceDrect Lnear Algebra and ts Applcatons journal homepage: wwwelsevercom/locate/laa Notes on Hlbert and Cauchy matrces Mroslav Fedler
More informationMultilayer Perceptron (MLP)
Multlayer Perceptron (MLP) Seungjn Cho Department of Computer Scence and Engneerng Pohang Unversty of Scence and Technology 77 Cheongam-ro, Nam-gu, Pohang 37673, Korea seungjn@postech.ac.kr 1 / 20 Outlne
More informationWeek 11: Chapter 11. The Vector Product. The Vector Product Defined. The Vector Product and Torque. More About the Vector Product
The Vector Product Week 11: Chapter 11 Angular Momentum There are nstances where the product of two vectors s another vector Earler we saw where the product of two vectors was a scalar Ths was called the
More informationConvexity preserving interpolation by splines of arbitrary degree
Computer Scence Journal of Moldova, vol.18, no.1(52), 2010 Convexty preservng nterpolaton by splnes of arbtrary degree Igor Verlan Abstract In the present paper an algorthm of C 2 nterpolaton of dscrete
More informationA. Further Background on Gauss Quadrature
A. Further Background on Gauss Quadrature Gauss quadrature for matrx nverse forms wth applcatons We present below a more detaled summary of materal on Gauss quadrature to make the paper self-contaned.
More information= = = (a) Use the MATLAB command rref to solve the system. (b) Let A be the coefficient matrix and B be the right-hand side of the system.
Chapter Matlab Exercses Chapter Matlab Exercses. Consder the lnear system of Example n Secton.. x x x y z y y z (a) Use the MATLAB command rref to solve the system. (b) Let A be the coeffcent matrx and
More informationExample: (13320, 22140) =? Solution #1: The divisors of are 1, 2, 3, 4, 5, 6, 9, 10, 12, 15, 18, 20, 27, 30, 36, 41,
The greatest common dvsor of two ntegers a and b (not both zero) s the largest nteger whch s a common factor of both a and b. We denote ths number by gcd(a, b), or smply (a, b) when there s no confuson
More informationON THE GENERAL ALGEBRAIC INVERSE EIGENVALUE PROBLEMS. 1. Introduction
Journal of Computatonal Mathematcs, Vol.22, No.4, 2004, 567 580. ON THE GENERAL ALGEBRAIC INVERSE EIGENVALUE PROBLEMS Yu-ha Zhang (Department of Mathematcs, Shong Unversty, Jnan 250100, Chna) (ICMSEC,
More informationA Riemannian Limited-Memory BFGS Algorithm for Computing the Matrix Geometric Mean
A Remannan Lmted-Memory BFGS Algorthm for Computng the Matrx Geometrc Mean Xnru Yuan 1, Wen Huang 2, P.-A. Absl 2 and Kyle A. Gallvan 1 1 Florda State Unversty 2 Unversté Catholque de Louvan July 3, 2016
More informationAn efficient algorithm for multivariate Maclaurin Newton transformation
Annales UMCS Informatca AI VIII, 2 2008) 5 14 DOI: 10.2478/v10065-008-0020-6 An effcent algorthm for multvarate Maclaurn Newton transformaton Joanna Kapusta Insttute of Mathematcs and Computer Scence,
More informationSpeeding up Computation of Scalar Multiplication in Elliptic Curve Cryptosystem
H.K. Pathak et. al. / (IJCSE) Internatonal Journal on Computer Scence and Engneerng Speedng up Computaton of Scalar Multplcaton n Ellptc Curve Cryptosystem H. K. Pathak Manju Sangh S.o.S n Computer scence
More informationMatrix Multiplication on Two Interconnected Processors
Matrx Multplcaton on Two Interconnected Processors Brett A. Becker School of Computer Scence and Informatcs Unversty College Dubln (UCD), Belfeld, Dubln 4, Ireland brett.becker@ucd.e Alexey Lastovetsky
More informationCSC321 Tutorial 9: Review of Boltzmann machines and simulated annealing
CSC321 Tutoral 9: Revew of Boltzmann machnes and smulated annealng (Sldes based on Lecture 16-18 and selected readngs) Yue L Emal: yuel@cs.toronto.edu Wed 11-12 March 19 Fr 10-11 March 21 Outlne Boltzmann
More information