Director of Research School of Mathematics
|
|
- Melanie Gibson
- 5 years ago
- Views:
Transcription
1 Multiprecision Research Matters Algorithms February Nick25, Higham 2009 School of Mathematics The University Nick Higham of Manchester Director of Research School of nickhigham.wordpress.com Waseda Research Institute for Science and Engineering Institute for Mathematical Science Kickoff Meeting, Waseda University, Tokyo, March 6, / 6
2 Outline Multiprecision arithmetic: floating point arithmetic supporting multiple, possibly arbitrary, precisions. Applications of & support for low precision. Applications of & support for high precision. How to exploit different precisions to achieve faster algs with higher accuracy. Download this talk from Nick Higham Multiprecision Algorithms 2 / 27
3 Floating Point Number System Floating point number system F R: y = ±m β e t, 0 m β t 1. Base β (β = 2 in practice), precision t, exponent range e min e e max. Assume normalized: m β t 1. Nick Higham Multiprecision Algorithms 3 / 27
4 Floating Point Number System Floating point number system F R: y = ±m β e t, 0 m β t 1. Base β (β = 2 in practice), precision t, exponent range e min e e max. Assume normalized: m β t 1. Floating point numbers are not equally spaced. If β = 2, t = 3, e min = 1, and e max = 3: Nick Higham Multiprecision Algorithms 3 / 27
5 IEEE Standard and 2008 Revision Type Size Range u = 2 t half 16 bits 10 ± single 32 bits 10 ± double 64 bits 10 ± quadruple 128 bits 10 ± Arithmetic ops (+,,, /, ) performed as if first calculated to infinite precision, then rounded. Default: round to nearest, round to even in case of tie. Half precision is a storage format only. Nick Higham Multiprecision Algorithms 4 / 27
6 Rounding For x R, fl(x) is an element of F nearest to x, and the transformation x fl(x) is called rounding (to nearest). Theorem If x R lies in the range of F then fl(x) = x(1 + δ), δ u. u := 1 2 β1 t is the unit roundoff, or machine precision. The machine epsilon, ɛ M = β 1 t is the spacing between 1 and the next larger floating point number (eps in MATLAB) Nick Higham Multiprecision Algorithms 5 / 27
7 Precision versus Accuracy fl(abc) = ab(1 + δ 1 ) c(1 + δ 2 ) δ i u, = abc(1 + δ 1 )(1 + δ 2 ) abc(1 + δ 1 + δ 2 ). Precision = u. Accuracy 2u. Nick Higham Multiprecision Algorithms 6 / 27
8 Precision versus Accuracy fl(abc) = ab(1 + δ 1 ) c(1 + δ 2 ) δ i u, = abc(1 + δ 1 )(1 + δ 2 ) abc(1 + δ 1 + δ 2 ). Precision = u. Accuracy 2u. Accuracy is not limited by precision Nick Higham Multiprecision Algorithms 6 / 27
9
10 NVIDIA Tesla P100 (2016) The Tesla P100 is the world s first accelerator built for deep learning, and has native hardware ISA support for FP16 arithmetic, delivering over 21 TeraFLOPS of FP16 processing power. Nick Higham Multiprecision Algorithms 8 / 27
11 AMD Radeon Instinct MI25 GPU (2017) 24.6 TFLOPS FP16 or 12.3 TFLOPS FP32 peak GPU compute performance on a single board... Up to 82 GFLOPS/watt FP16 or 41 GFLOPS/watt FP32 peak GPU compute performance Nick Higham Multiprecision Algorithms 9 / 27
12 Machine Learning For machine learning as well as for certain image processing and signal processing applications, more data at lower precision actually yields better results with certain algorithms than a smaller amount of more precise data. We find that very low precision is sufficient not just for running trained networks but also for training them. Courbariaux, Benji & David (2015) We re solving the wrong problem (Scheinberg, 2016), so don t need an accurate solution. Low precision provides regularization. See Jorge Nocedal s plenary talk Stochastic Gradient Methods for Machine Learning at SIAM CSE Nick Higham Multiprecision Algorithms 10 / 27
13 Climate Modelling T. Palmer, More reliable forecasts with less precise computations: a fast-track route to cloud-resolved weather and climate simulators?, Phil. Trans. R. Soc. A, 2014: Is there merit in representing variables at sufficiently high wavenumbers using half or even quarter precision floating-point numbers? T. Palmer, Build imprecise supercomputers, Nature, Nick Higham Multiprecision Algorithms 11 / 27
14 Error Analysis in Low Precision For an inner product x T y of n-vectors the standard error bound is fl(x T y) x T y nu x T y + O(u 2 ). In half precision, u , so nu = 1 for n = Nick Higham Multiprecision Algorithms 12 / 27
15 Error Analysis in Low Precision For an inner product x T y of n-vectors the standard error bound is fl(x T y) x T y nu x T y + O(u 2 ). In half precision, u , so nu = 1 for n = Most existing rounding error analysis guarantees no accuracy, and maybe not even a correct exponent, for half precision! Nick Higham Multiprecision Algorithms 12 / 27
16 Error Analysis in Low Precision For an inner product x T y of n-vectors the standard error bound is fl(x T y) x T y nu x T y + O(u 2 ). In half precision, u , so nu = 1 for n = Most existing rounding error analysis guarantees no accuracy, and maybe not even a correct exponent, for half precision! Is standard error analysis especially pessimistic in these applications? Try a statistical approach? Nick Higham Multiprecision Algorithms 12 / 27
17 Need for Higher Precision He and Ding, Using Accurate Arithmetics to Improve Numerical Reproducibility and Stability in Parallel Applications, Bailey, Barrio & Borwein, High-Precision Computation: Mathematical Physics & Dynamics, Khanna, High-Precision Numerical Simulations on a CUDA GPU: Kerr Black Hole Tails, Beliakov and Matiyasevich, A Parallel Algorithm for Calculation of Determinants and Minors Using Arbitrary Precision Arithmetic, Ma and Saunders, Solving Multiscale Linear Programs Using the Simplex Method in Quadruple Precision, Nick Higham Multiprecision Algorithms 13 / 27
18 Going to Higher Precision If we have quadruple or higher precision, how can we modify existing algorithms to exploit it? Nick Higham Multiprecision Algorithms 14 / 27
19 Going to Higher Precision If we have quadruple or higher precision, how can we modify existing algorithms to exploit it? To what extent are existing algs precision-independent? Newton-type algs: just decrease tol? How little higher precision can we get away with? Gradually increase precision through the iterations? For Krylov methods # iterations can depend on precision, so lower precision might not give fastest computation! Nick Higham Multiprecision Algorithms 14 / 27
20 Availability of Multiprecision in Software Maple, Mathematica, PARI/GP, Sage. MATLAB: Symbolic Math Toolbox, Multiprecision Computing Toolbox (Advanpix). Julia: BigFloat. Mpmath and SymPy for Python. GNU MP Library. GNU MPFR Library. (Quad only): some C, Fortran compilers. Gone, but not forgotten: Numerical Turing: Hull et al., Nick Higham Multiprecision Algorithms 15 / 27
21 Cost of Quadruple Precision How Fast is Quadruple Precision Arithmetic? Compare MATLAB double precision, Symbolic Math Toolbox, VPA arithmetic, digits(34), Multiprecision Computing Toolbox (Advanpix), mp.digits(34). Optimized for quad. Ratios of times Intel Broadwell-E Core 6 cores mp/double vpa/double vpa/mp LU, n = , eig, nonsymm, n = , eig, symm, n = , Nick Higham Multiprecision Algorithms 16 / 27
22 Matrix Functions (Inverse) scaling and squaring-type algorithms for e A, log A, cos A, A t use Padé approximants. Padé degree and algorithm parameters chosen to achieve double precision accuracy, u = Change u and the algorithm logic needs changing! Open questions, even for scalar elementary functions? Nick Higham Multiprecision Algorithms 17 / 27
23 Log Tables Henry Briggs ( ): Arithmetica Logarithmica (1624) Logarithms to base 10 of 1 20,000 and 90, ,000 to 14 decimal places. Briggs must be viewed as one of the great figures in numerical analysis. Herman H. Goldstine, A History of Numerical Analysis (1977) Name Year Range Decimal places R. de Prony , Edward Sang , Nick Higham Multiprecision Algorithms 18 / 27
24 The Scalar Logarithm: Comm. ACM J. R. Herndon (1961). Algorithm 48: Logarithm of a complex number. Nick Higham Multiprecision Algorithms 19 / 27
25 The Scalar Logarithm: Comm. ACM J. R. Herndon (1961). Algorithm 48: Logarithm of a complex number. A. P. Relph (1962). Certification of Algorithm 48: Logarithm of a complex number. Nick Higham Multiprecision Algorithms 19 / 27
26 The Scalar Logarithm: Comm. ACM J. R. Herndon (1961). Algorithm 48: Logarithm of a complex number. A. P. Relph (1962). Certification of Algorithm 48: Logarithm of a complex number. M. L. Johnson and W. Sangren, W. (1962). Remark on Algorithm 48: Logarithm of a complex number. Nick Higham Multiprecision Algorithms 19 / 27
27 The Scalar Logarithm: Comm. ACM J. R. Herndon (1961). Algorithm 48: Logarithm of a complex number. A. P. Relph (1962). Certification of Algorithm 48: Logarithm of a complex number. M. L. Johnson and W. Sangren, W. (1962). Remark on Algorithm 48: Logarithm of a complex number. D. S. Collens (1964). Remark on remarks on Algorithm 48: Logarithm of a complex number. Nick Higham Multiprecision Algorithms 19 / 27
28 The Scalar Logarithm: Comm. ACM J. R. Herndon (1961). Algorithm 48: Logarithm of a complex number. A. P. Relph (1962). Certification of Algorithm 48: Logarithm of a complex number. M. L. Johnson and W. Sangren, W. (1962). Remark on Algorithm 48: Logarithm of a complex number. D. S. Collens (1964). Remark on remarks on Algorithm 48: Logarithm of a complex number. D. S. Collens (1964). Algorithm 243: Logarithm of a complex number: Rewrite of Algorithm 48. Nick Higham Multiprecision Algorithms 19 / 27
29 Principal Logarithm and pth Root Let A C n n have no eigenvalues on R. Principal log X = log A denotes unique X such that e X = A. π < Im ( λ(x) ) < π. Nick Higham Multiprecision Algorithms 20 / 27
30 The Average Eye First order character of optical system characterized by transference matrix [ ] S δ T = R 5 5, 0 1 where S R 4 4 is symplectic: S T JS = J = [ 0 I2 I 2 0 ]. Average m 1 m i=1 T i is not a transference matrix. Harris (2005) proposes the average exp(m 1 m i=1 log T i). Nick Higham Multiprecision Algorithms 21 / 27
31 Inverse Scaling and Squaring log A = log ( A 1/2s ) 2 s = 2 s log ( A 1/2s ) Nick Higham Multiprecision Algorithms 22 / 27
32 Inverse Scaling and Squaring log A = log ( A 1/2s ) 2 s = 2 s log ( A 1/2s ) Algorithm Basic inverse scaling and squaring 1: s 0 3: while A I 2 1 do 4: A A 1/2 5: s s + 1 6: end while 7: Find Padé approximant r km to log(1 + x) 8: return 2 s r km (A I) Nick Higham Multiprecision Algorithms 22 / 27
33 Inverse Scaling and Squaring log A = log ( A 1/2s ) 2 s = 2 s log ( A 1/2s ) Algorithm Schur Padé inverse scaling and squaring 1: s 0 2: Compute the Schur decomposition A := QTQ 3: while T I 2 1 do 4: T T 1/2 5: s s + 1 6: end while 7: Find Padé approximant r km to log(1 + x) 8: return 2 s Q r km (T I)Q What degree for the Padé approximants? Should we take more square roots once T I 2 < 1? Nick Higham Multiprecision Algorithms 22 / 27
34 Bounding the Error Kenney & Laub (1989) derived absolute f err bound log(i + X) r km (X) log(1 X ) r km ( X ). Al-Mohy, H & Relton (2012, 2013) derive b err bound. Strategy requires symbolic and high prec. comp. (done off-line) and obtains parameters dependent on u. Fasi & H (2018) use a relative f err bound based on log(i + X) X, since X 1 is possible. Strategy: Use f err bound to minimize s subject to f err bounded by u (arbitrary) for suitable m. Nick Higham Multiprecision Algorithms 23 / 27
35 Sharper Error Bound for Nonnormal Matrices Al-Mohy & H (2009) note that c i A i i=l c i A i = c i ( A i 1/i) i i=l i=l i=l c i β i, where β = max i l A i 1/i. Fasi & H (2018) refine Kenney & Laub s bound using α p (X) = max ( X p 1/p, X p+1 1/(p+1)), in place of A. Note that ρ(a) α p (A) A. Even sharper bounds derived by Nadukandi & H (2018). Nick Higham Multiprecision Algorithms 24 / 27
36 Relative Errors: 64 Digits κ log (A)u logm mct logm agm logm tayl logm tfree tayl logm pade logm tfree pade Nick Higham Multiprecision Algorithms 25 / 27
37 Relative Errors: 256 Digits Nick Higham Multiprecision Algorithms 26 / 27
38 Conclusions Both low and high precision floating-point arithmetic becoming more prevalent, in hardware and software. Need better understanding of behaviour of algs in low precision arithmetic. Matrix functions at high precision need algs with (at least) different logic. In SIAM PP18, MS54 (Thurs, 4:50) I ll show that using three precisions in iter ref brings major benefits and permits faster and more accurate solution of Ax = b. Nick Higham Multiprecision Algorithms 27 / 27
39 References I D. H. Bailey, R. Barrio, and J. M. Borwein. High-precision computation: Mathematical physics and dynamics. Appl. Math. Comput., 218(20): , G. Beliakov and Y. Matiyasevich. A parallel algorithm for calculation of determinants and minors using arbitrary precision arithmetic. BIT, 56(1):33 50, Nick Higham Multiprecision Algorithms 1 / 7
40 References II E. Carson and N. J. Higham. Accelerating the solution of linear systems by iterative refinement in three precisions. MIMS EPrint , Manchester Institute for Mathematical Sciences, The University of Manchester, UK, July pp. Revised January To appear in SIAM J. Sci. Comput. E. Carson and N. J. Higham. A new analysis of iterative refinement and its application to accurate solution of ill-conditioned sparse linear systems. SIAM J. Sci. Comput., 39(6):A2834 A2856, Nick Higham Multiprecision Algorithms 2 / 7
41 References III M. Courbariaux, Y. Bengio, and J.-P. David. Training deep neural networks with low precision multiplications, ArXiv preprint v5. M. Fasi and N. J. Higham. Multiprecision algorithms for computing the matrix logarithm. MIMS EPrint , Manchester Institute for Mathematical Sciences, The University of Manchester, UK, May pp. Revised December To appear in SIAM J. Matrix Anal. Appl. Nick Higham Multiprecision Algorithms 3 / 7
42 References IV W. F. Harris. The average eye. Opthal. Physiol. Opt., 24: , Y. He and C. H. Q. Ding. Using accurate arithmetics to improve numerical reproducibility and stability in parallel applications. J. Supercomputing, 18(3): , N. J. Higham. Accuracy and Stability of Numerical Algorithms. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, second edition, ISBN xxx+680 pp. Nick Higham Multiprecision Algorithms 4 / 7
43 References V G. Khanna. High-precision numerical simulations on a CUDA GPU: Kerr black hole tails. J. Sci. Comput., 56(2): , D. Ma and M. Saunders. Solving multiscale linear programs using the simplex method in quadruple precision. In M. Al-Baali, L. Grandinetti, and A. Purnama, editors, Numerical Analysis and Optimization, number 134 in Springer Proceedings in Mathematics and Statistics, pages Springer-Verlag, Berlin, Nick Higham Multiprecision Algorithms 5 / 7
44 References VI P. Nadukandi and N. J. Higham. Computing the wave-kernel matrix functions. MIMS EPrint , Manchester Institute for Mathematical Sciences, The University of Manchester, UK, Feb pp. A. Roldao-Lopes, A. Shahzad, G. A. Constantinides, and E. C. Kerrigan. More flops or more precision? Accuracy parameterizable linear equation solvers for model predictive control. In 17th IEEE Symposium on Field Programmable Custom Computing Machines, pages , Apr Nick Higham Multiprecision Algorithms 6 / 7
45 References VII K. Scheinberg. Evolution of randomness in optimization methods for supervised machine learning. SIAG/OPT Views and News, 24(1):1 8, Nick Higham Multiprecision Algorithms 7 / 7
Research CharlieMatters
Research CharlieMatters Van Loan and the Matrix Exponential February 25, 2009 Nick Nick Higham Director Schoolof ofresearch Mathematics The School University of Mathematics of Manchester http://www.maths.manchester.ac.uk/~higham
More informationError Bounds for Iterative Refinement in Three Precisions
Error Bounds for Iterative Refinement in Three Precisions Erin C. Carson, New York University Nicholas J. Higham, University of Manchester SIAM Annual Meeting Portland, Oregon July 13, 018 Hardware Support
More informationImproved Inverse Scaling and Squaring Algorithms for the Matrix Logarithm. Al-Mohy, Awad H. and Higham, Nicholas J. MIMS EPrint: 2011.
Improved Inverse Scaling and Squaring Algorithms for the Matrix Logarithm Al-Mohy, Awad H. and Higham, Nicholas J. 2011 MIMS EPrint: 2011.83 Manchester Institute for Mathematical Sciences School of Mathematics
More informationA more accurate Briggs method for the logarithm
Numer Algor (2012) 59:393 402 DOI 10.1007/s11075-011-9496-z ORIGINAL PAPER A more accurate Briggs method for the logarithm Awad H. Al-Mohy Received: 25 May 2011 / Accepted: 15 August 2011 / Published online:
More informationRehabilitating Research Correlations, Matters Avoiding Inversion, and Extracting Roots
Rehabilitating Research Correlations, Matters Avoiding Inversion, and Extracting Roots February 25, 2009 Nick Nick Higham Higham Director School of of Research Mathematics The University of Manchester
More informationTesting matrix function algorithms using identities. Deadman, Edvin and Higham, Nicholas J. MIMS EPrint:
Testing matrix function algorithms using identities Deadman, Edvin and Higham, Nicholas J. 2014 MIMS EPrint: 2014.13 Manchester Institute for Mathematical Sciences School of Mathematics The University
More informationConvergence of Rump s Method for Inverting Arbitrarily Ill-Conditioned Matrices
published in J. Comput. Appl. Math., 205(1):533 544, 2007 Convergence of Rump s Method for Inverting Arbitrarily Ill-Conditioned Matrices Shin ichi Oishi a,b Kunio Tanabe a Takeshi Ogita b,a Siegfried
More informationMATH20602 Numerical Analysis 1
M\cr NA Manchester Numerical Analysis MATH20602 Numerical Analysis 1 Martin Lotz School of Mathematics The University of Manchester Manchester, January 27, 2014 Outline General Course Information Introduction
More informationAccurate polynomial evaluation in floating point arithmetic
in floating point arithmetic Université de Perpignan Via Domitia Laboratoire LP2A Équipe de recherche en Informatique DALI MIMS Seminar, February, 10 th 2006 General motivation Provide numerical algorithms
More informationComputing the Action of the Matrix Exponential
Computing the Action of the Matrix Exponential Nick Higham School of Mathematics The University of Manchester higham@ma.man.ac.uk http://www.ma.man.ac.uk/~higham/ Joint work with Awad H. Al-Mohy 16th ILAS
More informationECS 231 Computer Arithmetic 1 / 27
ECS 231 Computer Arithmetic 1 / 27 Outline 1 Floating-point numbers and representations 2 Floating-point arithmetic 3 Floating-point error analysis 4 Further reading 2 / 27 Outline 1 Floating-point numbers
More informationExact Determinant of Integer Matrices
Takeshi Ogita Department of Mathematical Sciences, Tokyo Woman s Christian University Tokyo 167-8585, Japan, ogita@lab.twcu.ac.jp Abstract. This paper is concerned with the numerical computation of the
More informationFLOATING POINT ARITHMETHIC - ERROR ANALYSIS
FLOATING POINT ARITHMETHIC - ERROR ANALYSIS Brief review of floating point arithmetic Model of floating point arithmetic Notation, backward and forward errors 3-1 Roundoff errors and floating-point arithmetic
More informationElements of Floating-point Arithmetic
Elements of Floating-point Arithmetic Sanzheng Qiao Department of Computing and Software McMaster University July, 2012 Outline 1 Floating-point Numbers Representations IEEE Floating-point Standards Underflow
More informationAlgorithms for Solving the Polynomial Eigenvalue Problem
Algorithms for Solving the Polynomial Eigenvalue Problem Nick Higham School of Mathematics The University of Manchester higham@ma.man.ac.uk http://www.ma.man.ac.uk/~higham/ Joint work with D. Steven Mackey
More informationFLOATING POINT ARITHMETHIC - ERROR ANALYSIS
FLOATING POINT ARITHMETHIC - ERROR ANALYSIS Brief review of floating point arithmetic Model of floating point arithmetic Notation, backward and forward errors Roundoff errors and floating-point arithmetic
More informationThe numerical stability of barycentric Lagrange interpolation
IMA Journal of Numerical Analysis (2004) 24, 547 556 The numerical stability of barycentric Lagrange interpolation NICHOLAS J. HIGHAM Department of Mathematics, University of Manchester, Manchester M13
More informationFunctions of Matrices and
Functions of Matrices and Nearest Research Correlation MattersMatrices February 25, 2009 Nick Higham School Nick Higham of Mathematics The Director University of Research of Manchester Nicholas.J.Higham@manchester.ac.uk
More informationGPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic
GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic Jan Verschelde joint work with Xiangcheng Yu University of Illinois at Chicago
More informationLAPACK-Style Codes for Pivoted Cholesky and QR Updating. Hammarling, Sven and Higham, Nicholas J. and Lucas, Craig. MIMS EPrint: 2006.
LAPACK-Style Codes for Pivoted Cholesky and QR Updating Hammarling, Sven and Higham, Nicholas J. and Lucas, Craig 2007 MIMS EPrint: 2006.385 Manchester Institute for Mathematical Sciences School of Mathematics
More informationACM 106a: Lecture 1 Agenda
1 ACM 106a: Lecture 1 Agenda Introduction to numerical linear algebra Common problems First examples Inexact computation What is this course about? 2 Typical numerical linear algebra problems Systems of
More informationAccelerating linear algebra computations with hybrid GPU-multicore systems.
Accelerating linear algebra computations with hybrid GPU-multicore systems. Marc Baboulin INRIA/Université Paris-Sud joint work with Jack Dongarra (University of Tennessee and Oak Ridge National Laboratory)
More informationSolving the Polynomial Eigenvalue Problem by Linearization
Solving the Polynomial Eigenvalue Problem by Linearization Nick Higham School of Mathematics The University of Manchester higham@ma.man.ac.uk http://www.ma.man.ac.uk/~higham/ Joint work with Ren-Cang Li,
More informationIntroduction to Scientific Computing Languages
1 / 21 Introduction to Scientific Computing Languages Prof. Paolo Bientinesi pauldj@aices.rwth-aachen.de Numerical Representation 2 / 21 Numbers 123 = (first 40 digits) 29 4.241379310344827586206896551724137931034...
More informationComputing the pth Roots of a Matrix. with Repeated Eigenvalues
Applied Mathematical Sciences, Vol. 5, 2011, no. 53, 2645-2661 Computing the pth Roots of a Matrix with Repeated Eigenvalues Amir Sadeghi 1, Ahmad Izani Md. Ismail and Azhana Ahmad School of Mathematical
More informationMATH20602 Numerical Analysis 1
M\cr NA Manchester Numerical Analysis MATH20602 Numerical Analysis 1 Martin Lotz School of Mathematics The University of Manchester Manchester, February 1, 2016 Outline General Course Information Introduction
More informationThe Complex Step Approximation to the Fréchet Derivative of a Matrix Function. Al-Mohy, Awad H. and Higham, Nicholas J. MIMS EPrint: 2009.
The Complex Step Approximation to the Fréchet Derivative of a Matrix Function Al-Mohy, Awad H. and Higham, Nicholas J. 2009 MIMS EPrint: 2009.31 Manchester Institute for Mathematical Sciences School of
More informationChapter 1 Error Analysis
Chapter 1 Error Analysis Several sources of errors are important for numerical data processing: Experimental uncertainty: Input data from an experiment have a limited precision. Instead of the vector of
More informationReliable real arithmetic. one-dimensional dynamical systems
Reliable real arithmetic and one-dimensional dynamical systems Keith Briggs, BT Research Keith.Briggs@bt.com http://keithbriggs.info QMUL Mathematics Department 2007 March 27 1600 typeset 2007 March 26
More informationMath 411 Preliminaries
Math 411 Preliminaries Provide a list of preliminary vocabulary and concepts Preliminary Basic Netwon s method, Taylor series expansion (for single and multiple variables), Eigenvalue, Eigenvector, Vector
More informationElements of Floating-point Arithmetic
Elements of Floating-point Arithmetic Sanzheng Qiao Department of Computing and Software McMaster University July, 2012 Outline 1 Floating-point Numbers Representations IEEE Floating-point Standards Underflow
More informationFloating Point Number Systems. Simon Fraser University Surrey Campus MACM 316 Spring 2005 Instructor: Ha Le
Floating Point Number Systems Simon Fraser University Surrey Campus MACM 316 Spring 2005 Instructor: Ha Le 1 Overview Real number system Examples Absolute and relative errors Floating point numbers Roundoff
More informationCompute the behavior of reality even if it is impossible to observe the processes (for example a black hole in astrophysics).
1 Introduction Read sections 1.1, 1.2.1 1.2.4, 1.2.6, 1.3.8, 1.3.9, 1.4. Review questions 1.1 1.6, 1.12 1.21, 1.37. The subject of Scientific Computing is to simulate the reality. Simulation is the representation
More informationAlgorithm for Sparse Approximate Inverse Preconditioners in the Conjugate Gradient Method
Algorithm for Sparse Approximate Inverse Preconditioners in the Conjugate Gradient Method Ilya B. Labutin A.A. Trofimuk Institute of Petroleum Geology and Geophysics SB RAS, 3, acad. Koptyug Ave., Novosibirsk
More informationAn Algorithm for. Nick Higham. Françoise Tisseur. Director of Research School of Mathematics.
An Algorithm for the Research Complete Matters Solution of Quadratic February Eigenvalue 25, 2009 Problems Nick Higham Françoise Tisseur Director of Research School of Mathematics The School University
More informationSampling exactly from the normal distribution
1 Sampling exactly from the normal distribution Charles Karney charles.karney@sri.com SRI International AofA 2017, Princeton, June 20, 2017 Background: In my day job, I ve needed normal (Gaussian) deviates
More informationApplied Mathematics 205. Unit 0: Overview of Scientific Computing. Lecturer: Dr. David Knezevic
Applied Mathematics 205 Unit 0: Overview of Scientific Computing Lecturer: Dr. David Knezevic Scientific Computing Computation is now recognized as the third pillar of science (along with theory and experiment)
More informationTR A Comparison of the Performance of SaP::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems
TR-0-07 A Comparison of the Performance of ::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems Ang Li, Omkar Deshmukh, Radu Serban, Dan Negrut May, 0 Abstract ::GPU is a
More informationImproved Damped Quasi-Newton Methods for Unconstrained Optimization
Improved Damped Quasi-Newton Methods for Unconstrained Optimization Mehiddin Al-Baali and Lucio Grandinetti August 2015 Abstract Recently, Al-Baali (2014) has extended the damped-technique in the modified
More informationChapter 1 Mathematical Preliminaries and Error Analysis
Numerical Analysis (Math 3313) 2019-2018 Chapter 1 Mathematical Preliminaries and Error Analysis Intended learning outcomes: Upon successful completion of this chapter, a student will be able to (1) list
More informationComputing least squares condition numbers on hybrid multicore/gpu systems
Computing least squares condition numbers on hybrid multicore/gpu systems M. Baboulin and J. Dongarra and R. Lacroix Abstract This paper presents an efficient computation for least squares conditioning
More informationImplementation of float float operators on graphic hardware
Implementation of float float operators on graphic hardware Guillaume Da Graça David Defour DALI, Université de Perpignan Outline Why GPU? Floating point arithmetic on GPU Float float format Problem &
More informationScaling, Sensitivity and Stability in Numerical Solution of the Quadratic Eigenvalue Problem
Scaling, Sensitivity and Stability in Numerical Solution of the Quadratic Eigenvalue Problem Nick Higham School of Mathematics The University of Manchester higham@ma.man.ac.uk http://www.ma.man.ac.uk/~higham/
More informationIntroduction, basic but important concepts
Introduction, basic but important concepts Felix Kubler 1 1 DBF, University of Zurich and Swiss Finance Institute October 7, 2017 Felix Kubler Comp.Econ. Gerzensee, Ch1 October 7, 2017 1 / 31 Economics
More informationComputing the matrix cosine
Numerical Algorithms 34: 13 26, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. Computing the matrix cosine Nicholas J. Higham and Matthew I. Smith Department of Mathematics, University
More informationLAPACK-Style Codes for Pivoted Cholesky and QR Updating
LAPACK-Style Codes for Pivoted Cholesky and QR Updating Sven Hammarling 1, Nicholas J. Higham 2, and Craig Lucas 3 1 NAG Ltd.,Wilkinson House, Jordan Hill Road, Oxford, OX2 8DR, England, sven@nag.co.uk,
More informationAccurate Solution of Triangular Linear System
SCAN 2008 Sep. 29 - Oct. 3, 2008, at El Paso, TX, USA Accurate Solution of Triangular Linear System Philippe Langlois DALI at University of Perpignan France Nicolas Louvet INRIA and LIP at ENS Lyon France
More informationarxiv:cs/ v1 [cs.ms] 13 Aug 2004
tsnnls: A solver for large sparse least squares problems with non-negative variables Jason Cantarella Department of Mathematics, University of Georgia, Athens, GA 30602 Michael Piatek arxiv:cs/0408029v
More informationLecture 7. Floating point arithmetic and stability
Lecture 7 Floating point arithmetic and stability 2.5 Machine representation of numbers Scientific notation: 23 }{{} }{{} } 3.14159265 {{} }{{} 10 sign mantissa base exponent (significand) s m β e A floating
More informationAdaptive and Efficient Algorithm for 2D Orientation Problem
Japan J. Indust. Appl. Math., 26 (2009), 215 231 Area 2 Adaptive and Efficient Algorithm for 2D Orientation Problem Katsuhisa Ozaki, Takeshi Ogita, Siegfried M. Rump and Shin ichi Oishi Research Institute
More informationCHAPTER 11. A Revision. 1. The Computers and Numbers therein
CHAPTER A Revision. The Computers and Numbers therein Traditional computer science begins with a finite alphabet. By stringing elements of the alphabet one after another, one obtains strings. A set of
More informationThe Scaling and Squaring Method for the Matrix Exponential Revisited. Nicholas J. Higham
The Scaling and Squaring Method for the Matrix Exponential Revisited Nicholas J. Higham 2005 MIMS EPrint: 2006.394 Manchester Institute for Mathematical Sciences School of Mathematics The University of
More informationOn-the-fly backward error estimate for matrix exponential approximation by Taylor algorithm
On-the-fly backward error estimate for matrix exponential approximation by Taylor algorithm M. Caliari a,, F. Zivcovich b a Department of Computer Science, University of Verona, Italy b Department of Mathematics,
More informationComputation of the error functions erf and erfc in arbitrary precision with correct rounding
Computation of the error functions erf and erfc in arbitrary precision with correct rounding Sylvain Chevillard Arenaire, LIP, ENS-Lyon, France Sylvain.Chevillard@ens-lyon.fr Nathalie Revol INRIA, Arenaire,
More informationHybrid Analog-Digital Solution of Nonlinear Partial Differential Equations
Hybrid Analog-Digital Solution of Nonlinear Partial Differential Equations Yipeng Huang, Ning Guo, Kyle Mandli, Mingoo Seok, Yannis Tsividis, Simha Sethumadhavan Columbia University Hybrid Analog-Digital
More informationOptimizing the Representation of Intervals
Optimizing the Representation of Intervals Javier D. Bruguera University of Santiago de Compostela, Spain Numerical Sofware: Design, Analysis and Verification Santander, Spain, July 4-6 2012 Contents 1
More informationAccelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers
UT College of Engineering Tutorial Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers Stan Tomov 1, George Bosilca 1, and Cédric
More informationParallel Polynomial Evaluation
Parallel Polynomial Evaluation Jan Verschelde joint work with Genady Yoffe University of Illinois at Chicago Department of Mathematics, Statistics, and Computer Science http://www.math.uic.edu/ jan jan@math.uic.edu
More informationClassification of Hand-Written Digits Using Scattering Convolutional Network
Mid-year Progress Report Classification of Hand-Written Digits Using Scattering Convolutional Network Dongmian Zou Advisor: Professor Radu Balan Co-Advisor: Dr. Maneesh Singh (SRI) Background Overview
More informationIntroduction to Scientific Computing Languages
1 / 19 Introduction to Scientific Computing Languages Prof. Paolo Bientinesi pauldj@aices.rwth-aachen.de Numerical Representation 2 / 19 Numbers 123 = (first 40 digits) 29 4.241379310344827586206896551724137931034...
More informationTEACHING NUMERICAL LINEAR ALGEBRA AT THE UNDERGRADUATE LEVEL by Biswa Nath Datta Department of Mathematical Sciences Northern Illinois University
TEACHING NUMERICAL LINEAR ALGEBRA AT THE UNDERGRADUATE LEVEL by Biswa Nath Datta Department of Mathematical Sciences Northern Illinois University DeKalb, IL 60115 E-mail: dattab@math.niu.edu What is Numerical
More informationJim Lambers MAT 610 Summer Session Lecture 2 Notes
Jim Lambers MAT 610 Summer Session 2009-10 Lecture 2 Notes These notes correspond to Sections 2.2-2.4 in the text. Vector Norms Given vectors x and y of length one, which are simply scalars x and y, the
More informationAccurate Multiple-Precision Gauss-Legendre Quadrature
Accurate Multiple-Precision Gauss-Legendre Quadrature Laurent Fousse Université Henri-Poincaré Nancy 1 laurent@komite.net Abstract Numerical integration is an operation that is frequently available in
More informationSolving Triangular Systems More Accurately and Efficiently
17th IMACS World Congress, July 11-15, 2005, Paris, France Scientific Computation, Applied Mathematics and Simulation Solving Triangular Systems More Accurately and Efficiently Philippe LANGLOIS and Nicolas
More informationAutomatic Verified Numerical Computations for Linear Systems
Automatic Verified Numerical Computations for Linear Systems Katsuhisa Ozaki (Shibaura Institute of Technology) joint work with Takeshi Ogita (Tokyo Woman s Christian University) Shin ichi Oishi (Waseda
More informationHomework 2 Foundations of Computational Math 1 Fall 2018
Homework 2 Foundations of Computational Math 1 Fall 2018 Note that Problems 2 and 8 have coding in them. Problem 2 is a simple task while Problem 8 is very involved (and has in fact been given as a programming
More informationUtilizing Quadruple-Precision Floating Point Arithmetic Operation for the Krylov Subspace Methods. SIAM Conference on Applied 1
Utilizing Quadruple-Precision Floating Point Arithmetic Operation for the Krylov Subspace Methods SIAM Conference on Applied 1 Outline Krylov Subspace methods Results of Accurate Computing Tools for Accurate
More informationOn the computation of the reciprocal of floating point expansions using an adapted Newton-Raphson iteration
On the computation of the reciprocal of floating point expansions using an adapted Newton-Raphson iteration Mioara Joldes, Valentina Popescu, Jean-Michel Muller ASAP 2014 1 / 10 Motivation Numerical problems
More informationNotes for Chapter 1 of. Scientific Computing with Case Studies
Notes for Chapter 1 of Scientific Computing with Case Studies Dianne P. O Leary SIAM Press, 2008 Mathematical modeling Computer arithmetic Errors 1999-2008 Dianne P. O'Leary 1 Arithmetic and Error What
More informationSPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics
SPARSE SOLVERS FOR THE POISSON EQUATION Margreet Nool CWI, Multiscale Dynamics November 9, 2015 OUTLINE OF THIS TALK 1 FISHPACK, LAPACK, PARDISO 2 SYSTEM OVERVIEW OF CARTESIUS 3 POISSON EQUATION 4 SOLVERS
More informationHow to Prepare Weather and Climate Models for Future HPC Hardware
How to Prepare Weather and Climate Models for Future HPC Hardware Peter Düben European Weather Centre (ECMWF) Peter Düben Page 2 The European Weather Centre (ECMWF) www.ecmwf.int Independent, intergovernmental
More informationComputing logarithms and other special functions
Computing logarithms and other special functions Mike Giles University of Oxford Mathematical Institute Napier 400 NAIS Symposium April 2, 2014 Mike Giles (Oxford) Computing special functions April 2,
More informationA Backward Stable Hyperbolic QR Factorization Method for Solving Indefinite Least Squares Problem
A Backward Stable Hyperbolic QR Factorization Method for Solving Indefinite Least Suares Problem Hongguo Xu Dedicated to Professor Erxiong Jiang on the occasion of his 7th birthday. Abstract We present
More informationSPECIAL PROJECT PROGRESS REPORT
SPECIAL PROJECT PROGRESS REPORT Progress Reports should be 2 to 10 pages in length, depending on importance of the project. All the following mandatory information needs to be provided. Reporting year
More informationPreface to Second Edition... vii. Preface to First Edition...
Contents Preface to Second Edition..................................... vii Preface to First Edition....................................... ix Part I Linear Algebra 1 Basic Vector/Matrix Structure and
More informationSquare root of a symmetric matrix
Square root of a symmetric matrix Mario Berljafa Stefan Güttel June 2015 Contents 1 Introduction 1 2 Running rkfit 1 3 Evaluating the rational approximant 2 4 Some different choices for the initial poles
More informationFunctions of a Matrix: Theory and Computation
Functions of a Matrix: Theory and Computation Nick Higham School of Mathematics The University of Manchester higham@ma.man.ac.uk http://www.ma.man.ac.uk/~higham/ Landscapes in Mathematical Science, University
More informationNumerical Algorithms. IE 496 Lecture 20
Numerical Algorithms IE 496 Lecture 20 Reading for This Lecture Primary Miller and Boxer, Pages 124-128 Forsythe and Mohler, Sections 1 and 2 Numerical Algorithms Numerical Analysis So far, we have looked
More informationVerification methods for linear systems using ufp estimation with rounding-to-nearest
NOLTA, IEICE Paper Verification methods for linear systems using ufp estimation with rounding-to-nearest Yusuke Morikura 1a), Katsuhisa Ozaki 2,4, and Shin ichi Oishi 3,4 1 Graduate School of Fundamental
More informationSOLUTION of linear systems of equations of the form:
Proceedings of the Federated Conference on Computer Science and Information Systems pp. Mixed precision iterative refinement techniques for the WZ factorization Beata Bylina Jarosław Bylina Institute of
More informationBounds for Variable Degree Rational L Approximations to the Matrix Cosine
Bounds for Variable Degree Rational L Approximations to the Matrix Cosine Ch. Tsitouras a, V.N. Katsikis b a TEI of Sterea Hellas, GR34400, Psahna, Greece b TEI of Piraeus, GR12244, Athens, Greece Abstract
More informationCME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 5. Ax = b.
CME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 5 GENE H GOLUB Suppose we want to solve We actually have an approximation ξ such that 1 Perturbation Theory Ax = b x = ξ + e The question is, how
More informationTaylor s Theorem for Matrix Functions with Applications to Condition Number Estimation. Deadman, Edvin and Relton, Samuel. MIMS EPrint: 2015.
Taylor s Theorem for Matrix Functions with Applications to Condition Number Estimation Deadman, Edvin and Relton, Samuel 215 MIMS EPrint: 215.27 Manchester Institute for Mathematical Sciences School of
More informationResearch Matters. February 25, The Nonlinear Eigenvalue Problem. Nick Higham. Part III. Director of Research School of Mathematics
Research Matters February 25, 2009 The Nonlinear Eigenvalue Problem Nick Higham Part III Director of Research School of Mathematics Françoise Tisseur School of Mathematics The University of Manchester
More informationNumerical Methods in Matrix Computations
Ake Bjorck Numerical Methods in Matrix Computations Springer Contents 1 Direct Methods for Linear Systems 1 1.1 Elements of Matrix Theory 1 1.1.1 Matrix Algebra 2 1.1.2 Vector Spaces 6 1.1.3 Submatrices
More informationWhat Every Programmer Should Know About Floating-Point Arithmetic DRAFT. Last updated: November 3, Abstract
What Every Programmer Should Know About Floating-Point Arithmetic Last updated: November 3, 2014 Abstract The article provides simple answers to the common recurring questions of novice programmers about
More informationNumerical Linear Algebra
Schedule Prerequisite Preliminaries Errors and Algorithms Numerical Linear Algebra Kim, Hyun-Min Department of Mathematics, Pusan National University E-mail:hyunmin@pusan.ac.kr Phone: 510-1060, 2596, 010-3833-8200
More informationFitting an artificial frequency response
Fitting an artificial frequency response Mario Berljafa Stefan Güttel February 215 Contents 1 Introduction 1 2 Problem setup 1 3 Testing RKFIT 2 4 The rkfun class 3 5 Some other choices for the initial
More information1 Number Systems and Errors 1
Contents 1 Number Systems and Errors 1 1.1 Introduction................................ 1 1.2 Number Representation and Base of Numbers............. 1 1.2.1 Normalized Floating-point Representation...........
More informationCompressing exterior Helmholtz problems
Compressing exterior Helmholtz problems Stefan Güttel October 2016 Contents 1 Introduction 1 2 A two-layer waveguide example 2 3 Other examples 6 4 References 6 1 Introduction Finite difference (FD) methods
More informationThe restarted QR-algorithm for eigenvalue computation of structured matrices
Journal of Computational and Applied Mathematics 149 (2002) 415 422 www.elsevier.com/locate/cam The restarted QR-algorithm for eigenvalue computation of structured matrices Daniela Calvetti a; 1, Sun-Mi
More informationSparse Solutions of Systems of Equations and Sparse Modelling of Signals and Images
Sparse Solutions of Systems of Equations and Sparse Modelling of Signals and Images Alfredo Nava-Tudela ant@umd.edu John J. Benedetto Department of Mathematics jjb@umd.edu Abstract In this project we are
More informationNumerical Methods - Numerical Linear Algebra
Numerical Methods - Numerical Linear Algebra Y. K. Goh Universiti Tunku Abdul Rahman 2013 Y. K. Goh (UTAR) Numerical Methods - Numerical Linear Algebra I 2013 1 / 62 Outline 1 Motivation 2 Solving Linear
More informationCS 542G: Conditioning, BLAS, LU Factorization
CS 542G: Conditioning, BLAS, LU Factorization Robert Bridson September 22, 2008 1 Why some RBF Kernel Functions Fail We derived some sensible RBF kernel functions, like φ(r) = r 2 log r, from basic principles
More informationNumerical Methods - Preliminaries
Numerical Methods - Preliminaries Y. K. Goh Universiti Tunku Abdul Rahman 2013 Y. K. Goh (UTAR) Numerical Methods - Preliminaries 2013 1 / 58 Table of Contents 1 Introduction to Numerical Methods Numerical
More informationMath 200 A and B: Linear Algebra Spring Term 2007 Course Description
Math 200 A and B: Linear Algebra Spring Term 2007 Course Description February 25, 2007 Instructor: John Schmitt Warner 311, Ext. 5952 jschmitt@middlebury.edu Office Hours: Monday, Wednesday 11am-12pm,
More informationWhat is it we are looking for in these algorithms? We want algorithms that are
Fundamentals. Preliminaries The first question we want to answer is: What is computational mathematics? One possible definition is: The study of algorithms for the solution of computational problems in
More informationIntroduction to Applied Linear Algebra with MATLAB
Sigam Series in Applied Mathematics Volume 7 Rizwan Butt Introduction to Applied Linear Algebra with MATLAB Heldermann Verlag Contents Number Systems and Errors 1 1.1 Introduction 1 1.2 Number Representation
More information1 Backward and Forward Error
Math 515 Fall, 2008 Brief Notes on Conditioning, Stability and Finite Precision Arithmetic Most books on numerical analysis, numerical linear algebra, and matrix computations have a lot of material covering
More informationInteger Factorisation on the AP1000
Integer Factorisation on the AP000 Craig Eldershaw Mathematics Department University of Queensland St Lucia Queensland 07 cs9@student.uq.edu.au Richard P. Brent Computer Sciences Laboratory Australian
More information