How to Write Fast Numerical Code
|
|
- Egbert Carpenter
- 5 years ago
- Views:
Transcription
1 How to Write Fast Numerical Code Lecture: Discrete Fourier transform, fast Fourier transforms Instructor: Markus Püschel TA: Georg Ofenbeck & Daniele Spampinato Rest of Semester Today Lecture Project meetings Project presentations 1 minutes each random order random speaker 2
2 Linear Transforms Overview: Transforms and algorithms Discrete Fourier transform Fast Fourier transforms After that: Optimized implementation and autotuning (FFTW) Automatic program synthesis (Spiral) 3 FFT References Complexity: Bürgisser, Clausen, Shokrollahi, Algebraic Complexity Theory, Springer, 1997 History: Heideman, Johnson, Burrus: Gauss and the History of the Fast Fourier Transform, Arch. Hist. Sc. 34(3) 1985 FFTs: Cooley and Tukey, An algorithm for the machine calculation of complex Fourier series, Math. of Computation, vol. 19, pp , 1965 Nussbaumer, Fast Fourier Transform and Convolution Algorithms, 2nd ed., Springer, 1982 van Loan, Computational Frameworks for the Fast Fourier Transform, SIAM, 1992 Tolimieri, An, Lu, Algorithms for Discrete Fourier Transforms and Convolution, Springer, 2nd edition, 1997 Franchetti, Püschel, Voronenko, Chellappa and Moura, Discrete Fourier Transform on Multicore, IEEE Signal Processing Magazine, special issue on ``Signal Processing on Platforms with Multiple Cores'', Vol. 26, No. 6, pp. 9-12, 29 4
3 Linear Transforms Very important class of functions: signal processing, scientific computing, Mathematically: Change of basis = Multiplication by a fixed matrix T 1 y y B 1. A = y = T x y n 1 Output T T = [t k;`] k;`<n x x x = B x n 1 Input 1 C A Equivalent definition: Summation form n 1 X y k = t k;`x`; `= k < n 5 Smallest Relevant Example: DFT, Size 2 Transform (matrix): T = " # Computation: y = " # 1 1 x 1 1 or y = x + x 1 y 1 = x x 1 As graph (direct acyclic graph or DAG): y x y 1 x 1 called a butterfly 211_2_1_archive.html 6
4 Transforms: Examples A few dozen transforms are relevant Some examples DFT n = [e 2k`¼i=n ] k;`<n 8 < cos 2¼k` RDFT n = [r k`] k;`<n ; r k` = n ; k bn 2 c : sin 2¼k` n ; k > bn 2 c DHT = h cos(2k`¼=n) + sin(2k`¼=n) i k;`<n " # WHTn=2 WHT WHT n = n=2 ; WHT WHT n=2 WHT 2 = DFT 2 n=2 IMDCT n = h cos((2k + 1)(2` n)¼=4n) i k<2n; `<n h i DCT-2 n = cos(k(2` + 1)¼=2n) k;`<n DCT-3 n = DCT-2 T n (transpose) universal tool MPEG JPEG DCT-4 n = h cos((2k + 1)(2` + 1)¼=4n) i k;`<n 7 Blackboard Discrete Fourier transform (DFT) Transform algorithms Fast Fourier transform, size 4 8
5 Linear Transforms: DFT 1 y y B 1. A = y = T x y n 1 Output T x x x = B x n 1 Input 1 C A Example: T = DFT n = [e 2k`¼i=n ] k;`<n = [! n k` ] k;`<n ;! n = e 2¼i=n 9 Algorithms: Example FFT, n = 4 Fast Fourier transform (FFT) i 1 i i 1 i 5 x = x i Representation using matrix algebra Data flow graph stride 2 stride 1 1
6 Cooley-Tukey FFT (Recursive, General-Radix) Blackboard Kronecker products Stride permutations 11 Example FFT, n = 16 (Recursive, Radix 4) stride 4 stride 1 12
7 Recursive Cooley-Tukey FFT radix DFT km = (DFT k I m )Tm km (I k DFT m )L km k DFT km = L km m (I k DFT m )T km m (DFT k I m ) decimation-in-time decimation-in-frequency For powers of two n = 2 t sufficient together with base case " # 1 1 DFT 2 = 1 1 Cost: (complex adds, complex mults) = (n log 2 (n), n log 2 (n)/2) independent of recursion (real adds, real mults) (2n log 2 (n), 3n log 2 (n)) = 5n log 2 (n) flops depends on recursion: best is at least radix-8 13 Recursive vs. Iterative FFT Recursive, radix-k Cooley-Tukey FFT DFT km = (DFT k I m )Tm km (I k DFT m )L km k DFT km = L km m (I k DFT m )T km m (DFT k I m ) Iterative, radix 2, decimation-in-time/decimation-in-frequency 1 ty DFT 2 t (I 2 j 1 DFT 2 I 2 t j) (I 2 j 1 T 2t j+1 2 t j ) A R 2 t j=1 1 ty DFT 2 t = R 2 (I 2 t j T 2 2j j 1 ) (I 2 t j DFT 2 I 2 j 1) A j=1 14
8 Radix 2, recursive Radix 2, iterative Recursive vs. Iterative Iterative FFT computes in stages of butterflies = log 2 (n) passes through the data Recursive FFT reduces passes through data = better locality Same computation graph but different topological sorting Rough analogy: MMM Triple loop Blocked DFT Iterative FFT Recursive FFT 16
9 The FFT Is Very Malleable 17 Iterative FFT, Radix 2 18
10 Pease FFT, Radix 2 19 Stockham FFT, Radix 2 2
11 Six-Step FFT 21 Multi-Core FFT 22
12 Transform Algorithms DFT n! P k=2;2m > ³ ³ ³ DFT2m I k=2 1 i C 2m rdft 2m (i=k) RDFT k I m ; k even; 11 RDFT n RDFT RDFT 2m n! (P DHT > n k=2;m I RDFT 2) rdft 2m (i=k) RDFT 1 k B 2m DHT DHT I k=2 1 rdft i D 2m 2m (i=k) RDFT CC B k rdht n DHT 2m (i=k) DHT I k m C A ; 2m rdht 2m (i=k) DHT Ã!! k rdft 2n (u) rdht 2n (u)! L2n rdft m I k 2m ((i + u)=k) rdft i à 2k (u) rdht 2m ((i + u)=k) rdht 2k (u) Im ; DCT-4 n! S ndct-2 n diag k<n (1=(2 cos((2k + 1)¼=4n))) ÃÃ" #! Ã" #!! 1 1 IMDCT 2m! (J m I m I m J m) I 1 m I 1 m J 2m DCT-4 2m ty WHT 2 k! (I 2 k 1 + +k i 1 WHT 2 k i I k 2 i+1 + +k t ); k = k k t i=1 DFT 2! F 2 k even; RDFT-3 n! (Q > k=2;m I 2) (I k i rdft 2m )(i + 1=2)=k)) (RDFT-3 k I m) ; k even; DCT-2 n! P k=2;2m > ³ DCT-22m K2 2m ³ I k=2 1 N 2m RDFT-3 > 2m Bn(L n=2 k=2 I 2)(I m RDFT k )Q m=2;k ; DCT-3 n! DCT-2 > n ; DCT-4 n! Q > ³ k=2;2m Ik=2 N 2m RDFT-3 > 2m B n (L n=2 k=2 I 2)(I m RDFT-3 k )Q m=2;k : DFT n! (DFT k I m) T n m (I k DFT m) L n k ; n = km Cooley-Tukey FFT DFT n! P n(dft k DFT m)q n; n = km; gcd(k; m) = 1 Prime-factor FFT DFT p! Rp T (I 1 DFT p 1 )D p(i 1 DFT p 1 )R p; p prime Rader FFT DCT-3 n! (I m J m) L n m (DCT-3m(1=4) DCT-3m(3=4)) " # Im J m 1 (F 2 I m) p1 (I I ; n = 2m m) DCT-2 2! diag(1; 1= p 2) F 2 DCT-4 2! J 2 R 13¼=8 23 Complexity of the DFT Measure: L c, 2 c Complex adds count 1 Complex mult by a constant a with a < c counts 1 L 2 is strictest, L the loosest (and most natural) Upper bounds: n = 2 k : L 2 (DFT n ) 3/2 n log 2 (n) (using Cooley-Tukey FFT) General n: L 2 (DFT n ) 8 n log 2 (n) (needs Bluestein FFT) Lower bound: Theorem by Morgenstern: If c <, then L c (DFT n ) ½ n log c (n) Implies: in the measure L c, the DFT is Θ(n log(n)) 24
13 History of FFTs The advent of digital signal processing is often attributed to the FFT (Cooley-Tukey 1965) History: Around 185: FFT discovered by Gauss [1] (Fourier publishes the concept of Fourier analysis in 187!) 1965: Rediscovered by Cooley-Tukey 25 [1]: Heideman, Johnson, Burrus: Gauss and the History of the Fast Fourier Transform Arch. Hist. Sc. 34(3) 1985 Carl-Friedrich Gauss Contender for the greatest mathematician of all times Some contributions: Modular arithmetic, least square analysis, normal distribution, fundamental theorem of algebra, Gauss elimination, Gauss quadrature, Gauss-Seidel, non-euclidean geometry, 26
How to Write Fast Numerical Code Spring 2012 Lecture 19. Instructor: Markus Püschel TAs: Georg Ofenbeck & Daniele Spampinato
How to Write Fast Numerical Code Spring 2012 Lecture 19 Instructor: Markus Püschel TAs: Georg Ofenbeck & Daniele Spampinato Miscellaneous Roofline tool Project report etc. online Midterm (solutions online)
More informationSpiral. Program Synthesis for Performance. and the Spiral team (only part shown) Supported by DARPA, ONR, NSF, Intel, Mercury
Spiral Program Synthesis for Performance Joint work with Franz Franchetti Yevgen Voronenko Srinivas Chellappa Frédéric de Mesmay Daniel McFarlin José Moura James Hoe and the Spiral team (only part shown)
More informationRadix-4 Factorizations for the FFT with Ordered Input and Output
Radix-4 Factorizations for the FFT with Ordered Input and Output Vikrant 1, Ritesh Vyas 2, Sandeep Goyat 3, Jitender Kumar 4, Sandeep Kaushal 5 YMCA University of Science & Technology, Faridabad (Haryana),
More information5.6 Convolution and FFT
5.6 Convolution and FFT Fast Fourier Transform: Applications Applications. Optics, acoustics, quantum physics, telecommunications, control systems, signal processing, speech recognition, data compression,
More informationTHERE is a large body of literature on fast transform algorithms.
1502 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 56, NO. 4, APRIL 2008 Algebraic Signal Processing Theory: Cooley Tukey Type Algorithms for DCTs and DSTs Markus Püschel, Senior Member, IEEE, and José
More informationALGEBRAIC SIGNAL PROCESSING THEORY: AN OVERVIEW Markus Püschel Electrical and Computer Engineering Carnegie Mellon University, Pittsburgh, U.S.A.
ALGEBRAIC SIGNAL PROCESSING THEORY: AN OVERVIEW Markus Püschel Electrical and Computer Engineering Carnegie Mellon University, Pittsburgh, U.S.A. ABSTRACT We give an overview of the algebraic signal processing
More informationAlgorithm Design and Analysis
Algorithm Design and Analysis LECTURE 14 Divide and Conquer Fast Fourier Transform Sofya Raskhodnikova 10/7/2016 S. Raskhodnikova; based on slides by K. Wayne. 5.6 Convolution and FFT Fast Fourier Transform:
More informationDesign and Analysis of Algorithms
Design and Analysis of Algorithms CSE 5311 Lecture 5 Divide and Conquer: Fast Fourier Transform Junzhou Huang, Ph.D. Department of Computer Science and Engineering CSE5311 Design and Analysis of Algorithms
More informationAlgebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs
Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs Markus Püschel and José M. F. Moura arxiv:cs/0005v [cs.it] Feb 00 Abstract This paper presents a systematic methodology
More informationTransforms and Orthogonal Bases
Orthogonal Bases Transforms and Orthogonal Bases We now turn back to linear algebra to understand transforms, which map signals between different domains Recall that signals can be interpreted as vectors
More informationE The Fast Fourier Transform
Fourier Transform Methods in Finance By Umberto Cherubini Giovanni Della Lunga Sabrina Mulinacci Pietro Rossi Copyright 2010 John Wiley & Sons Ltd E The Fast Fourier Transform E.1 DISCRETE FOURIER TRASFORM
More informationThe DFT as Convolution or Filtering
Connexions module: m16328 1 The DFT as Convolution or Filtering C. Sidney Burrus This work is produced by The Connexions Project and licensed under the Creative Commons Attribution License A major application
More informationAutomatic Generation of Fast Discrete Signal Transforms
1992 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL 49, NO 9, SEPTEMBER 2001 Automatic Generation of Fast Discrete Signal Transforms Sebastian Egner and Markus Püschel Abstract This paper presents an algorithm
More informationThe tangent FFT. D. J. Bernstein University of Illinois at Chicago
The tangent FFT D. J. Bernstein University of Illinois at Chicago Advertisement SPEED: Software Performance Enhancement for Encryption and Decryption A workshop on software speeds for secret-key cryptography
More informationLecture 7 January 26, 2016
MATH 262/CME 372: Applied Fourier Analysis and Winter 26 Elements of Modern Signal Processing Lecture 7 January 26, 26 Prof Emmanuel Candes Scribe: Carlos A Sing-Long, Edited by E Bates Outline Agenda:
More informationHow to Multiply. 5.5 Integer Multiplication. Complex Multiplication. Integer Arithmetic. Complex multiplication. (a + bi) (c + di) = x + yi.
How to ultiply Slides by Kevin Wayne. Copyright 5 Pearson-Addison Wesley. All rights reserved. integers, matrices, and polynomials Complex ultiplication Complex multiplication. a + bi) c + di) = x + yi.
More informationIntroduction to HPC. Lecture 20
COSC Introduction to HPC Lecture Dept of Computer Science COSC Fast Fourier Transform COSC Image Processing In Electron Microscopy Medical Imaging Signal Processing In Radio Astronomy Modeling and Solution
More informationMA3232 Numerical Analysis Week 9. James Cooley (1926-)
MA umerical Analysis Week 9 James Cooley (96-) James Cooley is an American mathematician. His most significant contribution to the world of mathematics and digital signal processing is the Fast Fourier
More informationSequential Fast Fourier Transform (PSC ) Sequential FFT
Sequential Fast Fourier Transform (PSC 3.1 3.2) 1 / 18 Applications of Fourier analysis Fourier analysis studies the decomposition of functions into their frequency components. Piano Concerto no. 9 by
More informationELEG 305: Digital Signal Processing
ELEG 5: Digital Signal Processing Lecture 6: The Fast Fourier Transform; Radix Decimatation in Time Kenneth E. Barner Department of Electrical and Computer Engineering University of Delaware Fall 8 K.
More informationThe tangent FFT. Daniel J. Bernstein
The tangent FFT Daniel J. Bernstein Department of Mathematics, Statistics, and Computer Science (M/C 249) University of Illinois at Chicago, Chicago, IL 60607 7045, USA djb@cr.yp.to Abstract. The split-radix
More informationCSE 160 Lecture 13. Numerical Linear Algebra
CSE 16 Lecture 13 Numerical Linear Algebra Announcements Section will be held on Friday as announced on Moodle Midterm Return 213 Scott B Baden / CSE 16 / Fall 213 2 Today s lecture Gaussian Elimination
More informationRadar Systems Engineering Lecture 3 Review of Signals, Systems and Digital Signal Processing
Radar Systems Engineering Lecture Review of Signals, Systems and Digital Signal Processing Dr. Robert M. O Donnell Guest Lecturer Radar Systems Course Review Signals, Systems & DSP // Block Diagram of
More informationThe Fourier transform allows an arbitrary function to be represented in terms of simple sinusoids. The Fourier transform (FT) of a function f(t) is
1 Introduction Here is something I wrote many years ago while working on the design of anemometers for measuring shear stresses. Part of this work required modelling and compensating for the transfer function
More informationFast Polynomials Multiplication Using FFT
Li Chen lichen.xd at gmail.com Xidian University January 17, 2014 Outline 1 Discrete Fourier Transform (DFT) 2 Discrete Convolution 3 Fast Fourier Transform (FFT) 4 umber Theoretic Transform (TT) 5 More
More informationCanonic FFT flow graphs for real-valued even/odd symmetric inputs
Lao and Parhi EURASIP Journal on Advances in Signal Processing (017) 017:45 DOI 10.1186/s13634-017-0477-9 EURASIP Journal on Advances in Signal Processing RESEARCH Canonic FFT flow graphs for real-valued
More informationFast integer multiplication
Fast integer multiplication David Harvey, Joris van der Hoeven, Grégoire Lecerf CNRS, École polytechnique Bordeaux, February 2, 2015 http://www.texmacs.org Fundamental complexities I(n): multiplication
More informationEDISP (NWL3) (English) Digital Signal Processing DFT Windowing, FFT. October 19, 2016
EDISP (NWL3) (English) Digital Signal Processing DFT Windowing, FFT October 19, 2016 DFT resolution 1 N-point DFT frequency sampled at θ k = 2πk N, so the resolution is f s/n If we want more, we use N
More informationOn the computational complexity of mathematical functions
On the computational complexity of mathematical functions Jean-Pierre Demailly Institut Fourier, Université de Grenoble I & Académie des Sciences, Paris (France) November 26, 2011 KVPY conference at Vijyoshi
More informationAlgebraic Signal Processing Theory: Foundation and 1-D Time Markus Püschel, Senior Member, IEEE, and José M. F. Moura, Fellow, IEEE
3572 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL 56, NO 8, AUGUST 2008 Algebraic Signal Processing Theory: Foundation and 1-D Time Markus Püschel, Senior Member, IEEE, and José M F Moura, Fellow, IEEE
More informationSignal Flow Graph Approach to Efficient DST I-IV Algorithms
Publications 1-18-016 Signal Flow Graph Approach to Efficient DST I-IV Algorithms Sirani M. Perera Embry-Riddle Aeronautical University, pereras@erau.edu Follow this and additional works at: https://commons.erau.edu/publication
More informationFast Polynomial Multiplication
Fast Polynomial Multiplication Marc Moreno Maza CS 9652, October 4, 2017 Plan Primitive roots of unity The discrete Fourier transform Convolution of polynomials The fast Fourier transform Fast convolution
More informationFAST ALGORITHMS FOR MDCT AND LOW DELAY FILTERBANKS USED IN AUDIO CODING RAVI KIRAN CHIVUKULA. Presented to the Faculty of the Graduate School of
FAST ALGORITHMS FOR MDCT AD LOW DELAY FILTERBAKS USED I AUDIO CODIG by RAVI KIRA CHIVUKULA Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial Fulfillment
More informationSummary of the FFT *
OpenStax-CNX module: m42270 1 Summary of the FFT * C. Sidney Burrus This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 Abstract Summary of research at
More informationA Mathematical Framework for Parallel Computing of Discrete-Time Discrete-Frequency Transforms in Multi-Core Processors
Appl. Math. Inf. Sci. 8, o. 6, 2795-2801 (2014) 2795 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/10.12785/amis/080615 A Mathematical Framework for Parallel Computing
More informationA HISTORICAL PERSPECTIVE OF THE FFT ALGORITHM
A HISTORICAL PERSPECTIVE OF THE FFT ALGORITHM Ingvar Claesson 1, Thomas L. Lagö 2, 1 Blekinge Institute of Technology, Telecommunication and Signal Processing Soft Center, Ronneby, Sweden 2 Svantek America
More informationOld and new algorithms for computing Bernoulli numbers
Old and new algorithms for computing Bernoulli numbers University of New South Wales 25th September 2012, University of Ballarat Bernoulli numbers Rational numbers B 0, B 1,... defined by: x e x 1 = n
More informationBig Prime Field FFT on the GPU
Big Prime Field FFT on the GPU Liangyu Chen, Svyatoslav Covanov, Davood Mohajerani, Marc Moreno Maza To cite this version: Liangyu Chen, Svyatoslav Covanov, Davood Mohajerani, Marc Moreno Maza. Big Prime
More informationDense Arithmetic over Finite Fields with CUMODP
Dense Arithmetic over Finite Fields with CUMODP Sardar Anisul Haque 1 Xin Li 2 Farnam Mansouri 1 Marc Moreno Maza 1 Wei Pan 3 Ning Xie 1 1 University of Western Ontario, Canada 2 Universidad Carlos III,
More informationParallel Numerical Algorithms
Parallel Numerical Algorithms Chapter 13 Prof. Michael T. Heath Department of Computer Science University of Illinois at Urbana-Champaign CS 554 / CSE 512 Michael T. Heath Parallel Numerical Algorithms
More informationFast Fourier Transform
Fast Fourier Transform December 8, 2016 FFT JPEG RGB Y C B C R (luma (brightness), chroma 2 (color)) chroma resolution is reduced image is split in blocks 8 8 pixels JPEG RGB Y C B C R (luma (brightness),
More informationBlock diagonal structure in discrete transforms. [ WH(N)] = N-point transform matrix for WHT
Block diagonal structure in discrete transforms Ja-Ling Wu Indexing terms: Mathematical techniques, Matrix algebra, Signal processing Abstract: The author investigates and summarises some of the computational
More informationDIGITAL SIGNAL PROCESSING UNIT I
DIGITAL SIGNAL PROCESSING UNIT I CONTENTS 1.1 Introduction 1.2 Introduction about signals 1.3 Signals Processing 1.4 Classification of signals 1.5 Operations performed on signals 1.6 Properties of signals
More information1 Multiply Eq. E i by λ 0: (λe i ) (E i ) 2 Multiply Eq. E j by λ and add to Eq. E i : (E i + λe j ) (E i )
Direct Methods for Linear Systems Chapter Direct Methods for Solving Linear Systems Per-Olof Persson persson@berkeleyedu Department of Mathematics University of California, Berkeley Math 18A Numerical
More informationLecture 12: Spiral: Domain Specific HLS. James C. Hoe Department of ECE Carnegie Mellon University
8 643 Lecture : Spiral: Domai Specific HLS James C. Hoe Departmet of ECE Caregie Mello Uiversity 8 643 F7 L S, James C. Hoe, CMU/ECE/CALCM, 07 Houseeepig Your goal today: see a eample of really highlevel
More informationMultiplying huge integers using Fourier transforms
Fourier transforms October 25, 2007 820348901038490238478324 1739423249728934932894??? integers occurs in many fields of Computational Science: Cryptography Number theory... Traditional approaches to
More informationVU Signal and Image Processing. Torsten Möller + Hrvoje Bogunović + Raphael Sahann
052600 VU Signal and Image Processing Torsten Möller + Hrvoje Bogunović + Raphael Sahann torsten.moeller@univie.ac.at hrvoje.bogunovic@meduniwien.ac.at raphael.sahann@univie.ac.at vda.cs.univie.ac.at/teaching/sip/17s/
More informationlecture 7: Trigonometric Interpolation
lecture : Trigonometric Interpolation 9 Trigonometric interpolation for periodic functions Thus far all our interpolation schemes have been based on polynomials However, if the function f is periodic,
More information! Circular Convolution. " Linear convolution with circular convolution. ! Discrete Fourier Transform. " Linear convolution through circular
Previously ESE 531: Digital Signal Processing Lec 22: April 18, 2017 Fast Fourier Transform (con t)! Circular Convolution " Linear convolution with circular convolution! Discrete Fourier Transform " Linear
More informationDIVIDE AND CONQUER II
DIVIDE AND CONQUER II master theorem integer multiplication matrix multiplication convolution and FFT Lecture slides by Kevin Wayne Copyright 2005 Pearson-Addison Wesley http://www.cs.princeton.edu/~wayne/kleinberg-tardos
More informationFast Fourier Transforms. Collection Editor: C. Sidney Burrus
Fast Fourier Transforms ollection Editor:. Sidney Burrus Fast Fourier Transforms ollection Editor:. Sidney Burrus Authors:. Sidney Burrus Matteo Frigo Steven G. Johnson Markus Pueschel Ivan Selesnick
More informationx (2) k even h n=(n) + (n+% x(6) X(3), x (5) 4 x(4)- x(), x (2), Decomposition of an N-point DFT into 2 N/2-point DFT's.
COMPUTATION OF DISCRETE FOURIER TRANSFORM - PART 2 1. Lecture 19-49 minutes k even n.o h n=(n) + (n+% x (2), x (2) x(4)- x(6) Decomposition of an N-point DFT into 2 N/2-point DFT's. X(3), x (5) 4 x(),
More informationToday s class. Linear Algebraic Equations LU Decomposition. Numerical Methods, Fall 2011 Lecture 8. Prof. Jinbo Bi CSE, UConn
Today s class Linear Algebraic Equations LU Decomposition 1 Linear Algebraic Equations Gaussian Elimination works well for solving linear systems of the form: AX = B What if you have to solve the linear
More informationLU Factorization. Marco Chiarandini. DM559 Linear and Integer Programming. Department of Mathematics & Computer Science University of Southern Denmark
DM559 Linear and Integer Programming LU Factorization Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark [Based on slides by Lieven Vandenberghe, UCLA] Outline
More informationAMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)
AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences) Lecture 19: Computing the SVD; Sparse Linear Systems Xiangmin Jiao Stony Brook University Xiangmin Jiao Numerical
More informationInput Decidable Language -- Program Halts on all Input Encoding of Input -- Natural Numbers Encoded in Binary or Decimal, Not Unary
Complexity Analysis Complexity Theory Input Decidable Language -- Program Halts on all Input Encoding of Input -- Natural Numbers Encoded in Binary or Decimal, Not Unary Output TRUE or FALSE Time and Space
More informationAlgebraic Signal Processing Theory
1 Algebraic Signal Processing Theory Markus Püschel and José M. F. Moura arxiv:cs/0612077v1 [cs.it] 15 Dec 2006 Abstract This paper presents an algebraic theory of linear signal processing. At the core
More informationEE123 Digital Signal Processing
EE123 Digital Signal Processing Lecture 5 based on slides by J.M. Kahn Info Last time Finished DTFT Ch. 2 z-transforms Ch. 3 Today: DFT Ch. 8 Reminders: HW Due tonight The effects of sampling What is going
More informationCS S Lecture 5 January 29, 2019
CS 6363.005.19S Lecture 5 January 29, 2019 Main topics are #divide-and-conquer with #fast_fourier_transforms. Prelude Homework 1 is due Tuesday, February 5th. I hope you ve at least looked at it by now!
More informationDSP Algorithm Original PowerPoint slides prepared by S. K. Mitra
Chapter 11 DSP Algorithm Implementations 清大電機系林嘉文 cwlin@ee.nthu.edu.tw Original PowerPoint slides prepared by S. K. Mitra 03-5731152 11-1 Matrix Representation of Digital Consider Filter Structures This
More informationRADIX-2 FAST HARTLEY TRANSFORM REVISITED. de Oliveira, H.M.; Viviane L. Sousa, Silva, Helfarne A.N. and Campello de Souza, R.M.
RADIX- FAST HARTLY TRASFRM RVISITD de liveira, H.M.; Viviane L. Sousa, Silva, Helfarne A.. and Campello de Souza, R.M. Federal University of Pernambuco Departamento de letrônica e Sistemas - Recife - P,
More informationThe Fourier Transform (and more )
The Fourier Transform (and more ) imrod Peleg ov. 5 Outline Introduce Fourier series and transforms Introduce Discrete Time Fourier Transforms, (DTFT) Introduce Discrete Fourier Transforms (DFT) Consider
More informationDense LU factorization and its error analysis
Dense LU factorization and its error analysis Laura Grigori INRIA and LJLL, UPMC February 2016 Plan Basis of floating point arithmetic and stability analysis Notation, results, proofs taken from [N.J.Higham,
More information1 Backward and Forward Error
Math 515 Fall, 2008 Brief Notes on Conditioning, Stability and Finite Precision Arithmetic Most books on numerical analysis, numerical linear algebra, and matrix computations have a lot of material covering
More informationComplexity of the Fourier transform on the Johnson graph
Complexity of the Fourier transform on the Johnson graph arxiv:1704.06299v3 [math.co] 8 Mar 2018 Rodrigo Iglesias Departamento de Matemática Universidad Nacional del Sur Bahía Blanca, Argentina. March
More informationFaster integer multiplication using short lattice vectors
Faster integer multiplication using short lattice vectors David Harvey and Joris van der Hoeven ANTS XIII, University of Wisconsin, Madison, July 2018 University of New South Wales / CNRS, École Polytechnique
More informationLecture 5. The Digital Fourier Transform. (Based, in part, on The Scientist and Engineer's Guide to Digital Signal Processing by Steven Smith)
Lecture 5 The Digital Fourier Transform (Based, in part, on The Scientist and Engineer's Guide to Digital Signal Processing by Steven Smith) 1 -. 8 -. 6 -. 4 -. 2-1 -. 8 -. 6 -. 4 -. 2 -. 2. 4. 6. 8 1
More informationDigital Signal Processing. Midterm 2 Solutions
EE 123 University of California, Berkeley Anant Sahai arch 15, 2007 Digital Signal Processing Instructions idterm 2 Solutions Total time allowed for the exam is 80 minutes Please write your name and SID
More informationDEN: Linear algebra numerical view (GEM: Gauss elimination method for reducing a full rank matrix to upper-triangular
form) Given: matrix C = (c i,j ) n,m i,j=1 ODE and num math: Linear algebra (N) [lectures] c phabala 2016 DEN: Linear algebra numerical view (GEM: Gauss elimination method for reducing a full rank matrix
More informationScientific Computing: An Introductory Survey
Scientific Computing: An Introductory Survey Chapter 12 Prof. Michael T. Heath Department of Computer Science University of Illinois at Urbana-Champaign Copyright c 2002. Reproduction permitted for noncommercial,
More informationA parallel implementation for polynomial multiplication modulo a prime.
A parallel implementation for polynomial multiplication modulo a prime. ABSTRACT Marshall Law Department of Mathematics Simon Fraser University Burnaby, B.C. Canada. mylaw@sfu.ca. We present a parallel
More informationComputer Vision, Convolutions, Complexity and Algebraic Geometry
Computer Vision, Convolutions, Complexity and Algebraic Geometry D. V. Chudnovsky, G.V. Chudnovsky IMAS Polytechnic Institute of NYU 6 MetroTech Center Brooklyn, NY 11201 December 6, 2012 Fast Multiplication:
More informationFFT: Fast Polynomial Multiplications
FFT: Fast Polynomial Multiplications Jie Wang University of Massachusetts Lowell Department of Computer Science J. Wang (UMass Lowell) FFT: Fast Polynomial Multiplications 1 / 20 Overview So far we have
More informationAlgebra. Modular arithmetic can be handled mathematically by introducing a congruence relation on the integers described in the above example.
Coding Theory Massoud Malek Algebra Congruence Relation The definition of a congruence depends on the type of algebraic structure under consideration Particular definitions of congruence can be made for
More informationImage Transforms. Digital Image Processing Fundamentals of Digital Image Processing, A. K. Jain. Digital Image Processing.
Digital Image Processing Fundamentals of Digital Image Processing, A. K. Jain 2D Orthogonal and Unitary Transform: Orthogonal Series Expansion: {a k,l (m,n)}: a set of complete orthonormal basis: N N *
More informationEFFICIENT ALGORITHMS FOR THE DISCRETE GABOR TRANSFORM WITH A LONG FIR WINDOW
EFFICIENT ALGORITHMS FOR THE DISCRETE GABOR TRANSFORM WITH A LONG FIR WINDOW PETER L. SØNDERGAARD Abstract. The Discrete Gabor Transform (DGT) is the most commonly used signal transform for signal analysis
More informationarxiv: v2 [math.ra] 6 Oct 2015
Generalizing Block LU Factorization: A Lower-Upper-Lower Block Triangular Decomposition with Minimal Off-Diagonal Ranks François Serre and Markus Püschel Department of Computer Science, ETH Zurich arxiv:48.994v2
More informationNumerical Results on the Transcendence of Constants. David H. Bailey 275{281
Numerical Results on the Transcendence of Constants Involving e, and Euler's Constant David H. Bailey February 27, 1987 Ref: Mathematics of Computation, vol. 50, no. 181 (Jan. 1988), pg. 275{281 Abstract
More informationDirect Methods for Solving Linear Systems. Simon Fraser University Surrey Campus MACM 316 Spring 2005 Instructor: Ha Le
Direct Methods for Solving Linear Systems Simon Fraser University Surrey Campus MACM 316 Spring 2005 Instructor: Ha Le 1 Overview General Linear Systems Gaussian Elimination Triangular Systems The LU Factorization
More informationMATH20602 Numerical Analysis 1
M\cr NA Manchester Numerical Analysis MATH20602 Numerical Analysis 1 Martin Lotz School of Mathematics The University of Manchester Manchester, February 1, 2016 Outline General Course Information Introduction
More informationThe discrete and fast Fourier transforms
The discrete and fast Fourier transforms Marcel Oliver Revised April 7, 1 1 Fourier series We begin by recalling the familiar definition of the Fourier series. For a periodic function u: [, π] C, we define
More information7. LU factorization. factor-solve method. LU factorization. solving Ax = b with A nonsingular. the inverse of a nonsingular matrix
EE507 - Computational Techniques for EE 7. LU factorization Jitkomut Songsiri factor-solve method LU factorization solving Ax = b with A nonsingular the inverse of a nonsingular matrix LU factorization
More informationInteger multiplication with generalized Fermat primes
Integer multiplication with generalized Fermat primes CARAMEL Team, LORIA, University of Lorraine Supervised by: Emmanuel Thomé and Jérémie Detrey Journées nationales du Calcul Formel 2015 (Cluny) November
More informationNumerical Methods in Matrix Computations
Ake Bjorck Numerical Methods in Matrix Computations Springer Contents 1 Direct Methods for Linear Systems 1 1.1 Elements of Matrix Theory 1 1.1.1 Matrix Algebra 2 1.1.2 Vector Spaces 6 1.1.3 Submatrices
More informationThe Discrete Fourier transform
453.70 Linear Systems, S.M. Tan, The University of uckland 9- Chapter 9 The Discrete Fourier transform 9. DeÞnition When computing spectra on a computer it is not possible to carry out the integrals involved
More informationFall 2011, EE123 Digital Signal Processing
Lecture 6 Miki Lustig, UCB September 11, 2012 Miki Lustig, UCB DFT and Sampling the DTFT X (e jω ) = e j4ω sin2 (5ω/2) sin 2 (ω/2) 5 x[n] 25 X(e jω ) 4 20 3 15 2 1 0 10 5 1 0 5 10 15 n 0 0 2 4 6 ω 5 reconstructed
More informationInterpolation on the unit circle
Lecture 2: The Fast Discrete Fourier Transform Interpolation on the unit circle So far, we have considered interpolation of functions at nodes on the real line. When we model periodic phenomena, it is
More informationA Non-Linear Lower Bound for Constant Depth Arithmetical Circuits via the Discrete Uncertainty Principle
A Non-Linear Lower Bound for Constant Depth Arithmetical Circuits via the Discrete Uncertainty Principle Maurice J. Jansen Kenneth W.Regan November 10, 2006 Abstract We prove a non-linear lower bound on
More informationThe Fast Wavelet Transform 1. Beyond Fourier transforms. Mac A. Cody 2
The Fast Wavelet Transform 1 Beyond Fourier transforms Mac A. Cody 2 The Fourier transform has been used as a reliable tool in signal analysis for many years. Invented in the early 1800s by its namesake,
More informationFrequency Domain Finite Field Arithmetic for Elliptic Curve Cryptography
Frequency Domain Finite Field Arithmetic for Elliptic Curve Cryptography Selçuk Baktır, Berk Sunar {selcuk,sunar}@wpi.edu Department of Electrical & Computer Engineering Worcester Polytechnic Institute
More informationAn Illustrated Introduction to the Truncated Fourier Transform
An Illustrated Introduction to the Truncated Fourier Transform arxiv:1602.04562v2 [cs.sc] 17 Feb 2016 Paul Vrbik. School of Mathematical and Physical Sciences The University of Newcastle Callaghan, Australia
More information6.3. MULTIVARIABLE LINEAR SYSTEMS
6.3. MULTIVARIABLE LINEAR SYSTEMS What You Should Learn Use back-substitution to solve linear systems in row-echelon form. Use Gaussian elimination to solve systems of linear equations. Solve nonsquare
More informationNumerical Linear Algebra
Numerical Linear Algebra Decompositions, numerical aspects Gerard Sleijpen and Martin van Gijzen September 27, 2017 1 Delft University of Technology Program Lecture 2 LU-decomposition Basic algorithm Cost
More informationProgram Lecture 2. Numerical Linear Algebra. Gaussian elimination (2) Gaussian elimination. Decompositions, numerical aspects
Numerical Linear Algebra Decompositions, numerical aspects Program Lecture 2 LU-decomposition Basic algorithm Cost Stability Pivoting Cholesky decomposition Sparse matrices and reorderings Gerard Sleijpen
More informationLecture 12 (Tue, Mar 5) Gaussian elimination and LU factorization (II)
Math 59 Lecture 2 (Tue Mar 5) Gaussian elimination and LU factorization (II) 2 Gaussian elimination - LU factorization For a general n n matrix A the Gaussian elimination produces an LU factorization if
More informationEven faster integer multiplication
Even faster integer multiplication DAVID HARVEY School of Mathematics and Statistics University of New South Wales Sydney NSW 2052 Australia Email: d.harvey@unsw.edu.au JORIS VAN DER HOEVEN a, GRÉGOIRE
More informationCSE 421 Algorithms. T(n) = at(n/b) + n c. Closest Pair Problem. Divide and Conquer Algorithms. What you really need to know about recurrences
CSE 421 Algorithms Richard Anderson Lecture 13 Divide and Conquer What you really need to know about recurrences Work per level changes geometrically with the level Geometrically increasing (x > 1) The
More informationA Simple Left-to-Right Algorithm for Minimal Weight Signed Radix-r Representations
A Simple Left-to-Right Algorithm for Minimal Weight Signed Radix-r Representations James A. Muir School of Computer Science Carleton University, Ottawa, Canada http://www.scs.carleton.ca/ jamuir 23 October
More informationLinear System of Equations
Linear System of Equations Linear systems are perhaps the most widely applied numerical procedures when real-world situation are to be simulated. Example: computing the forces in a TRUSS. F F 5. 77F F.
More informationAn Introduction and Analysis of the Fast Fourier Transform. Thao Nguyen Mentor: Professor Ron Buckmire
An Introduction and Analysis of the Fast Fourier Transform Thao Nguyen Mentor: Professor Ron Buckmire Outline 1. Outline of the Talk 2. Continuous Fourier Transform (CFT) Visualization of the CFT Mathematical
More information