EE216B: VLSI Signal Processing. FFT Processors. Prof. Dejan Marković FFT: Background

Size: px
Start display at page:

Download "EE216B: VLSI Signal Processing. FFT Processors. Prof. Dejan Marković FFT: Background"

Transcription

1 4/30/0 EE6B: VLSI Signal Processing FFT Processors Prof. Dejan Marković FFT: Background A bit of history algorithm first described by Gauss algorithm rediscovered (not for the first time) by Cooley and Tukey Applications FFT is a key building block in wireless communication receivers Also used for frequency analysis of EEG signals And many other applications Implementations Custom design with fixed number of points Flexible FFT kernels for many configurations 9.

2 4/30/0 Fourier Transform: Concept A complex function can be approximated with a weighted sum of basis functions Fourier used sinusoids with varying frequencies as the basis functions jωt X( ω) x( t) e dt π jωt x( t) X( ω) e dω π This representation provides the frequency content of the original function Fourier transform assumes that all spectral components are present at all times 9.3 The Fast Fourier Transform (FFT) Efficient method for calculating discrete Fourier transform (DFT) N = length of transform, must be composite N = N N N m Transform length DFT ops FFT ops DFT ops / FFT ops 64 4, ,536,048 3,04,048,576 0, ,536 4,94,967,96,048,576 4,

3 4/30/0 The DFT Algorithm Converts time-domain samples into frequency-domain samples X k N n 0 x e n π j nk N complex number k = 0,, N Implementation options Direct-sum evaluation: O(N ) operations FFT algorithm: O(N logn) operations Inverse DFT: frequency-to-time conversion DFT with opposite sign in the exponent A /N factor N π j nk N xk Xne k = 0,, N N n The Divide-and-Conquer Approach Map the original problem into several sub-problems in such a way that the following inequality holds: Cost(sub-problems) + Cost(mapping) < Cost(original problem) DFT: X N nk k xnwn n 0 k = 0,, N X() z N n 0 x z n n X k = evaluation of X(z) at z = W N k {x n } and {X k } are periodic sequences So, how shall we divide the problem? 9.6 3

4 4/30/0 The Divide and Conquer Approach (Cont.) Procedure: Consider sub-sets of the initial sequence Take the DFT of these sub-sequences Reconstruct the DFT from the intermediate results #: Define: I t, t = 0,, r partition of {0,, N } that defines G different subsets of the input sequence N r i i i 0 t 0 i I X() z x z x z #: Normalize the powers of z w.r.t. x 0t in each subset I t r X z z x z i i0t () i0t i t 0 Replace z by w N k in the inner sum t t i i 9.7 Cooley-Tukey Mapping Consider decimated versions of the initial sequence, N = N N I n = {n N + n } Equivalent description: Substitute () into () n = 0,, N ; n = 0,, N N N nk nn k k N n N n N n 0 N 0 i jπ in jπ in N N i N N X W x W W e e W N N nk nk k N n N n N n 0 N 0 X W x W (8.) (8.) DFT of length N 9.8 4

5 4/30/0 Cooley-Tukey Mapping (Cont.) Define: Y n,k = k th output of n th length-n DFT N nk k n, k N n 0 X Y W k = k N + k k = 0,, N k = 0,, N Y can be taken modulo N n,k k N k N k k WN W N W N W N W N All the X k for k being congruent modulo N are from the same group of N outputs of Y n,k Y = Y since k can be taken modulo N n,k n,k Equivalent description: N N n ( k N k ) n k n k kn k n, k N n, k N N n 0 n 0 X Y W Y W W From N DFTs of length N applied to Y Y n n,k,k 9.9 Cooley-Tukey Mapping, Example: N = 5, N = 3 -D mapped to -D x 3 x 0 x 4 x x 5 x N = 3 DFTs of 3 Twiddle-factor 4 N = 5 DFTs of length N = 5 multiply length N = 3 x 6 x 9 DFT-5 DFT-5 DFT-5 x DFT-3 DFT-3 x DFT-3 3 x DFT-3 x DFT-3 x 8 x x 7 0 x 5 x 6 x 0 x x x 3 x 4 x 9 [] x 4 [] P. Duhamel and M. Vetterli, "Fast Fourier Transforms - A Tutorial Review and a State-of-the-art," Elsevier Signal Processing, vol. 4, no. 9, pp , Apr

6 4/30/0 N = 3, N = 5 versus N = 5, N = 3 Original -D sequence x 0 x x x 3 x 4 x 6 x 7 x 8 x 5 x 9 x 0 x x x 3 x 4 -D to -D mapping N = 3, N = 5 N = 5, N = 3 x x 3 x 6 x 9 x 0 x x 4 x 7 x 0 x 3 x x 5 x 8 x x 4 x 0 x 5 x 0 x x 6 x x x 7 x x 8 x 3 x 3 x 4 x 4 x 9 D-D mapping can t be obtained by simple matrix transposition 9. Radix- Decimation in Time (DIT) N =, N = N divides the input sequence into the sequence of even- and odd-numbered samples ( decimation in time (DIT)) N/ N/ n k k n k k n N/ N n N/ n 0 n 0 X x W W x W N/ N/ n k n k N/ k n N/ N n N/ n 0 n 0 X x W W x W X k and X k+n/ obtained by -pt DFTs on the outputs of length N/ DFTs of the even- and odd-numbered sequences, one of which is weighted by twiddle factors 9. 6

7 4/30/0 Decimation in Time and Decimation in Frequency x 0 Decimation in Time (DIT) DFT-4 DFT- x 0 x 0 Decimation in Frequency (DIF) DFT- DFT-4 x 0 x x {x i } DFT- x 4 x x x DFT- {x i } x x 4 x 3 x 5 x 3 W 8 x 6 x 4 x 5 x 6 x 7 DFT-4 {x i+ } W 8 W 8 W 8 3 DFT- DFT- x x 6 x 3 x 7 x 4 x 5 x 6 x 7 DFT- DFT- W 8 W 8 3 DFT-4 {x i+ } x x 3 x 5 x 7 N = DFTs of length N = 4 Twiddlefactor multiply N = 4 DFTs of length N = N = 4 DFTs of length N = Twiddlefactor multiply Reverse the role of N, N (duality between DIT and DIF) N = DFTs of length N = Signal-Flow Graph (SFG) Notation In generalizing this formulation, it is most convenient to adopt a graphic approach Signal-flow-graph notation describes the three basic DSP operations: Addition Multiplication by a constant Delay x[n] y[n] x[n] x[n] + y[n] a a x[n] z k x[n] x[n k] 9.4 7

8 4/30/0 Radix- Butterfly Decimation in Time (DIT) X = A + B W Y = A B W Decimation in Frequency (DIF) X = A + B Y = (A B) W A X A X B W Y B W Y It does not make sense to compute B W twice, Z = B W Z = B W X = A + Z Y = A Z complex mult complex adds X = A + B Y = (A B) W complex mult complex adds Abbreviation: complex mult = c-mult 9.5 Radix-4 DIT Butterfly A B C D W d V W X Y V = A + B W b + C W c + D W d W = A j B W b C W c + j D W d X = A B W b + B W b + C W c D W d Y = A + j B W b C W c j D W d B = B W b C = C W c D = D W d 3 c-mults V = A + B + C + D W = A j B C + j D X = A B + B + C D Y = A + j B C j D 3 c-mults c-adds Multiply by j swap Re/Im, possibly a negation radix-4 BF is equivalent to 4 radix- BFs 3 c-mults 4 c-mults c-adds 8 c-adds Reduces to 8 c-adds with intermediate values: A + C A C B + D j B j D 9.6 8

9 4/30/0 Comparison of Radix- and Radix-4 Radix-4 has about the same number of adds and 75% the number of multiplies compared to radix- Higher radices Possible, but rarely used (complicated control) Additional frequency gains diminish for r > 4 Computational complexity: Number of mults = reasonable st estimate of algorithmic complexity M = log r (N) N mults = O(M N) Add N adds for more accurate estimate complex mult = 4 real mults + real adds 9.7 FFT Complexity N FFT Radix # Re mults # Re adds , , ,560 5, ,6 3, ,44 3, ,304 47, ,78 35, ,536 35, ,440 36,704 For M = log r (N) N mult = O(M N) Decreases monotonically with radix increase Decreases, reaches min, increases 9.8 9

10 4/30/0 Further Simplifications x 0 + X 0 W e πnk j N x W + X n =, k = in this example Example: N = 8 Im W 8 W 3 8 W 4 8 W 5 8 W 6 8 W 8 W 8 7 W 8 0 Considerably simpler Sign inversion Re Swap Re/Im (or both) 9.9 The Complete 8-Point Decimation-in-Time FFT x 0 x 4 x x 6 x x 5 x 3 x 7 X 0 X X X 3 X 4 X 5 X 6 X

11 N log (N) complexity 4/30/0 FFT Algorithm X() k N n 0 xn ( ) e n j π k N N complex multiplications and additions N m j π k N m 0 x( m) e N /4 complex multiplications and additions N m j π k N m 0 x(m ) e N /4 complex multiplications and additions N /6 complex multiplications and additions N /6 complex multiplications and additions 9. Higher Radix FFT Algorithms Radix Radix 4 N log (N) complexity N log 4 (N) complexity FFT size Real Multiplications Radix Real Additions Radix N 4 8 srfft 4 8 srfft Radix- is the most symmetric structure and best suited for folding 9.

12 4360 Real Multiplications 4/30/0 5-pt FFT: Synthesis-Time Bottleneck Direct-mapped architecture took about 8 hours to synthesize, retiming was unfinished after.5 days It is difficult to explore the design space if synthesis is done for every implementation 3566 Real Additions 9.3 Exploiting Hierarchy Each stage can be treated as a combination of butterfly- structures, with varying twiddle factors x 0 x(0) Stage Stage Stage 3 x 0 X(0) x 4 x(4) - x X() x x() - x X() x 6 x(6) - W8 - x 3 X(3) x x() x 4 X(4) x 3 x(3) - W8 x 5 X(5) x 5 x(5) - W8 xx(7) 7 - W8 X(7) - W x 6 X(6) x 7

13 4/30/0 Modeling the Radix- Butterfly: Area Re In Im In + + Re Out Im Out Multiplication with CSA Shifted version of the input a a a3 a4 a5 a6 a7 a8 a9 Re In x Carry save adder Carry save adder Carry save adder Re tw Re Out c s c s c s Im In Im tw x Carry save adder c s Carry save adder c s x Carry save adder c s x Im Out Carry save adder c s Carry Propagate Adder Area mult Area base Out Area total = Area base + Area mult 9.5 Twiddle-Factor Area Estimation Non-zero bits range from 0-8 for the sine and cosine term Non-zero bits in the cosine term Twiddle Factor Index 70 Non-zero bits in the sine term Twiddle Factor Index 70 Number of twiddle factors Number of twiddle factors Number of non-zero bits in the cosine term Number of non-zero bits in the sine term 9.6 3

14 4/30/0 Area Estimates from Synthesis 0000 cosine nz bits = cosine nz bits = Area 6000 Area Estimated Area Synthesized Area Estimated Area Synthesized Area 0 3 Number of non-zero bits in the sine term Number of non-zero bits in the sine term x 04 cosine nz bits = 3 cosine nz bits = 4.5 Area Estimated Area Synthesized Area Number of non-zero bits in the sine term Area Estimated Area Synthesized Area Number of non-zero bits in the sine term Average error in area estimate: ~ 5% 9.7 FFT Timing Estimation xx(0) 0 Stage Stage Stage 3 X(0) x 0 xx(4) 4 - X() x xx() - X() x xx(6) 6 - W8 - X(3) x 3 xx() X(4) x 4 xx(3) 3 - W8 X(5) x 5 xx(5) 5 - W8 X(6) x 6 x(7) 7 - W8 - W8 3 X(7) x 7 Critical path in each stage equal to delay of most complex butterfly For 3 stages delay varied between.6 ns to ns per stage (90-nm CMOS) Addition of pipeline registers between stages reduces the delay 9.8 4

15 4/30/0 FFT Energy Estimation Challenge: Difficult to estimate hierarchically. Carrying out switch level simulations in MATLAB or gate level simulation in RC is time consuming. Solution: Derive analytical expressions for energy in each butterfly as a function of input switching activity and nonzero bits in twiddle factor. x(0) 0 p(0) Stage Stage Stage 3 p(0)? X(0) x 0 xx(4) 4 p(4) - p()? X() x xx() p() p()? - X() x xx(6) 6 p(6) - p(3)? W8 - X(3) x 3 x() p() p(4)? X(4) x 4 x(3) 3 p(3) - p(5)? W8 X(5) x 5 x(5) 5 p(5) p(6)? - W8 X(6) x 6 x(7) 7 p(7) - p(7)? W8 - W8 3 X(7) x FFT Architectures Memory-based Execution time (~/Throughput) time-mux SDF, MDC parallelism Direct-mapped Area

16 4/30/0 Single-Path Delay Feedback (SDF) Architecture x[0] X[0] x[] x[] x[3] x[4] x[5] x[6] x[7] X[4] X[] X[6] X[] X[5] X[3] X[7] Stage Stage Stage 3 Z -4 Z - Z - Radix- butterfly Radix- butterfly Radix- butterfly Stage Stage Stage FFT Optimized Implementation 6

17 4/30/0 FFT Optimization For a given performance Minimize energy and/or area 9.33 Techniques Radix factorization Parallelism DL realization

18 4/30/0 FFT Factorization: N = N N N -pt FFT TF Full multiplier N -pt FFT N X( k) x( n) e n 0 πnk j N N N πnk πnk πn k j j j N NN N x( nn n ) e e e n 0 n 0 N -point DFT Twiddle factor (TF) N -point DFT 9.35 j j j C C 3 Radix Factorization C C 3 Radix, j CONSTANT multipliers j j C C 4 C 5 4 3, j, C, C 4, j, C, C, C 3, C 4, C 5, C 6 j C j C C

19 4/30/0 Flexible Processing Element (PE) PE-4 PE-4 W N nk Radix Implementation PE 4 PE+PE 8 PE+PE3+PE 6 PE+PE3+PE4+PE PE PE4 (3//) -j C... C 6 PE () -j PE3 (/) -j C C C 6 j C C 4 C C 3 C Only 3 Constant Factors to Build PUs Re{x} Im{x} = x x(j) N add = j x(j) real part imag part Re{x} xc 3 xc 4 Re{x} Im{x} xc xc Im{x} N add = N add = 8 xc 5 xc 6 Factor CSD realization Adders

20 4/30/0 Hardware Cost of PUs Radix Implementation Butterfly adders Const x (equiv. + ) PU 4 0 PU+PU 8 3 PU+PU3+PU 3 4 PU+PU3+PU4+PU Impact of TW Wordlength FFT size 4 points 8 points 6 points TW bits Radix Radix Radix Radix

21 Normalized Energy 4/30/0 Radix Structure of 8-pt FFT Arch R R4 R8 R6 A 7 A 5 A3 3 A4 3 A5 4 A6 A7 A8 A9 3 A0 A Normalized Area 9.4 Parallelism M-pt FFT L-pt FFT (a) reference SP SP M-pt FFT M-pt FFT P M-pt FFT (b) parallel L-pt FFT L-pt FFT P L-pt FFT PS PS M-pt FFT P (c) parallel (P = L) 9.4

22 4/30/0 (a) reference architecture M = L = 8 Z -8 Z -4 Z - PU PU PU Z - PU Example: 6-point FFT (b) L-path (L = 8) architecture SP... M = Z - PU 8... L = 8 8-point FFT... PS Arch 8-way L = 8 Delay 0 8 Mult 4 6 Add Hardware cost (a) (b) Delay buffers Full multipliers Adders Radix-Optimized 04-pt FFT Final architecture N = 8, N = 8 (out of 68 arch.)... 8-pt (4/4/8) pipeline FFT pt parallel FFT 8-pt FFT Z 64 Z 3 Z 6 Z 8 Z 4 Z Z PU PU PU PU PU PU3 PU Radix 4 Radix 4 Radix

23 Normalized Energy Normalized Energy 4/30/0 Optimization Techniques Normalized Area 9.45 Optimization Techniques Normalized Area

24 Normalized Energy Energy/FFT [nj] 4/30/0 Min EAP 4mW, 0.75mm 00MS/s) 0. N = Normalized Area 9.47 Mix-Radix vs. Sub-V T Circuits 04 points, 40 MS/s 7. Radix 0.7 V Sub-V T circuits V Logic synthesis C.-H. Yang, et al., JSSC 3/0. M. Seok, et al., ISSCC Area [mm ]

25 4/30/0 Flexible FFT: LTE Example 3GPP-LTE Specifications Parameter Configuration FFT size BW (MHz)

26 3.6 mm.5 mm 4/30/0 Flexible FFT (8-048 pts) for 3GPP-LTE pt SDF FFT /6-pt parallel FFT Flexible FFT: N = 6 to 56 N = 8 or 6 MUX Z -8 Z -64/-3 Z -3 Z -3/-6 Z -6 Z -6/-8 Z -8 PU PU3 PU4 PU Z -8 PU MUX MUX MUX Z -4 Z - Z - PU3 PU4 PU 6/3/64/8/56-pt 56-pt 8-pt 64-pt 6-pt 3-pt FFT FFT Architecture BW (MHz) FFT size Optimal Factorization (6x6)x (6x6)x (8x6)x8 5 5 (8x8)x (4x8)x8.5 8 (6)x8 9.5 Flexible FFT Chip Layout.7 mm. mm MEM MEM MEM MEM MEM MEM Reg. File Bank pt FFT Hard-output Sphere Decoder Soft-output Bank Pre-proc. MEM MEM MEM MEM MEM MEM PU3 PU Level shifters 8/6pt FFT TF Mults PU4 TF Mults PU PU4 PU3 PU PU C.-H. Yang, et al., VLSI

27 4/30/0 Energy of Flexible FFTs Energy/FFT (norm. to baseline) E/FFT = Power f clk N L min / 65nm V 0.4V FFT size, Baseline Energy (nj/fft) Y. Chen, et al., TCAS-II, /008. A. Wang and A. Chandrakasan, JSSC, /005. Baseline: C.-H. Yang and D. Marković, VLSI FFT Summary FFT: a technique for frequency analysis of stationary signals Key building block in radio systems and many other apps FFT is an economical implementation of DFT that leverages shorter sub-sequences to implement DFT on N discrete samples Key building block of an FFT is butterfly operator, which can be realized with N radix ( and 4 being the most common) Radix factorization is essential for low energy and area FFT does not work well with non-stationary signals

28 4/30/0 References : Algorithmic Techniques J.W. Cooley, J.W. Tukey, An algorithm for the machine calculation of complex Fourier series, Math. Comp., vol.9, pp.97-30, 965. P. Duhamel and H. Hollmann, Split-radix FFT algorithm, Electron. Lett., vol. 0, pp. 4 6, 984. I.J. Good, The Interaction Algorithm and Practical Fourier Analysis: An Addendum, J. Royal Statistical Society, no., pp , 960. S. Winograd, On computing the discrete Fourier transform, Math. Comp., vol. 3, pp , 978. P. Duhamel, Algorithms Meeting the Lower Bounds on the Multiplicative Complexity of Length-n DFTs and Their Connection with Practical Algorithms, IEEE Trans. Acoustic, Speech, and Signal Processing, vol. 38, no. 9, pp , 990. P. Duhamel and M. Vetterli, "Fast Fourier Transforms - A Tutorial Review and a State-of-the-art," Elsevier Signal Processing, vol. 4, no. 9, pp , Apr References : Traditional Architectures S. Magar, et. al., An application specific DSP chip set for 00 MHz data rates, in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 988, vol. 4, pp S. He and M. Torkelson, Design and implementation of a 04- point pipeline FFT processor, in Proc. IEEE Custom Integrated Circuits Conf., May 998, pp B.M. Baas, A low-power high-performance, 04-point FFT processor, IEEE J. Solid-State Circuits, vol. 34, no. 3, pp , Mar J. O Brien, J. Mather, and B. Holland, A 00 MIPS single-chip k FFT processor, in IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers, Feb. 989, pp

29 4/30/0 References : Circuit Techniques A. Wang and A. Chandrakasan, A 80-mV Subthreshold FFT Processor Using a Minimum Energy Design Methodology, IEEE J. Solid-State Circuits, vol. 40, no., pp , Jan M. Seok, D. Jeon, C. Chakrabarti, D. Blaauw, and D. Sylvester, A 0.7V 30MHz 7.7nJ/transform 04-pt Complex FFT Core with Super-pipelining, IEEE Int. Solid-State Circuits Conf. Dig. Tech. Papers, Feb. 0, pp Y. Chen, Y.-W. Lin, Y.-C. Tsao, and C.-Y. Lee, A.4-Gsample/s DVFS FFT Processor for MIMO OFDM Communication Systems, IEEE J. Solid-State Circuits, vol. 43, no. 5, pp , May 008. K.-S. Chong, B.-H. Gwee, and J.S. Chang, Energy-Efficient Synchronous-Logic and Asynchronous-Logic FFT/IFFT Processors, IEEE J. Solid-State Circuits, vol. 4, no. 9, pp , Sep References : Mix-radix Architectures (/) G. Zhong, F. Xu, and A.N. Willson, A Power-Scalable Reconfigurable FFT/IFFT IC Based on Multi-Processor Ring, IEEE J. Solid-State Circuits, vol. 4, no., pp , Feb Y.-W. Lin, H.-Y. Liu, and C.-Y. Lee, A -GS/s FFT/IFFT Processor for UWB Applications, IEEE J. Solid-State Circuits, vol. 40, no. 8, pp , Aug Y. Chen, Y.-C. Tsao, and C.-Y. Lee, An indexed-scaling pipelined FFT processor for OFDM-based WPAN applications, IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 55, no., pp , Feb Y.-T. Lin, P.-Y. Tsai, and T.-D. Chiueh, Low-power variable-length fast Fourier transform processor, IEE Proc.-Comput. Digit. Tech., vol. 5, no. 4, Jul. 005, pp W.-C. Yeh and C.-W. Jen, High-Speed and Low-Power Split-Radix FFT, IEEE Trans. Signal Processing, vol.5, no.3, pp ,

30 4/30/0 References : Mix-radix Architectures (/) C.-H. Yang, T.-H. Yu, and D. Marković, "Power and Area Minimization of Reconfigurable FFT Processors: A 3GPP-LTE Example," IEEE J. Solid-State Circuits, vol. 47, no.3, pp , Mar. 0. T.J. Ding, J.V. McCanny, and Y. Hu, Rapid Design of Application Specific FFT Cores, IEEE Trans. Signal Processing, vol. 47, no. 5, pp , 003. T.-D. Chiueh and P.-Y. Tsai, OFDM Baseband Receiver Design for Wireless Communications, Wiley, 007. A. Wenzler and E. Luder, New structures for complex multipliers and their noise analysis, in Proc. IEEE Int. Symp. Circuits Syst., ISCAS 95, May 995, vol., pp

A Low-Error Statistical Fixed-Width Multiplier and Its Applications

A Low-Error Statistical Fixed-Width Multiplier and Its Applications A Low-Error Statistical Fixed-Width Multiplier and Its Applications Yuan-Ho Chen 1, Chih-Wen Lu 1, Hsin-Chen Chiang, Tsin-Yuan Chang, and Chin Hsia 3 1 Department of Engineering and System Science, National

More information

Canonic FFT flow graphs for real-valued even/odd symmetric inputs

Canonic FFT flow graphs for real-valued even/odd symmetric inputs Lao and Parhi EURASIP Journal on Advances in Signal Processing (017) 017:45 DOI 10.1186/s13634-017-0477-9 EURASIP Journal on Advances in Signal Processing RESEARCH Canonic FFT flow graphs for real-valued

More information

Radix-4 Factorizations for the FFT with Ordered Input and Output

Radix-4 Factorizations for the FFT with Ordered Input and Output Radix-4 Factorizations for the FFT with Ordered Input and Output Vikrant 1, Ritesh Vyas 2, Sandeep Goyat 3, Jitender Kumar 4, Sandeep Kaushal 5 YMCA University of Science & Technology, Faridabad (Haryana),

More information

DSP Algorithm Original PowerPoint slides prepared by S. K. Mitra

DSP Algorithm Original PowerPoint slides prepared by S. K. Mitra Chapter 11 DSP Algorithm Implementations 清大電機系林嘉文 cwlin@ee.nthu.edu.tw Original PowerPoint slides prepared by S. K. Mitra 03-5731152 11-1 Matrix Representation of Digital Consider Filter Structures This

More information

AN IMPROVED LOW LATENCY SYSTOLIC STRUCTURED GALOIS FIELD MULTIPLIER

AN IMPROVED LOW LATENCY SYSTOLIC STRUCTURED GALOIS FIELD MULTIPLIER Indian Journal of Electronics and Electrical Engineering (IJEEE) Vol.2.No.1 2014pp1-6 available at: www.goniv.com Paper Received :05-03-2014 Paper Published:28-03-2014 Paper Reviewed by: 1. John Arhter

More information

EDISP (NWL3) (English) Digital Signal Processing DFT Windowing, FFT. October 19, 2016

EDISP (NWL3) (English) Digital Signal Processing DFT Windowing, FFT. October 19, 2016 EDISP (NWL3) (English) Digital Signal Processing DFT Windowing, FFT October 19, 2016 DFT resolution 1 N-point DFT frequency sampled at θ k = 2πk N, so the resolution is f s/n If we want more, we use N

More information

Digital Integrated Circuits A Design Perspective. Arithmetic Circuits. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.

Digital Integrated Circuits A Design Perspective. Arithmetic Circuits. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic. Digital Integrated Circuits A Design Perspective Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic Arithmetic Circuits January, 2003 1 A Generic Digital Processor MEM ORY INPUT-OUTPUT CONTROL DATAPATH

More information

A Bit-Plane Decomposition Matrix-Based VLSI Integer Transform Architecture for HEVC

A Bit-Plane Decomposition Matrix-Based VLSI Integer Transform Architecture for HEVC IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 64, NO. 3, MARCH 2017 349 A Bit-Plane Decomposition Matrix-Based VLSI Integer Transform Architecture for HEVC Honggang Qi, Member, IEEE,

More information

AIR FORCE INSTITUTE OF TECHNOLOGY

AIR FORCE INSTITUTE OF TECHNOLOGY A Modular Mixed Signal VLSI Design Approach for Digital Radar Applications THESIS Brian M. Brakus, Captain, USAF AFIT/GCE/ENG/07-02 DEPARTMENT OF THE AIR FORCE AIR UNIVERSITY AIR FORCE INSTITUTE OF TECHNOLOGY

More information

Low-Power Twiddle Factor Unit for FFT Computation

Low-Power Twiddle Factor Unit for FFT Computation Low-Power Twiddle Factor Unit for FFT Computation Teemu Pitkänen, Tero Partanen, and Jarmo Takala Tampere University of Technology, P.O. Box, FIN- Tampere, Finland {teemu.pitkanen, tero.partanen, jarmo.takala}@tut.fi

More information

A COMBINED 16-BIT BINARY AND DUAL GALOIS FIELD MULTIPLIER. Jesus Garcia and Michael J. Schulte

A COMBINED 16-BIT BINARY AND DUAL GALOIS FIELD MULTIPLIER. Jesus Garcia and Michael J. Schulte A COMBINED 16-BIT BINARY AND DUAL GALOIS FIELD MULTIPLIER Jesus Garcia and Michael J. Schulte Lehigh University Department of Computer Science and Engineering Bethlehem, PA 15 ABSTRACT Galois field arithmetic

More information

Sistemas de Aquisição de Dados. Mestrado Integrado em Eng. Física Tecnológica 2015/16 Aula 6-26 de Outubro

Sistemas de Aquisição de Dados. Mestrado Integrado em Eng. Física Tecnológica 2015/16 Aula 6-26 de Outubro Sistemas de Aquisição de Dados Mestrado Integrado em Eng. Física Tecnológica 2015/16 Aula 6-26 de Outubro Flash Decoder Thermometer code Wired NOR based decoder 2 Successive Approximation ADC (SAR) CONVERT

More information

Tunable Floating-Point for Energy Efficient Accelerators

Tunable Floating-Point for Energy Efficient Accelerators Tunable Floating-Point for Energy Efficient Accelerators Alberto Nannarelli DTU Compute, Technical University of Denmark 25 th IEEE Symposium on Computer Arithmetic A. Nannarelli (DTU Compute) Tunable

More information

Digital Integrated Circuits A Design Perspective. Arithmetic Circuits. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.

Digital Integrated Circuits A Design Perspective. Arithmetic Circuits. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic. Digital Integrated Circuits A Design Perspective Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic Arithmetic Circuits January, 2003 1 A Generic Digital Processor MEMORY INPUT-OUTPUT CONTROL DATAPATH

More information

AREA EFFICIENT MODULAR ADDER/SUBTRACTOR FOR RESIDUE MODULI

AREA EFFICIENT MODULAR ADDER/SUBTRACTOR FOR RESIDUE MODULI AREA EFFICIENT MODULAR ADDER/SUBTRACTOR FOR RESIDUE MODULI G.CHANDANA 1 (M.TECH),chandana.g89@gmail.com P.RAJINI 2 (M.TECH),paddam.rajani@gmail.com Abstract Efficient modular adders and subtractors for

More information

Frequency-domain representation of discrete-time signals

Frequency-domain representation of discrete-time signals 4 Frequency-domain representation of discrete-time signals So far we have been looing at signals as a function of time or an index in time. Just lie continuous-time signals, we can view a time signal as

More information

FIXED WIDTH BOOTH MULTIPLIER BASED ON PEB CIRCUIT

FIXED WIDTH BOOTH MULTIPLIER BASED ON PEB CIRCUIT FIXED WIDTH BOOTH MULTIPLIER BASED ON PEB CIRCUIT Dr. V.Vidya Devi 1, GuruKumar.Lokku 2, A.Natarajan 3 1 Professor, Department of ECE, A. M.S. Engineering college, T.N., India vidyapeace@gmail.com 2 VLSI

More information

R13 SET - 1

R13 SET - 1 R13 SET - 1 III B. Tech II Semester Regular Examinations, April - 2016 DIGITAL SIGNAL PROCESSING (Electronics and Communication Engineering) Time: 3 hours Maximum Marks: 70 Note: 1. Question Paper consists

More information

Multimedia Signals and Systems - Audio and Video. Signal, Image, Video Processing Review-Introduction, MP3 and MPEG2

Multimedia Signals and Systems - Audio and Video. Signal, Image, Video Processing Review-Introduction, MP3 and MPEG2 Multimedia Signals and Systems - Audio and Video Signal, Image, Video Processing Review-Introduction, MP3 and MPEG2 Kunio Takaya Electrical and Computer Engineering University of Saskatchewan December

More information

The Fourier Transform (and more )

The Fourier Transform (and more ) The Fourier Transform (and more ) imrod Peleg ov. 5 Outline Introduce Fourier series and transforms Introduce Discrete Time Fourier Transforms, (DTFT) Introduce Discrete Fourier Transforms (DFT) Consider

More information

EEC 281 VLSI Digital Signal Processing Notes on the RRI-FFT

EEC 281 VLSI Digital Signal Processing Notes on the RRI-FFT EEC 281 VLSI Digital Signal Processing Notes on the RRI-FFT Bevan Baas The attached description of the RRI-FFT is taken from a chapter describing the cached-fft algorithm and there are therefore occasional

More information

NCU EE -- DSP VLSI Design. Tsung-Han Tsai 1

NCU EE -- DSP VLSI Design. Tsung-Han Tsai 1 NCU EE -- DSP VLSI Design. Tsung-Han Tsai 1 Multi-processor vs. Multi-computer architecture µp vs. DSP RISC vs. DSP RISC Reduced-instruction-set Register-to-register operation Higher throughput by using

More information

VLSI Signal Processing

VLSI Signal Processing VLSI Signal Processing Lecture 1 Pipelining & Retiming ADSP Lecture1 - Pipelining & Retiming (cwliu@twins.ee.nctu.edu.tw) 1-1 Introduction DSP System Real time requirement Data driven synchronized by data

More information

Volume 3, No. 1, January 2012 Journal of Global Research in Computer Science RESEARCH PAPER Available Online at

Volume 3, No. 1, January 2012 Journal of Global Research in Computer Science RESEARCH PAPER Available Online at Volume 3, No 1, January 2012 Journal of Global Research in Computer Science RESEARCH PAPER Available Online at wwwjgrcsinfo A NOVEL HIGH DYNAMIC RANGE 5-MODULUS SET WHIT EFFICIENT REVERSE CONVERTER AND

More information

School of EECS Seoul National University

School of EECS Seoul National University 4!4 07$ 8902808 3 School of EECS Seoul National University Introduction Low power design 3974/:.9 43 Increasing demand on performance and integrity of VLSI circuits Popularity of portable devices Low power

More information

Transformation Techniques for Real Time High Speed Implementation of Nonlinear Algorithms

Transformation Techniques for Real Time High Speed Implementation of Nonlinear Algorithms International Journal of Electronics and Communication Engineering. ISSN 0974-66 Volume 4, Number (0), pp.83-94 International Research Publication House http://www.irphouse.com Transformation Techniques

More information

Transforms and Orthogonal Bases

Transforms and Orthogonal Bases Orthogonal Bases Transforms and Orthogonal Bases We now turn back to linear algebra to understand transforms, which map signals between different domains Recall that signals can be interpreted as vectors

More information

Fundamentals of the DFT (fft) Algorithms

Fundamentals of the DFT (fft) Algorithms Fundamentals of the DFT (fft) Algorithms D. Sundararajan November 6, 9 Contents 1 The PM DIF DFT Algorithm 1.1 Half-wave symmetry of periodic waveforms.............. 1. The DFT definition and the half-wave

More information

Low-Complexity Multiplierless Constant Rotators Based on Combined Coefficient Selection and Shift-and-Add Implementation (CCSSI)

Low-Complexity Multiplierless Constant Rotators Based on Combined Coefficient Selection and Shift-and-Add Implementation (CCSSI) Low-Complexity Multiplierless Constant Rotators Based on Combined Coefficient Selection and Shift-and-Add Implementation (CCSSI) Mario Garrido Gálvez, Fahad Qureshi and Oscar Gustafsson Linköping University

More information

1. Calculation of the DFT

1. Calculation of the DFT ELE E4810: Digital Signal Processing Topic 10: The Fast Fourier Transform 1. Calculation of the DFT. The Fast Fourier Transform algorithm 3. Short-Time Fourier Transform 1 1. Calculation of the DFT! Filter

More information

ELEG 305: Digital Signal Processing

ELEG 305: Digital Signal Processing ELEG 5: Digital Signal Processing Lecture 6: The Fast Fourier Transform; Radix Decimatation in Time Kenneth E. Barner Department of Electrical and Computer Engineering University of Delaware Fall 8 K.

More information

Elliptic Fourier Transform

Elliptic Fourier Transform NCUR, ART GRIGORYAN Frequency Analysis of the signals on ellipses: Elliptic Fourier Transform Joao Donacien N. Nsingui Department of Electrical and Computer Engineering The University of Texas at San Antonio

More information

Energy Delay Optimization

Energy Delay Optimization EE M216A.:. Fall 21 Lecture 8 Energy Delay Optimization Prof. Dejan Marković ee216a@gmail.com Some Common Questions Is sizing better than V DD for energy reduction? What are the optimal values of gate

More information

CMPEN 411 VLSI Digital Circuits Spring Lecture 19: Adder Design

CMPEN 411 VLSI Digital Circuits Spring Lecture 19: Adder Design CMPEN 411 VLSI Digital Circuits Spring 2011 Lecture 19: Adder Design [Adapted from Rabaey s Digital Integrated Circuits, Second Edition, 2003 J. Rabaey, A. Chandrakasan, B. Nikolic] Sp11 CMPEN 411 L19

More information

THE discrete sine transform (DST) and the discrete cosine

THE discrete sine transform (DST) and the discrete cosine IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS-II: EXPRESS BIREFS 1 New Systolic Algorithm and Array Architecture for Prime-Length Discrete Sine Transform Pramod K. Meher Senior Member, IEEE and M. N. S. Swamy

More information

CORDIC, Divider, Square Root

CORDIC, Divider, Square Root 4// EE6B: VLSI Signal Processing CORDIC, Divider, Square Root Prof. Dejan Marković ee6b@gmail.com Iterative algorithms CORDIC Division Square root Lecture Overview Topics covered include Algorithms and

More information

Optimum Circuits for Bit Reversal

Optimum Circuits for Bit Reversal Optimum Circuits for Bit Reversal Mario Garrido Gálvez, Jesus Grajal and Oscar Gustafsson Linköping University Post Print.B.: When citing this work, cite the original article. 2011 IEEE. Personal use of

More information

DHANALAKSHMI COLLEGE OF ENGINEERING DEPARTMENT OF ELECTRICAL AND ELECTRONICS ENGINEERING EC2314- DIGITAL SIGNAL PROCESSING UNIT I INTRODUCTION PART A

DHANALAKSHMI COLLEGE OF ENGINEERING DEPARTMENT OF ELECTRICAL AND ELECTRONICS ENGINEERING EC2314- DIGITAL SIGNAL PROCESSING UNIT I INTRODUCTION PART A DHANALAKSHMI COLLEGE OF ENGINEERING DEPARTMENT OF ELECTRICAL AND ELECTRONICS ENGINEERING EC2314- DIGITAL SIGNAL PROCESSING UNIT I INTRODUCTION PART A Classification of systems : Continuous and Discrete

More information

VLSI Architecture for Layered Decoding of QC-LDPC Codes With High Circulant Weight

VLSI Architecture for Layered Decoding of QC-LDPC Codes With High Circulant Weight IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS VLSI Architecture for Layered Decoding of QC-LDPC Codes With High Circulant Weight Yang Sun and Joseph R. Cavallaro Abstract In this brief,

More information

Characteristic Analysis of 1024-Point Quantized Radix-2 FFT-IFFT Processor

Characteristic Analysis of 1024-Point Quantized Radix-2 FFT-IFFT Processor Neonode Inc From the SelectedWorks of Dr. Rozita Teymourzadeh, CEng. 22 Characteristic Analysis of 24-Point Quantized Radix-2 FFT-IFFT Processor Rozita Teymourzadeh Available at: https://works.bepress.com/rozita_teymourzadeh/32/

More information

Chapter 4 Discrete Fourier Transform (DFT) And Signal Spectrum

Chapter 4 Discrete Fourier Transform (DFT) And Signal Spectrum Chapter 4 Discrete Fourier Transform (DFT) And Signal Spectrum CEN352, DR. Nassim Ammour, King Saud University 1 Fourier Transform History Born 21 March 1768 ( Auxerre ). Died 16 May 1830 ( Paris ) French

More information

DFT & Fast Fourier Transform PART-A. 7. Calculate the number of multiplications needed in the calculation of DFT and FFT with 64 point sequence.

DFT & Fast Fourier Transform PART-A. 7. Calculate the number of multiplications needed in the calculation of DFT and FFT with 64 point sequence. SHRI ANGALAMMAN COLLEGE OF ENGINEERING & TECHNOLOGY (An ISO 9001:2008 Certified Institution) SIRUGANOOR,TRICHY-621105. DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING UNIT I DFT & Fast Fourier

More information

Stack Sizing for Optimal Current Drivability in Subthreshold Circuits REFERENCES

Stack Sizing for Optimal Current Drivability in Subthreshold Circuits REFERENCES 598 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL 16, NO 5, MAY 2008 design can be easily expanded to a hierarchical 64-bit adder such that the result will be attained in four cycles

More information

The Fourier transform allows an arbitrary function to be represented in terms of simple sinusoids. The Fourier transform (FT) of a function f(t) is

The Fourier transform allows an arbitrary function to be represented in terms of simple sinusoids. The Fourier transform (FT) of a function f(t) is 1 Introduction Here is something I wrote many years ago while working on the design of anemometers for measuring shear stresses. Part of this work required modelling and compensating for the transfer function

More information

Low Power, High Speed Parallel Architecture For Cyclic Convolution Based On Fermat Number Transform (FNT)

Low Power, High Speed Parallel Architecture For Cyclic Convolution Based On Fermat Number Transform (FNT) RESEARCH ARTICLE OPEN ACCESS Low Power, High Speed Parallel Architecture For Cyclic Convolution Based On Fermat Number Transform (FNT) T.Jyothsna 1 M.Tech, M.Pradeep 2 M.Tech 1 E.C.E department, shri Vishnu

More information

Introduction to HPC. Lecture 20

Introduction to HPC. Lecture 20 COSC Introduction to HPC Lecture Dept of Computer Science COSC Fast Fourier Transform COSC Image Processing In Electron Microscopy Medical Imaging Signal Processing In Radio Astronomy Modeling and Solution

More information

HARDWARE IMPLEMENTATION OF FIR/IIR DIGITAL FILTERS USING INTEGRAL STOCHASTIC COMPUTATION. Arash Ardakani, François Leduc-Primeau and Warren J.

HARDWARE IMPLEMENTATION OF FIR/IIR DIGITAL FILTERS USING INTEGRAL STOCHASTIC COMPUTATION. Arash Ardakani, François Leduc-Primeau and Warren J. HARWARE IMPLEMENTATION OF FIR/IIR IGITAL FILTERS USING INTEGRAL STOCHASTIC COMPUTATION Arash Ardakani, François Leduc-Primeau and Warren J. Gross epartment of Electrical and Computer Engineering McGill

More information

Numbering Systems. Computational Platforms. Scaling and Round-off Noise. Special Purpose. here that is dedicated architecture

Numbering Systems. Computational Platforms. Scaling and Round-off Noise. Special Purpose. here that is dedicated architecture Computational Platforms Numbering Systems Basic Building Blocks Scaling and Round-off Noise Computational Platforms Viktor Öwall viktor.owall@eit.lth.seowall@eit lth Standard Processors or Special Purpose

More information

EECS150 - Digital Design Lecture 25 Shifters and Counters. Recap

EECS150 - Digital Design Lecture 25 Shifters and Counters. Recap EECS150 - Digital Design Lecture 25 Shifters and Counters Nov. 21, 2013 Prof. Ronald Fearing Electrical Engineering and Computer Sciences University of California, Berkeley (slides courtesy of Prof. John

More information

Analysis and Synthesis of Weighted-Sum Functions

Analysis and Synthesis of Weighted-Sum Functions Analysis and Synthesis of Weighted-Sum Functions Tsutomu Sasao Department of Computer Science and Electronics, Kyushu Institute of Technology, Iizuka 820-8502, Japan April 28, 2005 Abstract A weighted-sum

More information

A Digit-Serial Systolic Multiplier for Finite Fields GF(2 m )

A Digit-Serial Systolic Multiplier for Finite Fields GF(2 m ) A Digit-Serial Systolic Multiplier for Finite Fields GF( m ) Chang Hoon Kim, Sang Duk Han, and Chun Pyo Hong Department of Computer and Information Engineering Taegu University 5 Naeri, Jinryang, Kyungsan,

More information

CMPEN 411 VLSI Digital Circuits Spring Lecture 21: Shifters, Decoders, Muxes

CMPEN 411 VLSI Digital Circuits Spring Lecture 21: Shifters, Decoders, Muxes CMPEN 411 VLSI Digital Circuits Spring 2011 Lecture 21: Shifters, Decoders, Muxes [Adapted from Rabaey s Digital Integrated Circuits, Second Edition, 2003 J. Rabaey, A. Chandrakasan, B. Nikolic] Sp11 CMPEN

More information

ELEG 305: Digital Signal Processing

ELEG 305: Digital Signal Processing ELEG 305: Digital Signal Processing Lecture 18: Applications of FFT Algorithms & Linear Filtering DFT Computation; Implementation of Discrete Time Systems Kenneth E. Barner Department of Electrical and

More information

A High-Speed Realization of Chinese Remainder Theorem

A High-Speed Realization of Chinese Remainder Theorem Proceedings of the 2007 WSEAS Int. Conference on Circuits, Systems, Signal and Telecommunications, Gold Coast, Australia, January 17-19, 2007 97 A High-Speed Realization of Chinese Remainder Theorem Shuangching

More information

Chapter 5. Digital Design and Computer Architecture, 2 nd Edition. David Money Harris and Sarah L. Harris. Chapter 5 <1>

Chapter 5. Digital Design and Computer Architecture, 2 nd Edition. David Money Harris and Sarah L. Harris. Chapter 5 <1> Chapter 5 Digital Design and Computer Architecture, 2 nd Edition David Money Harris and Sarah L. Harris Chapter 5 Chapter 5 :: Topics Introduction Arithmetic Circuits umber Systems Sequential Building

More information

Chapter 8. Low-Power VLSI Design Methodology

Chapter 8. Low-Power VLSI Design Methodology VLSI Design hapter 8 Low-Power VLSI Design Methodology Jin-Fu Li hapter 8 Low-Power VLSI Design Methodology Introduction Low-Power Gate-Level Design Low-Power Architecture-Level Design Algorithmic-Level

More information

ECSE 512 Digital Signal Processing I Fall 2010 FINAL EXAMINATION

ECSE 512 Digital Signal Processing I Fall 2010 FINAL EXAMINATION FINAL EXAMINATION 9:00 am 12:00 pm, December 20, 2010 Duration: 180 minutes Examiner: Prof. M. Vu Assoc. Examiner: Prof. B. Champagne There are 6 questions for a total of 120 points. This is a closed book

More information

DSP Configurations. responded with: thus the system function for this filter would be

DSP Configurations. responded with: thus the system function for this filter would be DSP Configurations In this lecture we discuss the different physical (or software) configurations that can be used to actually realize or implement DSP functions. Recall that the general form of a DSP

More information

Design and Study of Enhanced Parallel FIR Filter Using Various Adders for 16 Bit Length

Design and Study of Enhanced Parallel FIR Filter Using Various Adders for 16 Bit Length International Journal of Soft Computing and Engineering (IJSCE) Design and Study of Enhanced Parallel FIR Filter Using Various Adders for 16 Bit Length D.Ashok Kumar, P.Samundiswary Abstract Now a day

More information

FPGA accelerated multipliers over binary composite fields constructed via low hamming weight irreducible polynomials

FPGA accelerated multipliers over binary composite fields constructed via low hamming weight irreducible polynomials FPGA accelerated multipliers over binary composite fields constructed via low hamming weight irreducible polynomials C. Shu, S. Kwon and K. Gaj Abstract: The efficient design of digit-serial multipliers

More information

Proposal to Improve Data Format Conversions for a Hybrid Number System Processor

Proposal to Improve Data Format Conversions for a Hybrid Number System Processor Proposal to Improve Data Format Conversions for a Hybrid Number System Processor LUCIAN JURCA, DANIEL-IOAN CURIAC, AUREL GONTEAN, FLORIN ALEXA Department of Applied Electronics, Department of Automation

More information

Radar Systems Engineering Lecture 3 Review of Signals, Systems and Digital Signal Processing

Radar Systems Engineering Lecture 3 Review of Signals, Systems and Digital Signal Processing Radar Systems Engineering Lecture Review of Signals, Systems and Digital Signal Processing Dr. Robert M. O Donnell Guest Lecturer Radar Systems Course Review Signals, Systems & DSP // Block Diagram of

More information

Design and Implementation of Efficient Modulo 2 n +1 Adder

Design and Implementation of Efficient Modulo 2 n +1 Adder www..org 18 Design and Implementation of Efficient Modulo 2 n +1 Adder V. Jagadheesh 1, Y. Swetha 2 1,2 Research Scholar(INDIA) Abstract In this brief, we proposed an efficient weighted modulo (2 n +1)

More information

EECS150 - Digital Design Lecture 11 - Shifters & Counters. Register Summary

EECS150 - Digital Design Lecture 11 - Shifters & Counters. Register Summary EECS50 - Digital Design Lecture - Shifters & Counters February 24, 2003 John Wawrzynek Spring 2005 EECS50 - Lec-counters Page Register Summary All registers (this semester) based on Flip-flops: q 3 q 2

More information

Test Pattern Generator for Built-in Self-Test using Spectral Methods

Test Pattern Generator for Built-in Self-Test using Spectral Methods Test Pattern Generator for Built-in Self-Test using Spectral Methods Alok S. Doshi and Anand S. Mudlapur Auburn University 2 Dept. of Electrical and Computer Engineering, Auburn, AL, USA doshias,anand@auburn.edu

More information

Design of Low Power, High Speed Parallel Architecture of Cyclic Convolution Based on Fermat Number Transform (FNT)

Design of Low Power, High Speed Parallel Architecture of Cyclic Convolution Based on Fermat Number Transform (FNT) Advance in Electronic and Electric Engineering. ISSN 2231-1297, Volume 3, Number 6 (2013), pp. 641-650 Research India Publications http://www.ripublication.com/aeee.htm Design of Low Power, High Speed

More information

VLSI Arithmetic. Lecture 9: Carry-Save and Multi-Operand Addition. Prof. Vojin G. Oklobdzija University of California

VLSI Arithmetic. Lecture 9: Carry-Save and Multi-Operand Addition. Prof. Vojin G. Oklobdzija University of California VLSI Arithmetic Lecture 9: Carry-Save and Multi-Operand Addition Prof. Vojin G. Oklobdzija University of California http://www.ece.ucdavis.edu/acsel Carry-Save Addition* *from Parhami 2 June 18, 2003 Carry-Save

More information

Successive Approximation ADCs

Successive Approximation ADCs Department of Electrical and Computer Engineering Successive Approximation ADCs Vishal Saxena Vishal Saxena -1- Successive Approximation ADC Vishal Saxena -2- Data Converter Architectures Resolution [Bits]

More information

DSP Design Lecture 7. Unfolding cont. & Folding. Dr. Fredrik Edman.

DSP Design Lecture 7. Unfolding cont. & Folding. Dr. Fredrik Edman. SP esign Lecture 7 Unfolding cont. & Folding r. Fredrik Edman fredrik.edman@eit.lth.se Unfolding Unfolding creates a program with more than one iteration, J=unfolding factor Unfolding is a structured way

More information

International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research)

International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational

More information

DIGITAL SIGNAL PROCESSING UNIT I

DIGITAL SIGNAL PROCESSING UNIT I DIGITAL SIGNAL PROCESSING UNIT I CONTENTS 1.1 Introduction 1.2 Introduction about signals 1.3 Signals Processing 1.4 Classification of signals 1.5 Operations performed on signals 1.6 Properties of signals

More information

! Circular Convolution. " Linear convolution with circular convolution. ! Discrete Fourier Transform. " Linear convolution through circular

! Circular Convolution.  Linear convolution with circular convolution. ! Discrete Fourier Transform.  Linear convolution through circular Previously ESE 531: Digital Signal Processing Lec 22: April 18, 2017 Fast Fourier Transform (con t)! Circular Convolution " Linear convolution with circular convolution! Discrete Fourier Transform " Linear

More information

DISCRETE FOURIER TRANSFORM

DISCRETE FOURIER TRANSFORM DISCRETE FOURIER TRANSFORM 1. Introduction The sampled discrete-time fourier transform (DTFT) of a finite length, discrete-time signal is known as the discrete Fourier transform (DFT). The DFT contains

More information

KEYWORDS: Multiple Valued Logic (MVL), Residue Number System (RNS), Quinary Logic (Q uin), Quinary Full Adder, QFA, Quinary Half Adder, QHA.

KEYWORDS: Multiple Valued Logic (MVL), Residue Number System (RNS), Quinary Logic (Q uin), Quinary Full Adder, QFA, Quinary Half Adder, QHA. GLOBAL JOURNAL OF ADVANCED ENGINEERING TECHNOLOGIES AND SCIENCES DESIGN OF A QUINARY TO RESIDUE NUMBER SYSTEM CONVERTER USING MULTI-LEVELS OF CONVERSION Hassan Amin Osseily Electrical and Electronics Department,

More information

An Effective New CRT Based Reverse Converter for a Novel Moduli Set { 2 2n+1 1, 2 2n+1, 2 2n 1 }

An Effective New CRT Based Reverse Converter for a Novel Moduli Set { 2 2n+1 1, 2 2n+1, 2 2n 1 } An Effective New CRT Based Reverse Converter for a Novel Moduli Set +1 1, +1, 1 } Edem Kwedzo Bankas, Kazeem Alagbe Gbolagade Department of Computer Science, Faculty of Mathematical Sciences, University

More information

L15: Custom and ASIC VLSI Integration

L15: Custom and ASIC VLSI Integration L15: Custom and ASIC VLSI Integration Average Cost of one transistor 10 1 0.1 0.01 0.001 0.0001 0.00001 $ 0.000001 Gordon Moore, Keynote Presentation at ISSCC 2003 0.0000001 '68 '70 '72 '74 '76 '78 '80

More information

9. Datapath Design. Jacob Abraham. Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017

9. Datapath Design. Jacob Abraham. Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017 9. Datapath Design Jacob Abraham Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017 October 2, 2017 ECE Department, University of Texas at Austin

More information

Design for Manufacturability and Power Estimation. Physical issues verification (DSM)

Design for Manufacturability and Power Estimation. Physical issues verification (DSM) Design for Manufacturability and Power Estimation Lecture 25 Alessandra Nardi Thanks to Prof. Jan Rabaey and Prof. K. Keutzer Physical issues verification (DSM) Interconnects Signal Integrity P/G integrity

More information

Low complexity state metric compression technique in turbo decoder

Low complexity state metric compression technique in turbo decoder LETTER IEICE Electronics Express, Vol.10, No.15, 1 7 Low complexity state metric compression technique in turbo decoder Qingqing Yang 1, Xiaofang Zhou 1a), Gerald E. Sobelman 2, and Xinxin Li 1, 3 1 State

More information

A HIGH-SPEED PROCESSOR FOR RECTANGULAR-TO-POLAR CONVERSION WITH APPLICATIONS IN DIGITAL COMMUNICATIONS *

A HIGH-SPEED PROCESSOR FOR RECTANGULAR-TO-POLAR CONVERSION WITH APPLICATIONS IN DIGITAL COMMUNICATIONS * Copyright IEEE 999: Published in the Proceedings of Globecom 999, Rio de Janeiro, Dec 5-9, 999 A HIGH-SPEED PROCESSOR FOR RECTAGULAR-TO-POLAR COVERSIO WITH APPLICATIOS I DIGITAL COMMUICATIOS * Dengwei

More information

High rate soft output Viterbi decoder

High rate soft output Viterbi decoder High rate soft output Viterbi decoder Eric Lüthi, Emmanuel Casseau Integrated Circuits for Telecommunications Laboratory Ecole Nationale Supérieure des Télécomunications de Bretagne BP 83-985 Brest Cedex

More information

VII. Discrete Fourier Transform (DFT) Chapter-8. A. Modulo Arithmetic. (n) N is n modulo N, n is an integer variable.

VII. Discrete Fourier Transform (DFT) Chapter-8. A. Modulo Arithmetic. (n) N is n modulo N, n is an integer variable. 1 VII. Discrete Fourier Transform (DFT) Chapter-8 A. Modulo Arithmetic (n) N is n modulo N, n is an integer variable. (n) N = n m N 0 n m N N-1, pick m Ex. (k) 4 W N = e -j2π/n 2 Note that W N k = 0 but

More information

A FRACTIONAL DELAY FIR FILTER BASED ON LAGRANGE INTERPOLATION OF FARROW STRUCTURE

A FRACTIONAL DELAY FIR FILTER BASED ON LAGRANGE INTERPOLATION OF FARROW STRUCTURE A Fractional Delay Fir Filter Based on Lagrange Interpolation of Farrow Structure A FRACTIONAL DELAY FIR FILTER BASED ON LAGRANGE INTERPOLATION OF FARROW STRUCTURE 1 K. RAJALAKSHMI, 2 SWATHI GONDI & 3

More information

DSP Design Lecture 5. Dr. Fredrik Edman.

DSP Design Lecture 5. Dr. Fredrik Edman. SP esign SP esign Lecture 5 Retiming r. Fredrik Edman fredrik.edman@eit.lth.se Fredrik Edman, ept. of Electrical and Information Technology, Lund University, Sweden-www.eit.lth.se SP esign Repetition Critical

More information

Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs

Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs Article Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs E. George Walters III Department of Electrical and Computer Engineering, Penn State Erie,

More information

IMPLEMENTATION OF SIGNAL POWER ESTIMATION METHODS

IMPLEMENTATION OF SIGNAL POWER ESTIMATION METHODS IMPLEMENTATION OF SIGNAL POWER ESTIMATION METHODS Sei-Yeu Cheng and Joseph B. Evans Telecommunications & Information Sciences Laboratory Department of Electrical Engineering & Computer Science University

More information

Retiming. delay elements in a circuit without affecting the input/output characteristics of the circuit.

Retiming. delay elements in a circuit without affecting the input/output characteristics of the circuit. Chapter Retiming NCU EE -- SP VLSI esign. Chap. Tsung-Han Tsai 1 Retiming & A transformation techniques used to change the locations of delay elements in a circuit without affecting the input/output characteristics

More information

Cost/Performance Tradeoff of n-select Square Root Implementations

Cost/Performance Tradeoff of n-select Square Root Implementations Australian Computer Science Communications, Vol.22, No.4, 2, pp.9 6, IEEE Comp. Society Press Cost/Performance Tradeoff of n-select Square Root Implementations Wanming Chu and Yamin Li Computer Architecture

More information

EDISP (NWL2) (English) Digital Signal Processing Transform, FT, DFT. March 11, 2015

EDISP (NWL2) (English) Digital Signal Processing Transform, FT, DFT. March 11, 2015 EDISP (NWL2) (English) Digital Signal Processing Transform, FT, DFT March 11, 2015 Transform concept We want to analyze the signal represent it as built of some building blocks (well known signals), possibly

More information

Hardware Design I Chap. 4 Representative combinational logic

Hardware Design I Chap. 4 Representative combinational logic Hardware Design I Chap. 4 Representative combinational logic E-mail: shimada@is.naist.jp Already optimized circuits There are many optimized circuits which are well used You can reduce your design workload

More information

RADIX-2 FAST HARTLEY TRANSFORM REVISITED. de Oliveira, H.M.; Viviane L. Sousa, Silva, Helfarne A.N. and Campello de Souza, R.M.

RADIX-2 FAST HARTLEY TRANSFORM REVISITED. de Oliveira, H.M.; Viviane L. Sousa, Silva, Helfarne A.N. and Campello de Souza, R.M. RADIX- FAST HARTLY TRASFRM RVISITD de liveira, H.M.; Viviane L. Sousa, Silva, Helfarne A.. and Campello de Souza, R.M. Federal University of Pernambuco Departamento de letrônica e Sistemas - Recife - P,

More information

On-Line Hardware Implementation for Complex Exponential and Logarithm

On-Line Hardware Implementation for Complex Exponential and Logarithm On-Line Hardware Implementation for Complex Exponential and Logarithm Ali SKAF, Jean-Michel MULLER * and Alain GUYOT Laboratoire TIMA / INPG - 46, Av. Félix Viallet, 3831 Grenoble Cedex * Laboratoire LIP

More information

Proposal to Improve Data Format Conversions for a Hybrid Number System Processor

Proposal to Improve Data Format Conversions for a Hybrid Number System Processor Proceedings of the 11th WSEAS International Conference on COMPUTERS, Agios Nikolaos, Crete Island, Greece, July 6-8, 007 653 Proposal to Improve Data Format Conversions for a Hybrid Number System Processor

More information

Professor Fearing EECS150/Problem Set Solution Fall 2013 Due at 10 am, Thu. Oct. 3 (homework box under stairs)

Professor Fearing EECS150/Problem Set Solution Fall 2013 Due at 10 am, Thu. Oct. 3 (homework box under stairs) Professor Fearing EECS50/Problem Set Solution Fall 203 Due at 0 am, Thu. Oct. 3 (homework box under stairs). (25 pts) List Processor Timing. The list processor as discussed in lecture is described in RT

More information

FAST FIR ALGORITHM BASED AREA-EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURES

FAST FIR ALGORITHM BASED AREA-EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURES FAST FIR ALGORITHM BASED AREA-EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURES R.P.MEENAAKSHI SUNDHARI 1, Dr.R.ANITA 2 1 Department of ECE, Sasurie College of Engineering, Vijayamangalam, Tamilnadu, India.

More information

Fall 2011, EE123 Digital Signal Processing

Fall 2011, EE123 Digital Signal Processing Lecture 6 Miki Lustig, UCB September 11, 2012 Miki Lustig, UCB DFT and Sampling the DTFT X (e jω ) = e j4ω sin2 (5ω/2) sin 2 (ω/2) 5 x[n] 25 X(e jω ) 4 20 3 15 2 1 0 10 5 1 0 5 10 15 n 0 0 2 4 6 ω 5 reconstructed

More information

Microcontrollers. Fast - Fourier - Transformation. þ additional file AP EXE available

Microcontrollers. Fast - Fourier - Transformation. þ additional file AP EXE available Microcontrollers Apote AP634 þ additional file AP634.EXE available ast - ourier - Transformation The ast ourier Transformation (T is an algorithm frequently used in various applications, like telecommunication,

More information

Chapter 2: The Fourier Transform

Chapter 2: The Fourier Transform EEE, EEE Part A : Digital Signal Processing Chapter Chapter : he Fourier ransform he Fourier ransform. Introduction he sampled Fourier transform of a periodic, discrete-time signal is nown as the discrete

More information

Area-Time Optimal Adder with Relative Placement Generator

Area-Time Optimal Adder with Relative Placement Generator Area-Time Optimal Adder with Relative Placement Generator Abstract: This paper presents the design of a generator, for the production of area-time-optimal adders. A unique feature of this generator is

More information

Discrete Fourier Transform

Discrete Fourier Transform Discrete Fourier Transform Virtually all practical signals have finite length (e.g., sensor data, audio records, digital images, stock values, etc). Rather than considering such signals to be zero-padded

More information