Signal Flow Graphs. Roger Woods Programmable Systems Lab ECIT, Queen s University Belfast

Size: px
Start display at page:

Download "Signal Flow Graphs. Roger Woods Programmable Systems Lab ECIT, Queen s University Belfast"

Transcription

1 Signal Flow Graphs Roger Woods Programmable Systems Lab ECIT, Queen s University Belfast (Slides 2-13 taken from Signal Flow Graphs and Data Flow Graphs chapter by Keshab K. Parhi and Yanni Chen)

2 Contents Signal Flow Graphs (SFGs) DSP characteristcs Transfer FuncTon DerivaTon of SFG Mason s Gain Formula Data Flow Graphs (DFGs) DFG representatons Equivalent Single- rate from MulT- rate DFG SFG/DFG transformatons ReTming Pipelining 2

3 Signal Flow Graph RepresentaTon DSP characteristcs Data independence Input samples are processed periodically (typically a clock) Tasks are repeated infinite number of Tmes => non- terminatng programs High computatonal requirements Trend towards DSP micros and FPGAs Towards SFGs (and DFGs) powerful representatons of signal processing algorithms as they represent the operatons using a finite number of nodes 3

4 Key features of SFGs SFGs can be transformed using retming, pipelining, unfolding and folding resultng forms have same input- output characteristcs but different constraints Pipelining and retming can be used to reduce clock period Folding can be used to Tme- multplex i.e. reduce area Unfolding leads to: lower iteraton periods in so`ware implementatons (by highlightng concurrency) Faster hardware by exploitng parallel implementatons Key goal of transforming representaton to meet performance requirements by adaptng levels of pipelining and parallelism (or both) 4

5 Signal Flow Graph SFG is a collecton of nodes and directed edges A directed edge (j,k) denotes a linear transform from the signal at node j to the signal at node k Edges are usually restricted to multplier or delay elements 5

6 TransposiTon Flow graph reversal/transpositon can be applied to single- input single- output (SISO) systems by reversing directons of all edges, exchanging input/output nodes while keeping edge gain or edge delay unchanged 6

7 Mason's gain formula A few useful terminologies in Mason's gain formula have to be defined related to an SFG Forward path: a path that connects a source node to a sink node in which no node is traversed more than once. Loop: a closed path without crossing the same point more than once. Loop gain: the product of all the transfer functons in the loop. Non- touching or non- interactng loops: two loops are non- touching or non- interactng if they have no nodes in common. 7

8 Mason's gain formula M = transfer functon or gain of the system Y = output node X = input node N = total number of forward paths between X and Y Δ = determinant of the graph = 1- loop gains + non- touching loop gains taken two at a Tme - non- touching loop gains taken three at a Tme + M j = gain of the j th forward path between X and Y Δ j = 1- loops remaining a`er eliminatng the j th forward path i.e. eliminate the loops touching the j th forward path from the graph. If none of the loops remains, Δ j = 1. 8

9 SFG example 1) Find the forward paths and their corresponding gains Two forward paths exist in this SFG: M 1 = G 1 G 2 G 3 and M 2 = G 4 2) Find the loops and their corresponding gains There are four loops in this example: Loop 1 = - G 1 H 1 Loop 2 = - G 3 H 2 Loop 3 = - G 1 G 2 G 3 H 3 Loop 4 = - G 4 H 3 9

10 SFG example (Cont d) 3) Find the Δ j If we eliminate the path M 1 =G 1 G 2 G 3 from the SFG, no complete loops remain, so Δ 1 = 1. Similarly, if the path M 2 =G 4 is eliminated from the SFG, no complete loops remain neither, so Δ 2 = 1 as well. 4) Find the determinant Δ Only one pair of non- touching loops is in this SFG, i.e. Loop 1 and Loop 2, thus non- touching loop gains taken two at a Tme = (- G 1 H 1 )(- G 3 H 2 ). Therefore, 10

11 SFG example (Cont d) Δ = 1- loop gains + non- touching loop gains taken 2 at a Tme = 1 - (- G 1 H 1 - G 3 H 2 - G 1 G 2 G 3 H 3 - G 4 H 3 ) + (- G 1 H 1 )(- G 3 H 2 ) = 1 + G 1 H 1 + G 3 H 2 + G 1 G 2 G 3 H 3 + G 4 H 3 + G 1 G 3 H 1 H 2 5) The final step is to apply the Mason's gain formula 11

12 Data Flow Graph In DFGs, nodes represent computatons/functons and directed edges represent data paths with non- negatve numbers associated with them Captures data- driven property of DSP algorithms - node fires when all input data is available. y n = ay n- 1 + x n Block diagram ConvenTonal DFG Synchronous DFG 12

13 SDFGs and mult- rate Synchronous DFG is a DFG special case where the number of data samples produced or consumed by each node in each executon is specified a priori can be used for mult- rate A, B &C operate at frequencies f A, f B & f C A consumes f A samples & produces 3f A samples per Jme unit. B consumes 5f B samples, therefore using equality 3f A = 5f B, we have f B =3 f A /5. Equivalent SFG Similarly, f C = 2f B /3 = 2f A /5 which can be used to compute the frequencies for A, B and C for a specified required input sample rate. 13

14 SequenTal algorithms OperaTons are one a`er other. Throughput dictated by the Tme to perform P 1, P 2 and P 3. OperaTons can be carried out at the same Tme. 14

15 Some definitons - Latency Latency Tme needed to generate an output from when the corresponding input is entered. Throughput rate is defined as the rate at which the outputs (and/or inputs) are produced. 15

16 Why is latency important? Latency can dictate performance in applicatons with feedback loops 16

17 MulTplexing Technique to increase the throughput rate of sequental algorithms Processors (PEs) will operate in a Tme shared fashion. 17

18 MulTplexing SequenTal: TR = 1/(tP 1 + tp 2 + tp 3 ) and output every cycle. E.g. if tp 1 = tp 2 = tp 3 =10, TR=1/30 Interleaved: TR = 1/ (tp 1 or tp 2 or tp 3 ) E.g. if tp 1 = tp 2 = tp 3 =10, TR=1/10 tp n time taken for processor P n 18

19 Some definitons - pipelining Method to increase the throughput rate of a sequental (and parallel) algorithm TR = 1/ (tp 1 or tp 2 or tp 3 ) If tp 1 = tp 2 = tp 3 =10, TR=1/10 Problems in feedback loops allow interleaving only. clock rate = f output = 1 per cycle TR = f clock rate = 4Xf output = 1 per 4 cycles TR = f 19

20 ApplicaTon of pipelining # Clock Input Node 1 Node 2 Node 3 Output 0 x 0 a 0 x 0 1 x 1 a 0 x 1 a 0 x 0 a 0 x 0 y 0 2 x 2 a 0 x 2 a 0 x 1 + a 1 x 0 a 0 x 1 + a 1 x 0 y 1 3 x 3 a 0 x 3 a 0 x 2 + a 1 x 1 a 0 x 2 + a 1 x 1 + a 2 x 0 y 2 4 x 4 a 0 x 4 a 0 x 3 + a 1 x 2 a 0 x 3 + a 1 x 2 + a 2 x 1 y 3 20

21 ApplicaTon of pipelining #2 Why not apply pipelining to adder chain in FIR filter? Clock Input Node 1 Node 2 Node 3 Output 0 x 0 a 0 x 0 1 x 1 a 0 x 1 a 0 x 0 2 x 2 a 0 x 2 a 0 x 1 + a 1 x 0 a 0 x 0 y 0 3 x 3 a 0 x 3 a 0 x 2 + a 1 x 1 a 0 x 1 + a 1 x 0 + a 2 x 0 X 4 x 4 a 0 x 4 a 0 x 3 + a 1 x 2 a 0 x 2 + a 1 x 1 + a 2 x 1 X 21

22 Formal method to apply pipelining Problem was that pipelining was not applied in a systematc fashion providing a functonal change in the system specificaton i.e. changing the actual functon. A systematc approach is thus required to apply retming to the system in such a way that it does not corrupt the system specificaton. ReTming equates to moving around existng delays Does not alter the latency of the system Reduces (hopefully) the critcal path of the system Can be employed to retme the node 22

23 FormulaTon of retming ReTming FormulaTon w = w + r(v) - r(u) ProperTes of retming Weight of the retmed path p = V > V >..V k is given by w r (p)= w (p) + r(v k ) - r(v 0 ) ReTming does not change the number of delays in a cycle. ReTming does not alter the iteraton bound (see later) in a DFG as the number of delays in a cycle does not change Adding the constant value j to the retming value of each node does not alter number of delays in edges of the retmed graph. 23

24 ReTming of FIR Filter Express as DFG w r (1 2) = w(1 2) + r(2)- r(1) 2,3,4 are multpliers, w r (1 3) = w(1 3) + r(3)- r(1) 5, 6 are adders, 1 is x input w r (1 4) = w(1 4) + r(4)- r(1) Can express relatonships w r (2 5) = w(2 5) + r(5)- r(2) for all edges w r (3 5) = w(3 5) + r(5)- r(3) w r (4 6) = w(4 6) + r(6)- r(4) w r (5 6) = w(5 6) + r(6)- r(5) 24

25 ReTming of FIR Filter Using the following retming values, r(1)=- 1, r(2)=- 1, r(3)=- 1, r(4)=- 1, r(5)=0, r(6)=0 we get a pipeline a`er the multpliers w r (1 2) = w(1 2) + r(2)- r(1) w r (1 3) = w(1 3) + r(3)- r(1) w r (1 4) = w(1 4) + r(4)- r(1) w r (2 5) = w(2 5) + r(5)- r(2) w r (3 5) = w(3 5) + r(5)- r(3) w r (4 6) = w(4 6) + r(6)- r(4) w r (5 6) = w(5 6) + r(6)- r(5) = 0 + (- 1) (- 1) = 0 = 1 + (- 1) (- 1) = 1 = 2 + (- 1) (- 1) = 2 = 0 + (0) (- 1) = 1 = 0 + (0) (- 1) = 1 = 0 + (0) (- 1) = 1 = 0 + (0) (0) = 0 25

26 Conclusions RepresentaTons for DSP systems Signal Flow Graphs (SFGs) Transfer FuncTon DerivaTon using Mason s Gain Formula Data Flow Graphs (DFGs) Various form including synchronous DFG ConstrucTng an Equivalent Single- rate DFG from MulT- rate TransformaTons Pipelining allows speed of resultng implementatons to be increased. ReTming can be used to perform delay transfer but that is only part of the issue. Can increase(unfold)/decrease(fold) parallelism to match performance covered later 26

VLSI Signal Processing

VLSI Signal Processing VLSI Signal Processing Lecture 1 Pipelining & Retiming ADSP Lecture1 - Pipelining & Retiming (cwliu@twins.ee.nctu.edu.tw) 1-1 Introduction DSP System Real time requirement Data driven synchronized by data

More information

Pipelining and Parallel Processing

Pipelining and Parallel Processing Pipelining and Parallel Processing Pipelining ---reduction in the critical path increase the clock speed, or reduce power consumption at same speed Parallel Processing ---multiple outputs are computed

More information

SFG and Mason s Rule : A revision

SFG and Mason s Rule : A revision SFG and Mason s Rule : A revision Andersen Ang 2016-Nov-29 SFG and Mason s Rule Vu Pham Review SFG: Signal-Flow Graph SFG is a directed graph SFG is used to model signal flow in a system SFG can be used

More information

Chaper 4: Retiming (Tái định thì) GV: Hoàng Trang

Chaper 4: Retiming (Tái định thì) GV: Hoàng Trang ĐẠI HỌC QUỐC GIA TP.HỒ CHÍ MINH TRƯỜNG ĐẠI HỌC BÁCH KHOA KHOA ĐIỆN-ĐIỆN TỬ BỘ MÔN KỸ THUẬT ĐIỆN TỬ XỬ LÝ TÍN HiỆU SỐ VỚI FPGA Chaper 4: Retiming (Tái định thì) GV: Hoàng Trang Email: hoangtrang@hcmut.edu.vn

More information

Retiming. delay elements in a circuit without affecting the input/output characteristics of the circuit.

Retiming. delay elements in a circuit without affecting the input/output characteristics of the circuit. Chapter Retiming NCU EE -- SP VLSI esign. Chap. Tsung-Han Tsai 1 Retiming & A transformation techniques used to change the locations of delay elements in a circuit without affecting the input/output characteristics

More information

DSP Design Lecture 5. Dr. Fredrik Edman.

DSP Design Lecture 5. Dr. Fredrik Edman. SP esign SP esign Lecture 5 Retiming r. Fredrik Edman fredrik.edman@eit.lth.se Fredrik Edman, ept. of Electrical and Information Technology, Lund University, Sweden-www.eit.lth.se SP esign Repetition Critical

More information

2D Critical path = 6. Retiming moving delays. Retiming - Pipelining. Delays can be moved from ALL inputs to ALL outputs

2D Critical path = 6. Retiming moving delays. Retiming - Pipelining. Delays can be moved from ALL inputs to ALL outputs SP esgn SP esgn Retmng movng delays Retmng elays can be moved from LL nputs to LL outputs Reduce Crtcal Path faster reduced power consumpton Reduced number of Regster Vktor Öwall, ept. of Electrcal and

More information

DSP Design Lecture 7. Unfolding cont. & Folding. Dr. Fredrik Edman.

DSP Design Lecture 7. Unfolding cont. & Folding. Dr. Fredrik Edman. SP esign Lecture 7 Unfolding cont. & Folding r. Fredrik Edman fredrik.edman@eit.lth.se Unfolding Unfolding creates a program with more than one iteration, J=unfolding factor Unfolding is a structured way

More information

PIPELINING AND PARALLEL PROCESSING. UNIT 4 Real time Signal Processing

PIPELINING AND PARALLEL PROCESSING. UNIT 4 Real time Signal Processing PIPELINING AND PARALLEL PROCESSING UNIT 4 Real time Signal Proceing Content Introduction Pipeling of FIR Digital Filter Parallel proceing Low power Deign FIR Digital Filter A FIR Filter i defined a follow:

More information

Pipelining and Parallel Processing

Pipelining and Parallel Processing Pipelining and Parallel Processing ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall, 010 ldvan@cs.nctu.edu.tw http://www.cs.nctu.edu.tw/~ldvan/ Outlines

More information

Transformation Techniques for Real Time High Speed Implementation of Nonlinear Algorithms

Transformation Techniques for Real Time High Speed Implementation of Nonlinear Algorithms International Journal of Electronics and Communication Engineering. ISSN 0974-66 Volume 4, Number (0), pp.83-94 International Research Publication House http://www.irphouse.com Transformation Techniques

More information

NCU EE -- DSP VLSI Design. Tsung-Han Tsai 1

NCU EE -- DSP VLSI Design. Tsung-Han Tsai 1 NCU EE -- DSP VLSI Design. Tsung-Han Tsai 1 Multi-processor vs. Multi-computer architecture µp vs. DSP RISC vs. DSP RISC Reduced-instruction-set Register-to-register operation Higher throughput by using

More information

Control Systems (CS)

Control Systems (CS) Control Systems (CS) Lecture-0- Signal Flow raphs Dr. Imtiaz ussain Associate Professor Mehran University of Engineering & Technology Jamshoro, Pakistan email: imtiaz.hussain@faculty.muet.edu.pk URL :http://imtiazhussainkalwar.weebly.com/

More information

A VLSI DSP DESIGN AND IMPLEMENTATION O F ALL POLE LATTICE FILTER USING RETIMING METHODOLOGY

A VLSI DSP DESIGN AND IMPLEMENTATION O F ALL POLE LATTICE FILTER USING RETIMING METHODOLOGY A VLSI DSP DESIGN AND IMPLEMENTATION O F ALL POLE LATTICE FILTER USING RETIMING METHODOLOGY 1 PURU GUPTA & 2 TARUN KUMAR RAWAT Dept. of Electronics and Communication Engineering, Netaji Subhas Institute

More information

Pipelining and Parallel Processing

Pipelining and Parallel Processing Pipelining and Parallel Processing Lan-Da Van ( 倫 ), Ph. D. Department of omputer Science National hiao Tung University Taiwan, R.O.. Spring, 007 ldvan@cs.nctu.edu.tw http://www.cs.nctu.edu.tw/~ldvan/

More information

Serial Parallel Multiplier Design in Quantum-dot Cellular Automata

Serial Parallel Multiplier Design in Quantum-dot Cellular Automata Serial Parallel Multiplier Design in Quantum-dot Cellular Automata Heumpil Cho and Earl E. Swartzlander, Jr. Application Specific Processor Group Department of Electrical and Computer Engineering The University

More information

Professor Fearing EECS150/Problem Set Solution Fall 2013 Due at 10 am, Thu. Oct. 3 (homework box under stairs)

Professor Fearing EECS150/Problem Set Solution Fall 2013 Due at 10 am, Thu. Oct. 3 (homework box under stairs) Professor Fearing EECS50/Problem Set Solution Fall 203 Due at 0 am, Thu. Oct. 3 (homework box under stairs). (25 pts) List Processor Timing. The list processor as discussed in lecture is described in RT

More information

DSP Configurations. responded with: thus the system function for this filter would be

DSP Configurations. responded with: thus the system function for this filter would be DSP Configurations In this lecture we discuss the different physical (or software) configurations that can be used to actually realize or implement DSP functions. Recall that the general form of a DSP

More information

TOPICS MULTIPLIERLESS FILTER DESIGN ELEMENTARY SCHOOL ALGORITHM MULTIPLICATION

TOPICS MULTIPLIERLESS FILTER DESIGN ELEMENTARY SCHOOL ALGORITHM MULTIPLICATION 1 2 MULTIPLIERLESS FILTER DESIGN Realzaton of flters wthout full-fledged multplers Some sldes based on support materal by W. Wolf for hs book Modern VLSI Desgn, 3 rd edton. Partly based on followng papers:

More information

Digital Signal Processing with Protein Molecules and DNA Strands

Digital Signal Processing with Protein Molecules and DNA Strands Digital Signal Processing with Protein Molecules and DNA Strands Keshab K. Parhi Electrical and Computer Engineering University of Minnesota, Minneapolis Nov. 10, 2010 Talk at EECS Dept., Berkeley Acknowledgements

More information

Discrete-Time Systems

Discrete-Time Systems FIR Filters With this chapter we turn to systems as opposed to signals. The systems discussed in this chapter are finite impulse response (FIR) digital filters. The term digital filter arises because these

More information

CHAPTER 5 : REDUCTION OF MULTIPLE SUBSYSTEMS

CHAPTER 5 : REDUCTION OF MULTIPLE SUBSYSTEMS CHAPTER 5 : REDUCTION OF MULTIPLE SUBSYSTEMS Objectives Students should be able to: Reduce a block diagram of multiple subsystems to a single block representing the transfer function from input to output

More information

J א א J א א א F א א א א

J א א J א א א F א א א א J CHAPTER # 4 SIGNAL FLOW GRAPH (SFG) 1. Introduction For complex control systems, the block diagram reduction technique is cumbersome. An alternative method for determining the relationship between system

More information

SECTION 2: BLOCK DIAGRAMS & SIGNAL FLOW GRAPHS

SECTION 2: BLOCK DIAGRAMS & SIGNAL FLOW GRAPHS SECTION 2: BLOCK DIAGRAMS & SIGNAL FLOW GRAPHS MAE 4421 Control of Aerospace & Mechanical Systems 2 Block Diagram Manipulation Block Diagrams 3 In the introductory section we saw examples of block diagrams

More information

Digital Signal Processing for Embedded Communications and Biomedical Systems

Digital Signal Processing for Embedded Communications and Biomedical Systems igital Signal Processing for Embedded Communications and Biomedical Systems Keshab K. Parhi istinguished McKnight University Professor University of Minnesota, Minneapolis http://www.ece.umn.edu/users/parhi

More information

VLSI System Design Part V : High-Level Synthesis(2) Oct Feb.2007

VLSI System Design Part V : High-Level Synthesis(2) Oct Feb.2007 VLSI System Design Part V : High-Level Synthesis(2) Oct.2006 - Feb.2007 Lecturer : Tsuyoshi Isshiki Dept. Communications and Integrated Systems, Tokyo Institute of Technology isshiki@vlsi.ss.titech.ac.jp

More information

Numbering Systems. Computational Platforms. Scaling and Round-off Noise. Special Purpose. here that is dedicated architecture

Numbering Systems. Computational Platforms. Scaling and Round-off Noise. Special Purpose. here that is dedicated architecture Computational Platforms Numbering Systems Basic Building Blocks Scaling and Round-off Noise Computational Platforms Viktor Öwall viktor.owall@eit.lth.seowall@eit lth Standard Processors or Special Purpose

More information

Latches. October 13, 2003 Latches 1

Latches. October 13, 2003 Latches 1 Latches The second part of CS231 focuses on sequential circuits, where we add memory to the hardware that we ve already seen. Our schedule will be very similar to before: We first show how primitive memory

More information

Loop Scheduling and Software Pipelining \course\cpeg421-08s\topic-7.ppt 1

Loop Scheduling and Software Pipelining \course\cpeg421-08s\topic-7.ppt 1 Loop Scheduling and Software Pipelining 2008-04-24 \course\cpeg421-08s\topic-7.ppt 1 Reading List Slides: Topic 7 and 7a Other papers as assigned in class or homework: 2008-04-24 \course\cpeg421-08s\topic-7.ppt

More information

Analog vs. discrete signals

Analog vs. discrete signals Analog vs. discrete signals Continuous-time signals are also known as analog signals because their amplitude is analogous (i.e., proportional) to the physical quantity they represent. Discrete-time signals

More information

DES. 4. Petri Nets. Introduction. Different Classes of Petri Net. Petri net properties. Analysis of Petri net models

DES. 4. Petri Nets. Introduction. Different Classes of Petri Net. Petri net properties. Analysis of Petri net models 4. Petri Nets Introduction Different Classes of Petri Net Petri net properties Analysis of Petri net models 1 Petri Nets C.A Petri, TU Darmstadt, 1962 A mathematical and graphical modeling method. Describe

More information

Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs

Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs Article Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs E. George Walters III Department of Electrical and Computer Engineering, Penn State Erie,

More information

ECE 645: Lecture 2. Carry-Lookahead, Carry-Select, & Hybrid Adders

ECE 645: Lecture 2. Carry-Lookahead, Carry-Select, & Hybrid Adders ECE 645: Lecture 2 Carry-Lookahead, Carry-Select, & Hybrid Adders Required Reading Behrooz Parhami, Computer Arithmetic: Algorithms and Hardware Design Chapter 6, Carry-Lookahead Adders Sections 6.1-6.2.

More information

Ch 7. Finite State Machines. VII - Finite State Machines Contemporary Logic Design 1

Ch 7. Finite State Machines. VII - Finite State Machines Contemporary Logic Design 1 Ch 7. Finite State Machines VII - Finite State Machines Contemporary Logic esign 1 Finite State Machines Sequential circuits primitive sequential elements combinational logic Models for representing sequential

More information

AN IMPROVED LOW LATENCY SYSTOLIC STRUCTURED GALOIS FIELD MULTIPLIER

AN IMPROVED LOW LATENCY SYSTOLIC STRUCTURED GALOIS FIELD MULTIPLIER Indian Journal of Electronics and Electrical Engineering (IJEEE) Vol.2.No.1 2014pp1-6 available at: www.goniv.com Paper Received :05-03-2014 Paper Published:28-03-2014 Paper Reviewed by: 1. John Arhter

More information

ECE 448 Lecture 6. Finite State Machines. State Diagrams, State Tables, Algorithmic State Machine (ASM) Charts, and VHDL Code. George Mason University

ECE 448 Lecture 6. Finite State Machines. State Diagrams, State Tables, Algorithmic State Machine (ASM) Charts, and VHDL Code. George Mason University ECE 448 Lecture 6 Finite State Machines State Diagrams, State Tables, Algorithmic State Machine (ASM) Charts, and VHDL Code George Mason University Required reading P. Chu, FPGA Prototyping by VHDL Examples

More information

Hilbert Transformator IP Cores

Hilbert Transformator IP Cores Introduction Hilbert Transformator IP Cores Martin Kumm December 27, 28 The Hilbert Transform is an important component in communication systems, e.g. for single sideband modulation/demodulation, amplitude

More information

Digital Filter Structures. Basic IIR Digital Filter Structures. of an LTI digital filter is given by the convolution sum or, by the linear constant

Digital Filter Structures. Basic IIR Digital Filter Structures. of an LTI digital filter is given by the convolution sum or, by the linear constant Digital Filter Chapter 8 Digital Filter Block Diagram Representation Equivalent Basic FIR Digital Filter Basic IIR Digital Filter. Block Diagram Representation In the time domain, the input-output relations

More information

A HIGH-SPEED PROCESSOR FOR RECTANGULAR-TO-POLAR CONVERSION WITH APPLICATIONS IN DIGITAL COMMUNICATIONS *

A HIGH-SPEED PROCESSOR FOR RECTANGULAR-TO-POLAR CONVERSION WITH APPLICATIONS IN DIGITAL COMMUNICATIONS * Copyright IEEE 999: Published in the Proceedings of Globecom 999, Rio de Janeiro, Dec 5-9, 999 A HIGH-SPEED PROCESSOR FOR RECTAGULAR-TO-POLAR COVERSIO WITH APPLICATIOS I DIGITAL COMMUICATIOS * Dengwei

More information

Cost/Performance Tradeoffs:

Cost/Performance Tradeoffs: Cost/Performance Tradeoffs: a case study Digital Systems Architecture I. L10 - Multipliers 1 Binary Multiplication x a b n bits n bits EASY PROBLEM: design combinational circuit to multiply tiny (1-, 2-,

More information

An Optimized Hardware Architecture of Montgomery Multiplication Algorithm

An Optimized Hardware Architecture of Montgomery Multiplication Algorithm An Optimized Hardware Architecture of Montgomery Multiplication Algorithm Miaoqing Huang 1, Kris Gaj 2, Soonhak Kwon 3, and Tarek El-Ghazawi 1 1 The George Washington University, Washington, DC 20052,

More information

Identification of ARX, OE, FIR models with the least squares method

Identification of ARX, OE, FIR models with the least squares method Identification of ARX, OE, FIR models with the least squares method CHEM-E7145 Advanced Process Control Methods Lecture 2 Contents Identification of ARX model with the least squares minimizing the equation

More information

Pipelined Viterbi Decoder Using FPGA

Pipelined Viterbi Decoder Using FPGA Research Journal of Applied Sciences, Engineering and Technology 5(4): 1362-1372, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scientific Organization, 2013 Submitted: July 05, 2012 Accepted: August

More information

Micro-architecture Pipelining Optimization with Throughput- Aware Floorplanning

Micro-architecture Pipelining Optimization with Throughput- Aware Floorplanning Micro-architecture Pipelining Optimization with Throughput- Aware Floorplanning Yuchun Ma* Zhuoyuan Li* Jason Cong Xianlong Hong Glenn Reinman Sheqin Dong* Qiang Zhou *Department of Computer Science &

More information

Lecture Notes: Markov chains Tuesday, September 16 Dannie Durand

Lecture Notes: Markov chains Tuesday, September 16 Dannie Durand Computational Genomics and Molecular Biology, Lecture Notes: Markov chains Tuesday, September 6 Dannie Durand In the last lecture, we introduced Markov chains, a mathematical formalism for modeling how

More information

ECE 5775 (Fall 17) High-Level Digital Design Automation. Scheduling: Exact Methods

ECE 5775 (Fall 17) High-Level Digital Design Automation. Scheduling: Exact Methods ECE 5775 (Fall 17) High-Level Digital Design Automation Scheduling: Exact Methods Announcements Sign up for the first student-led discussions today One slot remaining Presenters for the 1st session will

More information

COE 202: Digital Logic Design Sequential Circuits Part 4. Dr. Ahmad Almulhem ahmadsm AT kfupm Phone: Office:

COE 202: Digital Logic Design Sequential Circuits Part 4. Dr. Ahmad Almulhem   ahmadsm AT kfupm Phone: Office: COE 202: Digital Logic Design Sequential Circuits Part 4 Dr. Ahmad Almulhem Email: ahmadsm AT kfupm Phone: 860-7554 Office: 22-324 Objectives Registers Counters Registers 0 1 n-1 A register is a group

More information

ECC for NAND Flash. Osso Vahabzadeh. TexasLDPC Inc. Flash Memory Summit 2017 Santa Clara, CA 1

ECC for NAND Flash. Osso Vahabzadeh. TexasLDPC Inc. Flash Memory Summit 2017 Santa Clara, CA 1 ECC for NAND Flash Osso Vahabzadeh TexasLDPC Inc. 1 Overview Why Is Error Correction Needed in Flash Memories? Error Correction Codes Fundamentals Low-Density Parity-Check (LDPC) Codes LDPC Encoding and

More information

8. Design Tradeoffs x Computation Structures Part 1 Digital Circuits. Copyright 2015 MIT EECS

8. Design Tradeoffs x Computation Structures Part 1 Digital Circuits. Copyright 2015 MIT EECS 8. Design Tradeoffs 6.004x Computation Structures Part 1 Digital Circuits Copyright 2015 MIT EECS 6.004 Computation Structures L08: Design Tradeoffs, Slide #1 There are a large number of implementations

More information

FAST FIR ALGORITHM BASED AREA-EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURES

FAST FIR ALGORITHM BASED AREA-EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURES FAST FIR ALGORITHM BASED AREA-EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURES R.P.MEENAAKSHI SUNDHARI 1, Dr.R.ANITA 2 1 Department of ECE, Sasurie College of Engineering, Vijayamangalam, Tamilnadu, India.

More information

8. Design Tradeoffs x Computation Structures Part 1 Digital Circuits. Copyright 2015 MIT EECS

8. Design Tradeoffs x Computation Structures Part 1 Digital Circuits. Copyright 2015 MIT EECS 8. Design Tradeoffs 6.004x Computation Structures Part 1 Digital Circuits Copyright 2015 MIT EECS 6.004 Computation Structures L08: Design Tradeoffs, Slide #1 There are a large number of implementations

More information

Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks

Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks Yufei Ma, Yu Cao, Sarma Vrudhula,

More information

Matrix Computations: Direct Methods II. May 5, 2014 Lecture 11

Matrix Computations: Direct Methods II. May 5, 2014 Lecture 11 Matrix Computations: Direct Methods II May 5, 2014 ecture Summary You have seen an example of how a typical matrix operation (an important one) can be reduced to using lower level BS routines that would

More information

EEE2135 Digital Logic Design

EEE2135 Digital Logic Design EEE2135 Digital Logic Design Chapter 7. Sequential Circuits Design 서강대학교 전자공학과 1. Model of Sequential Circuits 1) Sequential vs. Combinational Circuits a. Sequential circuits: Outputs depend on both the

More information

Computer Architecture 10. Fast Adders

Computer Architecture 10. Fast Adders Computer Architecture 10 Fast s Ma d e wi t h Op e n Of f i c e. o r g 1 Carry Problem Addition is primary mechanism in implementing arithmetic operations Slow addition directly affects the total performance

More information

A Digit-Serial Systolic Multiplier for Finite Fields GF(2 m )

A Digit-Serial Systolic Multiplier for Finite Fields GF(2 m ) A Digit-Serial Systolic Multiplier for Finite Fields GF( m ) Chang Hoon Kim, Sang Duk Han, and Chun Pyo Hong Department of Computer and Information Engineering Taegu University 5 Naeri, Jinryang, Kyungsan,

More information

Causality and Time. The Happens-Before Relation

Causality and Time. The Happens-Before Relation Causality and Time The Happens-Before Relation Because executions are sequences of events, they induce a total order on all the events It is possible that two events by different processors do not influence

More information

Distributed Computing. Synchronization. Dr. Yingwu Zhu

Distributed Computing. Synchronization. Dr. Yingwu Zhu Distributed Computing Synchronization Dr. Yingwu Zhu Topics to Discuss Physical Clocks Logical Clocks: Lamport Clocks Classic paper: Time, Clocks, and the Ordering of Events in a Distributed System Lamport

More information

Lecture 11 FIR Filters

Lecture 11 FIR Filters Lecture 11 FIR Filters Fundamentals of Digital Signal Processing Spring, 2012 Wei-Ta Chu 2012/4/12 1 The Unit Impulse Sequence Any sequence can be represented in this way. The equation is true if k ranges

More information

Power Optimization in Programmable Processors and ASIC Implementations of Linear Systems: Transformation-based Approach

Power Optimization in Programmable Processors and ASIC Implementations of Linear Systems: Transformation-based Approach Power Optimization in Programmable Processors and ASIC Implementations of Linear Systems: Transformation-based Approach Mani Srivastava Miodrag Potkonjak AT&T Bell Laboratories Computer Science Department

More information

Fast Fir Algorithm Based Area- Efficient Parallel Fir Digital Filter Structures

Fast Fir Algorithm Based Area- Efficient Parallel Fir Digital Filter Structures Fast Fir Algorithm Based Area- Efficient Parallel Fir Digital Filter Structures Ms. P.THENMOZHI 1, Ms. C.THAMILARASI 2 and Mr. V.VENGATESHWARAN 3 Assistant Professor, Dept. of ECE, J.K.K.College of Technology,

More information

School of EECS Seoul National University

School of EECS Seoul National University 4!4 07$ 8902808 3 School of EECS Seoul National University Introduction Low power design 3974/:.9 43 Increasing demand on performance and integrity of VLSI circuits Popularity of portable devices Low power

More information

FPGA Resource Utilization Estimates for NI crio LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008

FPGA Resource Utilization Estimates for NI crio LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008 FPGA Resource Utilization Estimates for NI crio-9104 LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008 Note: The numbers presented in this document are estimates. Actual resource usage for your

More information

EECE 301 Signals & Systems

EECE 301 Signals & Systems EECE 30 Signals & Systems Prof. Mark Fowler Note Set #22 D-T Systems: DT Systems & Difference Equations / D-T System Models So far we ve seen how to use the FT to analyze circuits Use phasors and standard

More information

Design and Study of Enhanced Parallel FIR Filter Using Various Adders for 16 Bit Length

Design and Study of Enhanced Parallel FIR Filter Using Various Adders for 16 Bit Length International Journal of Soft Computing and Engineering (IJSCE) Design and Study of Enhanced Parallel FIR Filter Using Various Adders for 16 Bit Length D.Ashok Kumar, P.Samundiswary Abstract Now a day

More information

CS 700: Quantitative Methods & Experimental Design in Computer Science

CS 700: Quantitative Methods & Experimental Design in Computer Science CS 700: Quantitative Methods & Experimental Design in Computer Science Sanjeev Setia Dept of Computer Science George Mason University Logistics Grade: 35% project, 25% Homework assignments 20% midterm,

More information

Pipelining. Traditional Execution. CS 365 Lecture 12 Prof. Yih Huang. add ld beq CS CS 365 2

Pipelining. Traditional Execution. CS 365 Lecture 12 Prof. Yih Huang. add ld beq CS CS 365 2 Pipelining CS 365 Lecture 12 Prof. Yih Huang CS 365 1 Traditional Execution 1 2 3 4 1 2 3 4 5 1 2 3 add ld beq CS 365 2 1 Pipelined Execution 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5

More information

COVER SHEET: Problem#: Points

COVER SHEET: Problem#: Points EEL 4712 Midterm 3 Spring 2017 VERSION 1 Name: UFID: Sign here to give permission for your test to be returned in class, where others might see your score: IMPORTANT: Please be neat and write (or draw)

More information

FPGA Resource Utilization Estimates for NI PXI-7854R. LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008

FPGA Resource Utilization Estimates for NI PXI-7854R. LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008 FPGA Resource Utilization Estimates for NI PXI-7854R LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008 Note: The numbers presented in this document are estimates. Actual resource usage for your

More information

3.1 Symmetry & Coordinate Graphs

3.1 Symmetry & Coordinate Graphs 3.1 Symmetry & Coordinate Graphs I. Symmetry Point symmetry two distinct points P and P are symmetric with respect to point M if and only is M is the midpoint of PP' When the definition is extended to

More information

Petri nets. s 1 s 2. s 3 s 4. directed arcs.

Petri nets. s 1 s 2. s 3 s 4. directed arcs. Petri nets Petri nets Petri nets are a basic model of parallel and distributed systems (named after Carl Adam Petri). The basic idea is to describe state changes in a system with transitions. @ @R s 1

More information

Lecture 11. Advanced Dividers

Lecture 11. Advanced Dividers Lecture 11 Advanced Dividers Required Reading Behrooz Parhami, Computer Arithmetic: Algorithms and Hardware Design Chapter 15 Variation in Dividers 15.3, Combinational and Array Dividers Chapter 16, Division

More information

EEC 216 Lecture #3: Power Estimation, Interconnect, & Architecture. Rajeevan Amirtharajah University of California, Davis

EEC 216 Lecture #3: Power Estimation, Interconnect, & Architecture. Rajeevan Amirtharajah University of California, Davis EEC 216 Lecture #3: Power Estimation, Interconnect, & Architecture Rajeevan Amirtharajah University of California, Davis Outline Announcements Review: PDP, EDP, Intersignal Correlations, Glitching, Top

More information

Models for representing sequential circuits

Models for representing sequential circuits Sequential Circuits Models for representing sequential circuits Finite-state machines (Moore and Mealy) Representation of memory (states) Changes in state (transitions) Design procedure State diagrams

More information

Pipelined Computations

Pipelined Computations Chapter 5 Slide 155 Pipelined Computations Pipelined Computations Slide 156 Problem divided into a series of tasks that have to be completed one after the other (the basis of sequential programming). Each

More information

CORDIC, Divider, Square Root

CORDIC, Divider, Square Root 4// EE6B: VLSI Signal Processing CORDIC, Divider, Square Root Prof. Dejan Marković ee6b@gmail.com Iterative algorithms CORDIC Division Square root Lecture Overview Topics covered include Algorithms and

More information

Using Mason's Rule to Analyze DSP Networks

Using Mason's Rule to Analyze DSP Networks Using Mason's Rule to Analyze DSP Networks by Richard Lyons, Besser Associates 1 α This article discusses a powerful analysis method (well known to our analog control system engineering brethren) to obtain

More information

Worst-Case Execution Time Analysis. LS 12, TU Dortmund

Worst-Case Execution Time Analysis. LS 12, TU Dortmund Worst-Case Execution Time Analysis Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 02, 03 May 2016 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 53 Most Essential Assumptions for Real-Time Systems Upper

More information

On the Complexity of Mapping Pipelined Filtering Services on Heterogeneous Platforms

On the Complexity of Mapping Pipelined Filtering Services on Heterogeneous Platforms On the Complexity of Mapping Pipelined Filtering Services on Heterogeneous Platforms Anne Benoit, Fanny Dufossé and Yves Robert LIP, École Normale Supérieure de Lyon, France {Anne.Benoit Fanny.Dufosse

More information

Logic BIST. Sungho Kang Yonsei University

Logic BIST. Sungho Kang Yonsei University Logic BIST Sungho Kang Yonsei University Outline Introduction Basics Issues Weighted Random Pattern Generation BIST Architectures Deterministic BIST Conclusion 2 Built In Self Test Test/ Normal Input Pattern

More information

Testability. Shaahin Hessabi. Sharif University of Technology. Adapted from the presentation prepared by book authors.

Testability. Shaahin Hessabi. Sharif University of Technology. Adapted from the presentation prepared by book authors. Testability Lecture 6: Logic Simulation Shaahin Hessabi Department of Computer Engineering Sharif University of Technology Adapted from the presentation prepared by book authors Slide 1 of 27 Outline What

More information

Control Systems, Lecture 05

Control Systems, Lecture 05 Control Systems, Lecture 05 İbrahim Beklan Küçükdemiral Yıldız Teknik Üniversitesi 2015 1 / 33 Laplace Transform Solution of State Equations In previous sections, systems were modeled in state space, where

More information

DESIGN AND IMPLEMENTATION OF EFFICIENT HIGH SPEED VEDIC MULTIPLIER USING REVERSIBLE GATES

DESIGN AND IMPLEMENTATION OF EFFICIENT HIGH SPEED VEDIC MULTIPLIER USING REVERSIBLE GATES DESIGN AND IMPLEMENTATION OF EFFICIENT HIGH SPEED VEDIC MULTIPLIER USING REVERSIBLE GATES Boddu Suresh 1, B.Venkateswara Reddy 2 1 2 PG Scholar, Associate Professor, HOD, Dept of ECE Vikas College of Engineering

More information

Implementation of Discrete-Time Systems

Implementation of Discrete-Time Systems EEE443 Digital Signal Processing Implementation of Discrete-Time Systems Dr. Shahrel A. Suandi PPKEE, Engineering Campus, USM Introduction A linear-time invariant system (LTI) is described by linear constant

More information

EE 144/244: Fundamental Algorithms for System Modeling, Analysis, and Optimization Fall 2016

EE 144/244: Fundamental Algorithms for System Modeling, Analysis, and Optimization Fall 2016 EE 144/244: Fundamental Algorithms for System Modeling, Analysis, and Optimization Fall 2016 Discrete Event Simulation Stavros Tripakis University of California, Berkeley Stavros Tripakis (UC Berkeley)

More information

Clocks in Asynchronous Systems

Clocks in Asynchronous Systems Clocks in Asynchronous Systems The Internet Network Time Protocol (NTP) 8 Goals provide the ability to externally synchronize clients across internet to UTC provide reliable service tolerating lengthy

More information

Network Flows. CTU FEE Department of control engineering. March 28, 2017

Network Flows. CTU FEE Department of control engineering. March 28, 2017 Network Flows Zdeněk Hanzálek, Přemysl Šůcha hanzalek@fel.cvut.cz CTU FEE Department of control engineering March 28, 2017 Z. Hanzálek (CTU FEE) Network Flows March 28, 2017 1 / 44 Table of contents 1

More information

Digital Electronics. Part A

Digital Electronics. Part A Digital Electronics Final Examination Part A Winter 2004-05 Student Name: Date: lass Period: Total Points: Multiple hoice Directions: Select the letter of the response which best completes the item or

More information

New Implementations of the WG Stream Cipher

New Implementations of the WG Stream Cipher New Implementations of the WG Stream Cipher Hayssam El-Razouk, Arash Reyhani-Masoleh, and Guang Gong Abstract This paper presents two new hardware designs of the WG-28 cipher, one for the multiple output

More information

A COMBINED 16-BIT BINARY AND DUAL GALOIS FIELD MULTIPLIER. Jesus Garcia and Michael J. Schulte

A COMBINED 16-BIT BINARY AND DUAL GALOIS FIELD MULTIPLIER. Jesus Garcia and Michael J. Schulte A COMBINED 16-BIT BINARY AND DUAL GALOIS FIELD MULTIPLIER Jesus Garcia and Michael J. Schulte Lehigh University Department of Computer Science and Engineering Bethlehem, PA 15 ABSTRACT Galois field arithmetic

More information

Chapter 2 Examples of Applications for Connectivity and Causality Analysis

Chapter 2 Examples of Applications for Connectivity and Causality Analysis Chapter 2 Examples of Applications for Connectivity and Causality Analysis Abstract Connectivity and causality have a lot of potential applications, among which we focus on analysis and design of large-scale

More information

Clock-driven scheduling

Clock-driven scheduling Clock-driven scheduling Also known as static or off-line scheduling Michal Sojka Czech Technical University in Prague, Faculty of Electrical Engineering, Department of Control Engineering November 8, 2017

More information

Analog and Telecommunication Electronics

Analog and Telecommunication Electronics Politecnico di Torino - ICT School Analog and Telecommunication Electronics D3 - A/D converters» Error taxonomy» ADC parameters» Structures and taxonomy» Mixed converters» Origin of errors 12/05/2011-1

More information

Parallelism in Computer Arithmetic: A Historical Perspective

Parallelism in Computer Arithmetic: A Historical Perspective Parallelism in Computer Arithmetic: A Historical Perspective 21s 2s 199s 198s 197s 196s 195s Behrooz Parhami Aug. 218 Parallelism in Computer Arithmetic Slide 1 University of California, Santa Barbara

More information

ECE317 : Feedback and Control

ECE317 : Feedback and Control ECE317 : Feedback and Control Lecture : Steady-state error Dr. Richard Tymerski Dept. of Electrical and Computer Engineering Portland State University 1 Course roadmap Modeling Analysis Design Laplace

More information

Performance Evaluation of Queuing Systems

Performance Evaluation of Queuing Systems Performance Evaluation of Queuing Systems Introduction to Queuing Systems System Performance Measures & Little s Law Equilibrium Solution of Birth-Death Processes Analysis of Single-Station Queuing Systems

More information

Low-Cost and Area-Efficient FPGA Implementations of Lattice-Based Cryptography

Low-Cost and Area-Efficient FPGA Implementations of Lattice-Based Cryptography Low-Cost and Area-Efficient FPGA Implementations of Lattice-Based Cryptography Aydin Aysu, Cameron Patterson and Patrick Schaumont Electrical and Computer Engineering Department Virginia Tech Blacksburg,

More information

Chapter 3: Block Diagrams and Signal Flow Graphs

Chapter 3: Block Diagrams and Signal Flow Graphs Chapter 3: Block Diagrams and Signal Flow Graphs Farid Golnaraghi, Simon Fraser University Benjamin C. Kuo, University of Illinois ISBN: 978 0 470 04896 2 Introduction In this chapter, we discuss graphical

More information

/ / MET Day 000 NC1^ INRTL MNVR I E E PRE SLEEP K PRE SLEEP R E

/ / MET Day 000 NC1^ INRTL MNVR I E E PRE SLEEP K PRE SLEEP R E 05//0 5:26:04 09/6/0 (259) 6 7 8 9 20 2 22 2 09/7 0 02 0 000/00 0 02 0 04 05 06 07 08 09 0 2 ay 000 ^ 0 X Y / / / / ( %/ ) 2 /0 2 ( ) ^ 4 / Y/ 2 4 5 6 7 8 9 2 X ^ X % 2 // 09/7/0 (260) ay 000 02 05//0

More information

Compiling for Parallelism & Locality. Example. Announcement Need to make up November 14th lecture. Last time Data dependences and loops

Compiling for Parallelism & Locality. Example. Announcement Need to make up November 14th lecture. Last time Data dependences and loops Complng for Parallelsm & Localty Announcement Need to make up November 14th lecture Last tme Data dependences and loops Today Fnsh data dependence analyss for loops CS553 Lecture Complng for Parallelsm

More information