Signal Flow Graphs. Roger Woods Programmable Systems Lab ECIT, Queen s University Belfast
|
|
- Oswin Bailey
- 5 years ago
- Views:
Transcription
1 Signal Flow Graphs Roger Woods Programmable Systems Lab ECIT, Queen s University Belfast (Slides 2-13 taken from Signal Flow Graphs and Data Flow Graphs chapter by Keshab K. Parhi and Yanni Chen)
2 Contents Signal Flow Graphs (SFGs) DSP characteristcs Transfer FuncTon DerivaTon of SFG Mason s Gain Formula Data Flow Graphs (DFGs) DFG representatons Equivalent Single- rate from MulT- rate DFG SFG/DFG transformatons ReTming Pipelining 2
3 Signal Flow Graph RepresentaTon DSP characteristcs Data independence Input samples are processed periodically (typically a clock) Tasks are repeated infinite number of Tmes => non- terminatng programs High computatonal requirements Trend towards DSP micros and FPGAs Towards SFGs (and DFGs) powerful representatons of signal processing algorithms as they represent the operatons using a finite number of nodes 3
4 Key features of SFGs SFGs can be transformed using retming, pipelining, unfolding and folding resultng forms have same input- output characteristcs but different constraints Pipelining and retming can be used to reduce clock period Folding can be used to Tme- multplex i.e. reduce area Unfolding leads to: lower iteraton periods in so`ware implementatons (by highlightng concurrency) Faster hardware by exploitng parallel implementatons Key goal of transforming representaton to meet performance requirements by adaptng levels of pipelining and parallelism (or both) 4
5 Signal Flow Graph SFG is a collecton of nodes and directed edges A directed edge (j,k) denotes a linear transform from the signal at node j to the signal at node k Edges are usually restricted to multplier or delay elements 5
6 TransposiTon Flow graph reversal/transpositon can be applied to single- input single- output (SISO) systems by reversing directons of all edges, exchanging input/output nodes while keeping edge gain or edge delay unchanged 6
7 Mason's gain formula A few useful terminologies in Mason's gain formula have to be defined related to an SFG Forward path: a path that connects a source node to a sink node in which no node is traversed more than once. Loop: a closed path without crossing the same point more than once. Loop gain: the product of all the transfer functons in the loop. Non- touching or non- interactng loops: two loops are non- touching or non- interactng if they have no nodes in common. 7
8 Mason's gain formula M = transfer functon or gain of the system Y = output node X = input node N = total number of forward paths between X and Y Δ = determinant of the graph = 1- loop gains + non- touching loop gains taken two at a Tme - non- touching loop gains taken three at a Tme + M j = gain of the j th forward path between X and Y Δ j = 1- loops remaining a`er eliminatng the j th forward path i.e. eliminate the loops touching the j th forward path from the graph. If none of the loops remains, Δ j = 1. 8
9 SFG example 1) Find the forward paths and their corresponding gains Two forward paths exist in this SFG: M 1 = G 1 G 2 G 3 and M 2 = G 4 2) Find the loops and their corresponding gains There are four loops in this example: Loop 1 = - G 1 H 1 Loop 2 = - G 3 H 2 Loop 3 = - G 1 G 2 G 3 H 3 Loop 4 = - G 4 H 3 9
10 SFG example (Cont d) 3) Find the Δ j If we eliminate the path M 1 =G 1 G 2 G 3 from the SFG, no complete loops remain, so Δ 1 = 1. Similarly, if the path M 2 =G 4 is eliminated from the SFG, no complete loops remain neither, so Δ 2 = 1 as well. 4) Find the determinant Δ Only one pair of non- touching loops is in this SFG, i.e. Loop 1 and Loop 2, thus non- touching loop gains taken two at a Tme = (- G 1 H 1 )(- G 3 H 2 ). Therefore, 10
11 SFG example (Cont d) Δ = 1- loop gains + non- touching loop gains taken 2 at a Tme = 1 - (- G 1 H 1 - G 3 H 2 - G 1 G 2 G 3 H 3 - G 4 H 3 ) + (- G 1 H 1 )(- G 3 H 2 ) = 1 + G 1 H 1 + G 3 H 2 + G 1 G 2 G 3 H 3 + G 4 H 3 + G 1 G 3 H 1 H 2 5) The final step is to apply the Mason's gain formula 11
12 Data Flow Graph In DFGs, nodes represent computatons/functons and directed edges represent data paths with non- negatve numbers associated with them Captures data- driven property of DSP algorithms - node fires when all input data is available. y n = ay n- 1 + x n Block diagram ConvenTonal DFG Synchronous DFG 12
13 SDFGs and mult- rate Synchronous DFG is a DFG special case where the number of data samples produced or consumed by each node in each executon is specified a priori can be used for mult- rate A, B &C operate at frequencies f A, f B & f C A consumes f A samples & produces 3f A samples per Jme unit. B consumes 5f B samples, therefore using equality 3f A = 5f B, we have f B =3 f A /5. Equivalent SFG Similarly, f C = 2f B /3 = 2f A /5 which can be used to compute the frequencies for A, B and C for a specified required input sample rate. 13
14 SequenTal algorithms OperaTons are one a`er other. Throughput dictated by the Tme to perform P 1, P 2 and P 3. OperaTons can be carried out at the same Tme. 14
15 Some definitons - Latency Latency Tme needed to generate an output from when the corresponding input is entered. Throughput rate is defined as the rate at which the outputs (and/or inputs) are produced. 15
16 Why is latency important? Latency can dictate performance in applicatons with feedback loops 16
17 MulTplexing Technique to increase the throughput rate of sequental algorithms Processors (PEs) will operate in a Tme shared fashion. 17
18 MulTplexing SequenTal: TR = 1/(tP 1 + tp 2 + tp 3 ) and output every cycle. E.g. if tp 1 = tp 2 = tp 3 =10, TR=1/30 Interleaved: TR = 1/ (tp 1 or tp 2 or tp 3 ) E.g. if tp 1 = tp 2 = tp 3 =10, TR=1/10 tp n time taken for processor P n 18
19 Some definitons - pipelining Method to increase the throughput rate of a sequental (and parallel) algorithm TR = 1/ (tp 1 or tp 2 or tp 3 ) If tp 1 = tp 2 = tp 3 =10, TR=1/10 Problems in feedback loops allow interleaving only. clock rate = f output = 1 per cycle TR = f clock rate = 4Xf output = 1 per 4 cycles TR = f 19
20 ApplicaTon of pipelining # Clock Input Node 1 Node 2 Node 3 Output 0 x 0 a 0 x 0 1 x 1 a 0 x 1 a 0 x 0 a 0 x 0 y 0 2 x 2 a 0 x 2 a 0 x 1 + a 1 x 0 a 0 x 1 + a 1 x 0 y 1 3 x 3 a 0 x 3 a 0 x 2 + a 1 x 1 a 0 x 2 + a 1 x 1 + a 2 x 0 y 2 4 x 4 a 0 x 4 a 0 x 3 + a 1 x 2 a 0 x 3 + a 1 x 2 + a 2 x 1 y 3 20
21 ApplicaTon of pipelining #2 Why not apply pipelining to adder chain in FIR filter? Clock Input Node 1 Node 2 Node 3 Output 0 x 0 a 0 x 0 1 x 1 a 0 x 1 a 0 x 0 2 x 2 a 0 x 2 a 0 x 1 + a 1 x 0 a 0 x 0 y 0 3 x 3 a 0 x 3 a 0 x 2 + a 1 x 1 a 0 x 1 + a 1 x 0 + a 2 x 0 X 4 x 4 a 0 x 4 a 0 x 3 + a 1 x 2 a 0 x 2 + a 1 x 1 + a 2 x 1 X 21
22 Formal method to apply pipelining Problem was that pipelining was not applied in a systematc fashion providing a functonal change in the system specificaton i.e. changing the actual functon. A systematc approach is thus required to apply retming to the system in such a way that it does not corrupt the system specificaton. ReTming equates to moving around existng delays Does not alter the latency of the system Reduces (hopefully) the critcal path of the system Can be employed to retme the node 22
23 FormulaTon of retming ReTming FormulaTon w = w + r(v) - r(u) ProperTes of retming Weight of the retmed path p = V > V >..V k is given by w r (p)= w (p) + r(v k ) - r(v 0 ) ReTming does not change the number of delays in a cycle. ReTming does not alter the iteraton bound (see later) in a DFG as the number of delays in a cycle does not change Adding the constant value j to the retming value of each node does not alter number of delays in edges of the retmed graph. 23
24 ReTming of FIR Filter Express as DFG w r (1 2) = w(1 2) + r(2)- r(1) 2,3,4 are multpliers, w r (1 3) = w(1 3) + r(3)- r(1) 5, 6 are adders, 1 is x input w r (1 4) = w(1 4) + r(4)- r(1) Can express relatonships w r (2 5) = w(2 5) + r(5)- r(2) for all edges w r (3 5) = w(3 5) + r(5)- r(3) w r (4 6) = w(4 6) + r(6)- r(4) w r (5 6) = w(5 6) + r(6)- r(5) 24
25 ReTming of FIR Filter Using the following retming values, r(1)=- 1, r(2)=- 1, r(3)=- 1, r(4)=- 1, r(5)=0, r(6)=0 we get a pipeline a`er the multpliers w r (1 2) = w(1 2) + r(2)- r(1) w r (1 3) = w(1 3) + r(3)- r(1) w r (1 4) = w(1 4) + r(4)- r(1) w r (2 5) = w(2 5) + r(5)- r(2) w r (3 5) = w(3 5) + r(5)- r(3) w r (4 6) = w(4 6) + r(6)- r(4) w r (5 6) = w(5 6) + r(6)- r(5) = 0 + (- 1) (- 1) = 0 = 1 + (- 1) (- 1) = 1 = 2 + (- 1) (- 1) = 2 = 0 + (0) (- 1) = 1 = 0 + (0) (- 1) = 1 = 0 + (0) (- 1) = 1 = 0 + (0) (0) = 0 25
26 Conclusions RepresentaTons for DSP systems Signal Flow Graphs (SFGs) Transfer FuncTon DerivaTon using Mason s Gain Formula Data Flow Graphs (DFGs) Various form including synchronous DFG ConstrucTng an Equivalent Single- rate DFG from MulT- rate TransformaTons Pipelining allows speed of resultng implementatons to be increased. ReTming can be used to perform delay transfer but that is only part of the issue. Can increase(unfold)/decrease(fold) parallelism to match performance covered later 26
VLSI Signal Processing
VLSI Signal Processing Lecture 1 Pipelining & Retiming ADSP Lecture1 - Pipelining & Retiming (cwliu@twins.ee.nctu.edu.tw) 1-1 Introduction DSP System Real time requirement Data driven synchronized by data
More informationPipelining and Parallel Processing
Pipelining and Parallel Processing Pipelining ---reduction in the critical path increase the clock speed, or reduce power consumption at same speed Parallel Processing ---multiple outputs are computed
More informationSFG and Mason s Rule : A revision
SFG and Mason s Rule : A revision Andersen Ang 2016-Nov-29 SFG and Mason s Rule Vu Pham Review SFG: Signal-Flow Graph SFG is a directed graph SFG is used to model signal flow in a system SFG can be used
More informationChaper 4: Retiming (Tái định thì) GV: Hoàng Trang
ĐẠI HỌC QUỐC GIA TP.HỒ CHÍ MINH TRƯỜNG ĐẠI HỌC BÁCH KHOA KHOA ĐIỆN-ĐIỆN TỬ BỘ MÔN KỸ THUẬT ĐIỆN TỬ XỬ LÝ TÍN HiỆU SỐ VỚI FPGA Chaper 4: Retiming (Tái định thì) GV: Hoàng Trang Email: hoangtrang@hcmut.edu.vn
More informationRetiming. delay elements in a circuit without affecting the input/output characteristics of the circuit.
Chapter Retiming NCU EE -- SP VLSI esign. Chap. Tsung-Han Tsai 1 Retiming & A transformation techniques used to change the locations of delay elements in a circuit without affecting the input/output characteristics
More informationDSP Design Lecture 5. Dr. Fredrik Edman.
SP esign SP esign Lecture 5 Retiming r. Fredrik Edman fredrik.edman@eit.lth.se Fredrik Edman, ept. of Electrical and Information Technology, Lund University, Sweden-www.eit.lth.se SP esign Repetition Critical
More information2D Critical path = 6. Retiming moving delays. Retiming - Pipelining. Delays can be moved from ALL inputs to ALL outputs
SP esgn SP esgn Retmng movng delays Retmng elays can be moved from LL nputs to LL outputs Reduce Crtcal Path faster reduced power consumpton Reduced number of Regster Vktor Öwall, ept. of Electrcal and
More informationDSP Design Lecture 7. Unfolding cont. & Folding. Dr. Fredrik Edman.
SP esign Lecture 7 Unfolding cont. & Folding r. Fredrik Edman fredrik.edman@eit.lth.se Unfolding Unfolding creates a program with more than one iteration, J=unfolding factor Unfolding is a structured way
More informationPIPELINING AND PARALLEL PROCESSING. UNIT 4 Real time Signal Processing
PIPELINING AND PARALLEL PROCESSING UNIT 4 Real time Signal Proceing Content Introduction Pipeling of FIR Digital Filter Parallel proceing Low power Deign FIR Digital Filter A FIR Filter i defined a follow:
More informationPipelining and Parallel Processing
Pipelining and Parallel Processing ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall, 010 ldvan@cs.nctu.edu.tw http://www.cs.nctu.edu.tw/~ldvan/ Outlines
More informationTransformation Techniques for Real Time High Speed Implementation of Nonlinear Algorithms
International Journal of Electronics and Communication Engineering. ISSN 0974-66 Volume 4, Number (0), pp.83-94 International Research Publication House http://www.irphouse.com Transformation Techniques
More informationNCU EE -- DSP VLSI Design. Tsung-Han Tsai 1
NCU EE -- DSP VLSI Design. Tsung-Han Tsai 1 Multi-processor vs. Multi-computer architecture µp vs. DSP RISC vs. DSP RISC Reduced-instruction-set Register-to-register operation Higher throughput by using
More informationControl Systems (CS)
Control Systems (CS) Lecture-0- Signal Flow raphs Dr. Imtiaz ussain Associate Professor Mehran University of Engineering & Technology Jamshoro, Pakistan email: imtiaz.hussain@faculty.muet.edu.pk URL :http://imtiazhussainkalwar.weebly.com/
More informationA VLSI DSP DESIGN AND IMPLEMENTATION O F ALL POLE LATTICE FILTER USING RETIMING METHODOLOGY
A VLSI DSP DESIGN AND IMPLEMENTATION O F ALL POLE LATTICE FILTER USING RETIMING METHODOLOGY 1 PURU GUPTA & 2 TARUN KUMAR RAWAT Dept. of Electronics and Communication Engineering, Netaji Subhas Institute
More informationPipelining and Parallel Processing
Pipelining and Parallel Processing Lan-Da Van ( 倫 ), Ph. D. Department of omputer Science National hiao Tung University Taiwan, R.O.. Spring, 007 ldvan@cs.nctu.edu.tw http://www.cs.nctu.edu.tw/~ldvan/
More informationSerial Parallel Multiplier Design in Quantum-dot Cellular Automata
Serial Parallel Multiplier Design in Quantum-dot Cellular Automata Heumpil Cho and Earl E. Swartzlander, Jr. Application Specific Processor Group Department of Electrical and Computer Engineering The University
More informationProfessor Fearing EECS150/Problem Set Solution Fall 2013 Due at 10 am, Thu. Oct. 3 (homework box under stairs)
Professor Fearing EECS50/Problem Set Solution Fall 203 Due at 0 am, Thu. Oct. 3 (homework box under stairs). (25 pts) List Processor Timing. The list processor as discussed in lecture is described in RT
More informationDSP Configurations. responded with: thus the system function for this filter would be
DSP Configurations In this lecture we discuss the different physical (or software) configurations that can be used to actually realize or implement DSP functions. Recall that the general form of a DSP
More informationTOPICS MULTIPLIERLESS FILTER DESIGN ELEMENTARY SCHOOL ALGORITHM MULTIPLICATION
1 2 MULTIPLIERLESS FILTER DESIGN Realzaton of flters wthout full-fledged multplers Some sldes based on support materal by W. Wolf for hs book Modern VLSI Desgn, 3 rd edton. Partly based on followng papers:
More informationDigital Signal Processing with Protein Molecules and DNA Strands
Digital Signal Processing with Protein Molecules and DNA Strands Keshab K. Parhi Electrical and Computer Engineering University of Minnesota, Minneapolis Nov. 10, 2010 Talk at EECS Dept., Berkeley Acknowledgements
More informationDiscrete-Time Systems
FIR Filters With this chapter we turn to systems as opposed to signals. The systems discussed in this chapter are finite impulse response (FIR) digital filters. The term digital filter arises because these
More informationCHAPTER 5 : REDUCTION OF MULTIPLE SUBSYSTEMS
CHAPTER 5 : REDUCTION OF MULTIPLE SUBSYSTEMS Objectives Students should be able to: Reduce a block diagram of multiple subsystems to a single block representing the transfer function from input to output
More informationJ א א J א א א F א א א א
J CHAPTER # 4 SIGNAL FLOW GRAPH (SFG) 1. Introduction For complex control systems, the block diagram reduction technique is cumbersome. An alternative method for determining the relationship between system
More informationSECTION 2: BLOCK DIAGRAMS & SIGNAL FLOW GRAPHS
SECTION 2: BLOCK DIAGRAMS & SIGNAL FLOW GRAPHS MAE 4421 Control of Aerospace & Mechanical Systems 2 Block Diagram Manipulation Block Diagrams 3 In the introductory section we saw examples of block diagrams
More informationDigital Signal Processing for Embedded Communications and Biomedical Systems
igital Signal Processing for Embedded Communications and Biomedical Systems Keshab K. Parhi istinguished McKnight University Professor University of Minnesota, Minneapolis http://www.ece.umn.edu/users/parhi
More informationVLSI System Design Part V : High-Level Synthesis(2) Oct Feb.2007
VLSI System Design Part V : High-Level Synthesis(2) Oct.2006 - Feb.2007 Lecturer : Tsuyoshi Isshiki Dept. Communications and Integrated Systems, Tokyo Institute of Technology isshiki@vlsi.ss.titech.ac.jp
More informationNumbering Systems. Computational Platforms. Scaling and Round-off Noise. Special Purpose. here that is dedicated architecture
Computational Platforms Numbering Systems Basic Building Blocks Scaling and Round-off Noise Computational Platforms Viktor Öwall viktor.owall@eit.lth.seowall@eit lth Standard Processors or Special Purpose
More informationLatches. October 13, 2003 Latches 1
Latches The second part of CS231 focuses on sequential circuits, where we add memory to the hardware that we ve already seen. Our schedule will be very similar to before: We first show how primitive memory
More informationLoop Scheduling and Software Pipelining \course\cpeg421-08s\topic-7.ppt 1
Loop Scheduling and Software Pipelining 2008-04-24 \course\cpeg421-08s\topic-7.ppt 1 Reading List Slides: Topic 7 and 7a Other papers as assigned in class or homework: 2008-04-24 \course\cpeg421-08s\topic-7.ppt
More informationAnalog vs. discrete signals
Analog vs. discrete signals Continuous-time signals are also known as analog signals because their amplitude is analogous (i.e., proportional) to the physical quantity they represent. Discrete-time signals
More informationDES. 4. Petri Nets. Introduction. Different Classes of Petri Net. Petri net properties. Analysis of Petri net models
4. Petri Nets Introduction Different Classes of Petri Net Petri net properties Analysis of Petri net models 1 Petri Nets C.A Petri, TU Darmstadt, 1962 A mathematical and graphical modeling method. Describe
More informationReduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs
Article Reduced-Area Constant-Coefficient and Multiple-Constant Multipliers for Xilinx FPGAs with 6-Input LUTs E. George Walters III Department of Electrical and Computer Engineering, Penn State Erie,
More informationECE 645: Lecture 2. Carry-Lookahead, Carry-Select, & Hybrid Adders
ECE 645: Lecture 2 Carry-Lookahead, Carry-Select, & Hybrid Adders Required Reading Behrooz Parhami, Computer Arithmetic: Algorithms and Hardware Design Chapter 6, Carry-Lookahead Adders Sections 6.1-6.2.
More informationCh 7. Finite State Machines. VII - Finite State Machines Contemporary Logic Design 1
Ch 7. Finite State Machines VII - Finite State Machines Contemporary Logic esign 1 Finite State Machines Sequential circuits primitive sequential elements combinational logic Models for representing sequential
More informationAN IMPROVED LOW LATENCY SYSTOLIC STRUCTURED GALOIS FIELD MULTIPLIER
Indian Journal of Electronics and Electrical Engineering (IJEEE) Vol.2.No.1 2014pp1-6 available at: www.goniv.com Paper Received :05-03-2014 Paper Published:28-03-2014 Paper Reviewed by: 1. John Arhter
More informationECE 448 Lecture 6. Finite State Machines. State Diagrams, State Tables, Algorithmic State Machine (ASM) Charts, and VHDL Code. George Mason University
ECE 448 Lecture 6 Finite State Machines State Diagrams, State Tables, Algorithmic State Machine (ASM) Charts, and VHDL Code George Mason University Required reading P. Chu, FPGA Prototyping by VHDL Examples
More informationHilbert Transformator IP Cores
Introduction Hilbert Transformator IP Cores Martin Kumm December 27, 28 The Hilbert Transform is an important component in communication systems, e.g. for single sideband modulation/demodulation, amplitude
More informationDigital Filter Structures. Basic IIR Digital Filter Structures. of an LTI digital filter is given by the convolution sum or, by the linear constant
Digital Filter Chapter 8 Digital Filter Block Diagram Representation Equivalent Basic FIR Digital Filter Basic IIR Digital Filter. Block Diagram Representation In the time domain, the input-output relations
More informationA HIGH-SPEED PROCESSOR FOR RECTANGULAR-TO-POLAR CONVERSION WITH APPLICATIONS IN DIGITAL COMMUNICATIONS *
Copyright IEEE 999: Published in the Proceedings of Globecom 999, Rio de Janeiro, Dec 5-9, 999 A HIGH-SPEED PROCESSOR FOR RECTAGULAR-TO-POLAR COVERSIO WITH APPLICATIOS I DIGITAL COMMUICATIOS * Dengwei
More informationCost/Performance Tradeoffs:
Cost/Performance Tradeoffs: a case study Digital Systems Architecture I. L10 - Multipliers 1 Binary Multiplication x a b n bits n bits EASY PROBLEM: design combinational circuit to multiply tiny (1-, 2-,
More informationAn Optimized Hardware Architecture of Montgomery Multiplication Algorithm
An Optimized Hardware Architecture of Montgomery Multiplication Algorithm Miaoqing Huang 1, Kris Gaj 2, Soonhak Kwon 3, and Tarek El-Ghazawi 1 1 The George Washington University, Washington, DC 20052,
More informationIdentification of ARX, OE, FIR models with the least squares method
Identification of ARX, OE, FIR models with the least squares method CHEM-E7145 Advanced Process Control Methods Lecture 2 Contents Identification of ARX model with the least squares minimizing the equation
More informationPipelined Viterbi Decoder Using FPGA
Research Journal of Applied Sciences, Engineering and Technology 5(4): 1362-1372, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scientific Organization, 2013 Submitted: July 05, 2012 Accepted: August
More informationMicro-architecture Pipelining Optimization with Throughput- Aware Floorplanning
Micro-architecture Pipelining Optimization with Throughput- Aware Floorplanning Yuchun Ma* Zhuoyuan Li* Jason Cong Xianlong Hong Glenn Reinman Sheqin Dong* Qiang Zhou *Department of Computer Science &
More informationLecture Notes: Markov chains Tuesday, September 16 Dannie Durand
Computational Genomics and Molecular Biology, Lecture Notes: Markov chains Tuesday, September 6 Dannie Durand In the last lecture, we introduced Markov chains, a mathematical formalism for modeling how
More informationECE 5775 (Fall 17) High-Level Digital Design Automation. Scheduling: Exact Methods
ECE 5775 (Fall 17) High-Level Digital Design Automation Scheduling: Exact Methods Announcements Sign up for the first student-led discussions today One slot remaining Presenters for the 1st session will
More informationCOE 202: Digital Logic Design Sequential Circuits Part 4. Dr. Ahmad Almulhem ahmadsm AT kfupm Phone: Office:
COE 202: Digital Logic Design Sequential Circuits Part 4 Dr. Ahmad Almulhem Email: ahmadsm AT kfupm Phone: 860-7554 Office: 22-324 Objectives Registers Counters Registers 0 1 n-1 A register is a group
More informationECC for NAND Flash. Osso Vahabzadeh. TexasLDPC Inc. Flash Memory Summit 2017 Santa Clara, CA 1
ECC for NAND Flash Osso Vahabzadeh TexasLDPC Inc. 1 Overview Why Is Error Correction Needed in Flash Memories? Error Correction Codes Fundamentals Low-Density Parity-Check (LDPC) Codes LDPC Encoding and
More information8. Design Tradeoffs x Computation Structures Part 1 Digital Circuits. Copyright 2015 MIT EECS
8. Design Tradeoffs 6.004x Computation Structures Part 1 Digital Circuits Copyright 2015 MIT EECS 6.004 Computation Structures L08: Design Tradeoffs, Slide #1 There are a large number of implementations
More informationFAST FIR ALGORITHM BASED AREA-EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURES
FAST FIR ALGORITHM BASED AREA-EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURES R.P.MEENAAKSHI SUNDHARI 1, Dr.R.ANITA 2 1 Department of ECE, Sasurie College of Engineering, Vijayamangalam, Tamilnadu, India.
More information8. Design Tradeoffs x Computation Structures Part 1 Digital Circuits. Copyright 2015 MIT EECS
8. Design Tradeoffs 6.004x Computation Structures Part 1 Digital Circuits Copyright 2015 MIT EECS 6.004 Computation Structures L08: Design Tradeoffs, Slide #1 There are a large number of implementations
More informationOptimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks
2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks Yufei Ma, Yu Cao, Sarma Vrudhula,
More informationMatrix Computations: Direct Methods II. May 5, 2014 Lecture 11
Matrix Computations: Direct Methods II May 5, 2014 ecture Summary You have seen an example of how a typical matrix operation (an important one) can be reduced to using lower level BS routines that would
More informationEEE2135 Digital Logic Design
EEE2135 Digital Logic Design Chapter 7. Sequential Circuits Design 서강대학교 전자공학과 1. Model of Sequential Circuits 1) Sequential vs. Combinational Circuits a. Sequential circuits: Outputs depend on both the
More informationComputer Architecture 10. Fast Adders
Computer Architecture 10 Fast s Ma d e wi t h Op e n Of f i c e. o r g 1 Carry Problem Addition is primary mechanism in implementing arithmetic operations Slow addition directly affects the total performance
More informationA Digit-Serial Systolic Multiplier for Finite Fields GF(2 m )
A Digit-Serial Systolic Multiplier for Finite Fields GF( m ) Chang Hoon Kim, Sang Duk Han, and Chun Pyo Hong Department of Computer and Information Engineering Taegu University 5 Naeri, Jinryang, Kyungsan,
More informationCausality and Time. The Happens-Before Relation
Causality and Time The Happens-Before Relation Because executions are sequences of events, they induce a total order on all the events It is possible that two events by different processors do not influence
More informationDistributed Computing. Synchronization. Dr. Yingwu Zhu
Distributed Computing Synchronization Dr. Yingwu Zhu Topics to Discuss Physical Clocks Logical Clocks: Lamport Clocks Classic paper: Time, Clocks, and the Ordering of Events in a Distributed System Lamport
More informationLecture 11 FIR Filters
Lecture 11 FIR Filters Fundamentals of Digital Signal Processing Spring, 2012 Wei-Ta Chu 2012/4/12 1 The Unit Impulse Sequence Any sequence can be represented in this way. The equation is true if k ranges
More informationPower Optimization in Programmable Processors and ASIC Implementations of Linear Systems: Transformation-based Approach
Power Optimization in Programmable Processors and ASIC Implementations of Linear Systems: Transformation-based Approach Mani Srivastava Miodrag Potkonjak AT&T Bell Laboratories Computer Science Department
More informationFast Fir Algorithm Based Area- Efficient Parallel Fir Digital Filter Structures
Fast Fir Algorithm Based Area- Efficient Parallel Fir Digital Filter Structures Ms. P.THENMOZHI 1, Ms. C.THAMILARASI 2 and Mr. V.VENGATESHWARAN 3 Assistant Professor, Dept. of ECE, J.K.K.College of Technology,
More informationSchool of EECS Seoul National University
4!4 07$ 8902808 3 School of EECS Seoul National University Introduction Low power design 3974/:.9 43 Increasing demand on performance and integrity of VLSI circuits Popularity of portable devices Low power
More informationFPGA Resource Utilization Estimates for NI crio LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008
FPGA Resource Utilization Estimates for NI crio-9104 LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008 Note: The numbers presented in this document are estimates. Actual resource usage for your
More informationEECE 301 Signals & Systems
EECE 30 Signals & Systems Prof. Mark Fowler Note Set #22 D-T Systems: DT Systems & Difference Equations / D-T System Models So far we ve seen how to use the FT to analyze circuits Use phasors and standard
More informationDesign and Study of Enhanced Parallel FIR Filter Using Various Adders for 16 Bit Length
International Journal of Soft Computing and Engineering (IJSCE) Design and Study of Enhanced Parallel FIR Filter Using Various Adders for 16 Bit Length D.Ashok Kumar, P.Samundiswary Abstract Now a day
More informationCS 700: Quantitative Methods & Experimental Design in Computer Science
CS 700: Quantitative Methods & Experimental Design in Computer Science Sanjeev Setia Dept of Computer Science George Mason University Logistics Grade: 35% project, 25% Homework assignments 20% midterm,
More informationPipelining. Traditional Execution. CS 365 Lecture 12 Prof. Yih Huang. add ld beq CS CS 365 2
Pipelining CS 365 Lecture 12 Prof. Yih Huang CS 365 1 Traditional Execution 1 2 3 4 1 2 3 4 5 1 2 3 add ld beq CS 365 2 1 Pipelined Execution 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
More informationCOVER SHEET: Problem#: Points
EEL 4712 Midterm 3 Spring 2017 VERSION 1 Name: UFID: Sign here to give permission for your test to be returned in class, where others might see your score: IMPORTANT: Please be neat and write (or draw)
More informationFPGA Resource Utilization Estimates for NI PXI-7854R. LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008
FPGA Resource Utilization Estimates for NI PXI-7854R LabVIEW FPGA Version: 8.6 NI-RIO Version: 3.0 Date: 8/5/2008 Note: The numbers presented in this document are estimates. Actual resource usage for your
More information3.1 Symmetry & Coordinate Graphs
3.1 Symmetry & Coordinate Graphs I. Symmetry Point symmetry two distinct points P and P are symmetric with respect to point M if and only is M is the midpoint of PP' When the definition is extended to
More informationPetri nets. s 1 s 2. s 3 s 4. directed arcs.
Petri nets Petri nets Petri nets are a basic model of parallel and distributed systems (named after Carl Adam Petri). The basic idea is to describe state changes in a system with transitions. @ @R s 1
More informationLecture 11. Advanced Dividers
Lecture 11 Advanced Dividers Required Reading Behrooz Parhami, Computer Arithmetic: Algorithms and Hardware Design Chapter 15 Variation in Dividers 15.3, Combinational and Array Dividers Chapter 16, Division
More informationEEC 216 Lecture #3: Power Estimation, Interconnect, & Architecture. Rajeevan Amirtharajah University of California, Davis
EEC 216 Lecture #3: Power Estimation, Interconnect, & Architecture Rajeevan Amirtharajah University of California, Davis Outline Announcements Review: PDP, EDP, Intersignal Correlations, Glitching, Top
More informationModels for representing sequential circuits
Sequential Circuits Models for representing sequential circuits Finite-state machines (Moore and Mealy) Representation of memory (states) Changes in state (transitions) Design procedure State diagrams
More informationPipelined Computations
Chapter 5 Slide 155 Pipelined Computations Pipelined Computations Slide 156 Problem divided into a series of tasks that have to be completed one after the other (the basis of sequential programming). Each
More informationCORDIC, Divider, Square Root
4// EE6B: VLSI Signal Processing CORDIC, Divider, Square Root Prof. Dejan Marković ee6b@gmail.com Iterative algorithms CORDIC Division Square root Lecture Overview Topics covered include Algorithms and
More informationUsing Mason's Rule to Analyze DSP Networks
Using Mason's Rule to Analyze DSP Networks by Richard Lyons, Besser Associates 1 α This article discusses a powerful analysis method (well known to our analog control system engineering brethren) to obtain
More informationWorst-Case Execution Time Analysis. LS 12, TU Dortmund
Worst-Case Execution Time Analysis Prof. Dr. Jian-Jia Chen LS 12, TU Dortmund 02, 03 May 2016 Prof. Dr. Jian-Jia Chen (LS 12, TU Dortmund) 1 / 53 Most Essential Assumptions for Real-Time Systems Upper
More informationOn the Complexity of Mapping Pipelined Filtering Services on Heterogeneous Platforms
On the Complexity of Mapping Pipelined Filtering Services on Heterogeneous Platforms Anne Benoit, Fanny Dufossé and Yves Robert LIP, École Normale Supérieure de Lyon, France {Anne.Benoit Fanny.Dufosse
More informationLogic BIST. Sungho Kang Yonsei University
Logic BIST Sungho Kang Yonsei University Outline Introduction Basics Issues Weighted Random Pattern Generation BIST Architectures Deterministic BIST Conclusion 2 Built In Self Test Test/ Normal Input Pattern
More informationTestability. Shaahin Hessabi. Sharif University of Technology. Adapted from the presentation prepared by book authors.
Testability Lecture 6: Logic Simulation Shaahin Hessabi Department of Computer Engineering Sharif University of Technology Adapted from the presentation prepared by book authors Slide 1 of 27 Outline What
More informationControl Systems, Lecture 05
Control Systems, Lecture 05 İbrahim Beklan Küçükdemiral Yıldız Teknik Üniversitesi 2015 1 / 33 Laplace Transform Solution of State Equations In previous sections, systems were modeled in state space, where
More informationDESIGN AND IMPLEMENTATION OF EFFICIENT HIGH SPEED VEDIC MULTIPLIER USING REVERSIBLE GATES
DESIGN AND IMPLEMENTATION OF EFFICIENT HIGH SPEED VEDIC MULTIPLIER USING REVERSIBLE GATES Boddu Suresh 1, B.Venkateswara Reddy 2 1 2 PG Scholar, Associate Professor, HOD, Dept of ECE Vikas College of Engineering
More informationImplementation of Discrete-Time Systems
EEE443 Digital Signal Processing Implementation of Discrete-Time Systems Dr. Shahrel A. Suandi PPKEE, Engineering Campus, USM Introduction A linear-time invariant system (LTI) is described by linear constant
More informationEE 144/244: Fundamental Algorithms for System Modeling, Analysis, and Optimization Fall 2016
EE 144/244: Fundamental Algorithms for System Modeling, Analysis, and Optimization Fall 2016 Discrete Event Simulation Stavros Tripakis University of California, Berkeley Stavros Tripakis (UC Berkeley)
More informationClocks in Asynchronous Systems
Clocks in Asynchronous Systems The Internet Network Time Protocol (NTP) 8 Goals provide the ability to externally synchronize clients across internet to UTC provide reliable service tolerating lengthy
More informationNetwork Flows. CTU FEE Department of control engineering. March 28, 2017
Network Flows Zdeněk Hanzálek, Přemysl Šůcha hanzalek@fel.cvut.cz CTU FEE Department of control engineering March 28, 2017 Z. Hanzálek (CTU FEE) Network Flows March 28, 2017 1 / 44 Table of contents 1
More informationDigital Electronics. Part A
Digital Electronics Final Examination Part A Winter 2004-05 Student Name: Date: lass Period: Total Points: Multiple hoice Directions: Select the letter of the response which best completes the item or
More informationNew Implementations of the WG Stream Cipher
New Implementations of the WG Stream Cipher Hayssam El-Razouk, Arash Reyhani-Masoleh, and Guang Gong Abstract This paper presents two new hardware designs of the WG-28 cipher, one for the multiple output
More informationA COMBINED 16-BIT BINARY AND DUAL GALOIS FIELD MULTIPLIER. Jesus Garcia and Michael J. Schulte
A COMBINED 16-BIT BINARY AND DUAL GALOIS FIELD MULTIPLIER Jesus Garcia and Michael J. Schulte Lehigh University Department of Computer Science and Engineering Bethlehem, PA 15 ABSTRACT Galois field arithmetic
More informationChapter 2 Examples of Applications for Connectivity and Causality Analysis
Chapter 2 Examples of Applications for Connectivity and Causality Analysis Abstract Connectivity and causality have a lot of potential applications, among which we focus on analysis and design of large-scale
More informationClock-driven scheduling
Clock-driven scheduling Also known as static or off-line scheduling Michal Sojka Czech Technical University in Prague, Faculty of Electrical Engineering, Department of Control Engineering November 8, 2017
More informationAnalog and Telecommunication Electronics
Politecnico di Torino - ICT School Analog and Telecommunication Electronics D3 - A/D converters» Error taxonomy» ADC parameters» Structures and taxonomy» Mixed converters» Origin of errors 12/05/2011-1
More informationParallelism in Computer Arithmetic: A Historical Perspective
Parallelism in Computer Arithmetic: A Historical Perspective 21s 2s 199s 198s 197s 196s 195s Behrooz Parhami Aug. 218 Parallelism in Computer Arithmetic Slide 1 University of California, Santa Barbara
More informationECE317 : Feedback and Control
ECE317 : Feedback and Control Lecture : Steady-state error Dr. Richard Tymerski Dept. of Electrical and Computer Engineering Portland State University 1 Course roadmap Modeling Analysis Design Laplace
More informationPerformance Evaluation of Queuing Systems
Performance Evaluation of Queuing Systems Introduction to Queuing Systems System Performance Measures & Little s Law Equilibrium Solution of Birth-Death Processes Analysis of Single-Station Queuing Systems
More informationLow-Cost and Area-Efficient FPGA Implementations of Lattice-Based Cryptography
Low-Cost and Area-Efficient FPGA Implementations of Lattice-Based Cryptography Aydin Aysu, Cameron Patterson and Patrick Schaumont Electrical and Computer Engineering Department Virginia Tech Blacksburg,
More informationChapter 3: Block Diagrams and Signal Flow Graphs
Chapter 3: Block Diagrams and Signal Flow Graphs Farid Golnaraghi, Simon Fraser University Benjamin C. Kuo, University of Illinois ISBN: 978 0 470 04896 2 Introduction In this chapter, we discuss graphical
More information/ / MET Day 000 NC1^ INRTL MNVR I E E PRE SLEEP K PRE SLEEP R E
05//0 5:26:04 09/6/0 (259) 6 7 8 9 20 2 22 2 09/7 0 02 0 000/00 0 02 0 04 05 06 07 08 09 0 2 ay 000 ^ 0 X Y / / / / ( %/ ) 2 /0 2 ( ) ^ 4 / Y/ 2 4 5 6 7 8 9 2 X ^ X % 2 // 09/7/0 (260) ay 000 02 05//0
More informationCompiling for Parallelism & Locality. Example. Announcement Need to make up November 14th lecture. Last time Data dependences and loops
Complng for Parallelsm & Localty Announcement Need to make up November 14th lecture Last tme Data dependences and loops Today Fnsh data dependence analyss for loops CS553 Lecture Complng for Parallelsm
More information