ECE429 Introduction to VLSI Design

Similar documents
Lecture 6: Logical Effort

EE 447 VLSI Design. Lecture 5: Logical Effort

Introduction to CMOS VLSI Design. Lecture 5: Logical Effort. David Harris. Harvey Mudd College Spring Outline

VLSI Design, Fall Logical Effort. Jacob Abraham

Lecture 8: Combinational Circuit Design

Logical Effort. Sizing Transistors for Speed. Estimating Delays

Logical Effort: Designing for Speed on the Back of an Envelope David Harris Harvey Mudd College Claremont, CA

Very Large Scale Integration (VLSI)

Lecture 8: Logic Effort and Combinational Circuit Design

Introduction to CMOS VLSI Design. Logical Effort B. Original Lecture by Jay Brockman. University of Notre Dame Fall 2008

7. Combinational Circuits

Lecture 5. Logical Effort Using LE on a Decoder

EE M216A.:. Fall Lecture 5. Logical Effort. Prof. Dejan Marković

Lecture 7: Multistage Logic Networks. Best Number of Stages

Lecture 8: Combinational Circuits

Logic effort and gate sizing

Lecture 8: Combinational Circuits

Logic Gate Sizing. The method of logical effort. João Canas Ferreira. March University of Porto Faculty of Engineering

Lecture 9: Combinational Circuits

C.K. Ken Yang UCLA Courtesy of MAH EE 215B

Chapter 4. Digital Integrated Circuit Design I. ECE 425/525 Chapter 4. CMOS design can be realized meet requirements from

Interconnect (2) Buffering Techniques. Logical Effort

Lecture 9: Combinational Circuit Design

Lecture 14: Circuit Families

CPE/EE 427, CPE 527 VLSI Design I L13: Wires, Design for Speed. Course Administration

Estimating Delays. Gate Delay Model. Gate Delay. Effort Delay. Computing Logical Effort. Logical Effort

Lecture 6: Circuit design part 1

Logical Effort EE141

EE115C Digital Electronic Circuits Homework #5

EE M216A.:. Fall Lecture 4. Speed Optimization. Prof. Dejan Marković Speed Optimization via Gate Sizing

Integrated Circuits & Systems

EE141. Administrative Stuff

Digital Integrated Circuits A Design Perspective

Lecture 12: Datapath Functional Units

Announcements. EE141-Spring 2007 Digital Integrated Circuits. CMOS SRAM Analysis (Read/Write) Class Material. Layout. Read Static Noise Margin

Lecture 4: DC & Transient Response

EECS 151/251A Homework 5

Lecture 34: Portable Systems Technology Background Professor Randy H. Katz Computer Science 252 Fall 1995

Lecture 12: Datapath Functional Units

9. Datapath Design. Jacob Abraham. Department of Electrical and Computer Engineering The University of Texas at Austin VLSI Design Fall 2017

CMPEN 411. Spring Lecture 18: Static Sequential Circuits

EE141-Fall 2011 Digital Integrated Circuits

ALUs and Data Paths. Subtitle: How to design the data path of a processor. 1/8/ L3 Data Path Design Copyright Joanne DeGroat, ECE, OSU 1

University of Toronto. Final Exam

Digital Microelectronic Circuits ( ) Logical Effort. Lecture 7: Presented by: Adam Teman

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

Additional Gates COE 202. Digital Logic Design. Dr. Muhamed Mudawar King Fahd University of Petroleum and Minerals

Sequential Logic Worksheet

Lecture 18: Datapath Functional Units

CMPEN 411 VLSI Digital Circuits Spring Lecture 21: Shifters, Decoders, Muxes

and V DS V GS V T (the saturation region) I DS = k 2 (V GS V T )2 (1+ V DS )

Clock signal in digital circuit is responsible for synchronizing the transfer to the data between processing elements.

EE371 - Advanced VLSI Circuit Design

Simulation of Logic Primitives and Dynamic D-latch with Verilog-XL

Lecture 1: Gate Delay Models

Lecture 5: DC & Transient Response

Lecture 12 CMOS Delay & Transient Response

Static CMOS Circuits. Example 1

Spiral 2 7. Capacitance, Delay and Sizing. Mark Redekopp

Lecture 6: DC & Transient Response

For smaller NRE cost For faster time to market For smaller high-volume manufacturing cost For higher performance

EE141-Fall 2012 Digital Integrated Circuits. Announcements. Homework #3 due today. Homework #4 due next Thursday EECS141 EE141

CMPEN 411 VLSI Digital Circuits Spring 2012 Lecture 17: Dynamic Sequential Circuits And Timing Issues

CSE370 HW3 Solutions (Winter 2010)

ECE 425 Midterm Overview. Fall 2017

EECS 312: Digital Integrated Circuits Final Exam Solutions 23 April 2009

Digital Integrated Circuits Lecture 5: Logical Effort

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

ESE570 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Integrated Cicruits AND VLSI Fundamentals

! Crosstalk. ! Repeaters in Wiring. ! Transmission Lines. " Where transmission lines arise? " Lossless Transmission Line.

Advanced VLSI Design Prof. A. N. Chandorkar Department of Electrical Engineering Indian Institute of Technology- Bombay

EE241 - Spring 2000 Advanced Digital Integrated Circuits. Announcements

EE115C Digital Electronic Circuits Homework #6

EECS 151/251A Spring 2018 Digital Design and Integrated Circuits. Instructors: Nick Weaver & John Wawrzynek. Lecture 12 EE141

EEC 216 Lecture #2: Metrics and Logic Level Power Estimation. Rajeevan Amirtharajah University of California, Davis

Area-Time Optimal Adder with Relative Placement Generator

6.884 Complex Digital Systems Spring Quiz - March 18, Minutes

Floating Point Representation and Digital Logic. Lecture 11 CS301

CPE/EE 427, CPE 527 VLSI Design I L18: Circuit Families. Outline

Digital Logic. Lecture 5 - Chapter 2. Outline. Other Logic Gates and their uses. Other Logic Operations. CS 2420 Husain Gholoom - lecturer Page 1

EECS 427 Lecture 8: Adders Readings: EECS 427 F09 Lecture 8 1. Reminders. HW3 project initial proposal: due Wednesday 10/7

Slide Set 6. for ENEL 353 Fall Steve Norman, PhD, PEng. Electrical & Computer Engineering Schulich School of Engineering University of Calgary

ESE570 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Integrated Cicruits AND VLSI Fundamentals

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

Lecture 4. Adders. Computer Systems Laboratory Stanford University

EC-121 Digital Logic Design

EE141Microelettronica. CMOS Logic

! Charge Leakage/Charge Sharing. " Domino Logic Design Considerations. ! Logic Comparisons. ! Memory. " Classification. " ROM Memories.

CARNEGIE MELLON UNIVERSITY DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING DIGITAL INTEGRATED CIRCUITS FALL 2002

Lecture 16: Circuit Pitfalls

Problem Set 4 Solutions

EE 330 Lecture 39. Digital Circuits. Propagation Delay basic characterization Device Sizing (Inverter and multiple-input gates)

LECTURE 2: Delay models, std_ulogic and. EECS 316 CAD Computer Aided Design. with-select-when. Chris Papachristou Case Western Reserve University

Homework Assignment #1 Solutions EE 477 Spring 2017 Professor Parker

EE115C Winter 2017 Digital Electronic Circuits. Lecture 19: Timing Analysis

EE5780 Advanced VLSI CAD

Lecture Outline. ESE 570: Digital Integrated Circuits and VLSI Fundamentals. Review: 1st Order RC Delay Models. Review: Two-Input NOR Gate (NOR2)

Chapter 9. Estimating circuit speed. 9.1 Counting gate delays

Arithmetic Circuits-2

ECE 545 Digital System Design with VHDL Lecture 1. Digital Logic Refresher Part A Combinational Logic Building Blocks

Transcription:

ECE429 Introduction to VLSI Design Lecture 5: LOGICAL EFFORT Erdal Oruklu Illinois Institute of Technology Some of these slides have been adapted from the slides provided by David Harris, Harvey Mudd College

Outline Introduction Delay in a Logic Gate Multistage Logic Networks Choosing the Best Number of Stages Example Summary 5-2

Introduction Chip designers face a bewildering array of choices What is the best circuit topology for a function? How many stages of logic give least delay? How wide should the transistors be? Logical effort is a method to make these decisions Uses a simple model of delay Helps make rapid comparisons between alternatives Emphasizes remarkable symmetries 5-3

Delay in a Logic Gate Express delays in process-independent unit d = d abs τ τ = 3RC 12 ps in 180 nm process 40 ps in 0.6 μm process 5-4

Delay in a Logic Gate Express delays in process-independent unit d = d abs τ Delay has two components d = f + p 5-5

Delay in a Logic Gate Express delays in process-independent unit d = Delay has two components d d abs τ = f + p Effort delay f = gh (a.k.a. stage effort) Again has two components 5-6

Delay in a Logic Gate Express delays in process-independent unit d = d abs τ Delay has two components d = f + p Effort delay f = gh (a.k.a. stage effort) Again has two components g: logical effort Measures relative ability of gate to deliver current g 1 for inverter 5-7

Delay in a Logic Gate Express delays in process-independent unit d = d abs τ Delay has two components d = f + p Effort delay f = gh (a.k.a. stage effort) Again has two components h: electrical effort = C out / C in Ratio of output to input capacitance Sometimes called fanout 5-8

Delay in a Logic Gate Express delays in process-independent unit d = Delay has two components d d abs τ = f + p Parasitic delay p Represents delay of gate driving no load Set by internal parasitic capacitance 5-9

Delay Plots d = f + p = gh + p Normalized Delay: d 6 5 4 3 2 2-input NAND g = p = d = Inverter g = p = d = 1 0 0 1 2 3 4 5 Electrical Effort: h = C out / C in 5-10

Delay Plots d = f + p = gh + p What about NOR2? Normalized Delay: d 6 5 4 3 2 2-input NAND Inverter g = 1 p = 1 d = h + 1 Effort Delay: f g = 4/3 p = 2 d = (4/3)h + 2 1 Parasitic Delay: p 0 0 1 2 3 4 5 Electrical Effort: h = C out / C in 5-11

Computing Logical Effort DEF: Logical effort is the ratio of the input capacitance of a gate to the input capacitance of an inverter delivering the same output current. Measure from delay vs. fanout plots Or estimate by counting transistor widths A 2 1 Y A B 2 2 2 2 Y A B 4 4 1 1 Y C in = 3 g = 3/3 C in = 4 g = 4/3 C in = 5 g = 5/3 5-12

Catalog of Gates Logical effort of common gates Gate type Number of inputs 1 2 3 4 n Inverter 1 NAND 4/3 5/3 6/3 (n+2)/3 NOR 5/3 7/3 9/3 (2n+1)/3 Tristate/mux 2 2 2 2 2 XOR, XNOR 4, 4 6, 12, 6 8, 16,16, 8 5-13

Catalog of Gates Parasitic delay of common gates In multiples of p inv ( 1) Gate type Number of inputs 1 2 3 4 n Inverter 1 NAND 2 3 4 n NOR 2 3 4 n Tristate / mux 2 4 6 8 2n XOR, XNOR 4 6 8 5-14

Example: Ring Oscillator Estimate the frequency of an N-stage ring oscillator Logical Effort: g = Electrical Effort: h = Parasitic Delay: p = Stage Delay: d = Frequency: f osc = 5-15

Example: Ring Oscillator Estimate the frequency of an N-stage ring oscillator 31 stage ring oscillator in 0.6 μm process has frequency of ~ 200 MHz Logical Effort: g = 1 Electrical Effort: h = 1 Parasitic Delay: p = 1 Stage Delay: d = 2 Frequency: f osc = 1/(2*N*d) = 1/4N 5-16

Example: FO4 Inverter Estimate the delay of a fanout-of-4 (FO4) inverter d Logical Effort: g = Electrical Effort: h = Parasitic Delay: p = Stage Delay: d = 5-17

Example: FO4 Inverter Estimate the delay of a fanout-of-4 (FO4) d inverter The FO4 delay is about 200 ps in 0.6 μm process Logical Effort: g = 1 Electrical Effort: h = 4 Parasitic Delay: p = 1 Stage Delay: d = 5 60 ps in a 180 nm process f/3 ns in an f μm process 5-18

Multistage Logic Networks Logical effort generalizes to multistage networks Path Logical Effort Path Electrical Effort G = g i Cout-path H = C in-path Path Effort F = fi = gh i i 10 g 1 = 1 h 1 = x/10 x g 2 = 5/3 h 2 = y/x y g 3 = 4/3 h 3 = z/y z g 4 = 1 h 4 = 20/z 20 5-19

Multistage Logic Networks Logical effort generalizes to multistage networks Path Logical Effort Path Electrical Effort G = g i Cout path H = C in path Path Effort F = fi = gh i i Can we write F = GH? 5-20

Paths that Branch No! Consider paths that branch: G = H = GH = h 1 = h 2 = F = GH? 5 15 15 90 90 5-21

Paths that Branch No! Consider paths that branch: G = 1 H = 90 / 5 = 18 GH = 18 h 1 = (15 +15) / 5 = 6 h 2 = 90 / 15 = 6 F = g 1 g 2 h 1 h 2 = 36 = 2GH 5 15 15 90 90 5-22

Branching Effort Introduce branching effort Accounts for branching between stages in path b = C Now we compute the path effort F = GBH on path B= b i C + C on path off path Note: h i = BH 5-23

Multistage Delays Path Effort Delay D F = f i Path Parasitic Delay Path Delay P= p i = i = F + D d D P 5-24

Designing Fast Circuits = i = F + D d D P Delay is smallest when each stage bears same effort fˆ = gh = F i i 1 N Thus minimum delay of N stage path is 1 N D= NF + P This is a key result of logical effort Find fastest possible delay Doesn t require calculating gate sizes 5-25

Gate Sizes How wide should the gates be for least delay? fˆ = gh= g C = in i C C out in gc i fˆ out i Working backward, apply capacitance transformation to find input capacitance of each gate given load it drives. Check work by verifying input cap spec is met. 5-26

Example: 3-stage path Select gate sizes x and y for least delay from A to B x x y 45 A 8 x y B 45 5-27

Example: 3-stage path x Logical Effort G = Electrical Effort H = Branching Effort B = Path Effort F = Best Stage Effort x A 8 x ˆf = Parasitic Delay P = Delay D = y y 45 B 45 5-28

Example: 3-stage path x A 8 x x y y 45 B 45 Logical Effort G = (4/3)*(5/3)*(5/3) = 100/27 Electrical Effort H = 45/8 Branching Effort B = 3 * 2 = 6 Path Effort F = GBH = 125 Best Stage Effort ˆ 3 F 5 f = = Parasitic Delay P = 2 + 3 + 2 = 7 Delay D = 3*5 + 7 = 22 = 4.4 FO4 5-29

Example: 3-stage path Work backward for sizes y = x = x x y 45 A 8 x y B 45 5-30

Example: 3-stage path Work backward for sizes y = 45 * (5/3) / 5 = 15 x = (15*2) * (5/3) / 5 = 10 45 A P: 4 N: 4 P: 4 N: 6 P: 12 N: 3 B 45 5-31

Best Number of Stages How many stages should a path use? Minimizing number of stages is not always fastest Example: drive 64-bit datapath with unit inverter Initial Driver 1 1 1 1 D = Datapath Load N: f: D: 64 64 64 64 1 2 3 4 5-32

Best Number of Stages How many stages should a path use? Minimizing number of stages is not always fastest Example: drive 64-bit datapath with unit inverter Initial Driver 1 1 1 1 8 4 2.8 D = NF 1/N + P = N(64) 1/N + N Datapath Load 16 8 23 64 64 64 64 N: f: D: 1 64 65 2 8 18 3 4 15 Fastest 4 2.8 15.3 5-33

Derivation Consider adding inverters to end of path How many give least delay? = n 1 1 N + i + i= 1 D NF p N n p D N 1 1 1 N N N ( ) Define best stage effort 1 inv = F ln F + F + p = 0 inv p + ρ 1 lnρ = 0 inv ( ) 1 ρ = F N Logic Block: n 1 Stages Path Effort F N - n 1 Extra Inverters 5-34

Best Stage Effort p + ρ 1 lnρ = 0 inv solution ( ) has no closed-form Neglecting parasitics (p inv = 0), we find ρ = 2.718 (e) For p inv = 1, solve numerically for ρ = 3.59 5-35

Sensitivity Analysis How sensitive is delay to using exactly the best number of stages? 1.6 1.51 D(N) /D(N) 1.4 1.26 1.2 1.15 1.0 (ρ=6) (ρ =2.4) 0.0 0.5 0.7 1.0 1.4 2.0 2.4 < ρ < 6 gives delay within 15% of optimal Use ρ = 4 N / N 5-36

Example How do we design an address decoder? Decoder specifications: 16 word register file Each word is 32 bits wide Each bit presents load of 3 unit-sized transistors True and complementary address inputs A[3:0] Each input may drive 10 unit-sized transistors A[3:0] A[3:0] Decisions: How many stages to use? How large should each gate be? How fast can decoder operate? 4:16 Decoder 16 32 bits Register File 16 words 5-37

Number of Stages Decoder effort is mainly electrical and branching Electrical Effort: H = Branching Effort: B = If we neglect logical effort (assume G = 1) Path Effort: F = Number of Stages: N = 5-38

Number of Stages Decoder effort is mainly electrical and branching Electrical Effort: H = (32*3) / 10 = 9.6 Branching Effort: B = 8 If we neglect logical effort (assume G = 1) Path Effort: F = GBH = 76.8 Number of Stages: N = log 4 F = 3.1 Try a 3-stage design 5-39

Gate Sizes & Delay Logical Effort: G = Path Effort: F = Stage Effort: Path Delay: ˆf = D = Gate sizes: z = y = A[3] A[3] A[2] A[2] A[1] A[1] A[0] A[0] 10 10 10 10 10 10 10 10 y z word[0] 96 units of wordline capacitance y z word[15] 5-40

Gate Sizes & Delay Logical Effort: G = 1 * 6/3 * 1 = 2 Path Effort: F = GBH = 154 Stage Effort: Path Delay: Gate sizes: z = 96*1/5.36 = 18 y = 18*2/5.36 = 6.7 A[3] A[3] A[2] A[2] A[1] A[1] A[0] A[0] 10 10 10 10 10 10 10 10 ˆ 1/3 f = F = 5.36 D = 3 fˆ + 1+ 4 + 1 = 22.1 y z word[0] 96 units of wordline capacitance y z word[15] 5-41

Comparison Compare many alternatives with a spreadsheet Design N G P D NAND4-INV 2 2 5 29.8 NAND2-NOR2 2 20/9 4 30.1 INV-NAND4-INV 3 2 6 22.1 NAND4-INV-INV-INV 4 2 7 21.1 NAND2-NOR2-INV-INV 4 20/9 6 20.5 NAND2-INV-NAND2-INV 4 16/9 6 19.7 INV-NAND2-INV-NAND2-INV 5 16/9 7 20.4 NAND2-INV-NAND2-INV-INV-INV 6 16/9 8 21.6 5-42

Review of Definitions Term number of stages logical effort electrical effort branching effort effort effort delay parasitic delay delay Stage 1 g h = b = f f p = C C C out in on-path gh + C C on-path d = f + p off-path Path N G = g i H = C C out-path in-path B = b i F = GBH D F = f i P = p i D = d = D + P i F 5-43

Method of Logical Effort 1) Compute path effort 2) Estimate best number of stages 3) Sketch path with N stages 4) Estimate least delay 5) Determine best stage effort 6) Find gate sizes F = GBH N = log 4 F 1 N D= NF + P 1 ˆ N f = F C in i = gic fˆ out i 5-44

Limits of Logical Effort Chicken and egg problem Need path to compute G But don t know number of stages without G Simplistic delay model Neglects input rise time effects Interconnect Iteration required in designs with wire Maximum speed only Not minimum area/power for constrained delay 5-45

Summary Logical effort is useful for thinking of delay in circuits Numeric logical effort characterizes gates NANDs are faster than NORs in CMOS Paths are fastest when effort delays are ~4 Path delay is weakly sensitive to stages, sizes But using fewer stages doesn t mean faster paths Delay of path is about log 4 F FO4 inverter delays Inverters and NAND2 best for driving large caps Provides language for discussing fast circuits But requires practice to master 5-46