Digital Integrated Circuits A Design Perspective

Similar documents
Digital EE141 Integrated Circuits 2nd Combinational Circuits

Digital Integrated Circuits A Design Perspective

9/18/2008 GMU, ECE 680 Physical VLSI Design

EE141. Administrative Stuff

Digital Integrated Circuits A Design Perspective

Properties of CMOS Gates Snapshot

EE141Microelettronica. CMOS Logic

COMBINATIONAL LOGIC. Combinational Logic

Digital Integrated Circuits A Design Perspective

Digital Integrated Circuits A Design Perspective

COMP 103. Lecture 16. Dynamic Logic

EECS 141 F01 Lecture 17

Integrated Circuits & Systems

Digital Integrated Circuits Designing Combinational Logic Circuits. Fuyuzhuo

Pass-Transistor Logic

Based on slides/material by. Topic 3-4. Combinational Logic. Outline. The CMOS Inverter: A First Glance

CMPEN 411 VLSI Digital Circuits Spring 2011 Lecture 07: Pass Transistor Logic

Miscellaneous Lecture topics. Mary Jane Irwin [Adapted from Rabaey s Digital Integrated Circuits, 2002, J. Rabaey et al.]

Topics. Dynamic CMOS Sequential Design Memory and Control. John A. Chandy Dept. of Electrical and Computer Engineering University of Connecticut

CMOS Digital Integrated Circuits Lec 10 Combinational CMOS Logic Circuits

CMOS logic gates. João Canas Ferreira. March University of Porto Faculty of Engineering

EEC 118 Lecture #6: CMOS Logic. Rajeevan Amirtharajah University of California, Davis Jeff Parkhurst Intel Corporation

ΗΜΥ 307 ΨΗΦΙΑΚΑ ΟΛΟΚΛΗΡΩΜΕΝΑ ΚΥΚΛΩΜΑΤΑ Εαρινό Εξάμηνο 2018

CPE/EE 427, CPE 527 VLSI Design I L18: Circuit Families. Outline

Announcements. EE141-Spring 2007 Digital Integrated Circuits. CMOS SRAM Analysis (Read/Write) Class Material. Layout. Read Static Noise Margin

CPE/EE 427, CPE 527 VLSI Design I Pass Transistor Logic. Review: CMOS Circuit Styles

EEC 116 Lecture #5: CMOS Logic. Rajeevan Amirtharajah Bevan Baas University of California, Davis Jeff Parkhurst Intel Corporation

EE 466/586 VLSI Design. Partha Pande School of EECS Washington State University

Static CMOS Circuits

CPE/EE 427, CPE 527 VLSI Design I L07: CMOS Logic Gates, Pass Transistor Logic. Review: CMOS Circuit Styles

Static CMOS Circuits. Example 1

ENEE 359a Digital VLSI Design

CPE/EE 427, CPE 527 VLSI Design I Delay Estimation. Department of Electrical and Computer Engineering University of Alabama in Huntsville

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Electrical Engineering and Computer Sciences

5.0 CMOS Inverter. W.Kucewicz VLSICirciuit Design 1

Dynamic Combinational Circuits. Dynamic Logic

THE INVERTER. Inverter

The CMOS Inverter: A First Glance

Digital Integrated Circuits A Design Perspective

B.Supmonchai August 1st, q In-depth discussion of CMOS logic families. q Optimizing gate metrics. q High Performance circuit-design techniques

Hw 6 due Thursday, Nov 3, 5pm No lab this week

CPE/EE 427, CPE 527 VLSI Design I L06: CMOS Inverter, CMOS Logic Gates. Course Administration. CMOS Inverter: A First Look

MOSFET and CMOS Gate. Copy Right by Wentai Liu

Lecture 4: CMOS review & Dynamic Logic

The CMOS Inverter: A First Glance

Topics to be Covered. capacitance inductance transmission lines

Integrated Circuits & Systems

Interconnect (2) Buffering Techniques. Logical Effort

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

Lecture 6: Circuit design part 1

CMOS Inverter (static view)

CHAPTER 15 CMOS DIGITAL LOGIC CIRCUITS

CMPEN 411 VLSI Digital Circuits. Lecture 04: CMOS Inverter (static view)

Power Dissipation. Where Does Power Go in CMOS?

Digital Integrated Circuits 2nd Inverter

Dynamic Combinational Circuits. Dynamic Logic

CMOS Inverter: CPE/EE 427, CPE 527 VLSI Design I L06: CMOS Inverter, CMOS Logic Gates. Course Administration. CMOS Properties.

Logical Effort: Designing for Speed on the Back of an Envelope David Harris Harvey Mudd College Claremont, CA

L2: Combinational Logic Design (Construction and Boolean Algebra)

ECE 546 Lecture 10 MOS Transistors

Digital Integrated Circuits A Design Perspective

Spiral 2 7. Capacitance, Delay and Sizing. Mark Redekopp

EECS 427 Lecture 8: Adders Readings: EECS 427 F09 Lecture 8 1. Reminders. HW3 project initial proposal: due Wednesday 10/7

Lecture 14: Circuit Families

Today s lecture. EE141- Spring 2003 Lecture 4. Design Rules CMOS Inverter MOS Transistor Model

CARNEGIE MELLON UNIVERSITY DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING DIGITAL INTEGRATED CIRCUITS FALL 2002

EEE 421 VLSI Circuits

ECE 438: Digital Integrated Circuits Assignment #4 Solution The Inverter

ENGR890 Digital VLSI Design Fall Lecture 4: CMOS Inverter (static view)

Dynamic operation 20

Arithmetic Building Blocks

2007 Fall: Electronic Circuits 2 CHAPTER 10. Deog-Kyoon Jeong School of Electrical Engineering

VLSI Design, Fall Logical Effort. Jacob Abraham

Lecture 8: Combinational Circuit Design

CMOS INVERTER. Last Lecture. Metrics for qualifying digital circuits. »Cost» Reliability» Speed (delay)»performance

Logic Gate Sizing. The method of logical effort. João Canas Ferreira. March University of Porto Faculty of Engineering

Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic. November Digital Integrated Circuits 2nd Sequential Circuits

MODULE 5 Chapter 7. Clocked Storage Elements

The Inverter. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic

Integrated Circuits & Systems

EE115C Digital Electronic Circuits Homework #6

Semiconductor Memories

Chapter 5. The Inverter. V1. April 10, 03 V1.1 April 25, 03 V2.1 Nov Inverter

L2: Combinational Logic Design (Construction and Boolean Algebra)

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

VLSI Design I; A. Milenkovic 1

CMOS Digital Integrated Circuits Lec 13 Semiconductor Memories

C.K. Ken Yang UCLA Courtesy of MAH EE 215B

ENEE 359a Digital VLSI Design

VLSI Design I; A. Milenkovic 1

Very Large Scale Integration (VLSI)

Digital Integrated Circuits A Design Perspective

Digital Integrated Circuits A Design Perspective. Semiconductor. Memories. Memories

MODULE III PHYSICAL DESIGN ISSUES

EE 466/586 VLSI Design. Partha Pande School of EECS Washington State University

L2: Combinational Logic Design (Construction and Boolean Algebra)

Lecture 8-1. Low Power Design

Digital Integrated Circuits A Design Perspective. Arithmetic Circuits. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.

DC and Transient Responses (i.e. delay) (some comments on power too!)

Announcements. EE141- Spring 2003 Lecture 8. Power Inverter Chain

Transcription:

Digital Integrated Circuits Design Perspective Jan M. Rabaey nantha Chandrakasan orivoje Nikolić Designing Combinational Logic Circuits November 2002. 1

Views / bstractions / Hierarchies ehavioral Structural device Today s view Physical Circuit Logic rchitectural D.Gajski, Silicon Compilation, ddison Wesley, 1988 2

Design Technologies 3

Overview Static CMOS Conventional Static CMOS Logic Ratioed Logic Pass Transistor/Transmission Gate Logic Dynamic CMOS Logic Domino np-cmos 4

Combinational vs. Sequential Logic In Combinational Logic Circuit Out In Combinational Logic Circuit Out State Combinational Sequential Output = f(in) Output = f(in, Previous In) 5

The asic Idea Voltage on the Gate controls the current through the source/drain path N-Channel - N-Switches are ON when the Gate is HIGH and OFF when the Gate is LOW P-Channel - P-Switches are OFF when the Gate is HIGH and ON when the Gate is LOW (ON == Circuit between Source and Drain) 6

Transistors as Switches N Switch G D S 0 1 Passes good zeros P Switch G D S 0 1 Passes good ones 7

.The Rest of the Story... Put them in series - both must be on to complete the circuit Put them in parallel - either can be on to complete the circuit Generate all sorts of Switching Functions NOT the same as oolean Functions... Its RELY logic - pin ball machines 8

Series Parallel Structures 1 1 G G D S S 1 1 D G G S S D D N Channel: on=closed when gate is high 9

NMOS Transistors in Series/Parallel Connection Transistors can be thought as a switch controlled by its gate signal NMOS switch closes when switch control input is high X Y Y = X if and X Y Y = X if OR NMOS Transistors pass a strong 0 but a weak 1 10 EE141 Digital Integrated Circuits Circuits 2nd Introduction Prentice Hall 1995 Combinational Circuits

Series Parallel Structures(2) 0 0 G G D S S 0 0 D G G S S D D P Channel: on=closed when gate is low 11

PMOS Transistors in Series/Parallel Connection PMOS switch closes when switch control input is low X Y Y = X if ND = + X Y Y = X if OR = PMOS Transistors pass a strong 1 but a weak 0 12 EE141 Digital Integrated Circuits Circuits 2nd Introduction Prentice Hall 1995 Combinational Circuits

That s s it! This is Non-Trivial: it defines the basis for the logic abstraction which is essential for all oolean functions. Provide a path to VDD for 1 Provide a path to GND for 0 For complex functions - provide complex paths 13

From Switches to oolean Functions... Use the Switching Functions to provide paths to Vdd or GND Vdd is the source of all Truth (Vdd = = 1) GND is the source of all Falsehood (GND == 0) P-channel N-channel 0 0 1 1 14

The Inverter True to False / False to True Converter 1/0 0/1 15

Static CMOS Circuit t every point in time (except during the switching transients) each gate output is connected to either V DD or V ss via a low-resistive path. The outputs of the gates assume at all times the value of the oolean function, implemented by the circuit (ignoring, once again, the transient effects during switching periods). This is in contrast to the dynamic circuit class, which relies on temporary storage of signal values on the capacitance of high impedance circuit nodes. 16

The Exceptions Many interesting and useful circuits which are not fully complementary CMOS Pass Gates (Transmission Gates) Level shifters, etc. Even more interesting and useless circuits! New circuit styles keep being invented 17

Threshold Drops PUN V DD S V DD D V DD D 0 V DD V GS S 0 V DD -V Tn C L C L PDN V DD 0 V DD V Tp V DD D C L V GS S C L S D 18

Series Parallel Structures (3) N Switch G S S D 0 1 Passes good zeros D S G S P Switch 1 Open Circuit, High Z 0 i-directional Switch Passes good ones 19

Static Complementary CMOS V DD In1 In2 InN In1 In2 InN PUN PDN PMOS only NMOS only F(In1,In2, InN) PUN and PDN are dual logic networks 20

Complementary CMOS Logic Style 21

Example Gate: NND 22

Physical Layout in MX of 2-input 2 NND Gate Vdd! Cout 23

4-input NND Gate Vdd V DD In 1 In 2 In 3 In 4 In 1 Out In 2 Out In 3 In 4 GND In1 In2 In3 In4 24 EE141 Digital Integrated Circuits Circuits 2nd Introduction Prentice Hall 1995 Combinational Circuits

Example Gate: NOR 25

Physical Layout in MX of 2-input 2 NOR Gate 26

Constructing a Complex Gate V DD V DD C D C F SN1 D F C SN4 SN2 SN3 D F (a) pull-down network (b) Deriving the pull-up network hierarchically by identifying sub-nets D C (c) complete gate 27

Complex CMOS Gate C D D C OUT = D + ( + C) 28

Cell Design Standard Cells General purpose logic Can be synthesized Same height, varying width Structured rray Cells Tiling structure with multiplicative parameters Programmed with vias Datapath Cells For regular, structured designs (arithmetic) Includes some wiring in the cell Fixed height and width 29

Standard Cell Layout Methodology 1980s Routing channel V DD signals GND 30

Standard Cell Layout Methodology 1990s Mirrored Cell No Routing channels V DD V DD M2 M3 GND Mirrored Cell GND 31

Inside out Nand / Nor Gates Which is which? 32

Standard Cell Layout Methodology metal1 V DD Well V SS signals Routing Channel polysilicon 33 EE141 Digital Integrated Circuits Circuits 2nd Introduction Prentice Hall 1995 Combinational Circuits

Standard Cells N Well V DD Cell height 12 metal tracks Metal track is approx. 3λ + 3λ Pitch = repetitive distance between objects Cell height is 12 pitch 2λ In Out Cell boundary GND Rails ~10λ 34

Standard Cells With minimal diffusion routing V DD With silicided diffusion V DD V DD In M 2 Out In Out In Out M 1 GND GND 35

Standard Cells V DD 2-input NND gate V DD Out GND Note: well and substrate Contacts prevent external routing in poly 36

Multi-Fingered Transistors One finger Two fingers (folded) Less diffusion capacitance 37

Stick Diagrams Contains no dimensions Represents relative positions of transistors Inverter V DD NND2 V DD Out Out GND In GND 38

Stick Diagrams j C Logic Graph X C PUN X = C ( + ) X i V DD C i j C GND PDN 39

Two Versions of C ( + ) C C V DD V DD X X GND GND 40

Consistent Euler Path X C X i V DD j GND C 41

OI22 Logic Graph C X PUN D D C X = (+) (C+D) X V DD C D C D GND PDN 42

Example: x = ab+cd x x b c b c x V DD x V DD a d a d GND GND (a) Logic graphs for (ab+cd) (b) Euler Paths {a b c d} V DD x GND a b c d (c) stick diagram for ordering {a b c d} 43

CMOS Properties Full rail-to-rail swing; high noise margins Logic levels not dependent upon the relative device sizes; ratioless lways a path to Vdd or Gnd in steady state; low output impedance Extremely high input resistance; nearly zero steady-state input current No direct path steady state between power and ground; no static power dissipation Propagation delay function of load capacitance and resistance of transistors 44

Complex Gate Structures nd-or-invert (OI) Vdd C C Out Out = +(*C)... C How to add terms? Gnd 45

OI/OI Duality Or-nd-Invert (OI) Vdd C C Demorgan s Law in ction Switch from: Out = +(*C)... Out To: C Out = *(+C)... Gnd 46

Demorgan s Law in ction Or-nd-Invert (OI) Vdd C C Out = *(+C)... C Out Gnd 47

C Vdd Gnd Out Demorgan s Law in ction Or-nd-Invert (OI) C C Out = *(+C)... 48

Demorgan s Law in ction Or-nd-Invert (OI) C Gnd C Out = *(+C)... Out C Vdd 49

Demorgan s Law in ction Or-nd-Invert (OI) Gnd C C Out = *(+C)... Out C Vdd 50

Demorgan s Law in ction Or-nd-Invert (OI) Vdd C C Out = *(+C)... Out C What is the Magic command to do this? Gnd 51

Complex (OI/OI) Gates 52

Schematic Representation in SUE of OI (and-or or-invert) Gate Notice 6 transistors 53

Physical Layout in MX of OI Gate C Vdd! OUT C OUT C 54

Schematic Representation in SUE of OI (or-and and-invert) 55

Physical Layout in MX of OI Gate C OUT Vdd! C OUT C 56

Quiz 57

Step by Step Layout of XNOR Gate The equation for XNOR is: f = (a * b) + (a' * b') using DeMorgan's law on each of the two terms gives: f = (a'+ b')' + (a + b)' using DeMorgan's law on the two terms together gives: f = ((a'+ b') * (a + b))' This could be directly implemented with a single complementary CMOS gate: the equation is in a simple negated product of sums form. This form can be implemented with the standard Or-nd-Invert (OI) style gate. 58

Non-Inverted Inputs However, using DeMorgan's law one more time on the left term gives: f = ((a * b)' * (a + b)) a b This form uses no inverted inputs and can be implemented with two gates a NND gate and an OI gate. f 59

Now lets lay it out Start with Vdd! and GND! power buses. Without any more information, about the use of this cell, make the power and ground lines in metal 1 sized 3 and 3 apart. Use poly as inputs and guess that C might be used. 60

XOR from NOR/OI 61

62

XOR Gate 63

Irsim 64

XOR GTE 65

3.5um x 10um 66

.35um x 1um 67

Lab 2: Full dder Sum = xor xor C Cout = + C + C expand sum Sum = C+ C + C + C (exactly 1 or 3 inputs true) use Cout to help generate Sum Sum = C + Cout (++Cin) 68

Full dder (4 gates) 69

Full dder (4 gates) 70

71

72

73

One Solution (125x136) 74

Is this standard cell design? 75

Lab 3: 8 it Ripple Carry dder 76

X-panded 77

MS Flip Flop Is this edge triggered? Is any flip flop? I could do this with three (3) CMOS gates! (could you?) 78

MS Register it How many metals? How are they used? Why? 79

irsim Simulation 80

H Spice Simulation 81

82

83

8 x 8 Register File 84

Register file simulation 85

Properties of Complementary CMOS Gates Snapshot High noise margins: V OH and V OL are at V DD and GND, respectively. No static power consumption: There never exists a direct path between V DD and V SS (GND) in steady-state mode. Comparable rise and fall times: (under appropriate sizing conditions) 86

Switch Delay Model R eq R p R p R p R p R n C L R n C L R p C int NND2 R n Cint INV R n R n C L NOR2 87

Input Pattern Effects on Delay R p R n R n R p C L C int Delay is dependent on the pattern of inputs Low to high transition both inputs go low delay is 0.69 R p /2 C L one input goes low delay is 0.69 R p C L High to low transition both inputs go high delay is 0.69 2R n C L 88

Delay Dependence on Input Patterns Voltage [V] 3 2.5 2 1.5 1 0.5 0-0.5 ==1 0 =1, =1 0 =1 0, =1 0 100 200 300 400 time [ps] Input Data Pattern Delay (psec) ==0 1 67 =1, =0 1 64 = 0 1, =1 61 ==1 0 45 =1, =1 0 80 = 1 0, =1 81 NMOS = 0.5μm/0.25 μm PMOS = 0.75μm/0.25 μm C L = 100 ff 89

Transistor Sizing R p R p 2 2 4 R p 2 R n C L 4 R p C int 2 R n Cint 1 R n R n 1 C L 90

Transistor Sizing a Complex CMOS Gate 4 3 C 8 8 6 6 D 4 6 OUT = D + ( + C) 2 D 1 2 C 2 91

Fan-In Considerations C D C 3 C L Distributed RC model (Elmore delay) C D C 2 C 1 t phl = 0.69 R eqn (C 1 +2C 2 +3C 3 +4C L ) Propagation delay deteriorates rapidly as a function of fan-in quadratically in the worst case. 92

t p as a Function of Fan-In t p (psec) 1250 1000 750 500 250 0 fan-in t phl t plh 2 4 6 8 10 12 14 16 t p quadratic linear Gates with a fan-in greater than 4 should be avoided. 93

t p as a Function of Fan-Out t p (psec) t p NOR2 t p NND2 t p INV 2 4 6 8 10 12 14 16 eff. fan-out ll gates have the same drive current. Slope is a function of driving strength 94

t p as a Function of Fan-In and Fan-Out Fan-in: quadratic due to increasing resistance and capacitance Fan-out: each additional fan-out gate adds two gate capacitances to C L t p = a 1 FI + a 2 FI 2 + a 3 FO 95

Fast Complex Gates: Design Technique 1 Transistor sizing as long as fan-out capacitance dominates Progressive sizing In N MN C L Distributed RC line In 3 M3 C 3 M1 > M2 > M3 > > MN (the fet closest to the output is the smallest) In 2 In 1 M2 M1 C 2 C 1 Can reduce delay by more than 20%; decreasing gains as technology shrinks 96

Fast Complex Gates: Design Technique 2 Transistor ordering critical path critical path In 3 1 In 2 1 In 1 0 1 charged 0 1 In M3 C 1 L M3 charged C L In 1 M2 C 2 2 charged M2 C2 discharged M1 charged In 3 1 M1 C discharged 1 C 1 delay determined by time to discharge C L, C 1 and C 2 delay determined by time to discharge C L 97

Fast Complex Gates: Design Technique 3 lternative logic structures F = CDEFGH 98

Fast Complex Gates: Design Technique 4 Isolating fan-in from fan-out using buffer insertion C L C L 99

Fast Complex Gates: Design Technique 5 Reducing the voltage swing t phl = 0.69 (3/4 (C L V DD )/ I DSTn ) = 0.69 (3/4 (C L V swing )/ I DSTn ) linear reduction in delay also reduces power consumption ut the following gate is much slower! Or requires use of sense amplifiers on the receiving end to restore the signal level (memory design) 100

Sizing Logic Paths for Speed Frequently, input capacitance of a logic path is constrained Logic also has to drive some capacitance Example: LU load in an Intel s microprocessor is 0.5pF How do we size the LU datapath to achieve maximum speed? We have already solved this for the inverter chain can we generalize it for any type of logic? 101

uffer Example In Out 1 2 N C L Delay = N ( pi + gi fi ) i= 1 (in units of τ inv ) For given N: C i+1 /C i = C i /C i-1 To find N: C i+1 /C i ~ 4 How to generalize this to any logic path? 102

Logical Effort Delay = τ = k R ( p + g f ) unit C unit 1 + C γc p intrinsic delay (3kR unit C unit γ) - gate parameter f(w) g logical effort (kr unit C unit ) gate parameter f(w) f effective fanout Normalize everything to an inverter: g inv =1, p inv = 1 L in Divide everything by τ inv (everything is measured in unit delays τ inv ) ssume γ = 1. 103

Delay in a Logic Gate Gate delay: d = h + p effort delay intrinsic delay Effort delay: h = g f logical effort effective fanout = C out /C in Logical effort is a function of topology, independent of sizing Effective fanout (electrical effort) is a function of load/gate size 104

Logical Effort Inverter has the smallest logical effort and intrinsic delay of all static CMOS gates Logical effort of a gate presents the ratio of its input capacitance to the inverter capacitance when sized to deliver the same current Logical effort increases with the gate complexity 105

Logical Effort Logical effort is the ratio of input capacitance of a gate to the input capacitance of an inverter with the same output current V DD V DD V DD 2 2 2 4 1 F 2 F 4 F 2 1 1 Inverter 2-input NND 2-input NOR g = 1 g = 4/3 g = 5/3 106

Logical Effort of Gates Normalized delay (d) g = p = d = t pnnd t g = p = d = pinv F(Fan-in) 1 2 3 4 5 6 7 Fan-out (h) 107

Logical Effort of Gates Normalized delay (d) g = 4/3 p = 2 d = (4/3)h+2 t pnnd t pinv g = 1 p = 1 d = h+1 F(Fan-in) 1 2 3 4 5 6 7 Fan-out (h) 108

Logical Effort of Gates Normalized Delay 5 4 3 2 Inverter: g = 1; p = 1 2-input NND: g = 4/3; p = 2 Effort Delay 1 Intrinsic Delay 1 2 3 4 5 Fanout f 109

dd ranching Effort ranching effort: b = C on path + C C off on path path 110

Multistage Networks Delay = N ( pi + gi fi ) i= 1 Stage effort: h i = g i f i Path electrical effort: F = C out /C in Path logical effort: G = g 1 g 2 g N ranching effort: = b 1 b 2 b N Path effort: H = GF Path delay D = Σd i = Σp i + Σh i 111

Optimum Effort per Stage When each stage bears the same effort: h N = H h = N H Stage efforts: g 1 f 1 = g 2 f 2 = = g N f N Effective fanout of each stage: f i = h g i Minimum path delay Dˆ = ( ) 1/ N g f + p = NH P i i i + 112

Optimal Number of Stages For a given load, and given input capacitance of the first gate Find optimal number of stages and optimal sizing D N = H 1/ N D = NH + 1/ N Np inv 1/ N ln( ) 1/ N H + H + p = 0 inv Substitute best stage effort h = H 1/ Nˆ 113

Logical Effort From Sutherland, Sproull 114

Example: Optimize Path 1 a b c 5 g = 1 f = a g = 5/3 f = b/a g = 5/3 f = c/b g = 1 f = 5/c Effective fanout, F = G = H = h = a = b = 115

Example: Optimize Path 1 a b c 5 g = 1 f = a g = 5/3 f = b/a g = 5/3 f = c/b g = 1 f = 5/c Effective fanout, F = 5 G = 25/9 H = 125/9 = 13.9 h = 1.93 a = 1.93 b = ha/g 2 = 2.23 c = hb/g 3 = 5g 4 /f = 2.59 116

Example: Optimize Path 1 a b c 5 g 1 = 1 g 2 = 5/3 g 3 = 5/3 g 4 = 1 Effective fanout, H = 5 G = 25/9 F = 125/9 = 13.9 f = 1.93 a = 1.93 b = fa/g 2 = 2.23 c = fb/g 3 = 5g 4 /f = 2.59 117

Example 8-input ND 118

Method of Logical Effort Compute the path effort: F = GH Find the best number of stages N ~ log 4 F Compute the stage effort f = F 1/N Sketch the path with this number of stages Work either from either end, find sizes: C in = C out *g/f Reference: Sutherland, Sproull, Harris, Logical Effort, Morgan-Kaufmann 1999. 119

Summary Sutherland, Sproull Harris 120

Ratioed Logic 121

Ratioed Logic V DD V DD V DD Resistive Load R L Depletion Load V T < 0 PMOS Load F F V SS F In 1 In 2 In 3 PDN In 1 In 2 In 3 PDN In 1 In 2 In 3 PDN V SS V SS V SS (a) resistive load (b) depletion load NMOS (c) pseudo-nmos Goal: to reduce the number of devices over complementary CMOS 122

Ratioed Logic V DD Resistive Load R L N transistors + Load V OH = V DD F V OL = R PN R PN + R L In 1 In 2 In 3 PDN ssymetrical response Static power consumption V SS t pl = 0.69 R L C L 123

ctive Loads V DD V DD Depletion Load V T < 0 PMOS Load F V SS F In 1 In 2 In 3 PDN In 1 In 2 In 3 PDN V SS V SS depletion load NMOS pseudo-nmos 124

Pseudo-NMOS V DD C D F C L V OH = V DD (similar to complementary CMOS) V2 k n ( V OL k DD V Tn )V p OL ------------- = ------ ( V 2 2 DD V Tp ) 2 V = V V OL ( DD T ) 1 1 k ------ p (assuming that V = V = V ) k T Tn Tp n SMLLER RE & LOD UT STTIC POWER DISSIPTION!!! 125

Pseudo-NMOS VTC 3.0 2.5 2.0 W/L p = 4 V ou t [V] 1.5 1.0 W/L p = 2 0.5 W/L p = 0.5 W/L p = 0.25 W/L p = 1 0.0 0.0 0.5 1.0 1.5 2.0 2.5 V in [V] 126

Improved Loads V DD Enable M1 M2 M1 >> M2 F C D C L daptive Load 127

Improved Loads (2) V DD V DD M1 M2 Out Out PDN1 PDN2 V SS V SS Differential Cascode Voltage Switch Logic (DCVSL) 128

DCVSL Example Out Out XOR-NXOR gate 129

DCVSL Transient Response 2.5 Volta ge [V] 1.5 0.5,, -0.5 0 0.2 0.4 0.6 0.8 1.0 Time [ns] 130

Pass-Transistor Logic 131

Pass-Transistor Logic Inputs Switch Network Out Out N transistors No static consumption 132

Example: ND Gate F = 0 133

NMOS-Only Only Logic V DD In x 0.5μm/0.25μm 1.5μm/ 0.25μm 0.5 μm/ 0.25μm Out Voltage [V] 3.0 2.0 1.0 Out x In 0.0 0 0.5 1 1.5 2 Time [ns] 134

NMOS-only Switch C = 2.5V C = 2.5 V = 2.5 V = 2.5 V M 2 M n C L M 1 V does not pull up to 2.5V, but 2.5V -V TN Threshold voltage loss causes static power consumption NMOS has higher threshold than PMOS (body effect) 135

NMOS Only Logic: Level Restoring Transistor V DD Level Restorer V DD M r M 2 M n X Out M 1 dvantage: Full Swing Restorer adds capacitance, takes away pull down current at X Ratio problem 136

Restorer Sizing Voltage [V] 3.0 2.0 1.0 W/L r =1.75/0.25 W/L r =1.50/0.25 W/L r =1.0/0.25 W/L r =1.25/0.25 Upper limit on restorer size Pass-transistor pull-down can have several transistors in stack 0.0 0 100 200 300 400 500 Time [ps] 137

Solution 2: Single Transistor Pass Gate with V T =0 V DD 0V 2.5V V DD V DD 0V Out 2.5V WTCH OUT FOR LEKGE CURRENTS 138

Complementary Pass Transistor Logic Pass-Transistor Network F (a) Inverse Pass-Transistor Network F F= F=+ F= ΒÝ (b) F= F=+ F= ΒÝ ND/NND OR/NOR EXOR/NEXOR 139

Solution 3: Transmission Gate C C C C C = 2.5 V = 2.5 V C L C = 0 V 140

Resistance of Transmission Gate 30 R n 2.5 V Rn Resistance, ohms 20 10 R p R n R p 2.5 V 0 V R p V ou t 0 0.0 1.0 2.0 V ou t, V 141

Pass-Transistor ased Multiplexer S S V DD S V DD M 2 S F M 1 S GND In 1 S S In 2 142

Transmission Gate XOR M2 M1 F M3/M4 143

Delay in Transmission Gate Networks 2.5 2.5 2.5 2.5 In V 1 V i-1 V i V i+1 V n-1 V n C 0 0 C 0 C C 0 C (a) In R eq R V eq R eq R 1 V i V i+1 V eq n-1 V n C C C C C m (b) R eq R eq R eq R eq R eq R eq In C C C C C C C C (c) 144

Delay Optimization 145

Transmission Gate Full dder P V DD V DD P C i C i P S Sum Generation V DD P P P V DD C o Carry Generation C i C i Setup C i P Similar delays for sum and carry 146

Dynamic Logic 147

Dynamic CMOS In static circuits at every point in time (except when switching) the output is connected to either GND or V DD via a low resistance path. fan-in of n requires 2n (n N-type + n P-type) devices Dynamic circuits rely on the temporary storage of signal values on the capacitance of high impedance nodes. requires on n + 2 (n+1 N-type + 1 P-type) transistors 148

Dynamic Gate Clk M p Out Clk M p off on 1 Out In 1 In 2 In 3 PDN C L (()+C) C Clk M e Two phase operation Precharge (Clk = 0) Evaluate (Clk = 1) Clk M e off on 149

Conditions on Output Once the output of a dynamic gate is discharged, it cannot be charged again until the next precharge operation. Inputs to the gate can make at most one transition during evaluation. Output can be in the high impedance state during and after evaluation (PDN off), state is stored on C L 150

Properties of Dynamic Gates Logic function is implemented by the PDN only number of transistors is N + 2 (versus 2N for static complementary CMOS) Full swing outputs (V OL = GND and V OH = V DD ) Non-ratioed - sizing of the devices does not affect the logic levels Faster switching speeds reduced load capacitance due to lower input capacitance (C in ) reduced load capacitance due to smaller output loading (Cout) no I sc, so all the current provided by PDN goes into discharging C L 151

Properties of Dynamic Gates Overall power dissipation usually higher than static CMOS no static current path ever exists between V DD and GND (including P sc ) no glitching higher transition probabilities extra load on Clk PDN starts to work as soon as the input signals exceed V Tn, so V M, V IH and V IL equal to V Tn low noise margin (NM L ) Needs a precharge/evaluate clock 152

Issues in Dynamic Design 1: Charge Leakage CLK Clk M p Out C L Clk M e V Out Evaluate Precharge Leakage sources Dominant component is subthreshold current 153

Solution to Charge Leakage Keeper Clk M p M kp C L Out Clk M e Same approach as level restorer for pass-transistor logic 154

Issues in Dynamic Design 2: Charge Sharing Clk M p C L Out Charge stored originally on C L is redistributed (shared) over C L and C leading to reduced robustness =0 Clk M e C C 155

Charge Sharing Example Clk Out C L =50fF C a =15fF! C b =15fF C c =15fF C C C d =10fF Clk 156

Charge Sharing V DD case 1) if ΔV out < V Tn Clk M p Out C L V DD = C L V out ( t) + C a ( V DD V Tn ( V X )) M a X C L ΔV out or = V out ( t) V DD = C a -------V ( C DD V Tn ( V X )) L = 0 M b C a case 2) if ΔV out > V Tn Clk M e C b C a ΔV out V -------------------- = DD C a + C L 157

Solution to Charge Redistribution Clk M p M kp Out Clk Clk M e Precharge internal nodes using a clock-driven transistor (at the cost of increased area and power) 158

Issues in Dynamic Design 3: ackgate Coupling Clk =0 M p C L1 Out1 =1 C L2 Out2 =0 In =0 Clk M e Dynamic NND Static NND 159

ackgate Coupling Effect 3 Voltage 2 1 Clk Out1 0 In Out2-1 0 2 Time, ns 4 6 160

Issues in Dynamic Design 4: Clock Feedthrough Clk Clk M p M e C L Out Coupling between Out and Clk input of the precharge device due to the gate to drain capacitance. So voltage of Out can rise above V DD. The fast rising (and falling edges) of the clock couple to Out. 161

Voltage Clock Feedthrough Clk Out 2.5 Clock feedthrough In 1 In 2 1.5 In 3 In 4 Clk 0.5-0.5 In & Clk Out 0 0.5 Time, ns 1 Clock feedthrough 162

Other Effects Capacitive coupling Substrate coupling Minority charge injection Supply noise (ground bounce) 163

Cascading Dynamic Gates V Clk In M p Clk Out1 M p Out2 Clk In Clk M e Clk M e Out1 V Tn Out2 ΔV t Only 0 1 transitions allowed at inputs! 164

Domino Logic Clk In 1 In 2 M p PDN 1 1 1 0 Out1 Clk 0 0 0 1 In 4 M p M kp PDN Out2 In 3 In 5 Clk M e Clk M e 165

Why Domino? Clk In i In j Clk PDN In i PDN In i PDN In i PDN In j In j In j Like falling dominos! 166

Properties of Domino Logic Only non-inverting logic can be implemented Very high speed static inverter can be skewed, only L-H transition Input capacitance reduced smaller logical effort 167

Designing with Domino Logic V DD V DD V DD Clk M p Out1 Clk M p M r Out2 In 1 In 2 PDN In 4 PDN In 3 Can be eliminated! Clk M e Clk M e Inputs = 0 during precharge 168

Footless Domino V DD V DD V DD Clk M p Out 1 Clk M p Out 2 Clk M p Out n 0 1 0 1 0 1 In 1 1 0 In 2 1 0 In 3 In n 1 0 1 0 The first gate in the chain needs a foot switch Precharge is rippling short-circuit current solution is to delay the clock for each stage 169

Differential (Dual Rail) Domino Out = Clk off on M p M kp M kp M p Clk 1 0 1 0!! Out = Clk M e Solves the problem of non-inverting logic 170

np-cmos Clk In 1 M p 1 1 1 0 Out1 Clk In 4 M e PUN In 2 In 3 Clk PDN M e In 5 Clk M p 0 0 0 1 Out2 (to PDN) Only 0 1 transitions allowed at inputs of PDN Only 1 0 transitions allowed at inputs of PUN 171

NOR Logic Clk In 1 M p 1 1 1 0 Out1 Clk In 4 M e PUN In 2 In 3 Clk PDN M e In 5 Clk M p 0 0 0 1 Out2 (to PDN) to other PDN s to other PUN s WRNING: Very sensitive to noise! 172