EEC 216 Lecture #3: Power Estimation, Interconnect, & Architecture. Rajeevan Amirtharajah University of California, Davis

Similar documents
EEC 216 Lecture #2: Metrics and Logic Level Power Estimation. Rajeevan Amirtharajah University of California, Davis

CMPEN 411 VLSI Digital Circuits Spring Lecture 14: Designing for Low Power

Lecture 23. Dealing with Interconnect. Impact of Interconnect Parasitics

Clock signal in digital circuit is responsible for synchronizing the transfer to the data between processing elements.

Chapter 8. Low-Power VLSI Design Methodology

EE241 - Spring 2001 Advanced Digital Integrated Circuits

EE241 - Spring 2000 Advanced Digital Integrated Circuits. Announcements

CSE241 VLSI Digital Circuits Winter Lecture 07: Timing II

MODULE III PHYSICAL DESIGN ISSUES

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

Lecture 8-1. Low Power Design

! Crosstalk. ! Repeaters in Wiring. ! Transmission Lines. " Where transmission lines arise? " Lossless Transmission Line.

CSE493/593. Designing for Low Power

Digital Integrated Circuits A Design Perspective. Arithmetic Circuits. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.

CMPEN 411 VLSI Digital Circuits Spring 2012 Lecture 17: Dynamic Sequential Circuits And Timing Issues

VLSI Design I; A. Milenkovic 1

Digital Integrated Circuits A Design Perspective. Arithmetic Circuits. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.

Design for Manufacturability and Power Estimation. Physical issues verification (DSM)

Vectorized 128-bit Input FP16/FP32/ FP64 Floating-Point Multiplier

Fig. 1 CMOS Transistor Circuits (a) Inverter Out = NOT In, (b) NOR-gate C = NOT (A or B)

Objective and Outline. Acknowledgement. Objective: Power Components. Outline: 1) Acknowledgements. Section 4: Power Components

Digital Integrated Circuits. The Wire * Fuyuzhuo. *Thanks for Dr.Guoyong.SHI for his slides contributed for the talk. Digital IC.

THE INVERTER. Inverter

Topics to be Covered. capacitance inductance transmission lines

L15: Custom and ASIC VLSI Integration

Lecture 25. Dealing with Interconnect and Timing. Digital Integrated Circuits Interconnect

EE115C Winter 2017 Digital Electronic Circuits. Lecture 19: Timing Analysis

School of EECS Seoul National University

Digital Integrated Circuits A Design Perspective

EE 466/586 VLSI Design. Partha Pande School of EECS Washington State University

VLSI Signal Processing

9/18/2008 GMU, ECE 680 Physical VLSI Design

Announcements. EE141- Fall 2002 Lecture 7. MOS Capacitances Inverter Delay Power

EE382 Processor Design Winter 1999 Chapter 2 Lectures Clocking and Pipelining

Hw 6 and 7 Graded and available Project Phase 2 Graded Project Phase 3 Launch Today

The Linear-Feedback Shift Register

L16: Power Dissipation in Digital Systems. L16: Spring 2007 Introductory Digital Systems Laboratory

Name: Answers. Mean: 83, Standard Deviation: 12 Q1 Q2 Q3 Q4 Q5 Q6 Total. ESE370 Fall 2015

CMPEN 411 VLSI Digital Circuits Spring Lecture 19: Adder Design

Very Large Scale Integration (VLSI)

EE115C Winter 2017 Digital Electronic Circuits. Lecture 6: Power Consumption

Lecture 2: CMOS technology. Energy-aware computing

Spiral 2 7. Capacitance, Delay and Sizing. Mark Redekopp

Interconnects. Wire Resistance Wire Capacitance Wire RC Delay Crosstalk Wire Engineering Repeaters. ECE 261 James Morizio 1

S No. Questions Bloom s Taxonomy Level UNIT-I

EECS 427 Lecture 11: Power and Energy Reading: EECS 427 F09 Lecture Reminders

ECE 407 Computer Aided Design for Electronic Systems. Simulation. Instructor: Maria K. Michael. Overview

EECS 151/251A Homework 5

Homework 4 due today Quiz #4 today In class (80min) final exam on April 29 Project reports due on May 4. Project presentations May 5, 1-4pm

Power Consumption in CMOS CONCORDIA VLSI DESIGN LAB

University of Toronto. Final Exam

Bit-Sliced Design. EECS 141 F01 Arithmetic Circuits. A Generic Digital Processor. Full-Adder. The Binary Adder

Lecture 7: Logic design. Combinational logic circuits

SEMICONDUCTOR MEMORIES

Arithmetic Building Blocks

CARNEGIE MELLON UNIVERSITY DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING DIGITAL INTEGRATED CIRCUITS FALL 2002

Integrated Circuits & Systems

Managing Physical Design Issues in ASIC Toolflows Complex Digital Systems Christopher Batten February 21, 2006

The Wire. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic. July 30, 2002

ECE321 Electronics I

Digital Integrated Circuits A Design Perspective. Semiconductor. Memories. Memories

Energy Delay Optimization

CMPEN 411 VLSI Digital Circuits Spring Lecture 21: Shifters, Decoders, Muxes

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Electrical Engineering and Computer Sciences

EECS 141 F01 Lecture 17

Digital Integrated Circuits A Design Perspective

Logical Effort: Designing for Speed on the Back of an Envelope David Harris Harvey Mudd College Claremont, CA

Lecture 9: Clocking, Clock Skew, Clock Jitter, Clock Distribution and some FM

Semiconductor memories

Lecture 6 Power Zhuo Feng. Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 2010

Lecture #39. Transistor Scaling

Chapter 5 CMOS Logic Gate Design

Introduction to CMOS VLSI Design (E158) Lecture 20: Low Power Design

CHAPTER 15 CMOS DIGITAL LOGIC CIRCUITS

DKDT: A Performance Aware Dual Dielectric Assignment for Tunneling Current Reduction

Integrated Circuits & Systems

Datapath Component Tradeoffs

Errata of K Introduction to VLSI Systems: A Logic, Circuit, and System Perspective

A novel Capacitor Array based Digital to Analog Converter

8. Design Tradeoffs x Computation Structures Part 1 Digital Circuits. Copyright 2015 MIT EECS

EE141- Spring 2004 Digital Integrated Circuits

Using Global Clock Networks

Implementation of Clock Network Based on Clock Mesh

8. Design Tradeoffs x Computation Structures Part 1 Digital Circuits. Copyright 2015 MIT EECS

Testability. Shaahin Hessabi. Sharif University of Technology. Adapted from the presentation prepared by book authors.

Lecture 21: Packaging, Power, & Clock

Semiconductor Memories

Topics. Dynamic CMOS Sequential Design Memory and Control. John A. Chandy Dept. of Electrical and Computer Engineering University of Connecticut

Semiconductor Memories. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic Paolo Spirito

EECS 427 Lecture 8: Adders Readings: EECS 427 F09 Lecture 8 1. Reminders. HW3 project initial proposal: due Wednesday 10/7

DesignConEast 2005 Track 4: Power and Packaging (4-WA1)

Chapter Overview. Memory Classification. Memory Architectures. The Memory Core. Periphery. Reliability. Memory

COMP 103. Lecture 16. Dynamic Logic

EEC 118 Lecture #5: CMOS Inverter AC Characteristics. Rajeevan Amirtharajah University of California, Davis Jeff Parkhurst Intel Corporation

Memory, Latches, & Registers

Session 8C-5: Inductive Issues in Power Grids and Packages. Controlling Inductive Cross-talk and Power in Off-chip Buses using CODECs

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING QUESTION BANK

EEC 118 Lecture #2: MOSFET Structure and Basic Operation. Rajeevan Amirtharajah University of California, Davis Jeff Parkhurst Intel Corporation

MODULE 5 Chapter 7. Clocked Storage Elements

Timing Issues. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolić. January 2003

Transcription:

EEC 216 Lecture #3: Power Estimation, Interconnect, & Architecture Rajeevan Amirtharajah University of California, Davis

Outline Announcements Review: PDP, EDP, Intersignal Correlations, Glitching, Top Level Power Estimation Behavioral Level Power Estimation Architectural Level Power Estimation Interconnect Power Low Power Architecture R. Amirtharajah, EEC216 Winter 2008 2

Announcements R. Amirtharajah, EEC216 Winter 2008 3

Outline Announcements Review: PDP, EDP, Intersignal Correlations, Glitching, Top Level Power Estimation Behavioral Level Power Estimation Architectural Level Power Estimation Interconnect Power Low Power Architecture R. Amirtharajah, EEC216 Winter 2008 4

Review: Power-Delay Product PDP = P av t pd Product of average power and propagation delay is generally a constant (fixed technology and topology) PDP = Energy consumed by gate per switching event (Watts x seconds = Joules) Energy is nice because it is a physical quantity, easy to relate to actual device Can be distorting: to minimize energy, use V DD = 0 V Most useful when t pd is constrained by application DSP, multimedia applications, real-time operation R. Amirtharajah, EEC216 Winter 2008 5

Review: Energy-Delay Product EDP = PDP t = pd P av t 2 pd Weight performance more heavily than PDP Enables more flexible power-performance tradeoff Higher voltages decrease delay but increase energy Lower voltages decrease energy but increase delay Therefore there exists an optimum supply voltage Useful when application allows power and performance (e.g., clock frequency) to both vary Good for evaluating logic styles, microprocessors, etc. R. Amirtharajah, EEC216 Winter 2008 6

Review: Intersignal Correlations Activity factor assumes independent, uniformly distributed input data, NOT very good assumptions typically Switching activity strong function of input statistics Must use conditional probabilities when evaluating circuits with reconvergent fanout Several techniques can be applied to reduce activity factor of logic internal nodes Logic restructuring: rearrange gates, for example in chains instead of trees Input reordering: put high activity signals late in path Consider parallelizing structures instead of time multiplexing them R. Amirtharajah, EEC216 Winter 2008 7

Review: Glitches Glitches due to mismatches in nonzero propagation delay of logic gates Internal nodes may make spurious transitions before settling on final output Reduce glitching by balancing logic delays (can also speed up circuits) Add transparent latches to inputs of complex combinational logic blocks to eliminate glitching when outputs unused R. Amirtharajah, EEC216 Winter 2008 8

Review: Top Level Power Estimation System Level Power Estimation Allows designer to analyze power impact of system partitioning among software, FPGAs, ASICs, etc. Spreadsheet analysis using library of models for entire components Models created from measurements, low-level power estimation Instruction Level Power Estimation Run assembly instructions on target processor and measure power Create database of power cost for individual instructions, pairs, maybe entire frequently used traces Add in cache misses, pipeline stalls, etc. R. Amirtharajah, EEC216 Winter 2008 9

Outline Announcements Review: PDP, EDP, Intersignal Correlations, Glitching, Top Level Power Estimation Behavioral Level Power Estimation Architectural Level Power Estimation Interconnect Power Low Power Architecture R. Amirtharajah, EEC216 Winter 2008 10

Outline Announcements Review: PDP, EDP, Intersignal Correlations, Glitching, Top Level Power Estimation Behavioral Level Power Estimation Architectural Level Power Estimation Interconnect Power Low Power Architecture R. Amirtharajah, EEC216 Winter 2008 11

Interconnect Modeling Early days of CMOS, wires could be treated as ideal for most digital applications, not so anymore! On-chip wires have resistance, capacitance, and inductance Similar to MOSFET charging, energy depends on capacitance Resistance might impact adiabatic charging, static current dissipation Ignore inductance for now Interconnect modeling is whole field of research itself! R. Amirtharajah, EEC216 Winter 2008 12

Resistance Resistance proportional to length and inversely proportional to cross section Depends on material constant resistivity ρ (Ω-m) H L W ρ L ρ L L ρ R = A = HW = R sq W R sq = H R. Amirtharajah, EEC216 Winter 2008 13

Parallel-Plate Capacitance Width large compared to dielectric thickness, height small compared to width: E field lines orthogonal to substrate H t W substrate L dielectric C = ε r WL t R. Amirtharajah, EEC216 Winter 2008 14

Fringing Field Capacitance When height comparable to width, must account for fringing field component as well W L H t dielectric substrate R. Amirtharajah, EEC216 Winter 2008 15

Total Capacitance Model When height comparable to width, must account for fringing field component as well Model as a cylindrical conductor above substrate dielectric H W t substrate R. Amirtharajah, EEC216 Winter 2008 16

Total Capacitance Model Total capacitance per unit length is parallel-plate (area) term plus fringing-field term: c = c pp + c fringe = ε t r W H 2πε + r 2 log 2 + ( t H 1) Model is simple and works fairly well More sophisticated numerical models also available Process models often give both area and fringing (also known as sidewall) capacitance numbers per unit length of wire for each interconnect layer R. Amirtharajah, EEC216 Winter 2008 17

Capacitive Coupling Fringing fields can terminate on adjacent conductors as well as substrate Mutual capacitance between wires implies crosstalk, affects data dependency of power dielectric substrate R. Amirtharajah, EEC216 Winter 2008 18

Miller Capacitance Amount of charge moved onto mutual capacitance depends on switching of surrounding wires When adjacent wires move in opposite direction, capacitance is effectively doubled (Miller effect) A B V + + V C m C m ΔQ = C = C m m ( V V ) f i ( V ( V )) DD = 2C m V DD DD C R. Amirtharajah, EEC216 Winter 2008 19

Data Dependent Switched Capacitance 1 When adjacent wires move in same direction, mutual capacitance is effectively eliminated A B C OR A B C = 0 C eff A B C OR A B C C = 4C eff m A A B C OR A B C B C OR A B C C = 2C eff m R. Amirtharajah, EEC216 Winter 2008 20

Data Dependent Switched Capacitance 2 When adjacent wires are static, mutual capacitance is effectively to ground 0 B 0 OR 1B 1 1B 0 OR 0B 1 0 B 1 OR 1B 0 C = 2C eff m 1B 1 OR 0B 0 Remember: it is the charging of capacitance where we account for energy from supply, not discharging R. Amirtharajah, EEC216 Winter 2008 21

Wire Length Estimation Flow Final piece of architectural power estimation: incorporate interconnect power Given hierarchical RTL description, estimate wire lengths before design actually placed and routed Depth-first traversal of hierarchy, where leaf nodes are blocks already well characterized for area and wire length by dedicated analysis Example: consider four block types Primitives: memory, datapath, control Composite block made up of primitives R. Amirtharajah, EEC216 Winter 2008 22

Hierarchical Chip Floorplan Datapath Composite Memory Memory Composite Control Datapath Datapath Composite R. Amirtharajah, EEC216 Winter 2008 23

Memory and Control Blocks Assume given blocks are already characterized or can easily be done Typical for foundry to provide memory hard or soft layout macro for synthesis flow: user given interconnect lengths and area Random control logic usually small, can be quickly synthesized, placed, and routed Fairly straightforward to turn complexity metrics (number of memory cells or gate counts) into area estimates Turn into power models as discussed earlier R. Amirtharajah, EEC216 Winter 2008 24

Composite Blocks Wire Length Best approach is to estimate length based on early floorplan Alternative is to use empirical observations Studies have shown that good cell placement differs from random placement by constant fudge factor k (k is often quoted as being 3/5) Average wire length for random placement on a square of area A array is 1/3 length of a side Average wire length of good placement on square array: A L = k = 3 R. Amirtharajah, EEC216 Winter 2008 25 5 A

Composite Blocks Area Area of composite block equals sum of areas of constituent blocks A B and area of wires A w A = A + A = A + A w B w i i { blocks} Routing area depends on total number of wires N w, average wire pitch W p, and average length L Using formula on preceding slide, can solve quadratic for length L: L = k 2 N w W p A + w = N ( 2 ) k N W 18 R. Amirtharajah, EEC216 Winter 2008 26 w W w p L p 2 + 36k 2 A B

Datapath Blocks Wire Length Datapaths often laid out in linear (bit slice) pattern Tile N bit slices to create and N-bit datapath Interconnect length thus proportional to datapath length Use another empirical factor to relate good placement to random placement for length in x dimension L x L DP k = k = L + R Li 3 3 { } i blocks Routing channel length L R estimated from wiring pitch and number of I/Os on each block L R = W p i { blocks} R. Amirtharajah, EEC216 Winter 2008 27 N IO i

Datapath Blocks Area Similar approach used to estimate vertical routing in feedthroughs within a bit slice of width W BS : L = y W 3 2 BS Total average interconnect length is L = L x + L y Approximate area as datapath width x datapath length for N bit slices: DP DP Incorporate all of these approximations plus equations for physical capacitance into our RTL-level power estimates k A = W L = NW BS L DP R. Amirtharajah, EEC216 Winter 2008 28

Rent s Rule N g N IO N = IO N p N r g Empirical rule relating number of I/Os in and out of a module to number of gates within the module N p is average number of pins per gate (~2.5) Rent s exponent r between 0.65 and 0.7 Numbers can be used to characterize various design styles (memories, gate arrays, microprocessors) Extended to derive average wire lengths R. Amirtharajah, EEC216 Winter 2008 29

Outline Announcements Review: PDP, EDP, Intersignal Correlations, Glitching, Top Level Power Estimation Behavioral Level Power Estimation Architectural Level Power Estimation Interconnect Power Low Power Architecture R. Amirtharajah, EEC216 Winter 2008 30

Low Power Architecture Outline Clock Gating Power Down Modes Parallelization Pipelining Bit Serial vs. Bit Parallel Datapaths R. Amirtharajah, EEC216 Winter 2008 31

Example Pipeline Use the following example logic pipeline: two combinational logic blocks between registers D f LOGIC A LOGIC B Define reference dynamic power: P 0 = C V Consider various architectural transformations to reduce power, primarily through voltage scaling and duty cycle R. Amirtharajah, EEC216 Winter 2008 32 0 2 DD f

Clock Gated Pipeline Use enable signal to turn off clock when not in use f en D LOGIC A LOGIC B Dynamic power reduction proportional to duty cycle D C (% time the system in use): P G = D C V Must take care in implementation: glitches on enable signal result in false clocking C 0 2 DD f R. Amirtharajah, EEC216 Winter 2008 33

Clock Gated Pipeline With Power Down V DD en f en D LOGIC A LOGIC B Disconnect logic from power supply when clock off Eliminates leakage, static current for further power reduction R. Amirtharajah, EEC216 Winter 2008 34

Partitioning Into Gated Clock Domains CLK5 CLK1 CLK2 CLK0 CLK5 CLK3 CLK4 Generally gate off entire modules or functional units Globally Asynchronous Locally Synchronous (GALS) domains form natural clock gating partitions Synchronize on boundaries (clock gating can introduce skew due to logic in clock path) R. Amirtharajah, EEC216 Winter 2008 35

Power Reduction Due to Clock Gating Clock gating Stretched Operation P G () t Only linear reduction in average power, peak power stays same (issue for power supply and delivery net) Decrease frequency to expand computing to fill time allows voltage reduction also: better than linear gain R. Amirtharajah, EEC216 Winter 2008 36 t

Low Power Architecture Outline Clock Gating Power Down Modes Parallelization Pipelining Bit Serial vs. Bit Parallel Datapaths R. Amirtharajah, EEC216 Winter 2008 37

An Explosion of Power Down Modes Although suboptimal for power reduction, clock gating is often used in practice Power electronics necessary to generate optimum voltage, more cost and complexity Tradeoff between power reduction and startup delay to return to operation ( light sleep vs. deep sleep ) Gate clocks off to individual modules, fastest to start up again Turn off clock generating PLLs, phase-lock transient potentially lasts several ms Ground power supply, must charge VDD net before even turning on PLL R. Amirtharajah, EEC216 Winter 2008 38

Low Power Architecture Outline Clock Gating Power Down Modes Parallelization Pipelining Bit Serial vs. Bit Parallel Datapaths R. Amirtharajah, EEC216 Winter 2008 39

Parallelization Driven Voltage Scaling D LOGIC A LOGIC B LOGIC A LOGIC B f 2 Parallelize computation up to N times Reduce clock frequency by factor N Reduce voltage to meet relaxed frequency constraint R. Amirtharajah, EEC216 Winter 2008 40

Tradeoffs of Parallelization Amount of parallelism in application may be limited Extra capacitance overhead of multiple datapaths N times higher input loading N-to-1 selector on output Lower clock frequency somewhat offset by higher clock load Consumes more area, devices, more leakage power especially in deep submicron Voltage reduction typically results in dramatic power gains Chandrakasan92: ~3X power reduction R. Amirtharajah, EEC216 Winter 2008 41

Low Power Architecture Outline Clock Gating Power Down Modes Parallelization Pipelining Bit Serial vs. Bit Parallel Datapaths R. Amirtharajah, EEC216 Winter 2008 42

Pipeline Driven Voltage Scaling Pipeline at finer granularity to relax critical path constraint D LOGIC A LOGIC B f Clock frequency stays the same Reduce voltage to meet relaxed frequency constraint Increased clock load offsets power reduction somewhat Can t pipeline beyond single gate granularity R. Amirtharajah, EEC216 Winter 2008 43

Parallel / Pipeline Driven Voltage Scaling D LOGIC A LOGIC B LOGIC A LOGIC B f 2 Combine parallelism and pipelining for lowest voltage Reduce clock frequency by parallelism factor N Largest increase in area, capacitance, leakage R. Amirtharajah, EEC216 Winter 2008 44

Low Power Architecture Outline Clock Gating Power Down Modes Parallelization Pipelining Bit Serial vs. Bit Parallel Datapaths R. Amirtharajah, EEC216 Winter 2008 45

Bit Serial vs. Bit Parallel Computation So far, we ve talked about serial versus parallel implementations at the functional unit level Can also consider serial versus parallel implementations at the bit level Historically, datapaths have almost always operated on words (several bits in parallel) In the past, heavily area constrained designs have used serial techniques where one output bit is produced per clock cycle Multipliers in older CMOS processes (> 2 μm) often implemented serially Rather than area, current and future motivation for bit serial techniques may be leakage power R. Amirtharajah, EEC216 Winter 2008 46

Bit Serial and Bit Parallel Adders Serial Adder: C in A i C out B i Parallel Adder: Si B i 1 A i 1 B i A i B i+1 A i+1 C in, i C o, i S i 1 S i S i+1 R. Amirtharajah, EEC216 Winter 2008 47

Parallel Array Multiplier 16b Parallel multiplier: 32390 μm 2, Serial multiplier: 2743 μm 2 32b Parallel adder: 2543 μm 2, Serial adder: 139 μm 2 R. Amirtharajah, EEC216 Winter 2008 48

Supply Voltage Clock Frequency Tradeoff 1 0.9 0.8 parallel (70nm) parallel (100nm) parallel (130nm) serial (70nm) serial (100nm) serial (130nm) Power Supply Voltage vs. Frequency for Multipliers 0.7 Voltage (V) 0.6 0.5 0.4 0.3 0.2 0.1 10 3 10 4 10 5 10 6 10 7 Frequency (Hz) R. Amirtharajah, EEC216 Winter 2008 49

Serial vs. Parallel Adder Power 10 4 Power vs. Frequency for Adders 10 5 10 6 Power (W) 10 7 10 8 parallel (70nm) parallel (100nm) parallel (130nm) serial (70nm) serial (100nm) serial (130nm) 10 9 10 3 10 4 10 5 10 6 10 7 Frequency (Hz) R. Amirtharajah, EEC216 Winter 2008 50

Serial vs. Parallel Multiplier Power 10 3 10 4 parallel (70nm) parallel (100nm) parallel (130nm) serial (70nm) serial (100nm) serial (130nm) Power vs. Frequency for Multipliers Power (W) 10 5 10 6 10 7 10 3 10 4 10 5 10 6 10 7 Frequency (Hz) R. Amirtharajah, EEC216 Winter 2008 51

Bit Serial vs. Bit Parallel Arithmetic Power At low frequencies, lower leakage of smaller serial implementation results in less power Similar result for adders, but more impact since array multiplier is quadratically larger than serial version Are these frequencies interesting? Below 10 MHz is typical for sensor applications, biomedical DSP, RFID tags R. Amirtharajah, EEC216 Winter 2008 52

Next Topic: Low Power Circuit Design Logic families Transistor sizing for low power Clocking methodologies R. Amirtharajah, EEC216 Winter 2008 53