Luis Manuel Santana Gallego 31 Investigation and simulation of the clock skew in modern integrated circuits

Similar documents
Luis Manuel Santana Gallego 100 Investigation and simulation of the clock skew in modern integrated circuits. Clock Skew Model

Luis Manuel Santana Gallego 71 Investigation and simulation of the clock skew in modern integrated circuits. Clock Skew Model 1

Clock signal in digital circuit is responsible for synchronizing the transfer to the data between processing elements.

EE115C Winter 2017 Digital Electronic Circuits. Lecture 19: Timing Analysis

CMPEN 411 VLSI Digital Circuits Spring 2012 Lecture 17: Dynamic Sequential Circuits And Timing Issues

Lecture 9: Clocking, Clock Skew, Clock Jitter, Clock Distribution and some FM

Lecture 23. Dealing with Interconnect. Impact of Interconnect Parasitics

Fig. 1 CMOS Transistor Circuits (a) Inverter Out = NOT In, (b) NOR-gate C = NOT (A or B)

Xarxes de distribució del senyal de. interferència electromagnètica, consum, soroll de conmutació.

Lecture 25. Dealing with Interconnect and Timing. Digital Integrated Circuits Interconnect

Scaling of MOS Circuits. 4. International Technology Roadmap for Semiconductors (ITRS) 6. Scaling factors for device parameters

Very Large Scale Integration (VLSI)

Interconnect (2) Buffering Techniques. Logical Effort

MOS Transistor Theory

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

! Crosstalk. ! Repeaters in Wiring. ! Transmission Lines. " Where transmission lines arise? " Lossless Transmission Line.

Lecture 6 Power Zhuo Feng. Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 2010

Spiral 2 7. Capacitance, Delay and Sizing. Mark Redekopp

THE INVERTER. Inverter

The Linear-Feedback Shift Register

University of Toronto. Final Exam

EE241 - Spring 2000 Advanced Digital Integrated Circuits. Announcements

E40M Capacitors. M. Horowitz, J. Plummer, R. Howe

Lecture #39. Transistor Scaling

Chapter 2. Design and Fabrication of VLSI Devices

VLSI GATE LEVEL DESIGN UNIT - III P.VIDYA SAGAR ( ASSOCIATE PROFESSOR) Department of Electronics and Communication Engineering, VBIT

ENEE 359a Digital VLSI Design

Chapter 2 CMOS Transistor Theory. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

Physical Design of Digital Integrated Circuits (EN0291 S40) Sherief Reda Division of Engineering, Brown University Fall 2006

Lecture 15: Scaling & Economics

CSE241 VLSI Digital Circuits Winter Lecture 07: Timing II

Answers. Name: Grade: Q1 Q2 Q3 Q4 Total mean: 83, stdev: 14. ESE370 Fall 2017

ESE570 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Integrated Cicruits AND VLSI Fundamentals

Issues on Timing and Clocking

VLSI Design and Simulation

S No. Questions Bloom s Taxonomy Level UNIT-I

Errata of K Introduction to VLSI Systems: A Logic, Circuit, and System Perspective

ECE 546 Lecture 10 MOS Transistors

Problems in VLSI design

and V DS V GS V T (the saturation region) I DS = k 2 (V GS V T )2 (1+ V DS )

Chapter 4. Digital Integrated Circuit Design I. ECE 425/525 Chapter 4. CMOS design can be realized meet requirements from

DesignConEast 2005 Track 4: Power and Packaging (4-WA1)

Chapter 9. Estimating circuit speed. 9.1 Counting gate delays

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

ESE570 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Integrated Cicruits AND VLSI Fundamentals

CARNEGIE MELLON UNIVERSITY DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING DIGITAL INTEGRATED CIRCUITS FALL 2002

Variability Aware Statistical Timing Modelling Using SPICE Simulations

Lecture 5: DC & Transient Response

ECE 342 Solid State Devices & Circuits 4. CMOS

EE5780 Advanced VLSI CAD

Lecture 12 CMOS Delay & Transient Response

Delay Modelling Improvement for Low Voltage Applications

Dynamic Repeater with Booster Enhancement for Fast Switching Speed and Propagation in Long Interconnect

Semiconductor Memories

Interconnect (2) Buffering Techniques.Transmission Lines. Lecture Fall 2003

MOS Transistor Theory

Lecture 16: Circuit Pitfalls

Interconnects. Introduction

Analysis of Substrate Thermal Gradient Effects on Optimal Buffer Insertion

Thermal-reliable 3D Clock-tree Synthesis Considering Nonlinear Electrical-thermal-coupled TSV Model

CMPEN 411 VLSI Digital Circuits. Lecture 04: CMOS Inverter (static view)

GMU, ECE 680 Physical VLSI Design 1

MOSFET: Introduction

Timing Issues. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolić. January 2003

Digital Integrated Circuits. The Wire * Fuyuzhuo. *Thanks for Dr.Guoyong.SHI for his slides contributed for the talk. Digital IC.

Name: Grade: Q1 Q2 Q3 Q4 Q5 Total. ESE370 Fall 2015

Interconnects. Wire Resistance Wire Capacitance Wire RC Delay Crosstalk Wire Engineering Repeaters. ECE 261 James Morizio 1

L ECE 4211 UConn F. Jain Scaling Laws for NanoFETs Chapter 10 Logic Gate Scaling

2007 Fall: Electronic Circuits 2 CHAPTER 10. Deog-Kyoon Jeong School of Electrical Engineering

Digital Microelectronic Circuits ( )

Lecture 21: Packaging, Power, & Clock

Next, we check the race condition to see if the circuit will work properly. Note that the minimum logic delay is a single sum.

ESE570 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Integrated Cicruits AND VLSI Fundamentals

Name: Answers. Mean: 83, Standard Deviation: 12 Q1 Q2 Q3 Q4 Q5 Q6 Total. ESE370 Fall 2015

Methodology to Achieve Higher Tolerance to Delay Variations in Synchronous Circuits

EEE 421 VLSI Circuits

The Wire EE141. Microelettronica

TAU 2014 Contest Pessimism Removal of Timing Analysis v1.6 December 11 th,

The Physical Structure (NMOS)

Digital Integrated Circuits A Design Perspective. Semiconductor. Memories. Memories

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Electrical Engineering and Computer Sciences

Topics to be Covered. capacitance inductance transmission lines

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING QUESTION BANK

Lecture 34: Portable Systems Technology Background Professor Randy H. Katz Computer Science 252 Fall 1995

CMPEN 411 VLSI Digital Circuits Spring 2012

PARADE: PARAmetric Delay Evaluation Under Process Variation *

EE141-Fall 2011 Digital Integrated Circuits

Topic 4. The CMOS Inverter

Lecture 7 Circuit Delay, Area and Power

Lecture 5 Fault Modeling

Interconnect s Role in Deep Submicron. Second class to first class

Lecture 4: CMOS Transistor Theory

The CMOS Inverter: A First Glance

EECS 141: FALL 05 MIDTERM 1

Design for Manufacturability and Power Estimation. Physical issues verification (DSM)

! Charge Leakage/Charge Sharing. " Domino Logic Design Considerations. ! Logic Comparisons. ! Memory. " Classification. " ROM Memories.

The Wire. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic. July 30, 2002

ECE 342 Electronic Circuits. Lecture 35 CMOS Delay Model

ECE 342 Electronic Circuits. 3. MOS Transistors

EECS 151/251A Homework 5

Transcription:

Luis Manuel Santana Gallego 31 Investigation and simulation of the clock skew in modern egrated circuits 3. Clock skew 3.1. Definitions For two sequentially adjacent registers, as shown in figure.1, C i and C f are the clock signals that drive the local data path. Both clock signals are generated in the same clock source. The propagation delay of the clock signals from the source to the registers R i and R f, is T Ci and T Cj respectively. They define the timing reference of when the data signals leave each register. There is a clock distribution network designed to generate a specific signal waveform. Ideally, clock events occur at all registers simultaneously. Given this strategy of global clocking, the clock signal arrival time to each register is defined with respect to a universal time reference. The difference in clock signals arrival time between two register sequentially adjacent is the clock skew T skew. We can define the clock skew mathematical expression as: T skew = T Ci T Cf. If the signals C i and C f are in complete synchronism, it means they arrive at the exact same moment, the clock skew is zero. It is important to note that the clock skew between is only relevant to sequentially adjacent registers that make up a local data path. Thus, the clock skew, at system or chip level, between two registers non-sequentially adjacent has no ects on the performance and reliability of a synchronous digital system from an analysis viewpo. Different clock signal paths can have different delays due to several reasons. We can summarize them in the following three reasons: 1) Differences in the wire lengths from the clock source to the clocked registers. ) Differences in the delays of any active buffer in the clock distribution network. 3) Differences in the erconnection passive parameters. We can note that, for a well-designed balanced clock distribution network, distributed buffers are the primary clock skew source.

Luis Manuel Santana Gallego 3 Investigation and simulation of the clock skew in modern egrated circuits Clock skew magnitude and polarity have two different ects on system performance and reliability. Depending on which signal, C i or C f, arrive earlier and the magnitude of T skew with respect to data path time delay T PD, system reliability and performance can be degraded or improved. Both cases are discussed below. 1) Maximum data path/clock skew constra relationship If the clock signal arrival time to the final register, T Cf, is previous to the arrival time to the initial register, T Ci, the clock skew is positive (T Ci > T Cf ). Under this condition the maximum operation reachable frequency is decreased. A positive clock skew is the additional time amount that must be added to the minimum clock period to apply a new clock signal edge to the final register without any problem. Figure 3.1: Positive clock skew. For a specific design, the greatest propagation delay T PD (max) of any local data path between two sequentially adjacent registers must be less than the minimum clock period T CP (min). ( log (max) ) T T T (max) = T T T T T, where TCi > TCf (3.1) skew CP PD CP C Q ic setup This situation is the typical analysis of the critical data path in a synchronous system. If this constra is not satisfied, the system will not operate correctly with this specific clock period. Therefore, T CP must be increased if we want the circuit operates without any problem. In a circuit where the clock skew tolerance is small, data and clock signals should run in the same direction, thereby forcing that C i leads C f and making the clock skew negative.

Luis Manuel Santana Gallego 33 Investigation and simulation of the clock skew in modern egrated circuits ) Minimum data path/clock skew constra relationship If the clock signal arrival time to the final register, T Cf, is later than arrival time to the initial register, T Ci, the clock skew is negative (T Ci < T Cf ). It can be used to improve the maximum performance of a synchronous system by the reduction of the critical data path. However, there is a minimum constra to avoid race conditions. Figure 3.: Negative clock skew. When C f follows to C i, clock skew must be less than the required time for the data signal to leave the initial register, propagate through the combinatorial logic and erconnections and setup in the final register input. If this condition is not met, the data stored in the final register is overwritten with the data that was stored in the initial register because it arrives to the R f input earlier than the clock signal (race condition). Furthermore, a circuit operating close to this restriction could not work correctly at unpredictable times due to environmental temperature or power supply voltage fluctuations: Tskew T T T T T PD (min) = C Q logic (min) setup, where TCi TCf < (3.) where T PD (min) is the minimum data path delay between two sequentially adjacent registers. 3.. Clock skew sources Clock skew appears due to differences in clock paths from the source to each destination register. These differences can be unequal wire lengths or different resistive and/or capacitive wire parameters. In a balanced clock tree, the nominal value for clock

Luis Manuel Santana Gallego 34 Investigation and simulation of the clock skew in modern egrated circuits skew is zero, since clock paths are designed to be equal. However, clock skew appearance is still possible due to variations in the clock paths caused by process and circuit parameter tolerances. We can classify them in the following way: Transistor parameter variations In the egrated circuit fabrication process, all transistor parameters are subject to deviations from their nominal values. Statistical models have been developed for transistor parameters such as threshold voltage (V T ), gate oxide thickness (t ox ), charge carrier mobility (µ), transistor width (W) and ective channel length ( L ). Figure 3.3: Vertical section of a MOS transistor. In table 3.1, typical values for this parameter are presented according to different technologies. In table 3., standard deviations are shown. Parameter 13 nm 1 nm 7 nm 45 nm V T.19 V.15 V.6 V.1 V T ox 3.3 nm.5 nm 1.6 nm 1.4 nm L 13 nm 1 nm 7 nm 45 nm W (min) 13 nm 1 nm 7 nm 45 nm Table 3.1: Typical values for different technologies [ITRS].

Luis Manuel Santana Gallego 35 Investigation and simulation of the clock skew in modern egrated circuits Parameter Description Standard Deviation σ VT Threshold voltage 4. % σ µ Charge carrier mobility % σ tox Gate oxide thickness 1.3 % σ W Transistor width 5% σ L Transistor ective channel length 5 % Table 3.: Typical values for standard deviations [KAH-1]. Interconnect Parameter Variations Interconnect width (W ) and thickness (t ) and erlevel dielectric thickness (T ILD ) variations are the main parameters of erest. As technology advances, the number of erconnect layers increases, and the erconnect lines become more non-uniform. This non-uniformity, which is caused by manufacturing processes, produces large variations of erconnect parameter values. Figure 3.4: Interconnect segment main parameters. In table 3.3, some typical values for this parameter are presented according to different technologies. In table 3.4, standard deviations are shown.

Luis Manuel Santana Gallego 36 Investigation and simulation of the clock skew in modern egrated circuits Parameter 13 nm 1 nm 7 nm 45 nm W (min) 335 nm 37 nm 16 nm 13 nm t (min) 67 nm 5 nm 35 nm 35 nm Table 3.3: Typical values for different technologies [ITRS]. Parameter Description Standard Deviation σ W Wire width 3 % σ t Wire thickness 3 % σ TILD ILD thickness 3 % Table 3.4: Typical values for standard deviations [FAN-98]. System Parameter Variations Besides process parameter variations, which are mainly the tolerances of device and erconnect physical parameters, system level fluctuations may create clock skew. Power supply voltage fluctuation (V DD ) and temperature variations (T) are considered as system level parameter variations. In table 3.5, some typical values V DD are presented according to different technologies. In table 3.6, standard deviations are shown. Parameter 13 nm 1 nm 7 nm 45 nm V DD 1. V 1 V.9 V.6 V Table 3.5: Typical values for different technologies [ITRS]. Parameter Description Standard Deviation σ VDD Power supply voltage 3.3 % [KAH-1] σ T Temperature 8 % [GRO-98] Table 3.6: Typical values for standard deviations. The thermal image of the Alpha 164 microprocessor, presented in section.4.1, shows a 3º C temperature gradient over the entire chip that gives a temperature variation of about 8 % [GRO-98]. In figure 3.6, this image is depicted.

Luis Manuel Santana Gallego 37 Investigation and simulation of the clock skew in modern egrated circuits Figure 3.6: 164 thermal image [GRO-98]. 3.3. Clock skew models An important research area in VLSI circuits is timing analysis, where simplified models are used to estimate the delay through a CMOS circuit according to process and circuit parameter variations. At first, a probabilistic model for the accumulation of clock skew in synchronous systems is described. Using this model, upper bounds for expected skew and its variance in tree distribution systems are derived. Thereafter, a model for tapered H-Tree is described, where no buffers are placed at branching pos and the wires are widened to avoid reflections. The clock skew is calculated as function of device, system and erconnect parameter variations. The first statistical model (upper bounds model) is too conservative for estimating the clock skew of a well-balanced H- tree clock distribution network because correlation between overlapped parts of paths are not considered. Finally, a new approach to estimate the mean value and variance of clock skew is described taking o account this correlation. 3.3.1. MODEL 1: Statistical model to estimate upper bounds of clock skew This model is described in depth in Appendix 1. Kugelmass and Steiglitz [KUG-88] present a probabilistic model for the accumulation of clock skew in synchronous systems. Using this model, it s possible to

Luis Manuel Santana Gallego 38 Investigation and simulation of the clock skew in modern egrated circuits estimate upper bounds for expected clock skew between processing elements (and its variance) in symmetric tree distribution systems with N synchronously clocked processing elements. The first assumption in this model is the topology of the clock distribution network. It must be a symmetric tree-like structure with a single source and N end pos called processing elements (PE). There must be only one path from the source to the PEs. Figure 3.7: Model 1 tree structure. Each clock path is composed of delay elements: buffers and erconnection wires. It is possible to associate a random variable to each element that gives the delay contribution of it. The total delay from the clock source to each PE is the sum of all the random variables along the path. By the Central Limit Theorem, the sum converges to a normal distribution. According to these authors, it is possible to define a random variable that characterizes the clock skew of the clock distribution network. It is R = A max - A min, where A max and A min are the maximal and the minimal arrival time to any of the N PEs. Random variables that compose R are dependent in a clock tree because they are sums of overlapping variables. However, thanks to a demonstrated theorem, the

Luis Manuel Santana Gallego 39 Investigation and simulation of the clock skew in modern egrated circuits expected mean value of R is smaller than the same case but with independent random variables. Another theorem says that if R is composed of N independent identically distributed random variables (it is the case for a symmetric clock distribution network), then, the asymptotically expected value of R is: [ ] E R 4ln N ln ln N ln 4π C 1 = σ O ( ) 1 ln N log N (3.3) where C.577 is Euler constant and σ is the standard deviation of the path delay. The variance of R is given by: [ ] Var R σ π 1 = O ln N 6 log N (3.4) Equation (3.3) is therefore asymptotic upper bound on the expected skew in a clock distribution tree with N leaves. To apply these model equations to the proposed H-tree depicted in figure 3.1, it is necessary to know the value of clock path standard deviation, σ. It has two different components (and independent according to the model assumption): b w ( ( N )) σ = σ log N σ 1 (3.5) where σ b is the standard deviation of buffer delays and σ w is standard deviation of the wire in the lowest level. The next step to calculate E[R] it is necessary to determine the delay variance σ b through a buffer of the clock distribution tree and delay variance σ w through a wire of the clock distribution tree.

Luis Manuel Santana Gallego 4 Investigation and simulation of the clock skew in modern egrated circuits Using Sakurai s model for erconnection delay, described in section.3.1, and the possible clock skew sources considered by the authors (V T, t ox, L, V DD, T ILD, W, t ), the value of σ b and σ w is determined, in terms of variances of the independent random variables that compose them, by the following expressions: T T T T σ = σ σ σ σ Delay Delay Delay Delay b VT t ox L VDD VT t ox L VDD T T T σ = σ σ σ Delay Delay Delay w TILD W t TILD W t (3.6) (3.7) where: T T R R = =.3( C C ) Delay Delay VT R VT VDD VT = =.3( C C ).3( R R ) Delay Delay Delay tox R tox C tox tox tox C R = =.3( C C ).3 L R L C L L Delay Delay Delay T T R R = =.3( C C ) V R V V V Delay Delay DD DD DD T ( R R ) C L (3.8) T T C C ( 1.R.3R ) T C T T Delay Delay = = ILD ILD ILD ( 1.C.3C ) ( 1.R.3R ) W R W C W W W Delay Delay Delay = = R = t R t R ( 1..3 ) = C C t 3.3.. MODEL : Statistical model for clock skew in tapered H-trees This model is described in depth in Appendix. Zarkesh-Ha, Mule and Meindl [ZAR-98] described a compact model to enable first-order estimation for clock skew in tapered H-trees. In this kind of structure, there

Luis Manuel Santana Gallego 41 Investigation and simulation of the clock skew in modern egrated circuits are not ermediate buffers at the split pos and the wires must be widened to avoid reflections in those pos. Figure 3.8: Model tree structure [ZAR-98]. Authors propose that any H-tree circuit can be simplified in the following equivalent circuit shown in figure 3.9. Figure 3.9: Equivalent circuit of clock H-tree network [ZAR-98]. Using the equivalent circuit, the delay of the entire clock network of figure 3.9 is divided o two parts: Interconnect delay from the clock source to clock the driver: If the H-tree network is driven by a single driver, then the delay expression for a distributed RC line using Sakurai s model (5% of time delay) is: ρ ε 1 ε r r 1 TH tree =.4 D 1 D 1 n n t TILD c (3.9)

Luis Manuel Santana Gallego 4 Investigation and simulation of the clock skew in modern egrated circuits where ε r is the relative dielectric constant of the ILD material, ρ the line resistivity, c o the speed of light in free space, D the die size, and n the number of H-tree levels. Transistor delay of the sub-block clock driver: the delay expression is according to Sakurai s Model (5% of time delay): T driver L / W =.7 C µ Cox ( VDD VT ) L (3.1) where C L is the capacitive load of the sub-block clock driver. The overall delay of the entire clock distribution network is the sum of the previous components: T Delay = T H-tree T Driver. This model gives first order estimation of the clock skew: TCSK ( x) = TDelay x x (3.11) where x is any variation of clock skew components such as V T, t ox, L, H, T ILD, V DD, T and C L. Table 3.1 shows the closed form equations for each individual clock skew component by using (3.11): Physical parameter and derivation used Threshold voltage fluctuation Gate oxide thickness tolerance Transistor channel length tolerance Wire thickness variation Clock skew component V T VT TCSK ( VT ) =.7 R CL VDD VT VT tox T ( t ) =.7 R C t CSK ox L ox L TCSK ( L ) =.7 R CL L 1 t TCSK ( t ) =.4 ( r c ) D 1 n t

Luis Manuel Santana Gallego 43 Investigation and simulation of the clock skew in modern egrated circuits ILD thickness variation IR drop Non uniform register distribution Temperature gradient 1 T TCSK ( TILD ) =.4 ( r c ) D 1 n T ILD ILD V DD VDD TCSK ( VDD ) =.7 R CL VDD VT VDD CL T ( C ) =.7 R C C CSK L L Eg / q VT Τ TCSK ( Τ ) =.7 R CL VDD VT Τ L Table 3.1: Clock skew components. It is important to note that the model equations can be easily modified to be more similar to other models, where the Sakurai s expressions are used with 9% of time delay. It only supposes to change the coicients de T H-tree and T Driver. r - TH tree :.4 1. TH tree = 1. ( r c ) l l c - Tdriver :.7.3 Tdriver =.3 R CL ε 3.3.3. MODEL 3: Statistical model for clock skew considering path correlations This model is described in depth in Appendix 3. Jiang and Horiguchi [JIA-1] propose a new approach to estimate the mean value and variance of clock skew for general clock distribution networks. The novelty is that clock paths can be not identical and path delay correlation caused by the overlapped parts of path lengths is considered. In this way, clock skew mean and variance is accurately estimated for general clock distribution networks. Clock paths of a clock distribution network usually have some common branches over their length. These common branches cause correlation among the delays of these paths. Only the overlapped parts of two paths determine the correlation between them.

Luis Manuel Santana Gallego 44 Investigation and simulation of the clock skew in modern egrated circuits If ξ is the maximum value of the propagation delay and η the minimum value, then the mean value and the variance of the clock skew, χ, are given by: E( χ) = E( ξ ) E( η) (3.1) D( χ) = D( ξ ) D( η) ρ D( ξ ) D( η) (3.13) Here, E( ) and D( ) represent the mean value and the variance of a random variable, respectively, and ρ is the correlation coicient of ξ and η. Author propose a recursive approach for evaluating the parameters E(ξ), E(η), D(ξ), D(η) and ρ. Based on this algorithm, an expression can be derived for the clock skew of a well-balanced H-tree clock distribution networks. Clock skew estimation for H-tree clock distribution networks Before developing the models, the H-tree itself must first be defined. The H-tree presents ermediate buffers at each branching po and, without loss of generality, it has n hierarchical levels, where n denotes the tree depth. The level branch corresponds to the root branch, and level n branches to the branches that support sinks. A level i branch begins in a level split i po and ends in a level i1 split po. The H-tree illustrated in figure 3.1 is drawn for n=4 and it is used to distribute the clock signals to 16 processors. Figure 3.1: A well-balanced H-tree clock distribution network for 16 processors.

Luis Manuel Santana Gallego 45 Investigation and simulation of the clock skew in modern egrated circuits For a n level well-balanced H-tree, let d i, i=,, n be the actual delay of branch i of a clock path. The clock skew E(χ) and skew variance D(χ) of the n level well-balanced H-tree are given by: E D 1 n i 1 k π n i k π i= 1 k = 1 π (3.14) ( χ ) = D ( d ) n π 1 i i= π (3.15) ( χ ) = ( 1 ρ ) D( d ) i The closed-form expressions (3.14) (3.15) indicate clearly how the clock skew is accumulated along the clock paths and with the increase of H-tree size. Clock skew calculation in function of its components The delay of a branch may then be obtained by averaging the rise and fall times using Sakurai s model for erconnection delays (9 % of time delay), described before in section.3.1. One approach to calculating the delay variance of a branch due to the variations of process parameters is express the parameter relations in terms of independent variables. Authors consider the following variables to calculate the variance of the delay of any branch D(d n-ik ): V T, µ, t ox, L, W, T ILD, W, t. The variance of the delay in a branch is the following: T T T T T σ = σ σ σ σ σ Delay Delay Delay Delay Delay TDelay VT µ t ox L W VT µ t ox L W T T T σ σ σ T W t Delay Delay Delay T ILD W t ILD (3.16) where:

Luis Manuel Santana Gallego 46 Investigation and simulation of the clock skew in modern egrated circuits T T R R = =.3( C C ) Delay Delay VT R VT VDD VT R R = =.3( C C ) V R µ µ T = =.3( C C ).3( R R ) Delay Delay Delay tox R tox C tox tox tox R = L R L =.3( C C ).3( R R ) C L L L = =.3( C C ).3( R R ) Delay Delay Delay W R W C W W W (3.18) T T C C ( 1.R.3R ) T C T T Delay Delay = = ILD ILD ILD ( 1.C.3C ) ( 1.R.3R ) W R W C W W W Delay Delay Delay = = R = t R t R ( 1..3 ) = C C t

Luis Manuel Santana Gallego 47 Investigation and simulation of the clock skew in modern egrated circuits 3.3.4. Summary of the models Parameters of the models Model 1 Interconnection resistance: R Interconnection capacitance: C Driving transistor on-resistance: R Parameters Parameter deviations in % Threshold voltage: σ VT Power supply voltage: σ VDD Driving inverter input capacitance: C Gate oxide thickness: σ tox Effective channel length: σ L Number of processing elements: N Lowest level branch length: L Power supply voltage: V DD ILD thickness: σ TILD Wire width: σ w Wire thickness: σ t Threshold voltage: V T Process parameters: Interconnection parameters: r c Threshold voltage of inverters: V T Power supply voltage: V DD Transistors energy gap: E g Design parameters: Buffer output resistance: R Die size: D H-tree levels: n Parameter deviations (in %): Threshold voltage: σ VT Gate oxide thickness: σ tox Effective channel length: σ L Wire thickness: σ t ILD thickness: σ TILD Power supply voltage: σ VDD Load capacitance: σ CL Temperature: σ T Capacitive load of sub-blocks: C L 3 Interconnection resistance: R Interconnection capacitance: C Driving transistor on-resistance: R Driving inverter input capacitance: C Parameter deviations in % Threshold voltage: σ VT Charge carrier mobility: σ µ Gate oxide thickness: σ tox Transistor width: σ W H-tree levels: n Lowest level branch length: L Power supply voltage: V DD Threshold voltage: V T Effective channel length: σ L Wire width: σ w Wire thickness: σ t ILD thickness: σ TILD

Luis Manuel Santana Gallego 48 Investigation and simulation of the clock skew in modern egrated circuits Equations of the models Model 1 Clock skew expression (mean): Equations 4ln N ln ln N ln 4π C 1 E[ Skew] = ( σ b log N σ w( N 1) ) O 1 ( ln N ) ln N Parameter calculation (using 9% time delay in Sakurai s model): ( ) T =1.R C.3 R C R C R C Delay T T T T σ = σ σ σ σ Delay Delay Delay Delay b VT t ox L VDD VT t ox L VDD T T T σ = σ σ σ Delay Delay Delay w TILD W t TILD W t T T R R ( C C ) Delay Delay = =.3 VT R VT VDD VT = =.3( C C ).3( R R ) Delay Delay Delay tox R tox C tox tox tox C R = =.3( C C ).3 Delay Delay Delay L R L C L L T T R R = =.3( C C ) V R V V V Delay Delay DD DD DD T T T C C ( 1.R.3R ) T C T T Delay Delay = = ILD ILD ILD C R ( 1.C.3C ) W R W C W Delay Delay Delay = = W T T R R ( 1.C.3C ) Delay Delay = = t R t t ( R R ) C L ( 1.R.3R ) C W

Luis Manuel Santana Gallego 49 Investigation and simulation of the clock skew in modern egrated circuits Model Equations TCSK = TDelay = TCSK ( x) x x T Delay = T H-tree T Driver Parameter calculation (using 5% time delay in Sakurai s model): ρ ε 1 εr r 1 TH tree =.4 D 1 D 1 n n t TILD c L / W Tdriver =.7 C L µ Cox ( VDD VT ) Physical parameter Clock skew component and derivation used Threshold voltage V T VT fluctuation TCSK ( VT ) =.7 R CL VDD VT VT Gate oxide tox thickness tolerance TCSK ( tox ) =.7 R CL t Transistor channel length tolerance Wire thickness ox L TCSK ( L ) =.7 R CL L 1 t TCSK ( t ) =.4 r c D 1 variation ( ) ILD thickness variation ( ) IR drop Non uniform register distribution Temperature n t 1 T TCSK ( TILD ) =.4 r c D 1 n T V DD VDD TCSK ( VDD ) =.7 R CL VDD VT VDD CL TCSK ( CL ) =.7 R CL C Eg / q VT Τ gradient TCSK ( Τ ) =.7 R CL VDD VT Τ L ILD ILD

Luis Manuel Santana Gallego 5 Investigation and simulation of the clock skew in modern egrated circuits Model 3 Clock skew expression (mean): Equations E n i π 1 k π i= 1 k = 1 π 1 ( χ ) = D( d ) n i k For a n level well-balanced H-tree (with buffers at each split po), D(d i ), i=,, n, is the delay variance of the branch i. T T T T T σ = σ σ σ σ σ Delay Delay Delay Delay Delay TDelay VT µ t ox L W VT µ t ox L W T T T σ σ σ T W t Delay Delay Delay T ILD W t ILD where: T T R R = =.3( C C ) Delay Delay VT R VT VDD VT R R = =.3( C C ) V R µ µ T = =.3( C C ).3( R R ) Delay Delay Delay tox R tox C tox tox tox R = L R L =.3( C C ).3( R R ) C L L L = =.3( C C ).3( R R ) Delay Delay Delay W R W C W W W T T C C ( 1.R.3R ) T C T T Delay Delay = = ILD ILD ILD ( 1.C.3C ) ( 1.R.3R ) W R W C W W W Delay Delay Delay = = R = t R t R ( 1..3 ) = C C t