9th International Symposium on Quality Electronic Design

Similar documents
Categories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms

50.1. Categories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 27, NO. 5, MAY

AS IC TECHNOLOGIES are scaled to the deep submicrometer

Efficient Incremental Analysis of On-Chip Power Grid via Sparse Approximation

Fast Statistical Analysis of Rare Failure Events for Memory Circuits in High- Dimensional Variation Space

Recursive Statistical Blockade: An Enhanced Technique for Rare Event Simulation with Application to SRAM Circuit Design

Statistical Performance Modeling and Optimization

M.S. Project Report. Efficient Failure Rate Prediction for SRAM Cells via Gibbs Sampling. Yamei Feng 12/15/2011

A Robustness Optimization of SRAM Dynamic Stability by Sensitivity-based Reachability Analysis

An Automated Approach for Evaluating Spatial Correlation in Mixed Signal Designs Using Synopsys HSpice

PARADE: PARAmetric Delay Evaluation Under Process Variation * (Revised Version)

Design Based Analog Testing by Characteristic Observation Inference

PARADE: PARAmetric Delay Evaluation Under Process Variation *

Accounting for Non-linear Dependence Using Function Driven Component Analysis

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 34, NO. 3, MARCH

Hyperspherical Clustering and Sampling for Rare Event Analysis with Multiple Failure Region Coverage

Dynamic Adaptation for Resilient Integrated Circuits and Systems

ANALYSIS OF NONLINEAR PARTIAL LEAST SQUARES ALGORITHMS

Measurement and Modeling of MOS Transistor Current Mismatch in Analog IC s

Discriminative Direction for Kernel Classifiers

An Equivalent Circuit Formulation of the Power Flow Problem with Current and Voltage State Variables

Efficient Per-Nonlinearity Distortion Analysis for Analog and RF Circuits

Recursive Least Squares for an Entropy Regularized MSE Cost Function

Statistical Analysis of BTI in the Presence of Processinduced Voltage and Temperature Variations

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu

Predicting IC Defect Level using Diagnosis

Exploiting Multiple Mahalanobis Distance Metric to Screen Outliers from Analogue Product Manufacturing Test Responses

Parallel VLSI CAD Algorithms. Lecture 1 Introduction Zhuo Feng

Plan of Class 4. Radial Basis Functions with moving centers. Projection Pursuit Regression and ridge. Principal Component Analysis: basic ideas

Statistical Modeling for the Minimum Standby Supply Voltage of a Full SRAM Array

Pre and post-silicon techniques to deal with large-scale process variations

Fast Buffer Insertion Considering Process Variation

Impact of parametric mismatch and fluctuations on performance and yield of deep-submicron CMOS technologies. Philips Research, The Netherlands

Advanced Computational Methods for VLSI Systems. Lecture 5 Statistical VLSI modeling and analysis. Zhuo Feng

in a Rao-Blackwellised Unscented Kalman Filter

OPERA: OPtimization with Ellipsoidal uncertainty for Robust Analog IC design

Oil Field Production using Machine Learning. CS 229 Project Report

CS168: The Modern Algorithmic Toolbox Lecture #6: Regularization

Design for Manufacturability and Power Estimation. Physical issues verification (DSM)

Hyperbolic Polynomial Chaos Expansion (HPCE) and its Application to Statistical Analysis of Nonlinear Circuits

CHAPTER 2 AN OVERVIEW OF TCAD SIMULATOR AND SIMULATION METHODOLOGY

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R.

Analysis for Dynamic of Analog Circuits by using HSPN

EECS150 - Digital Design Lecture 26 - Faults and Error Correction. Types of Faults in Digital Designs

488 Solutions to the XOR Problem

Incremental Latin Hypercube Sampling for Lifetime Stochastic Behavioral Modeling of Analog Circuits

Response Surface Methodology

Mustafa H. Tongarlak Bruce E. Ankenman Barry L. Nelson

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015

Response surface methodology: advantages and challenges

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 35, NO. 12, DECEMBER

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

Integer Least Squares: Sphere Decoding and the LLL Algorithm

CHAPTER 6 A STUDY ON DISC BRAKE SQUEAL USING DESIGN OF EXPERIMENTS

EE5900 Spring Lecture 5 IC interconnect model order reduction Zhuo Feng

A Geometric Programming-based Worst-Case Gate Sizing Method Incorporating Spatial Correlation

Unit 1A: Computational Complexity

MACHINE LEARNING. Methods for feature extraction and reduction of dimensionality: Probabilistic PCA and kernel PCA

Advanced Computational Methods for VLSI Systems. Lecture 4 RF Circuit Simulation Methods. Zhuo Feng

Arizona State University, Tempe, AZ 85287, USA 2 Department of Electrical Engineering. Arizona State University, Tempe, AZ 85287, USA ABSTRACT

CS281 Section 4: Factor Analysis and PCA

SVD, PCA & Preprocessing

Lecture 15: Scaling & Economics

From Blind Certainty to Informed Uncertainty

Thermal-reliable 3D Clock-tree Synthesis Considering Nonlinear Electrical-thermal-coupled TSV Model

NANO-CMOS DESIGN FOR MANUFACTURABILILTY

Computing Clock Skew Schedules Under Normal Process Variation

Post-Silicon Timing Characterization by Compressed Sensing

Statistical Analysis Techniques for Logic and Memory Circuits

Problems. Looks for literal term matches. Problems:

Technology Mapping for Reliability Enhancement in Logic Synthesis

Worst-Case Analysis and Optimization of VLSI. Abstract

Digital Predistortion Using Machine Learning Algorithms

Incremental Latin Hypercube Sampling

Automatic Differentiation Equipped Variable Elimination for Sensitivity Analysis on Probabilistic Inference Queries

Principal Component Analysis -- PCA (also called Karhunen-Loeve transformation)

Nonlinear Support Vector Machines through Iterative Majorization and I-Splines

A Low Power Bandgap Voltage Reference with Nonlinear Voltage Curvature Compensation

Behavioral Modeling for Analog System-Level Simulation by Wavelet Collocation Method

ON REPLICATION IN DESIGN OF EXPERIMENTS

Representative Path Selection for Post-Silicon Timing Prediction Under Variability

A Hierarchical Analysis Methodology for Chip-Level Power Delivery with Realizable Model Reduction

Problems in VLSI design

Chapter 2 Review of Linear and Nonlinear Controller Designs

Nonparametric Bayesian Methods (Gaussian Processes)

9. Two-Degrees-of-Freedom Design

Dept. Information Systems Engineering, Osaka Univ., Japan

EE115C Winter 2017 Digital Electronic Circuits. Lecture 6: Power Consumption

Physical Design of Digital Integrated Circuits (EN0291 S40) Sherief Reda Division of Engineering, Brown University Fall 2006

Modelling Multivariate Data by Neuro-Fuzzy Systems

Closed Form Expressions for Delay to Ramp Inputs for On-Chip VLSI RC Interconnect

Statistical Timing Analysis with Path Reconvergence and Spatial Correlations

Reducing power in using different technologies using FSM architecture

RESPONSE SURFACE METHODS FOR STOCHASTIC STRUCTURAL OPTIMIZATION

Methodology to Achieve Higher Tolerance to Delay Variations in Synchronous Circuits

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 25, NO. 7, JULY

Stochastic Analogues to Deterministic Optimizers

Toward More Accurate Scaling Estimates of CMOS Circuits from 180 nm to 22 nm

Use of Design Sensitivity Information in Response Surface and Kriging Metamodels

Transcription:

9th International Symposium on Quality Electronic Design Projection-Based Piecewise-Linear Response Surface Modeling for Strongly Nonlinear VLSI Performance Variations Xin Li Department of Electrical and Computer Engineering Carnegie Mellon University, Pittsburgh, PA 3 xinli@ece.cmu.edu Yu Cao Department of Electrical Engineering Arizona State University, empe, AZ 887 ycao@asu.edu ABSRAC Large-scale process fluctuations (particularly random device mismatches) at nanoscale technologies bring about highdimensional strongly nonlinear performance variations that cannot be accurately captured by linear or quadratic response surface models. In this paper, we propose a novel projection-based piecewise linear modeling technique, PM, to address such a modeling challenge with affordable computational cost. PM borrows the projection pursuit idea from mathematics to convert a high-dimensional modeling problem to a low-dimensional one. In addition, a new piecewise-linear model template is proposed and tuned for strongly nonlinear performance variations. By exploiting the unique piecewise-linear nature of the model template, a robust numerical algorithm is further developed to determine all model coefficients by solving a sequence of over-determined linear equations. Several circuit examples designed in a commercial 6nm CMOS process demonstrate that compared with the traditional quadratic modeling, PM achieves x error reduction with negligible computational overhead.. INRODUCION As IC technologies are scaled to nanoscale regime, it becomes increasingly difficult to control the variations in manufacturing process []-[]. Process variations can be classified into two broad categories: inter-die variations and intra-die variations. Inter-die variations model the common/average variations across the die, while intra-die variations model the individual, but spatially correlated, local variations (e.g., random device mismatches) within the same die. Both inter-die and intra-die variations introduce substantial uncertainties in circuit performance and significantly impact parametric yield. Hence, accurately modeling and analyzing process variations to ensure manufacturability has been identified as a top priority for today s IC design. o address this issue, response surface modeling has been widely applied to solve various statistical circuit analysis problems [], [3]-[7], []. he objective of response surface modeling is to approximate the circuit performance (e.g., delay, gain) as an analytical function of process parameters (e.g., V H, OX ). Most existing response surface models are either linear or quadratic, assuming that the approximated performance functions are weakly nonlinear. However, two recent changes in advanced IC technologies suggest a need to revisit this assumption. Firstly, among all sources of variations, random mismatches become dominant at 4nm process and beyond []. As a result, any two transistors on the same die can have significantly different electrical performance (e.g., mobility, V H, etc.). o accurately model this effect, a large number of random variables must be utilized, rendering a high-dimensional variation space. Even for a small-size circuit block (e.g., an analog amplifier), the total number of random process parameters can easily reach ~ [6], [8]. Secondly, process variations become relatively larger, as IC technologies are scaled to finer feature size. For example, the 3- sigma V H variation is expected to reach 3% in 8 and it continuously increases in future technology generations []. Such large-scale variations yield strongly nonlinear performance variations that cannot be accurately captured by linear or quadratic models. his nonlinearity issue is especially critical for analog and mixed-signal circuits. As will be demonstrated by the numerical examples in Section, the error of a quadratic model can reach 3.6% for a commercial 6nm SRAM cell. o improve accuracy, high-order (e.g., cubic) polynomial models can be used. Directly applying existing response surface modeling techniques to high order, however, results in expensive computational cost. For example, if the total number of random process parameters reaches [6], [8], a cubic polynomial will contain 76,8 unknown model coefficients! he authors of [8] propose a new projection-based nonlinear modeling technique based on neural network. While it has been successfully applied to various circuit problems, a neural network must be trained by nonlinear optimization where global convergence is difficult to achieve. In other words, the quality of the extracted model heavily depends on the initial guess that is provided to the optimizer. he challenging problem here is how to solve such a high-dimensional strongly nonlinear modeling problem both robustly and efficiently. In this paper, we propose a novel projection-based piecewiselinear modeling (PM) algorithm that is especially tuned for highdimensional strongly nonlinear fitting problems. PM borrows the projection pursuit idea that was initially developed by mathematicians in 98s [9]. It converts a high-dimensional modeling problem to a low-dimensional problem that is easy to solve. In addition to dimension reduction, we propose to capture strongly nonlinear performance variations by piecewise-linear functions. Compared with the traditional quadratic response surface modeling, the proposed PM approach reduces modeling error by x without substantially increasing computational cost, as will be demonstrated by the numerical examples in Section. An important contribution of this paper is to propose a robust numerical algorithm to determine all unknown model coefficients. Our proposed algorithm recursively approximates a highdimensional performance function by a number of onedimensional piecewise-linear functions. Furthermore, by exploiting the unique piecewise-linear nature of the model template, PM formulates the modeling problem in a special form for which all model coefficients can be solved from a sequence of over-determined linear equations. herefore, unlike the existing optimization-based techniques that suffer from several numerical issues such as local convergence, our proposed PM approach offers robust convergence with low computational cost. he remainder of this paper is organized as follows. In Section, we review the background on response surface modeling and projection pursuit. hen, we propose our PM approach in Section 3 and describe the numerical algorithms in Section 4. he efficacy of PM is demonstrated by several -769-37-/8 $. 8 IEEE DOI.9/ISQED.8.8 8

numerical examples in Section, followed by the conclusions in Section 6.. BACGROUND. Response Surface Modeling Given a circuit design, the circuit performance (e.g., delay, gain) is a function of process parameters (e.g., V H, OX ). hese process parameters must be modeled as random variables to account for uncertain manufacturing fluctuations. A circuit performance f can be approximated as a linear response surface model of process parameters [], []: f ( X ) = B X + C () where X = [x x... x N ] represents the random variables to model process variations, B R N and C R stand for the model coefficients, and N is the total number of random variables. he linear approximation in () is efficient and accurate when process variations are sufficiently small. As manufacturing variations become relatively large in nanoscale technologies, quadratic response surface models are required to improve modeling accuracy [4], [6], []: f ( X ) = X AX + B X + C () where C R is the constant term, B R N contains the linear coefficients, and A R N N contains the quadratic coefficients. he unknown model coefficients in () and () can be determined by solving the over-determined linear equations at a number of sampling points []: ~ 3 B X i + C = fi ( i =,,, S) (3) ~ 4 X i AX i + B X i + C = fi ( i =,,, S ) (4) where X i and f i are the value of X and the exact value of f for the i- th sampling point respectively, and S is the total number of sampling points. Even if quadratic response surface models are utilized, however, large modeling error can still be observed in some cases [8]. For instance, as will be demonstrated by the numerical examples in Section, the quadratic modeling error can reach 3.6% for an SRAM cell designed in a commercial 6nm CMOS process. It, in turn, motivates us to develop a new piecewise-linear modeling technique to accurately capture strongly nonlinear performance variations.. Projection Pursuit One major technical difficulty of fitting high-dimensional nonlinear response surface models stems from the large number of unknown model coefficients. o address this issue, projection pursuit was proposed by mathematicians in 98s [9] and it has been recently applied to several circuit modeling problems [6]-[8]. he key idea of projection pursuit is to approximate a highdimensional nonlinear function by the sum of several lowdimensional functions. In particular, a one-dimensional projection has the form of [9]: f ( X ) = g ( P X ) + g( P X ) + + g ( P X ) () where f(x) is the approximated high-dimensional nonlinear function, {g i ( ); i =,,...,} contains one-dimensional nonlinear functions, {P i R N ; i =,,...,} defines onedimensional projection vectors, and is referred to as the rank of the model. was developed in [6] to handle the special case where all nonlinear functions {g i ( ); i =,,...,} in () are quadratic. A quadratic function defined in () can be re-written as [6]: N = λ i i + (6) 6 f ( X ) ( Q X ) + B X C i= where λ i and Q i R N are the i-th dominant eigenvalue and eigenvector of the quadratic coefficient matrix A, respectively. In this case, the optimal projection vectors are determined by the eigenvectors and they can be extracted by the implicit power iteration algorithm proposed in [6]. he idea of projection pursuit has been further applied to strongly nonlinear circuit problems in [8]. he SiLVR algorithm developed in [8] determines the optimal projection vectors by nonlinear optimization. Such an optimization-based approach, however, suffers from several numerical issues such as local convergence. In this paper, we propose a new projection-based piecewise linear modeling algorithm, PM, that aims to robustly solve the high-dimensional nonlinear modeling problem. By exploiting the unique piecewise-linear nature of the model template, PM determines all unknown model coefficients by solving a sequence of over-determined linear equations, thereby offering low computational complexity and robust convergence. 3. PIECEWISE-LINEAR MODELING Our proposed PM approach utilizes one-dimensional piecewise-linear functions to approximate {g i ( ); i =,,...,} in (). Such a piecewise-linear model template allows us to approximate strongly nonlinear performance functions both accurately and efficiently, which is one of the major advantages of the PM method. A one-dimensional M-segment piecewise-linear function g i (P i X) is uniquely specified by the projection vector P i and the M+ grid points {(α j, β j ); j =,,...,M}: β j β 7 gi ( Pi X ) = β + ( Pi X α ). (7) α α where j 8 α j P i X α j. (8) In other words, the value of g i (P i X) is determined by the linear interpolation of the M+ grid points. Fig shows a simple piecewise-linear function example with four segments. g i (P i X) β β β β 3 α α α α 3 α 4 P i X Fig. A one-dimensional 4-segment piecewise-linear function. Given the one-dimensional M-segment piecewise linear functions {g i ( ); i =,,...,}, the nonlinear model f(x) in () is the sum of all {g i ( ); i =,,...,}. heoretically, this is equivalent to partitioning the variation space X into M polytopes: α, j P X j ( j,,, M ) α, = α, j P X, j ( j,,, M ) α = 9 (9) α P X α j =,,, M, j, j β 4 ( ) and approximating f(x) as a linear function within each polytope. 9

Fig shows a two-dimensional space that is partitioned by the projection vectors P = [ ] and P = [ ]. Note that the polytopes in Fig are not rectangular, because P and P are not orthogonal. x P = [ ] P = [ ] x Fig. Partitions of a two-dimensional space [x x ]. he proposed piecewise-linear model f(x) defined in (), (7)- (8) has two important properties: Continuous: Since all {g i ( ); i =,,...,} are continuous, it is straightforward to verify that f(x) = g ( ) + g ( ) +... + g ( ) is also continuous. Low-rank: For many high-dimensional problems, the rank is substantially smaller than the variation space dimension N. In other words, the performance of interest is only affected by several dominant directions of process variations and the variations on other non-dominant directions can be ignored [6]-[8]. his rank-deficient property allows us to accurately extract a compact low-rank model with low computational cost. o determine a rank- piecewise-linear model f(x), we must determine the projection vectors {P i ; i =,,...,} in () and the grid points {(α j, β j ); i =,,...,, j =,,...,M} in (7)-(8). In what follows, we propose an efficient numerical algorithm to solve these unknown model coefficients. 4. IMPLEMENAION OF PM he proposed PM algorithm is facilitated by two key techniques, including: () a nonlinear sensitivity analysis to determine the projection vectors; and () an iterative algorithm to decompose a rank- modeling problem into multiple rank-one problems. In this section, we first develop the numerical algorithm for rank-one PM approximation, and then extend it to rank- approximation. 4. Rank-One Approximation Given a rank-one M-segment PM model, the unknown model coefficients include the projection vector P and the M+ grid points {(α,j, β,j ); j =,,...,M}. Ideally, P and {(α,j, β,j ); j =,,...,M} should be optimized concurrently to extract the optimal model. However, such a co-optimization can be computationally expensive and does not guarantee global convergence. For this reason, we propose a heuristic algorithm to decompose the modeling process into two separate steps. We first apply a nonlinear sensitivity analysis to find the projection vector P and then perform a least-square fitting to determine the grid points {(α,j, β,j ); j =,,...,M}. Our numerical examples in Section demonstrate that the proposed heuristic algorithm yields excellent results for most circuit problems. o determine the projection vector P, we need to find out the dominant direction along which the performance function f(x) varies significantly. If f(x) is linear, the projection vector P is determined by linear sensitivities and it can be easily found by fitting a linear response surface model. However, such a linearsensitivity-based approach does not work well if f(x) is strongly nonlinear. Motivated by this observation, we propose to borrow the algorithm [6] to fit a rank-one quadratic model from which the projection directions for both linear and quadratic terms are determined. Such an approach is referred to as nonlinear sensitivity analysis in this paper. A rank-one model contains the constant term, the linear term, and the first dominant quadratic term [6]: f ( X ) = λ ( Q X ) + B X + C. () From (), we obtain two projection directions: B for the linear term and Q for the quadratic term. For rank-one PM approximation, we need to select one of them as the dominant projection vector P. oward this goal, we define the following expected energies and use them as a criterion to compare the significance of the linear and quadratic terms: Energy Linear = E ( B X ) = E[ f L ] () 4 4 EnergyQuadratic = E λ ( Q X ) = E[ λ fq ] () where E( ) denotes the expected value [4] and 3 f L = B X (3) Q 4 f = Q X. (4) o compute the statistical measures in ()-(), we assume that all random variables X = [x x... x N ] are mutually independent and standard Normal (i.e., zero mean and unit variance). If {x i ; i =,,...,N} are correlated Normal distributions, they can be converted into independent Normal distributions by principal component analysis (PCA) []. Given the definitions in (3)-(4), both f L and f Q are Normal, since they are the linear combinations of multiple Normal distributions. In addition, the first-order and second-order moments of f L and f Q can be determined by []: E[ f ] = and E[ f ] B () L L = 6 E[ f ] and E[ f ] Q = (6) Q Q = where denotes the -norm of a vector. Substituting () into () yields the expected energy for the linear term B X: 7 [ ] Energy Linear = E f L = B. (7) he expected energy for the quadratic term λ (Q X) is determined by the eigenvalue λ and the fourth order moment of f Q [4]: 4 4 8 Energy = E[ f ] = λ Quadratic λ. (8) Q 3 Q Based on (7) and (8), we make P equal the linear projection direction B (or the quadratic projection direction Q ) if the expected linear energy Energy Linear is greater (or smaller) than the expected quadratic energy Energy Quadratic. his heuristic rule is summarized by the following equation: 4 B if B 3λ Q 9 P = 4 Q ( if B < 3λ Q ). (9) Next, given the projection vector P, we need to determine the grid points {(α,j, β,j ); j =,,...,M}. We evenly partition the axis P X into M segments, resulting in M+ equally-spaced grid

points {α,j ; j =,,...,M}, as shown in Fig. hen, using a set of sampling points, we list the following linear equations for {β,j ; j =,,...,M}: β, j β, ~ β, + ( P X i α, ) = fi α α () ( α P X < α i =,,, S),, j, i, j where X i and f i are the value of X and the exact value of f for the i- th sampling point respectively, and S is the total number of sampling points. Solving the over-determined linear equations in () yields the optimal values of {β,j ; j =,,...,M}. It, in turn, determines the approximated one-dimensional piecewise-linear model. Algorithm : Rank-one piecewise-linear approximation. Start from a set of sampling points {(X i, f ĩ ); i =,,...,S}.. Fit the rank-one quadratic model in () using [6]. 3. Calculate Energy Linear and Energy Quadratic using (7)-(8). 4. Determine the projection vector P using (9).. Evenly partition the axis P X into M segments, resulting in M+ equally-spaced grid points {α,j ; j =,,...,M}. 6. Solve the linear equations () for {β,j ; j =,,...,M}. 7. he rank-one piecewise-linear model is determined by substituting the solved model coefficients P and {(α,j, β,j ); j =,,...,M} into (7)-(8). Algorithm summarizes the major steps of the proposed rankone piecewise-linear modeling. Note that both the algorithm in Step and the piecewise-linear fitting in Step 6 only require solving a sequence of over-determined linear equations. No further nonlinear optimization is involved in Algorithm. herefore, the proposed piecewise-linear modeling algorithm completely eliminates the local convergence issue incurred by nonlinear optimization. 4. Rank- Approximation Algorithm : Rank- piecewise-linear approximation. Start from a set of sampling points {(X i, f ĩ ); i =,,...,S}.. For k =,,..., 3. Apply Algorithm to the sampling points {(X i, f i); i =,,...,S} and extract the rank-one model g k (X). 4. Update the sampling points: ~ ~ fi = fi gk ( Xi ) ( i =,,, S). (). End For 6. he rank- piecewise-linear model is: f ( X ) = g ( X ) + g ( X ) + + g ( X ). () Algorithm shows the proposed PM algorithm for rank- piecewise-linear approximation. Starting from a set of sampling points, PM first extracts a rank-one piecewise-linear model g (X). hen, the sampling points are updated in () to calculate the residue which is further approximated as a new rank-one piecewise-linear model in the next iteration. he rank-one piecewise-linear fitting and the residue update are repeatedly applied for times until the rank- model f (X) in () is achieved. Algorithm assumes a given approximation rank. In practical applications, the value of can be iteratively determined based on the approximation error. For example, starting from a low-rank approximation, should be iteratively increased if the modeling error remains large.. NUMERICAL EXAMPLES In this section we demonstrate the efficacy of PM using two circuit examples. For each example, two independent sampling sets, called training set and testing set respectively, are generated. he training set contains sampling points that are created by Latin hypercube sampling []; it is used for coefficient fitting. For testing and comparison, we collect random samples as the testing set and use them to measure the modeling error. All numerical experiments are performed on a.8ghz Linux server.. SRAM Cell Fig 3. Circuit schematic of a 6 SRAM cell. Fig 3 shows the circuit schematic of a 6 SRAM cell designed in a commercial 6nm CMOS process. We consider three important performance metrics for this SRAM cell: static noise margin (SNM), read margin (RM) and write margin (WM). hese performance metrics are functions of both inter-die variations and device mismatches. he probability distribution and the correlation information of all variations are specified in the process design kit provided by the foundry. We first apply fractional factorial experiment [6] to identify a subset of important process parameters that have significant influence on the performances of interest. After such a variable screening, 3 random variables are left to model process variations in this example. SNM Error (%) RM Error (%) WM Error (%) 4 6 8 4 6 8 8 6 4 4 6 8 Approximation Rank () PM PM PM Fig 4. and PM modeling error of SRAM cell. We create performance models over the -sigma variation range using two different techniques: the traditional quadratic modeling ( [6]) and the proposed piecewise-linear modeling (PM). Fig 4 compares the modeling error for and PM. As shown in Fig 4, the error of both and PM decreases, as the approximation rank increases. However, after

6, further increases in do not have a significant impact on reducing the error. It, in turn, implies that selecting the rank = 6, instead of the full rank = 3, is sufficient in this example. Studying Fig 4, one would notice that the proposed PM is substantially more accurate than when modeling the static noise margin and the read margin in this example. For instance, the modeling error of read margin is reduced by.x, from 3.6% () to 6.6% (PM). It should be noted that as IC technologies are scaled to finer feature sizes, process variations are expected to become increasingly larger. It, in turn, would make the performance nonlinearities even more pronounced. o intuitively understand the strongly nonlinear performance variations, we plot all training samples of read margin over the first dominant projection direction P X, as shown in Fig. Note that the strongly nonlinear relation between read margin and process variations cannot be predicted by a simple linear model. Namely, the first dominant projection vector P of read margin cannot be extracted by a simple linear sensitivity analysis. his observation demonstrates the importance of the nonlinear sensitivity analysis proposed in Section 4.. On the other hand, although applying quadratic sensitivity analysis yields the correct projection vector P, a simple quadratic model fails to accurately approximate the performance function. In this example, the proposed piecewise-linear model is required to achieve sufficiently small modeling error. RM Value (Normalized).8.6.4. -. -.4 Samples PWL -.6 -. - -... P X (Normalized) Fig. Sampling points of read margin over the first dominant projection direction P X. able. Computational cost of and PM for SRAM cell performance modeling Rank Simulation Fitting Cost (Sec.) otal Cost (Sec.) Cost (Sec.) PM PM.3.4 98.3 99.4 7.63 3. 987.63 993. 3.74 6.79 99.74 996.79 4.33.69 99.33.69 9.33 8.8 999.33 8.8 98 6 3. 37.46 3. 7.46 7 6.93 47.6 6.93 7.6 8 9.4 7.48 9.4 37.48 9 3.9 67.97.9 47.97 34.89 8.4 4.89 6.4 able compares the computational cost for and PM. he overall computational cost consists of two portions: simulation cost and fitting cost. he simulation cost is the computational time to run a numerical simulator (e.g., SPICE) to generate a number of sampling points. he fitting cost is the computational time to solve all unknown model coefficients from a sequence of over-determined linear equations. Studying able, one would find that PM has higher fitting cost than. However, since the simulation cost is dominant, the overall computational overhead of PM is negligible (within %) in this example.. Operational Amplifier Fig 6. Circuit schematic of a two-stage operational amplifier. V OS Error (%) Gain Error (%) BW Error (%) PM Error (%) PSRR Error (%) Power Error (%) 4 3 4 6 8 4 6 8 4 6 8 4 6 8 8 6 4 6 8 3 4 6 8 Approximation Rank () PM PM PM PM PM PM Fig 7. and PM modeling error of operational amplifier. Shown in Fig 6 is the circuit schematic of a two-stage operational amplifier designed in a commercial 6nm CMOS process. We consider six performance metrics in this example:

offset voltage (VOS), gain, bandwidth (BW), phase margin (PM), power supply rejection ratio (PSRR), and power. hese performance metrics depend on both inter-die variations and device mismatches. he probability distribution and the correlation information of all variations are specified in the process design kit provided by the foundry. Similar to the SRAM cell example, we first apply fractional factorial experiment [6] to identify a subset of important process variations. After such a variable screening, 47 random variables are left to model process variations in this example. We create performance models over the 4-sigma variation range using two different techniques: the traditional quadratic modeling ( [6]) and the proposed piecewise-linear modeling (PM). Fig 7 compares the modeling error for and PM. wo important observations can be made from Fig 7. Firstly, for both and PM, selecting the rank = 6, instead of the full rank = 47, is sufficient in this example. Secondly, compared with, PM achieves significant error reduction for most performance metrics. aking bandwidth (BW) as an example, the error is 4.%, while the PM error is 6.6% (.x difference). able. Computational cost of and PM for operational amplifier performance modeling Rank Simulation Fitting Cost (Sec.) otal Cost (Sec.) Cost (Sec.) PM PM.4 3.3 777.4 7783.3 33.48 3.3 7793.48 779.3 3 4.4 46.86 784.4 786.86 4 68.43 6.3 788.43 78.3 86.96 8.84 7846.96 784.84 776 6 4.97 4.4 7864.97 7864.4 7.8 4.8 788.8 7884.8 8 4.6 4.4 79.6 79.4 9 9.48 84.8 799.48 7944.8 86. 37.6 7946. 7997.6 able shows the computational cost for and PM. In this example, PM has slightly higher fitting cost than. However, since the simulation cost is dominant, the overall computational overhead of PM is negligible (within %) in this example. 6. CONCLUSIONS In this paper, we propose a novel projection-based piecewiselinear modeling (PM) algorithm to capture high-dimensional strongly nonlinear performance variations that are observed in nanoscale technologies. PM borrows the projection pursuit idea from mathematics to achieve dimension reduction. As such, a high-dimensional modeling problem can be converted to a lowdimensional problem that is tractable. In addition, piecewiselinear functions are utilized by PM to model strongly nonlinear performance variations. By exploiting the unique piecewise-linear nature of the model template, a robust numerical algorithm is proposed to determine all model coefficients by solving a sequence of over-determined linear equations. Our numerical examples demonstrate that compared with the traditional quadratic modeling, PM achieves x error reduction without substantially increasing the computational complexity. he response surface models created by PM can be further incorporated into a statistical analysis/optimization environment for accurate and efficient parametric yield analysis/optimization. 7. ACNOWLEDGEMEN his work has been supported in part by the Semiconductor Research Corporation (SRC) and the National Science Foundation (NSF). 8. REFERENCES [] S. Nassif, Modeling and analysis of manufacturing variations, IEEE CICC, pp. 3-8,. [] Semiconductor Industry Associate, International echnology Roadmap for Semiconductors,. [3] Z. Wang and S. Director, An efficient yield optimization method using a two step linear approximation of circuit performance, IEEE EDAC, pp. 67-7, 994. [4] A. Dharchoudhury and S. ang, Worse-case analysis and optimization of VLSI circuit performance, IEEE rans. CAD, vol. 4, no. 4, pp. 48-49, Apr. 99. [] F. Schenkel, M. Pronath, S. Zizala, R. Schwencker, H. Graeb and. Antreich, Mismatch analysis and direct yield optimization by spec-wise linearization and feasibilityguided search, IEEE DAC, pp. 88-863,. [6] X. L J. Le, L. Pileggi and A. Strojwas, Projection-based performance modeling for inter/intra-die variations, IEEE ICCAD, pp. 7-77,. [7] Z. Feng and P. L Performance-oriented statistical parameter reduction of parameterized systems via reduced rank regression, IEEE ICCAD, pp. 868-87, 6. [8] A. Singhee and R. Rutenbar, Beyond low-order statistical response surfaces: latent variable regression for efficient, highly nonlinear fitting, IEEE DAC, pp. 6-6, 7. [9] J. Friedman and W. Stuetzle, Projection pursuit regression, Journal of the American Statistical Association, vol. 76, no. 376, pp. 87-83, 98. [] X. L J. Le and L. Pilegg Projection-based statistical analysis of full-chip leakage power with non-log-normal distributions, IEEE DAC, pp. 3-8, 6. [] M. Mckay, R. Beckman and W. Conover, A comparison of three methods for selecting values of input variables in the analysis of output from a computer code, echnometrics, vol., no., pp. 39-4, May. 979. [] G. Seber, Multivariate Observations, Wiley Series, 984. [3] G. Golub and C. Loan, Matrix Computations, he Johns Hopkins Univ. Press, 996. [4] A. Papoulis and S. Pilla Probability, Random Variables and Stochastic Processes, McGraw-Hill,. [] R. Myers and D. Montgomery, Response Surface Methodology: Process and Product Optimization Using Designed Experiments, Wiley-Interscience,. [6] D. Montgomery, Design and Analysis of Experiments, John Wiley & Sons,. 3