On Potential Design Impacts of Electromigration Awareness Andrew B. Kahng, Siddhartha Nath and Tajana S. Rosing VLSI CAD LABORATORY, UC San Diego UC San Diego / VLSI CAD Laboratory -1-
Outline Motivation Previous Work Our Work Preliminaries Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Conclusions -2-
Electromigration in Interconnects Electromigration (EM) is the gradual displacement of metal atoms in an interconnect I avg causes DC EM and affects power delivery networks I rms causes AC EM and affects clock and logic signals -3-
EM Lifetime EM degrades interconnect lifetime Black s Equation calculates lifetime of interconnect segment due to EM degradation t 50 = A J n ee a kt t 50 median time to failure (= log e 2 x MTTF) A* geometry-dependent constant J current density in interconnect segment n constant ( = 2) E a activation energy of metal atoms k Boltzmann s constant T temperature of the interconnect -4-
Parameters Affecting EM MTTF W wire Driver size Fanout J rms V dd MTTF Freq α Temp A A B Inverse relation; if A increases then B decreases B Direct relation; if A increases then B increases Design parameters Runtime parameters -5-
Current Density (MA/cm 2 ) Current Density (MA/cm 2 ) Why Is EM Important Now? ITRS 2011 data shows that EM will be a significant reliability issue Physical 3 design teams trade off 2.5 performance and/or resources to meet EM MTTF 2 2.5 1.5 What values of MTTF do we really need? 2 1 In the US, people replace MTTF 1.5 0.5 10 Cell phones every 2 years 2004 2008 2012 2016 2020 0.5 Laptops every 3 5 years Year 0 Servers every 3 7 years 10 9 8 7 6 5 4 3 2 1 Devices can be designed with small EM lifetimes MTTF (years) J rms -6-
Examples of EM Guardband W wire Driver size Fanout J rms V dd MTTF Freq α Temp To meet EM MTTF margin at given wire width upper bound Reduce J rms reduce driver size slower circuit To meet EM MTTF margin at given performance requirement Increase W wire increase capacitance, dynamic power -7-
Outline Motivation Previous Work Our Work Preliminaries Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Conclusions -8-
To Meet EM Lifetime Requirements Three major categories of prior work EM MTTF modeling Black69 (Black s Equation) Liew89 (AC lifetime models) Lu07 and Wu12 (Joule heating) Architecture changes to mitigate EM Srinivasan04 (RAMP) Romanescu08 (core cannibalization) Synthesis and physical design (PD) techniques to reduce current density violations Dasgupta96 (limit J rms violation at synthesis) Jerke04 (limit J rms violation at PD) Lienig03 (post-route J rms fixes) -9-
Outline Motivation Previous Work Our Work Preliminaries Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Conclusions -10-
Key Idea We quantify impact of EM guardband on performance (F max ), area and power [Black s Equation] MTTF = A WH 2 e E a kt I rms 2 (I rms,limit ) 2 = (I rms,default ) 2 x MTTF default /MTTF reduced F max Area / Power Decrease MTTF reduced increase I rms,limit We study impacts on F max, area and power -11-
Approach We conduct two studies 1. MTTF vs. F max tradeoffs with fixed resource budget 2. MTTF vs. resources tradeoffs with fixed performance requirement Assumptions 10 years = example default EM MTTF Six testcases Report three representative (AES, DMA, JPEG) -12-
Key Contributions We are the first to quantify impacts of EM guardband on performance and resources by using PD flows We introduce EM slack as an accurate measure of potential performance improvements in different circuits at reduced MTTF requirements Black s Equation cannot accurately quantify the impacts of EM-awareness in circuits We study how tightness vs. looseness of timing constraints determine area and power trends at reduced MTTF Our study flow/methodology can potentially be used by architects and front-end designers to improve performance at no area cost physical designers whose levers are conventional SI and EM fixing methods -13-
Outline Motivation Previous Work Our Work Preliminaries Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Conclusions -14-
EM Slack When EM violations occur I rms,net = C load V dd α F max 1 + 1 t rise t fall > I rms,limit Black s Equation MTTF = A WH 2 I rms 2 e E a kt Theoretical limit of I rms,net I rms,net I rms,limit MTTF default MTTFreduced Basic Concept: EM slack of a net (units: ma) EM slack,net = I rms,net I rms,limit I rms,limit MTTF default MTTF reduced 1-15-
Significance of EM Slack Positive EM slack potential for improved F max If EM slack > 0, a part of it can be used to increase I rms,limit by reducing MTTF (from Black s Equation), and improve F max by using SP&R knobs (e.g., gate sizing) without causing EM violations I rms,net = C load V dd α F max 1 + 1 t rise t fall > I rms,limit -16-
Outline Motivation Previous Work Our Work Preliminaries Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Conclusions -17-
Study 1: MTTF vs. F max Study MTTF vs. F max tradeoffs given upper bounds on area, temperature and #EM violations Setup Three testcases: AES, DMA and JPEG Two technology libraries: TSMC 45GS and 65GPLUS Upper bounds temperature = 378 K area = 66% utilization #EM violations = 25 Synopsys DesignCompiler and Cadence SOC Encounter flows Thermal analysis using Hotspot -18-
Automated Flow to Determine F max RTL LIB SDC Synthesis (DC) NETLIST MTTF reduced AR Tech. LEF α PI LEF Place and Route (SOCE) Check constraints UTIL tar -- timing slack -- peak temperature -- UTIL eff -- #EM violations Met all constraints? Y Increase frequency N Decrease frequency Frequency tried before? N Y EXIT -19-
Automated Flow to Determine F max RTL LIB SDC Synthesis (DC) NETLIST MTTF reduced AR Tech. LEF α PI LEF Place and Route (SOCE) Check constraints UTIL tar -- timing slack -- peak temperature -- UTIL eff -- #EM violations Met all constraints? Y Increase frequency N Decrease frequency Frequency tried before? N Y EXIT -20-
Derating LEF We derate current density limits in technology Library Exchange Format (LEF) file Reduced lifetime (ratio > 1) increases J rms limit Reduced toggle rate (ratio > 1) increases J rms limit Increased maximum temperature increases J rms limit -21-
Automated Flow to Determine F max RTL LIB SDC Synthesis (DC) NETLIST MTTF reduced AR Tech. LEF α PI LEF Place and Route (SOCE) Check constraints UTIL tar -- timing slack -- peak temperature -- UTIL eff -- #EM violations Met all constraints? Y Increase frequency N Decrease frequency Frequency tried before? N Y EXIT -22-
Binary Search for F max Increase frequency by step until some constraint is violated Perform binary search between the current F and the last feasible F to find F max -23-
Automated Flow to Determine F max RTL LIB SDC Synthesis (DC) NETLIST MTTF reduced AR Tech. LEF α PI LEF Place and Route (SOCE) Check constraints UTIL tar -- timing slack -- peak temperature -- UTIL eff -- #EM violations Met all constraints? Y Increase frequency N Decrease frequency Frequency tried before? N Y EXIT -24-
Flow to Fix EM Violations Netlist from F max determination flow Group nets depending on the extent of I rms,limit violations Y return N N Timing met? Create nondefault rules (NDR) for each net group Perform timing analysis Decrease fanout Perform ECO route All EM violations fixed? N Downsize drivers Y return Y -25-
Automated Flow to Determine F max RTL LIB SDC Synthesis (DC) NETLIST MTTF reduced AR Tech. LEF α PI LEF Place and Route (SOCE) Check constraints UTIL tar -- timing slack -- peak temperature -- UTIL eff -- #EM violations Met all constraints? Y Increase frequency N Decrease frequency Frequency tried before? N Y EXIT -26-
% Increase in Performance Observation 1 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% DMA AES JPEG 45nm 10 9 8 7 6 5 4 3 2 1 F max scaling is not uniform across designs and at reduced MTTF as suggested by Black s Equation F max scaling is determined by the EM slack in each design at each MTTF requirement Large F max improvements may be setup artifacts -27-
% of EM violations % EM slack Observation 2 90% 120% 80% 100% 70% 60% 80% 50% 60% 40% 30% 40% 20% 20% 10% DMA AES JPEG DMA AES JPEG EM slack (not timing slack) limits performance scaling 0% 10 10 9 9due 8 8 to 7 7AC 6 EM 6 5 54 43 23 12 1 EM slack determines F max at fixed resources % of positive EM slack is usable to improve F max by reducing MTTF requirement EM violations in critical paths lead to positive EM slack -28-
% of target utilization EM AREA TEMP Observation 3 105% DMA AES JPEG 100% 95% 90% 85% 80% 10 9 8 7 6 5 4 3 2 1 Area and temperature can be dominating constraints 75% at lower MTTF requirements 10 9 8 7 6 5 4 3 2 1 Area limits F max scaling for MTTF 7 years (DMA) Area upper bounds are violated for MTTF 6 years; Temperature upper bounds are violated for MTTF 3 years -29-
Outline Motivation Previous Work Our Work Preliminaries Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Conclusions -30-
Study 2: MTTF vs. Area, Power Study MTTF vs. area and power tradeoffs at a fixed performance requirement Setup DMA at 2000 MHz (2ps slack after SP&R at 45nm) AES at 1100 MHz (1.6ps slack after SP&R at 45nm) JPEG at 850 MHz (93ps slack after SP&R at 45nm) Two technology libraries: TSMC 45GS and 65GPLUS -31-
Power Area (µm (mw) 2 ) Observation 4 48980 36.5 48960 36.4 36.3 48940 36.2 48920 36.1 48900 36 48880 35.9 35.8 48860 35.7 48840 35.6 48820 35.5 48800 35.4 0 0 1 1 22 33 44 5 6 7 8 9 10 Large positive timing slack at MTTF = 10 years can lead to smaller area when MTTF requirement is reduced Large positive timing slack at MTTF = 10 years can lead to smaller power when MTTF requirement is reduced -32-
Power Area (µm (mw) 2 ) Observation 5 16.8 13740 16.6 13720 16.4 13700 16.2 13680 16 15.8 13660 15.6 13640 15.4 13620 15.2 13600 15 0 0 1 1 2 2 3 3 44 55 6 7 8 9 10 10 Area and power can decrease as MTTF requirement is reduced for designs with loose timing constraints Small positive timing slack at MTTF = 10 years can lead to increase in area as MTTF requirement is reduced Small positive timing slack at MTTF = 10 years can lead to increase in power as MTTF requirement is reduced -33-
Outline Motivation Previous Work Our Work Preliminaries Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Conclusions -34-
Conventional EM Fixes and MTTF Study how conventional SI and EM fixing methods affect area and performance at reduced MTTF requirements. Setup Sweep MTTF from 10 years down to 1 year Apply per-net NDRs, driver downsizing and fanout reduction fixes Study using AES, JPEG and DMA testcases Two technology libraries: TSMC 45GS and 65GPLUS Insights are very instance-, technology/libraryand flow-specific -35-
Observation 6 3% 2.5% 2% 1.5% 1% 0.5% Fmax Area 0% AES JPEG DMA Fixing EM violations using NDRs can be effective in improving F max only till MTTF = 7 years % increase in F max is less than 5% % increase in area is ~2% -36-
Observation 7 3% 3.5% 3% 2.5% 2.5% 2% 2% 1.5% 1.5% 1% 1% 0.5% 0.5% 0% 0% Fmax Area 8 11 14 17 20 23 26 29 Fanout 12 16 20 24 28 32 Driver size NDRs can be more effective knobs to increase F max with less increase in area Fanout reductions to fix EM can increase F max by 3% at the cost of 1.86% increase in area Drive downsizing to fix EM can increase F max by 2.5% at the cost of 2% increase in area -37-
Outline Motivation Previous Work Our Work Preliminaries Study 1: MTTF vs. F max Study 2: MTTF vs. Area, Power Insights on Conventional EM Fixes Conclusions -38-
Conclusions We study and quantify potential impacts of improved EM-awareness in designs through two basic studies Our key observations Study 1: Available performance scaling (up to 80%) from MTTF reduction is dependent on EM slack Study 2: Area and power can decrease when MTTF is reduced in designs with loose timing constraints Additional studies: NDRs can be more effective in increasing performance ~5% at the cost of 2% increase in area for MTTF up to 7 years Ongoing work EM reliability requirements in multiple operating modes Combined impacts of EM and other back end of the line reliability mechanisms on interconnect lifetime -39-
Acknowledgments Work supported by IMPACT, SRC, NSF, Qualcomm Inc. and NXP Semiconductors -40-
Thank You! -41-
Backup -42-
Hotspot Setup We use Hotspot5.0 calibrated with thermal package from Qualcomm Inc. We perform two kinds of modeling Without heat spread and heat sink when profiling single block of AES, JPEG or DMA (area in µm 2 ) With heat spreader and heat sink when profiling 50x50 blocks of AES, JPEG, or DMA in an area of ~5mm 2 We get same values of temperature for a single block from both these methods -43-