GMU, ECE 680 Physical VLSI Design 1

Similar documents
Digital Integrated Circuits A Design Perspective

Lecture 25. Semiconductor Memories. Issues in Memory

Semiconductor Memories

Semiconductor Memory Classification

Digital Integrated Circuits A Design Perspective. Semiconductor. Memories. Memories

Semiconductor Memories

SEMICONDUCTOR MEMORIES

Magnetic core memory (1951) cm 2 ( bit)

Hw 6 and 7 Graded and available Project Phase 2 Graded Project Phase 3 Launch Today

EE141- Fall 2002 Lecture 27. Memory EE141. Announcements. We finished all the labs No homework this week Projects are due next Tuesday 9am EE141

Semiconductor memories

Semiconductor Memories

Memory Trend. Memory Architectures The Memory Core Periphery

Chapter Overview. Memory Classification. Memory Architectures. The Memory Core. Periphery. Reliability. Memory

EE241 - Spring 2000 Advanced Digital Integrated Circuits. References

Semiconductor Memories

EE141. EE141-Spring 2006 Digital Integrated Circuits. Administrative Stuff. Class Material. Flash Memory. Read-Only Memory Cells MOS OR ROM

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

Semiconductor Memories. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic Paolo Spirito

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

Topics. Dynamic CMOS Sequential Design Memory and Control. John A. Chandy Dept. of Electrical and Computer Engineering University of Connecticut

! Charge Leakage/Charge Sharing. " Domino Logic Design Considerations. ! Logic Comparisons. ! Memory. " Classification. " ROM Memories.

Semiconductor Memories

Administrative Stuff

CMOS Digital Integrated Circuits Lec 13 Semiconductor Memories

ECE520 VLSI Design. Lecture 23: SRAM & DRAM Memories. Payman Zarkesh-Ha

! Memory. " RAM Memory. ! Cell size accounts for most of memory array size. ! 6T SRAM Cell. " Used in most commercial chips

EE141-Fall 2011 Digital Integrated Circuits

Thin Film Transistors (TFT)

9/18/2008 GMU, ECE 680 Physical VLSI Design

Topics to be Covered. capacitance inductance transmission lines

S No. Questions Bloom s Taxonomy Level UNIT-I

University of Toronto. Final Exam

MOSFET: Introduction

ECE251. VLSI System Design

EE141Microelettronica. CMOS Logic

Moores Law for DRAM. 2x increase in capacity every 18 months 2006: 4GB

ESE570 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Integrated Cicruits AND VLSI Fundamentals

Flash Memory Cell Compact Modeling Using PSP Model

MTJ-Based Nonvolatile Logic-in-Memory Architecture and Its Application

Name: Answers. Mean: 83, Standard Deviation: 12 Q1 Q2 Q3 Q4 Q5 Q6 Total. ESE370 Fall 2015

EECS 312: Digital Integrated Circuits Final Exam Solutions 23 April 2009

L ECE 4211 UConn F. Jain Scaling Laws for NanoFETs Chapter 10 Logic Gate Scaling

Lecture 34: Portable Systems Technology Background Professor Randy H. Katz Computer Science 252 Fall 1995

Random Access Memory. DRAM & SRAM Design DRAM SRAM MS635. Dynamic Random Access Memory. Static Random Access Memory. Cell Structure. 6 Tr.

CS 152 Computer Architecture and Engineering

C.K. Ken Yang UCLA Courtesy of MAH EE 215B

Fig. 1 CMOS Transistor Circuits (a) Inverter Out = NOT In, (b) NOR-gate C = NOT (A or B)

Spiral 2 7. Capacitance, Delay and Sizing. Mark Redekopp

F14 Memory Circuits. Lars Ohlsson

ESE570 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Integrated Cicruits AND VLSI Fundamentals

VLSI GATE LEVEL DESIGN UNIT - III P.VIDYA SAGAR ( ASSOCIATE PROFESSOR) Department of Electronics and Communication Engineering, VBIT

CMOS Inverter (static view)

ELEN0037 Microelectronic IC Design. Prof. Dr. Michael Kraft

Memory, Latches, & Registers

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

Dynamic Combinational Circuits. Dynamic Logic

Lecture 12 CMOS Delay & Transient Response

ESE570 Spring University of Pennsylvania Department of Electrical and System Engineering Digital Integrated Cicruits AND VLSI Fundamentals

AE74 VLSI DESIGN JUN 2015

Lecture 23. Dealing with Interconnect. Impact of Interconnect Parasitics

CMOS Inverter. Performance Scaling

EE115C Winter 2017 Digital Electronic Circuits. Lecture 3: MOS RC Model, CMOS Manufacturing

CMPEN 411 VLSI Digital Circuits Spring Lecture 21: Shifters, Decoders, Muxes

VLSI VLSI CIRCUIT DESIGN PROCESSES P.VIDYA SAGAR ( ASSOCIATE PROFESSOR) Department of Electronics and Communication Engineering, VBIT

ENGR890 Digital VLSI Design Fall Lecture 4: CMOS Inverter (static view)

Dynamic Combinational Circuits. Dynamic Logic

Digital Integrated Circuits. The Wire * Fuyuzhuo. *Thanks for Dr.Guoyong.SHI for his slides contributed for the talk. Digital IC.

The Physical Structure (NMOS)

MOS Transistor Theory

Objective and Outline. Acknowledgement. Objective: Power Components. Outline: 1) Acknowledgements. Section 4: Power Components

CMPEN 411 VLSI Digital Circuits. Lecture 04: CMOS Inverter (static view)

Introduction to Computer Engineering. CS/ECE 252, Fall 2012 Prof. Guri Sohi Computer Sciences Department University of Wisconsin Madison

Lecture 6 Power Zhuo Feng. Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 2010

Single Event Effects: SRAM

WARM SRAM: A Novel Scheme to Reduce Static Leakage Energy in SRAM Arrays

Floating Point Representation and Digital Logic. Lecture 11 CS301

VLSI. Faculty. Srikanth

MOSFET and CMOS Gate. Copy Right by Wentai Liu

Lecture 16: Circuit Pitfalls

Digital Integrated Circuits A Design Perspective

3. Design a stick diagram for the PMOS logic shown below [16] Y = (A + B).C. 4. Design a layout diagram for the CMOS logic shown below [16]

Chapter 2 CMOS Transistor Theory. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

Chapter 7. VLSI System Components

CMOS Transistors, Gates, and Wires

COMBINATIONAL LOGIC. Combinational Logic

Power Dissipation. Where Does Power Go in CMOS?

CS 152 Computer Architecture and Engineering. Lecture 11 VLSI

MOS Transistor Theory

Interconnects. Wire Resistance Wire Capacitance Wire RC Delay Crosstalk Wire Engineering Repeaters. ECE 261 James Morizio 1

CMPEN 411 VLSI Digital Circuits. Lecture 03: MOS Transistor

and V DS V GS V T (the saturation region) I DS = k 2 (V GS V T )2 (1+ V DS )

Chapter 3 Basics Semiconductor Devices and Processing

ENEE 359a Digital VLSI Design

CMPEN 411 VLSI Digital Circuits Spring 2012

ECE 438: Digital Integrated Circuits Assignment #4 Solution The Inverter

Static CMOS Circuits. Example 1

Introduction to Computer Engineering. CS/ECE 252, Spring 2017 Rahul Nayar Computer Sciences Department University of Wisconsin Madison

MOS Transistor Properties Review

High-to-Low Propagation Delay t PHL

Transcription:

ECE680: Physical VLSI Design Chapter VIII Semiconductor Memory (chapter 12 in textbook) 1

Chapter Overview Memory Classification Memory Architectures The Memory Core Periphery Reliability Case Studies 2

Semiconductor Memory Classification Read-Write Memory Non-Volatile Read-Write Memory Read-Only Memory Random Access Non-Random Access EPROM E 2 PROM Mask-Programmed Programmable (PROM) SRAM FIFO FLASH DRAM LIFO Shift Register CAM 3

Memory Timing: Definitions Read cycle READ Read access Read access Write cycle WRITE Data valid Write access DATA Data written 4

Memory Architecture: Decoders M bits M bits S 0 S 0 Word 0 S 1 Word 1 A 0 S 2 Storage Word 2 cell A 1 Word 0 Word 1 Word 2 Storage cell N words S N 2 2 S N 2 1 Word N2 2 Word N2 1 A K 2 1 K 5 log 2 N Decoder Word N2 2 Word N2 1 Input-Output (M bits) Input-Output (M bits) Intuitive architecture for N x M memory Too many select signals: N words == N select signals Decoder reduces the number of select signals K = log 2 N 5

Array Structured Memory Architecture Problem: ASPECT RATIO or HEIGHT >> WIDTH Bit line 2 L 2 K Storage cell A K A K11 A L 21 Decoder Row Word line Sense amplifiers / Drivers M.2 K Amplify swing to rail-to-rail amplitude A 0 A K21 Column decoder Selects appropriate word Input-Output p (M bits) 6

Hierarchical Memory Architecture Block 0 Block i Block P 2 1 Row address Column address Block address Control circuitry Block selector Global amplifier/driver Global data bus Advantages: 1. Shorter wires within blocks 2. Block address activates only 1 block => power savings I/O 7

Block Diagram of 4 Mbit SRAM Clock generator Z address buffer Predecoder and block selector Bit line load X address buffer Block 30 Subglobal row decoder Block 31 Global row Block 1 decoderd Transfer gate Column decoder Sense amplifier and write driver 128 K Array Block 0 Subglobal row decoder Sense amplifier and write driver Local row decoder CS, WE buffer I/O buffer x1/x4 controller Y address buffer X address buffer GMU, ECE [Hirose90] 680 Physical VLSI Design 8

Contents Addressable Memory Buffers I/O B Data (64 bits) mands Com Comparand Mask Control Logic R/W Address (9 bits) Ad ddress Decod der CAM Array 2 9 words 3 64 bits 2 9 Validity Bit ts er riority Encod P 9

Memory Timing: Approaches Address bus Row Address Column Address RAS CAS Address Bus Address Address transition initiates memory operation RAS-CAS timing DRAM Timing SRAM Timing i Multiplexed Adressing Self timed 10

Read Only Memory Cells 1 WL BL WL V DD BL WL BL BL BL BL 0 WL WL WL GND Diode ROM MOS ROM 1 MOS ROM 2 11

MOS OR ROM BL [0] BL [1] BL [2] BL [3] WL [0] V DD WL [1] WL [2] V DD WL [3] V bias Pull down loads 12

MOS NOR ROM V DD Pull up up devices WL [0] WL [1] GND WL [2] GND WL [3] BL [0] BL [1] BL [2] BL [3] 13

MOS NOR ROM Layout Cell (9.5 x 7 ) Programmming using the Active Layer Only Polysilicon Metal1 Diffusion Metal1 on Diffusion 14

MOS NOR ROM Layout Cell (11 x 7 ) Programmming using the Contact Layer Only Polysilicon Metal1 Diffusion Metal1 on Diffusion 15

MOS NAND ROM V DD Pull up devices BL [0] BL [1] BL [2] BL [3] WL [0] WL [1] WL [2] WL [3] All word lines high by default with exception of selected row 16

MOS NAND ROM Layout Cell (8 x 7 ) Programmming using the Metal-1 Layer Only No contact to VDD or GND necessary; drastically reduced cell size Loss in performance compared to NOR ROM Polysilicon Diffusion Metal1 on Diffusion 17

NAND ROM Layout Cell (5 x 6 ) Programmming using Implants Only Polysilicon Threshold altering implant Metal1 on Diffusion 18

Equivalent Transient Model for MOS NOR ROM Model for NOR ROM V DD WL r word BL C bit c word Word line parasitics Wire capacitance and gate capacitance Wire resistance (polysilicon) Bit line parasitics Resistance not dominant (metal) Drain and Gate Drain capacitance Page 642

Equivalent Transient Model for MOS NAND ROM Model for NAND ROM V DD BL rbit C L WL r word c bit c word Word line parasitics Similar to NOR ROM Bit line parasitics Resistance of cascaded transistors dominates Drain/Source and complete gate capacitance 20

Decreasing Word Line Delay WL Driver Polysilicon word line Metal word line (a) Driving the word line from both sides Metal bypass WL K cells (b) Using a metal bypass Polysilicon word line (c) () Use silicides 21

Precharged MOS NOR ROM f pre V DD Precharge devices WL [0] WL [1] GND WL [2] GND WL [3] BL [0] BL [1] BL [2] BL [3] PMOS precharge device can be made as large as necessary, but clock driver becomes harder to design. 22

Non Volatile Memories The Floating gate transistor (FAMOS) Source Floating gate Gate Drain D t ox G n + Substrate p t ox n +_ S Device cross section Schematic symbol 23

Floating Gate Transistor Programming 20 V 0 V 5 V 10 V 5 V 20 V 2 5 V 0 V 2 2.5 V 5 V S D S D S D Avalanche injection Removing programming voltage leaves charge trapped Programming results in higher V T. 24

A Programmable Threshold Transistor I 0 -state t 1 -state t I D ON DV T OFF V WL V GS 25

FLOTOX EEPROM Source Floating gate Gate Drain I 20 30 nm 10 V n1 Substrate n1 p 10 nm 10 V V GD FLOTOX transistor Fowler Nordheim I Vcharacteristic 26

EEPROM Cell BL WL V DD Absolute threshold control is hard Unprogrammed transistor might be depletion 2 transistor cell 27

Flash EEPROM Control gate Floating gate erasure Thin tunneling oxide n 1 source n 1 drain programming p-substrate Many other options 28

Cross sections of NVM cells Flash EPROM GMU, Courtesy ECE 680 Physical Intel VLSI Design 29

Basic Operations in a NOR Flash Memory Erase cell array BL 0 BL 1 12 V G 0 V WL 0 S D 12 V 0 V WL 1 open open 30

Basic Operations in a NOR Flash Memory Write 12 V BL 0 BL 1 G 6 V 12 V WL 0 S D 0 V 0 V WL 1 6 V 0 V 31

Basic Operations in a NOR Flash Memory Read 5 V G 1 V 5V BL 0 BL 1 WL 0 S D 0 V 0 V WL 1 1 V 0 V 32

NAND Flash Memory Word line(poly) Unit Cell Gate ONO Gate Oxide FG Source line (Diff. Layer) Courtesy Toshiba 33

NAND Flash Memory Select transistor Word lines Active area STI Bit line contact Source line contact GMU, Courtesy ECE 680 Physical Toshiba VLSI Design 34

Characteristics of State of the art NVM 35

Read Write Memories (RAM) STATIC (SRAM) Dt Data stored as long as supply is applied Large (6 transistors/cell) Fast Differential DYNAMIC (DRAM) Periodic refresh required Small (1 3 transistors/cell) Slower Single Ended 36

6 transistor CMOS SRAM Cell M 2 WL V DD M 4 Q M Q M 5 6 M 1 M 3 BL BL 37

CMOS SRAM Analysis (Read) WL BL V DD M 4 Q = 0 Q = 1 M 6 M 5 BL V DD M 1 V DD V DD C bit C bit 38

CMOS SRAM Analysis (Read) 1.2 1 ge Rise (V V) Volta 0.8 0.6 0.4 0.2 Voltage rise [V] 0 0 0.5 1 1.2 1.5 2 Cell Ratio (CR) 2.5 3 39

CMOS SRAM Analysis (Write) V DD WL M 4 Q = 0 M 6 M 5 Q = 1 M 1 V DD BL = 1 BL = 0 40

CMOS SRAM Analysis (Write) 41

6T SRAM Layout V DD M2 M4 Q Q M1 M3 M5 M6 GND WL BL BL 42

Resistance load SRAM Cell V DD WL R L R L M 3 Q Q M 4 BL M 1 M 2 BL Static power dissipation -- Want R L large Bit lines precharged to V DD to address t p problem 43

SRAM Characteristics Page 663 44

3 Transistor DRAM Cell BL 1 BL 2 WWL RWL WWL M 3 RWL M 1 X M 2 X V DD - V T C S BL 1 V DD BL 2 V DD -V T ΔV No constraints on device ratios Reads are non destructive Value stored at node X when writing a 1 = V WWL VV Tn 45

3T DRAM Layout BL2 BL1 GND RWL M3 M2 WWL M1 Area: 50% of 6 T SRAM 46

1 Transistor DRAM Cell WL BL WL Write 1 Read 1 M 1 X GND V DD 2 V T C S BL V DD V DD /2 V sensing DD /2 C BL Write: C S is charged or discharged by asserting WL and BL. Read: Charge redistribution takes places between bit line and storage capacitance V = VBL V PRE = V BIT V PRE C S ------------ C S + C BL Voltage swing is small; typically around 250 mv. 47

DRAM Cell Observations 1T DRAM requires a sense amplifier for each bit line, due to charge redistribution readout. DRAM memory cells are single ended in contrast to SRAM cells. The read out of the 1T DRAM cell is destructive; read and refresh operations are necessary for correct operation. Unlike 3T cell, 1T cell requires presence of an extra capacitance that must be explicitly included in the design. When writing a 1 into a DRAM cell, a threshold voltage is lost. This charge loss can be circumvented by bootstrapping the word lines to a higher value than V DD 48

Sense Amplifiers t p make V as small C V = ---------------- as possible I av large small Idea: Use Sense Amplifer small transition s.a. input output 49

Differential Sense Amplifier V DD M 3 M 4 y Out bit M 1 M 2 bit SE M 5 Directly applicable to SRAMs 50

Sense Amp Operation V BL V (1) V PRE D V (1) V(0) Sense amp activated t Word line activated 51

1 T DRAM Cell Capacitor Metal word line Poly n + n + Inversion layer Poly induced by plate bias Cross section SiO 2 Field Oxide Diffused bit line Polysilicon gate Layout Polysilicon plate M 1 word line Uses Polysilicon Diffusion Capacitance Expensive in Area 52

SEM of poly diffusion capacitor 1T DRAM 53

Advanced 1T DRAM Cells Word line Insulating Layer Cell plate Capacitor dielectric layer Cell Plate Si Capacitor Insulator Refilling Poly Transfer gate Isolation Storage electrode Storage Node Poly Si Substrate 2nd Field Oxide Trench Cell Stacked capacitor Cell 54

Static CAM Memory Cell Word Bit Bit Bit Bit CAM CAM Bit M4 M8 M6 M9 M7 M5 Bit Word CAM CAM Word Match S M3 int M2 S M1 Wired NOR Match Line 55

CAM in Cache Memory CAM ARRAY Hit Logic SRAM ARRAY Address Decoder Input Drivers Sense Amps / Input Drivers Address Tag Hit R/W Data 56

Periphery Decoders Sense Amplifiers Input/Output Buffers Control l/ Timing i Circuitry it 57

Row Decoders Collection of 2 M complex logic gates Organized in regular and dense fashion (N)AND Decoder NOR Decoder 58

Hierarchical Decoders Multi stage implementation improves performance WL 1 WL 0 A 0 A 1 A 0 A 1 A 0 A 1 A 0 A 1 A 2 A 3 A 2 A 3 A 2 A 3 A 2 A 3 NAND decoder using 2 input pre decoders A 1 A 0 A 0 A 1 A 3 A 2 A 2 A 3 59

Dynamic Decoders Precharge devices GND GND V DD WL 3 V WL DD 3 WL 2 V DD WL 2 WL 1 WL 0 V DD WL 1 WL 0 V DD A 0 A 0 A 1 A 1 A 0 A 0 A 1 A 1 2 input NOR decoder 2 input NAND decoder 60

4 input pass transistor based column decoder BL 0 BL 1 BL 2 BL 3 S 0 A 0 S 1 S 2 A 1 S 3 2-input NOR decoder Advantages: speed (t pd does not add to overall memory access time) Only one extra transistor in signal path Disadvantage: Large transistor count D 61

4 to 1 tree based column decoder BL 0 BL 1 BL 2 BL 3 A 0 A 0 A 1 A 1 D Number of devices drastically reduced Delay increases quadratically with # of sections; prohibitive for large decoders Solutions: buffers progressive sizing combination of tree and pass transistor approaches 62