Semiconductor memories
Semiconductor Memories Data in Write Memory cell Read Data out Some design issues : How many cells? Function? Power consuption? Access type? How fast are read/write operations?
Semiconductor Memories RWM NVRWM ROM Random Access Non-Random Access EPROM E 2 PROM Mask-Programmed Programmable (PROM) SRAM FIFO FLASH DRAM LIFO Shift Register CAM
Architecture Problem: ASPECT RATIO or HEIGHT >> WIDTH 2 L-K Bit Line Storage Cell A K A K+1 A L-1 Row Decoder Word Line M.2 K Sense Amplifiers / Drivers Amplify swing to rail-to-rail amplitude A 0 A K-1 Column Decoder Selects appropriate word Input-Output (M bits)
Architecture: example 256 cells = 32 bytes 16 columns L=5 L-K 16 rows 2 4 M = 8 K = 1 L-K=4 M K 2 bytes 8 x 2 1
Blocks ( 1Mbit) Row Address Column Address Block Address Control Circuitry Block Selector Global Amplifier/Driver I/O Global Data Bus Advantages: 1. Shorter wires within blocks 2. Block address activates only 1 block => power savings
Architecture: example 256 cells = 32 bytes = 8 blocks (32 bits) 4 columns 8 rows.. Block 0 Block 1 Block 7
Area occupation INTEL 2102 1Kx1bit SRAM 1972
Area occupation Micron 1 Gbit DRAM Today
Write/read cycles Read Cycle READ Read Access Read Access Write Cycle WRITE Data Valid Write Access DATA Data Written
Semiconductor Memories Write cycle Read cycle
Read-Write Memories (RAM) STATIC (SRAM) Data stored as long as supply is applied Large (6 transistors/cell) Fast Differential DYNAMIC (DRAM) Periodic refresh required Small (1-3 transistors/cell) Slower Single Ended
6-transistor CMOS SRAM Cell WL M2 M4 M5 Q Q M6 M2 M4 Q Q M1 M3 M6 M1 M3 M8 S M5 M7 R SR Flip-Flop
CMOS SRAM Analysis (Read) WL precharge sensing M4 WL Q = 0 M6 M5 Q = 1 M1 V DD /2 /2 C bit C bit /2 Out 0V k n M5 --------------- V V DD DD ----------- V 2 2 Tn ----------- 2 V 2 = k 2 n M1 V Tn ----------- ----------- DD 2 8 Differential sense amplifier (W/L) n,m5 10 (W/L) n,m1 (supercedes read constraint) Out pin
CMOS SRAM Analysis (Read)
CMOS SRAM Analysis (Read) WL M4 M5 Q = 0 Q = 1 M6 M1 V DD /2 /2 C bit C bit k n M5 --------------- V V DD DD ----------- V 2 2 Tn ----------- 2 V 2 = k 2 n M1 V Tn ----------- ----------- DD 2 8 (W/L) n,m5 10 (W/L) n,m1 (supercedes read constraint)
CMOS SRAM Analysis (Write) WL M4 M5 Q = 0 Q = 1 M6 M1 = 1 = 0 k n M6 V Tn V 2 DD ---------- V ---------- DD = k p M4 2 8 V Tp ---------- 2 2 V ---------- DD 8 (W/L) n,m6 0.33 (W/L) p,m4 k n M5 2 ------------- V V DD DD ---------- V Tn ---------- 2 k 2 2 n M1 V Tn ---------- ----------- 2 8 2 = (W/L) n,m5 10 (W/L) n,m1 in ( =0)
CMOS SRAM Analysis (Write) WL M4 M5 Q = 0 Q = 1 M6 M1 = 1 = 0 k n M6 V Tn V 2 DD ---------- V ---------- DD = k p M4 2 8 V Tp ---------- 2 2 V ---------- DD 8 (W/L) n,m6 0.33 (W/L) p,m4 k n M5 2 2 ------------- V V DD DD ---------- V Tn ---------- 2 k 2 2 n M1 V Tn ---------- ----------- 2 8 = (W/L) n,m5 10 (W/L) n,m1
6 transistor SRAM : features 6 transistors cell (large number!) nmos and PMOS in the cell wells 5 lines :,, WL, VDD, GND Critical read driven by small cell transistors increase access time
Resistance-load SRAM Cell WL M3 Q M1 R L M2 Q R L M4 Reduced cell size : No well Possible vertical integration Static power dissipation Very high value of R L not suitable for embedded Static power dissipation -- Want R L large Bit lines precharged to to address t p problem
1-Transistor DRAM Cell WL M1 x C S WL X GND Write "1" Read "1" V T C C S 30fF /2 sensing /2 C /C S 10100 Write: C S is charged or discharged by asserting WL and. Read: Charge redistribution takes places between bit line and storage capacitance C S V = V V PRE = V BIT V PRE ----------------------- C S + C Voltage swing is small; typically around 250 mv.
DRAM Cell Observations 1T DRAM requires a sense amplifier for each bit line, due to charge redistribution read-out. DRAM memory cells are single ended in contrast to SRAM cells. The read-out of the 1T DRAM cell is destructive; read and refresh operations are necessary for correct operation. Unlike 3T cell, 1T cell requires presence of an extra capacitance that must be explicitly included in the design. When writing a 1 into a DRAM cell, a threshold voltage is lost. This charge loss can be circumvented by bootstrapping the word lines to a higher value than.
1-T DRAM Cell Capacitor Metal word line poly SiO 2 M1 word line n + n + Field Oxide poly Inversion layer induced by plate bias (a) Cross-section Diffused bit line Polysilicon gate Polysilicon plate (b) Layout Used Polysilicon-Diffusion Capacitance Expensive in Area
Advanced 1T DRAM Cells Word line Insulating Layer Cell plate Capacitor dielectric layer Cell Plate Si Capacitor Insulator Refilling Poly Transfer gate Storage electrode Isolation Storage Node Poly 2nd Field Oxide Si Substrate Trench Cell Stacked-capacitor Cell
Periphery D e c o d e rs S e n s e A m p lifie rs In p u t/o u tp u t B u ffe rs C o n tro l / T im in g C ir c u itry
Row Decoders Collection of 2 M complex logic gates Organized in regular and dense fashion (N)AND Decoder NOR Decoder
Dynamic Decoders Precharge devices GND GND WL 3 WL 3 WL 2 WL 2 WL 1 WL 1 WL 0 WL 0 A 0 A 0 A 1 A 1 A 0 A 0 A 1 A 1 Dynamic 2-to-4 NOR decoder 2-to-4 MOS dynamic NAND Decoder Propagation delay is primary concern
4 input pass-transistor based column decoder 0 1 2 3 A 0 A 1 2 input NOR decoder S 0 S 1 S 2 S 3 D dvantage: speed (t pd does not add to overall memory access time) only 1 extra transistor in signal path sadvantage: large transistor count
4-to-1 tree based column decoder 0 1 2 3 A 0 A 0 A 1 A 1 D Number of devices drastically reduced Delay increases quadratically with # of sections; prohibitive for large decoders Solutions: buffers progressive sizing combination of tree and pass transistor approaches
Sense Amplifiers t p = C ---------------- V I av make V as small as possible large small Idea: Use Sense Amplifer small transition s.a. input output
Differential Sensing - SRAM PC y M3 M4 y EQ x M1 M2 SE M5 x x SE x WL i (b) Doubled-ended Current Mirror Amplifier SRAM cell i Diff. x Sense x Amp y y D D x y SE y x (a) SRAM sensing scheme. (c) Cross-Coupled Amplifier
Differential Sensing - SRAM P WL S
Latch-Based Sense Amplifier EQ SE SE Initialized in its meta-stable point with EQ Once adequate voltage gap created, sense amp enabled with SE Positive feedback quickly forces output to a stable operating point.
Open bitline architecture EQ R L 1 L 0 R 0 R 1 SE L L R C S... C S C S SE... C S C S C S dummy cell dummy cell
DRAM Read Process with Dummy Cell 6.0 V (Volt) V (Volt) 4.0 2.0 0.00 1 2 3 4 5 t (nsec) (a) reading a zero 6.0 4.0 2.0 V (Volt) 5.0 4.0 WL 3.0 SE 2.0 EQ 1.0 0.0 0 1 2 3 4 5 (c) control signals 0.00 1 2 3 4 5 t (nsec) (b) reading a one
Open Bit-line Architecture Cross Coupling EQ WL 1 WL 0 WL D WL D WL 0 WL 1 C W C W C C C C Sense Amplifier C C C C
Alpha-particles -particle WL n + SiO 2 1 particle ~ 1 million carriers
Yield Yield curves at different stages of process maturity (from [Veendrick92])
Redundancy Redundant columns Redundant rows Memory Array Row Address Row Decoder : Fuse Bank Column Decoder Column Address
Redundancy and Error Correction