Beruflich Dokumente
Kultur Dokumente
2018
SESSION
30
Emerging
Memories
A N40 256KX44 Embedded RRAM Macro with SL-
Pre-charged SA and Low Voltage Write Current
Limiter to Improve Read and Write Performance
TSMC
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 1 of 32
Outline
1.Basics of RRAM Operations
3.SL Pre-Charged SA
4.Silicon Data
5.Summary
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 2 of 32
Outline
1.Basics of RRAM Operations
3.SL Pre-charged SA
4.Silicon Data
5.Summary
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 3 of 32
Basic RRAM Operations
Set Reset
Fresh Sample V-forming Low Resistance State High Resistance State
BL 2.4V 1.4V 0V
TE TE TE TE
O2- Oxygen
Capping
vacancies
Hi-K
BE BE BE BE
WL 1V 1.3V 2.4V
SL 0V 1.4V
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 4 of 32
Cell Current Distribution
RESET SET
Variation: Process, Bias, etc
Tighter Distribution
Bit Count
Better endurance
Better retention
More read window
IR0 IR1
Cell Current
∆
Read Window
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 5 of 32
Self-Adaptive Write Scheme
FB
VDDH VDIO
VREF -
Verdict COMP Write Vref WR_VOL
+
DATA + OP2
Module Generator - IWRITE
RS SET
WEN
VWRITE
IWRITE
Fast Cell Slow Cell
1T1R
Prevent Fast cells from Over-Write,
[X.Xue, et al., JSSC ,2013] Save Write Power
Area, Power Penalty
Voltage Overhead = Vdsat+VDS
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 6 of 32
Current Limiter
VWRITE Specified Filament Size
Protect Fast Cells by Current Limiting
IWRITE Area Efficient
Icomp Voltage Overhead = Vdsat+VDS
1T1R
IWRITE
+
VDD Icomp
VD (300uA)
- VD
0.5V
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 7 of 32
Outline
1.Basics of RRAM Operations
3.SL Pre-Charged SA
4.Silicon Data
5.Summary
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 8 of 32
LV-WCLS: 2-Step Operation of Current Limiter
VWRITE SET_B
SET_B SW1
= <
Icomp SW2
1T1R
700
0.1V + 600
VD
N1 N3
- 500
+ 1 1-X X N4
ID (uA)
OP2 400
0.1V -
SW1 SW2 300
OP1 EN N2 N5 N6
200 SW1+SW2
100
0
Post-Delay, Td 0 0.1 0.2
VD (V)
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 9 of 32
LV-WCLS: Full Conduction Phase
VWRITE SET_B
SET_B SW1
= <
Icomp SW2
1T1R
700
0.1V + 600
VD
N1 N3
- 500
+ 1 1-X X N4
ID (uA)
OP2 400
0.1V -
SW1 SW2 300
OP1 EN N2 N5 N6
200 SW1+SW2
100
0
Post-Delay, Td 0 0.1 0.2
VD (V)
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 10 of 32
LV-WCLS: Partial Conduction Phase
VWRITE SET_B
SET_B SW1
= <
Icomp SW2
1T1R
700
0.1V + 600
VD
N1 N3
- 500
+ 1 1-X X N4
ID (uA)
OP2 400
0.1V -
SW1 SW2 300 SW1+SW2
OP1 EN N2 N5 N6
200
SW1
100
0
Post-Delay, Td 0 0.1 0.2
VD (V)
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 11 of 32
LV-WCLS: Write Termination
Td
VWRITE SET_B
SET_B SW1
= <
Icomp SW2
1T1R
700
0.1V + 600
VD
N1 N3
- 500
+ 1 1-X X N4
ID (uA)
OP2 400
0.1V -
SW1 SW2 300
OP1 EN N2 N5 N6
200 SW1
100
0
Post-Delay, Td 0 0.1 0.2
VD (V)
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 12 of 32
OP2: Latch-Type Voltage Comparator
Reuse Read SA
Area Saving VDD
CLK
0.1V CLK
VD
SW2
VD
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 13 of 32
Performance of LV-WCLS
10000
Only 0.1V of voltage overhead
1000
LV_WCLS OFF
LV_WCLS ON
100
10
1
40 45 50 55 60 65 70 75
Cell Current (a.u.)
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 14 of 32
Voltage Overhead Comparison
VDDH VWRITE
+ VWRITE
IWRITE
IWRITE
SET_B
>800mV Icomp
+
- 1T1R 1T1R
-
VWRITE
IWRITE + +
VBIAS
100mV >500mV
1T1R
- -
Self-Adaptive Write Scheme This Work Conv. Current Limiter
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 15 of 32
Outline
1.Basics of RRAM Operations
3.SL Pre-charged SA
4.Silicon Data
5.Summary
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 16 of 32
Common-SL Architecture
SL-Read: Better immunity from read disturb than BL-read
Common SL
Pros: Reduce SL resistance, feasibility to further shrink bit-cell
Cons: Incur more capacitive load at SL node
SA
SA
SL Iref
SL Iref WL
WL BL
BL EQ1
EQ0
Icell Icell
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 18 of 32
Read SA Operation (1/3)
VDD
Standby VDD VDD Pre-charger
P3
Read Path is Off (SE=“0”) P0 P1
SW1 On, SW2 Off SW2 SW1
SL & BL are Pre-Charged at VRSL RDI
RDI/RDREF = VDD RDREF DO
N0
VCL N1
SE=0
0.3V (VRSL)
WL C_SL
VRD
0.3V (VRSL)
C_BL
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 19 of 32
Read SA Operation (2/3)
VDD
Pre-charge VDD VDD Pre-charger
P3
Reference branch is on (SE=“1”) P0 P1
SW1 on, SW2 off SW2 SW1
Cell branch connects to RDI Pre-Charger RDI
RDI = VDD-VTH(P3) ~ RDREF RDREF DO
N0
VCL N1
~0.3V ~0.3V
SE=1
0.3V (VRSL)
WL C_SL
VRD
0.3V (VRSL)
Iref C_BL
Icell
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 20 of 32
Read SA Operation (3/3)
VDD
Signal Development VDD VDD Pre-charger
P3
Reference branch is on (SE=“1”) P0 P1
SW1 off, SW2 on SW2 SW1
Signal = RDI – RDREF RDI
Latch-SA amplifies signal RDREF DO
N0
VCL N1
~0.3V ~0.3V
SE=1
0.3V (VRSL)
WL C_SL
VRD
0.3V (VRSL)
Iref C_BL
Icell
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 21 of 32
Waveform of the SA Operation
READ “1” This work READ “0”
Grounded SL
Tacc Tacc
Grounded-SL SE
Favor read “1”
Charge SL by cell current WL
Un-balanced read 58% improved
SW1
Precharged-SL
SL has been pre-charged
SW2
Discharge BL by VSS RDI
Balanced read RDREF
SL/BL SL SL
(This work) BL BL
SL/BL BL SL BL SL
(Grounded SL)
DOUT
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 22 of 32
Shmoo Plot: Tacc v.s. VRSL
VRSL: Pre-Charged SL/BL Voltage Level
Optimum VRSL Can Achieve Balanced Read 0 and Read 1
VRSL(V) V.S Tacc(ns) 5 6 7 8 9 10 11 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40
0.28 . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
Read 0 0.26 . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
HB=24uA 0.24 . . . . . . . . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P
Iref=25uA 0.22 . . . . . . . . . . . . . . P P P P P P P P P P P P P P P P P P P P P P P
∆ =1uA 0.2 . . . . . . . . . . . . . . . . P P P P P P P P P P P P P P P P P P P P P
0.18 . . . . . . . . . . . . . . . . . P P P P P P P P P P P P P P P P P P P P
0.28 . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
0.26 . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
Read 1 0.24 . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
Iref=27.6uA
Ir1 LB=30uA 0.22 . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
∆ =2.4uA 0.2 . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
0.18 . . . . . . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 23 of 32
Shmoo Plot: Tacc v.s. VRSL
9nS of Access Time can be achieved for both Read”0” and “1”, at
VRSL=0.26~0.28V, at VDD= 1.1V / 25C.
VRSL(V) V.S Tacc(ns) 5 6 7 8 9 10 11 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40
0.28 . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
Read 0 0.26 . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
HB=24uA 0.24 . . . . . . . . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P
Iref=25uA 0.22 . . . . . . . . . . . . . . P P P P P P P P P P P P P P P P P P P P P P P
∆ =1uA 0.2 . . . . . . . . . . . . . . . . P P P P P P P P P P P P P P P P P P P P P
0.18 . . . . . . . . . . . . . . . . . P P P P P P P P P P P P P P P P P P P P
0.28 . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
0.26 . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
Read 1 0.24 . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
Iref=27.6uA
Ir1 LB=30uA 0.22 . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
∆ =2.4uA 0.2 . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P P
0.18 . . . . . . . . . P P P P P P P P P P P P P P P P P P P P P P P P P P P P
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 24 of 32
Outline
1.Basics of RRAM Operations
3.SL Pre-charged SA
4.Silicon Data
5.Summary
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 25 of 32
Ir0/Ir1 Distribution after 1KC RAC at 25C
Endurance Test: Do SET/RESET cycling and check failure bit count for every Cycle
RAC (Retention After Cycles) : After Endurance Test, then bake the samples at 175C
for 11 hours, equivalent to 85C for 10 years.
1K cycles RAC @ 25゚C
Ir0>30uA FBC: 22
Ir1<42uA FBC: 0
BER: 0.008 PPM
14uA
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 26 of 32
Ir0/Ir1 Distribution After 1KC RAC at -40C
Ir0>30uA FBC: 0
Ir1<42uA FBC: 0
BER: 0 PPM
12uA
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 27 of 32
Ir0/Ir1 distribution after 1KC RAC at 125C
14uA
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 28 of 32
Die Photo and Key Parameters
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 29 of 32
Outline
1.Basics of RRAM Operations
3.SL Pre-charged SA
4.Silicon Data
5.Summary
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 30 of 32
Summary
LV-WCLS
Tighter distribution of "1" current
Reduce write operation voltage overhead by 400mV
Area efficient
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 31 of 32
Thank You !
© 2018 IEEE 30.1: A N40 256KX44 Embedded RRAM macro with SL precharged SA and low voltage write current
International Solid-State Circuits Conference limiter to improve read and write performance 32 of 32
A 1Mb 28nm STT-MRAM with 2.8ns Read
Access Time at 1.2V VDD Using Single-
Cap Offset-Cancelled Sense Amplifier
and In-situ Self-Write-Termination
Qing Dong1,2, Zhehong Wang1, Jongyup Lim1, Yiqun Zhang1, Yi-Chun
Shih3, Yu-Der Chih3, Jonathan Chang3, David Blaauw1, Dennis Sylvester1
1University
of Michigan, Ann Arbor, MI
2TSMC, San Jose, CA
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 1 of 57
Outline
Motivation
Read margin improvement
Self-generated Vref
Offset-cancelled sense amplifier
Measurement Results
Conclusion
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 2 of 57
Outline
Motivation
Read margin improvement
Self-generated Vref
Offset-cancelled sense amplifier
Measurement Results
Conclusion
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 3 of 57
Memory Applications and Requirements
Many-Core Systems Mobile Systems IoT Systems
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 4 of 57
Challenges of Conv. Embedded Memories
SRAM DRAM Flash
SL RP RAP
Compact area RAP
Good scalability
High performance
RP
Good endurance
Low write energy IW- IW+
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 6 of 57
STT-MRAM Read Challenge
Array0 at 20˚C
Frequency
RP RAP
Limited read margin (RAP/RP≈2)
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 7 of 57
STT-MRAM Read Challenge
Array0 at 20˚C
Array0 at 120˚C
Margin degraded with
Frequency
temperature variation
RP RAP
Limited read margin (RAP/RP≈2)
Degraded read margin at high temperature
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 8 of 57
STT-MRAM Read Challenge
Array0 at 20˚C Array1 at 20˚C
Margin degraded with
Frequency
array-to-array variation
RP RAP
Limited read margin (RAP/RP≈2)
Degraded read margin at high temperature
Array to array process variations
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 9 of 57
STT-MRAM Write Challenge
Write
Pulse
20˚C
RAP
RP
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 10 of 57
STT-MRAM Write Challenge
Write
Pulse
Write completion could change
20˚C with state and variation
RAP
RP
Measurement Results
Conclusion
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 13 of 57
Variation Sources in Conventional SA VDD
VDD
OUT
Vclamp Vclamp
BL
Icell Iref
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 14 of 57
Variation Sources in Conventional SA VDD
VDD
OUT
Vclamp Vclamp
BL
Icell Iref Reference
Reference PVT
PVT
Generation Variation
Generation Variation
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 15 of 57
Variation Sources in Conventional SA VDD
VDD
OUT
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 16 of 57
Variation Sources in Conventional SA VDD
VDD
PMOS
PMOS
Header Vtmismatch
Vt Mismatch
Header
OUT
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 17 of 57
Variation Sources in Conventional SA VDD
VDD
PMOS
PMOS
Header Vtmismatch
Vt Mismatch
Header
Latch-based
Latch-based
InputOffset
Input Offset
SenseAmp
Sense Amp
OUT
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 18 of 57
Conventional Reference Generation
WL
Array0 at 20˚C Array1 at 20˚C
Array0 at 120˚C
Array
Frequency
BL
MUX 1/32
Sense Amplifiers
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 20 of 57
Previous SA Works
Iread
S4
Using PMOS headers as
S3R S3R current clamper instead of
S1 C0 S1 voltage clamper
S2R
S2R
S1B
Ref BLs
WL
as PMOS headers to alleviate
SL variation
RP RAP
Vref ≈ 2Iread*(RP//RAP)
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 23 of 57
Proposed Sensing Scheme
Iref WL
Gen.
Vbias_R S1
S1B
2Iread
Iread
S2R
S4
S3R
S3R S3R S4
Offset
S1 VL C0 VR S1
S2R
S2R
Cancelling
S1B
Ref BLs
BL
S4
WL
SL
RP RAP
Vref ≈ 2Iread*(RP//RAP)
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 24 of 57
Proposed Sensing Scheme
Iref WL
Gen.
Vbias_R S1
S1B
2Iread
Iread
S2R
S4
S3R
S3R VR-VL S3R S4
0 C0
Offset
S1 S1 Equalizing
S2R
S2R
Cancelling
S1B
Ref BLs
BL
S4
WL
SL
RP RAP
Vref ≈ 2Iread*(RP//RAP)
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 25 of 57
Proposed Sensing Scheme
Iref WL
Gen.
Vbias_R S1
S1B
2Iread
Iread
S2R
S4
S3R
S3R Vref Vref +VR-VL VBL S3R S4
C0 Offset BL
S1 S1 Equalizing
S2R
S2R
Cancelling Sampling
S1B
Ref BLs
BL
S4
WL
SL
RP RAP
Vref ≈ 2Iread*(RP//RAP)
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 26 of 57
Proposed Sensing Scheme
Iref WL
Gen.
Vbias_R S1
S1B
2Iread
Iread
S2R
S4
S3R
S3R S3R S4
C0 Offset BL
S1 S1 Equalizing Evaluating
S2R
S2R
Cancelling Sampling
S1B
Ref BLs
BL
S4
WL
SL
RP RAP
Vref ≈ 2Iread*(RP//RAP)
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 27 of 57
Proposed Sensing Scheme
Iref WL
Gen.
Vbias_R S1
S1B
2Iread
Iread
S2R
S4
S3R
S3R S3R S4
C0 Offset BL
S1 S1 Equalizing Evaluating
S2R
S2R
Cancelling Sampling
S1B
Overlaps with decoding
Ref BLs
BL
S4
Standard Deviation of
40
Measurement Results
Conclusion
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 30 of 57
Read-Compare-Write & Write-Verify-Write
Read
Hundreds of μAs current continuously applied
A Complete Write Cycle
Write Pros:
Only write necessary bits, avoid redundant
Read write
Verifying end of write avoids overwrite
End Cons:
[ISSCC’16] Multiple cycles are used for write
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 31 of 57
Proposed in-situ self-write-termination
Write
Read
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 32 of 57
Detection of Write Completion
Write 1 Write 0
VDD VSS
Current flows from BL to SL to write
1; from SL to BL to write 0
Iwrite1 Iwrite0
VH1 VH0
VL1 VL0
BL BL
RAP RP RP RAP
BL voltage drops abruptly once write
WL WL
is completed in both write 0/1 cases
SL SL
VSS VDD
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 33 of 57
Proposed Write Scheme
Iref Vbias_W0 Shared sense amplifier
Gen.
Vbias_W1
Self-termination STOP Data
with read
S4
Iwrite
S3W S3W
Vref_W
S0
S0
S1 C0 S1
S2W
S2W
S1B
BL
S4
WL
SL
Data
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 34 of 57
Proposed Write Scheme
Iref Vbias_W0
Gen.
Vbias_W1
Self-termination STOP Data
S4
Iwrite
Vref_W
S3W S3W WL
S0
S0
S0
S1 C0 S1
S2W
S2W
S1
S1B
S2W
BL
S4
S3W
WL
SL
STOP
Data BL
Offset
Cancelling
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 35 of 57
Proposed Write Scheme
Iref Vbias_W0
Gen.
Vbias_W1
Self-termination STOP Data
S4
Iwrite
Vref_W
S3W S3W WL
S0
S0
VDD S0 VSS
S1 C0 S1
S2W
S2W
S1
S1B
S2W
BL
S4
S3W
WL
SL
STOP
Data BL
Offset Precharge
Cancelling
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 36 of 57
Proposed Write Scheme
Iref Vbias_W0
Gen.
Vbias_W1
Self-termination STOP Data
S4
Iwrite
Vref_W
S3W S3W WL
S0
S0
VDD S0 VSS
S1 C0 S1
S2W
S2W
S1
S1B
S2W
BL
S4
S3W
WL
SL
STOP
Data BL
Offset Precharge BL
Cancelling Developing
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 37 of 57
Proposed Write Scheme
Iref Vbias_W0 Ends redundant write
Gen.
Vbias_W1
Self-termination STOP Data
S4
Iwrite
Vref_W
S3W S3W WL
S0
S0
VSS S0 VDD
S1 C0 S1
S2W
S2W
S1
S1B
S2W
BL
S4
S3W
WL
SL
STOP
Data BL
Offset
Precharge BL In-situ Already
Cancelling Written
Developing Evaluating
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 38 of 57
Proposed Write Scheme
Iref Vbias_W0 Ends redundant write
Gen.
Vbias_W1
Self-termination STOP Data Avoid overwrite
Overlaps with decoding
S4
Iwrite
Vref_W
S3W S3W WL
S0
S0
VSS S0 VDD
S1 C0 S1
S2W
S2W
S1
S1B
S2W
BL
S4
S3W
WL
SL
STOP
Write is
Data BL Done
Offset Precharge BL In-situ Already
Cancelling Written
Developing Evaluating
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 39 of 57
Outline
Motivation
Read margin improvement
Self-generated Vref
Offset-cancelled sense amplifier
Measurement Results
Conclusion
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 40 of 57
Chip Photos
600μm
28μm
Bank Bank Bank Bank
Iref Ctrl.
350μm
0 2 4 6
145μm
BIST
Gen.
Bank Bank Bank Bank
1 3 5 7
1E-1
@120˚C
1E-5
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 43 of 57
Measured Read Results across 8 arrays
Array[1:8] @25˚C w/ External Vref
1E-3
Optimal Fixed Vref
1E-4
1E-5
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 44 of 57
Measured Read Results across 8 arrays
Array[1:8] @25˚C w/ External Vref
1E-3
Optimal Fixed Vref
1E-4
1E-5
2.2X
0
0.55 0.56 0.57 0.58 0.59 0.60
Measured SA VDD_min (V)
With offset cancellation, the average VDD_min of SA is 0.57V with
19mV standard deviation across 10 dies
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 47 of 57
Measured Write Results for 1Mb Chip
80
(%)
Write (%)
68%
w/
1E-1 70
Ratiow/
Self-TerminationWrite
SavingRatio
61% 60
1E-2
Self-Termination
47%
PowerSaving
1E-3
40
1E-4 30
Power
1E-5 20
Technology (nm) 28 40 65 90 65 65
Cell Type 1T1MTJ 1T1MTJ 1T1MTJ 2T1MTJ 2T2MTJ 2T2MTJ
Target Application NVM NVM NVM LLC1) LLC1) LLC1)
Cell Area (F2) 75 85 169 107
Capacity 1Mb 1Mb 64Mb 32Mb 1Mb 1Mb
Macro Area (mm2) 0.214 0.57 47.124 91.02 0.8196 0.628
Power Supply (V) 1.2/1.8 1.1/2.5 1.2 1.5 1.2/0.9/0.4 1.2/0.9/0.4
Word Length (bit) 16 32 16 32 256 256
Read Speed (ns) 2.8 10 30 12 3.3 4
Write Speed (ns) 20 30 12 3 4
Read Power (mW) 3.9 7.8 60 21.6 17.8
Write Power (mW) 3.6 9.3 91 55.4 46.5
1) MTJ cell in LLC applications do not require high retention, allowing low switching current MTJs
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 50 of 57
Comparison Table
This Work ISSCC’13 ISSCC’10 ISSCC’09 ISSCC’15 VLSI’13
Technology (nm) 28 40 65 90 65 65
Cell Type 1T1MTJ 1T1MTJ 1T1MTJ 2T1MTJ 2T2MTJ 2T2MTJ
Target Application NVM NVM NVM LLC1) LLC1) LLC1)
Cell Area (F2) 75 85 169 107
Capacity 1Mb 1Mb 64Mb 32Mb 1Mb 1Mb
Macro Area (mm2) 0.214 0.57 47.124 91.02 0.8196 0.628
Power Supply (V) 1.2/1.8 1.1/2.5 1.2 1.5 1.2/0.9/0.4 1.2/0.9/0.4
Word Length (bit) 16 32 16 32 256 256
Read Speed (ns) 2.8 10 30 12 3.3 4
Write Speed (ns) 20 30 12 3 4
Read Power (mW) 3.9 7.8 60 21.6 17.8
Write Power (mW) 3.6 9.3 91 55.4 46.5
1) MTJ cell in LLC applications do not require high retention, allowing low switching current MTJs
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 51 of 57
Comparison Table
This Work ISSCC’13 ISSCC’10 ISSCC’09 ISSCC’15 VLSI’13
Technology (nm) 28 40 65 90 65 65
Cell Type 1T1MTJ 1T1MTJ 1T1MTJ 2T1MTJ 2T2MTJ 2T2MTJ
Target Application NVM NVM NVM LLC1) LLC1) LLC1)
Cell Area (F2) 75 85 169 107
Capacity 1Mb 1Mb 64Mb 32Mb 1Mb 1Mb
Macro Area (mm2) 0.214 0.57 47.124 91.02 0.8196 0.628
Power Supply (V) 1.2/1.8 1.1/2.5 1.2 1.5 1.2/0.9/0.4 1.2/0.9/0.4
Word Length (bit) 16 32 16 32 256 256
Read Speed (ns) 2.8 10 30 12 3.3 4
Write Speed (ns) 20 30 12 3 4
Read Power (mW) 3.9 7.8 60 21.6 17.8
Write Power (mW) 3.6 9.3 91 55.4 46.5
1) MTJ cell in LLC applications do not require high retention, allowing low switching current MTJs
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 52 of 57
Comparison Table
This Work ISSCC’13 ISSCC’10 ISSCC’09 ISSCC’15 VLSI’13
Technology (nm) 28 40 65 90 65 65
Cell Type 1T1MTJ 1T1MTJ 1T1MTJ 2T1MTJ 2T2MTJ 2T2MTJ
Target Application NVM NVM NVM LLC1) LLC1) LLC1)
Cell Area (F2) 75 85 169 107
Capacity 1Mb 1Mb 64Mb 32Mb 1Mb 1Mb
Macro Area (mm2) 0.214 0.57 47.124 91.02 0.8196 0.628
Power Supply (V) 1.2/1.8 1.1/2.5 1.2 1.5 1.2/0.9/0.4 1.2/0.9/0.4
Word Length (bit) 16 32 16 32 256 256
Read Speed (ns) 2.8 10 30 12 3.3 4
Write Speed (ns) 20 30 12 3 4
Read Power (mW) 3.9 7.8 60 21.6 17.8
Write Power (mW) 3.6 9.3 91 55.4 46.5
1) MTJ cell in LLC applications do not require high retention, allowing low switching current MTJs
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 53 of 57
Outline
Motivation
Read margin improvement
Self-generated Vref
Offset-cancelled sense amplifier
Measurement Results
Conclusion
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 54 of 57
Conclusions
A 1Mb 28nm 1T1MTJ embedded STT-MRAM is presented
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 55 of 57
Conclusions
A 1Mb 28nm 1T1MTJ embedded STT-MRAM is presented
© 2018 IEEE 30.02 : A 1Mb 28nm STT-MRAM with 2.8ns Read Access Time at 1.2V VDD Using
International Solid-State Circuits Conference Single-Cap Offset-Cancelled Sense Amplifier and In-situ Self-Write-Termination 57 of 57
A 28nm 32Kb Embedded 2T2MTJ
STT-MRAM Macro with 1.3ns
Read-Access Time for Fast and
Reliable Read Applications
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 5 of 21
Outline
Introduction to STT-MRAM
Challenges of Conventional(CNV) Read Scheme
Proposed Continuous-Recording-and-Enhancement VSA
(CRE-VSA) Scheme
• Circuit structure
• Read operations
• Analysis
Measurement Results
Conclusion
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 6 of 21
Proposed CRE-VSA Scheme
Pre-charge/ Developing/ VBL_RD
Sensing
VTH Sampling CRE CNV-VSA ΔVIN Cell variation
- IN1/ IN2
Concept: VSS
VDD
Continuously enhance ∆VIN over TBL, despite
VRSM is decreasing after reaching its peak Proposed ΔVIN ΔVIN Continuous
CRE-VSA Enhancement
- IN1/ IN2
Feature: VSS
All Pass (Proposed)
• Continuous-Margin enhancement
1. Out of TSMW_CNV: Fail
• Variation tolerance in the input stage ΔVIN 2. Tail bit: Fail (CNV)
Peak:VRSM_MAX
(IN1-IN2)
VSS
• No timing overhead
TBL
Improve sensing yield and read speed TSMW_CNV
Improve Access time
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 7 of 21
CRE-VSA Operations- Standby Phase
Standby phase
• Reset all nodes to zero
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 8 of 21
Phase 1(P1)- VTH Sampling/ BL Pre-charge
P1 phase: VTH sampling
• VX1=VDD-VTH3 VTH is sampled on Cap.
• VX2=VDD-VTH4
VDD VDD
VBL_RD VBL_RD
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 9 of 21
Phase 2(P2)- CRE/ BL Developing-(T1)
P2 phase: Continuous-Recording and P1 P2 P3
VDD
CLK
Enhancement (CRE) (Time=T1) VSS
VDD
WL
• M3: VOD3=VS_BL VOD: Over-drive voltage (VGS-VTH) VSS
VBL_RD
BL/BLB
• M4: VOD4=VS_BLB VSS
VDD
Variation tolerance INI VSS
VDD
PRE VSS
VDD VDD VDD
SAEN VSS
X1/X2
VDD VSS
VDD-VTH3-VS_BL VDD-VTH4-VS_BLB
IN1/IN2
ΔVIN > VS VSS
VDD
Q/QB
VS_BL VS_BLB VSS
T1 T2
30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
Time
© 2018 IEEE
International Solid-State Circuits Conference 10 of 21
Phase 2(P2)- CRE/ BL Developing-(T2)
P2 phase: CRE (Time=T2)
• M3: VOD3=VS_BL (Saturation region)
• M4: VOD4=Negative (Sub-VTH region)
VDD-k*VS_BL ~VDD
VS_BL VS_BLB
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 11 of 21
Phase 3(P3)- Sensing Phase
P3 phase: Sensing
• VIN1=VDD-k*VS_BL
Margin enhancement
• VIN2=~VDD
VDD-k*VS_BL ~VDD
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 12 of 21
Analysis: Tolerate Low TMR Ratio
CRE-VSA achieves 5.85x margin enhancement
TMRMIN of CRE-VSA is 5.6x~8x lower than that of CNV-VSA
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 13 of 21
Analysis: Tolerate Low Read Voltage
VBL_RD of CRE-VSA is 2x lower than that of CNV-VSA
Reduce read disturb rate by 106x and 34% lower I/O read energy
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 14 of 21
Analysis: Improve Read Speed
This work achieves 1.5x~1.86x read speed improvement in different
BL length This work achieved the fastest read speed among
the reported STT-MRAM works
This work
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 15 of 21
Outline
Introduction to STT-MRAM
Challenges of Conventional(CNV) Read Scheme
Proposed Continuous-Recording-and-Enhancement VSA
(CRE-VSA) Scheme
• Circuit structure
• Read operations
• Analysis
Measurement Results
Conclusion
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 16 of 21
Test-chip Summary
Technology 28nm CMOS Process
A 28nm 32kb STT-MRAM chip Device STT-MRAM
was fabricated Cell type 2T-2MTJ
0.18mm2 (including test-
Macro area
We implement CRE-VSA and modes and conventional
Capacity 32kb
CNV-VSA for comparison purpose Sub-array 256 columns x 64 rows
I/O widths 64 bits
Supply Voltage VDD=0.9V
Read Voltage V BL_RD=0.14V~0.3V
CRE-VSA (Proposed) and
Sense Amp options
CNV-VSA (Conventional)
Read speed CRE-VSA: 1.3 ns
@V BL_RD=0.3V Conventional: 1.86 ns
Read speed CRE-VSA: 2 ns
@V BL_RD=0.15V Conventional: Fail
Measured Read 209.6 fJ (@ V BL_RD=0.3V)
Energy (fJ/bit) 172.8 fJ (@ V BL_RD=0.15V)
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 17 of 21
Measurement Results-1
Use DFF methodology to measure the read latency (TAC)
This work achieved 1.3ns macro-level read access time
TAC
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 18 of 21
Measurement Results-2
Achieved 1.3ns read access time (1.43x faster than CNV-VSA)
Minimum VBL_RD: Improve 140mV @ 25oC, Improve 190mV @ 75oC
VBL_RD [mV]
Acknowledgements
NTHU-MDL
TSMC
MOST-Taiwan
© 2018 IEEE 30.3 : A 28nm 32Kb Embedded 2T2MTJ STT-MRAM Macro with 1.3ns Read-Access Time for Fast and Reliable Read Applications
International Solid-State Circuits Conference 21 of 21
A 20ns-Write 45ns-Read and
1014-Cycle Endurance Memory Module
Composed of 60nm Crystalline Oxide
Semiconductor Transistors
S. Maeda, S. Ohshita, K. Furutani, Y. Yakubo,
T. Ishizu, T. Atsumi, Y. Ando, D. Matsubayashi,
K. Kato, T. Okuda, M. Fujita*, S. Yamazaki
Semiconductor Energy Laboratory, Kanagawa, Japan
*The University of Tokyo, Tokyo, Japan
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 1 of 22
Motivation
Performance required in LSI and memory for AI
–Low power consumption
• Low-power memory capable of long-term data retention
• Low energy for driving IOs
–High-performance processing
• Memory and architecture reducing data delay time
–High endurance
• Memory with high durability for more learning time
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 2 of 22
Motivation
Local memory architecture
Global memory architecture CMOS +
CMOS
Oxide Semiconductor
Chen, et al., Bong, et al., Logic
ISSCC2016 ISSCC2017 Memory
Memory
Processing
Engine (PE)
High data delay Low data delay Data retention w/o power
High energy for driving IOs Low energy for driving IOs Low area overhead
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 3 of 22
Outline
• Oxide Semiconductor (OS) Memory
• Design of OS Memory Module
• Measurement Results
• Summary
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 4 of 22
What Is OS Memory?
• Oxide semiconductor (OS): Novel channel material for FET
C-Axis-Aligned Crystalline In-Ga-Zn Oxide
• Memory using OSFET’s ultralow off-state current
Arrhenius plot of off-state current
1T1C cell 2T1C cell
OSFET OSFET
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 5 of 22
What Is OS Memory?
• OS process is compatible with CMOS process.
• OS memory can be embedded in BEOL including access Tr.
1T1C Cell 2T1C Cell MIM
OS FET
OS FET
Si FET Si FET
ID (A)
10-7
1.E-07
10-9
1.E-09
10-11
1.E-11
10-13
1.E-13
-1 -0.5 0 0.5 1 1.5 2 2.5
VG (V)
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 8 of 22
OS Dynamic Logic Circuit
Normal operation is possible even applying power gating.
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 9 of 22
Test Chip Configuration
• 1 kb memory cell (3T1C) array with 32b access
• OS dynamic logic for drivers
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 10 of 22
Test Chip Configuration
• 1 kb memory cell (3T1C) array with 32b access
• OS dynamic logic for drivers
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 11 of 22
Test Chip Configuration
• 1 kb memory cell (3T1C) array with 32b access
• OS dynamic logic for drivers
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 12 of 22
Test Chip Configuration
• 1 kb memory cell (3T1C) array with 32b access
• OS dynamic logic for drivers
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 13 of 22
Write Sequence of Memory Cell
• T1: Reset
– Fix WWL and WBL to 0V
• T2: Write
– Control access Tr. (M1) (0 V)
logic-1
logic-0
VDD = 3.3 V
VH (control signal) = 5.0 V Write time
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 14 of 22
Write Sequence of Memory Cell
• T1: Reset
– Fix WWL and WBL to 0V
• T2: Write
– Control access Tr. (M1) (0 V)
logic-1
logic-0
VDD = 3.3 V
VH (control signal) = 5.0 V Write time
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 15 of 22
Read Sequence of Memory Cell
• T1: Reset
– Fix RBL to 0V
– Fix OUT to 3.3V
• T2: Read
– Control selection Tr. (M3) (3.3 V)
– Control read Tr. (M2) with
SN voltage
logic-1
– Determine data of read logic-0
circuit with RBL voltage
VDD = 3.3V
VH(Control signal) = 5.0V Read time
© 2018 IEEE
(0 V)
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 16 of 22
Read Sequence of Memory Cell
• T1: Reset
– Fix RBL to 0V
– Fix OUT to 3.3V
• T2: Read
– Control selection Tr. (M3) (3.3 V)
– Control read Tr. (M2) with
SN voltage
logic-1
– Determine data of read logic-0
circuit with RBL voltage
VDD = 3.3V
VH(Control signal) = 5.0V Read time
© 2018 IEEE
(0 V)
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 17 of 22
Reduction of Read Time
• Lowered Vt of read transistors by back gate
• 33% reduction in read time by bootstrapping RBL
Typical, RT
w/o bootstrap w/ bootstrap 80
70
-33%
60
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 19 of 22
Endurance
>1014-cycle endurance is achieved.
10-4
1.E-04 2.50
10-5
1.E-05
Logic-1
10-6
1.E-06 1.50
10-7
1.E-07 Logic-0
0.50
ISL (A)
Vt (V)
10-8
1.E-08
Logic-0
10-9
1.E-09
-0.50
10 -10
1.E-10
10 -11
1.E-11
-1.50
Logic-1
10 -12
1.E-12
10 -13
1.E-13 -2.50
1.E+00
0 1.E+02
2 1.E+04
4 1.E+06
6 1.E+08
8 1.E+10
10 1.E+12
12 1.E+14
-3 -2 -1 01 2 3 4 10 10 10 10 10 10 1410 10
VRWL (V) Write cycles
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 20 of 22
Die Photo and Comparison Table
• Standby power (RT) : 9.9 nW
• Write power (RT) : 97.9 μW/MHz
• Read power (RT) : 123.6 μW/MHz (CL=10fF)
© 2018 IEEE
International Solid-State Circuits Conference 30.4: A 20ns-Write 45ns-Read and 1014-Cycle Endurance Memory Module Composed of 60nm Crystalline Oxide Semiconductor Transistors 22 of 22