Beruflich Dokumente
Kultur Dokumente
for Resistive
Memory Technology
A broad perspective on ReRAM technology
for future storage and computing.
©iStockphoto.com/sarawuth702
R
esistive random-access memory (ReRAM) is semiconductor (CMOS) technology. In addition to the data stor-
regarded as one of the most promising alternative non- age, ReRAM can also be used for logic operation and
volatile memory technologies for its advantages in computation, demonstrating a great potential in developing a
very-high-storage density, simple structure, low power non–von Neumann computing system. Extensive studies on
consumption, and long endurance, as well as good ReRAM technology, including material, device process, cell
compatibility with traditional complimentary metal–oxide– and array structure, circuit, and architecture, have been conduct-
ed in recent years. In this article, we provide a broad perspec-
Digital Object Identifier 10.1109/MCE.2016.2614523
tive on ReRAM technology for future storage and computing.
Date of publication: 14 December 2016 The models, challenges, and applications are also summarized.
Manufacturability
Scalability BEOL Standard CMOS Fab
Sub 10 nm Easy 3-D Stacking
eFlash Perf and Rel Degrades < 25 nm Existing Tools and Process
Density
Low Cost Process
1F2 with MLC and 3-D Terabytes
2 Additional Backend Masking Steps
NAND 4.8F2 32 GBs
3~4% Incremental Cost Over Standard CMOS
Read Performance
Temperature
<30 ns Initial Latency for Code Memory
10-Year Retention at 125 °C
2 µs Initial Latency for Data Storage
Flash < 1 Year Below 55 °C
NAND ~75 µs Random Access
Bipolar
Metallization Effects
taminates the real information of the target cell during a read.
Valency Change The performance of a crosspoint array thereby depends on
Memory Effects the amount of sneak path leakage current, which is deter-
mined by the number of memory devices as well as the resis-
Thermochemical
Memory Effects tance value of these cells. A popular solution at device level
Unipolar is integrating each ReRAM device with a selective device
Phase Change such as a transistor or a bipolar nonlinear selector, construct-
Memory Effects
ing 1T1R or 1S1R cell structure, respectively.
Improvements in reading procedure and peripheral circuit
FIGURE 2. A general category of resistive switching memory. designs have also been widely explored. For example, multi-
stage reading was introduced by HP Labs [17], which is a
self-referencing technique. The unfolded architecture [18]
inserts a separate column for each ReRAM device to cancel
BL out the impact of sneak paths. Intel proposed to utilize an
WL Selected operational amplifier for data detecting [19]. The scheme
(Vdd)
can suppress the sneak path leakage efficiently, but the large
Unselected
(0) area of operational amplifier severely constrains the design
Unselected
(0) scalability. The issue can be addressed by the weighted sens-
Unselected ing scheme that assigns different weights to different cells
(0)
and reads them simultaneously with one shared operational
amplifier [20].
FIGURE 4. The BEOL fabrication process for ReRAM (memristor) and NMOS transistors integrated arrays. (a) An NMOS substrate from
the FEOL process with exposed vias, (b) a metal deposition and patterning of bottom electrodes, (c) an active layer deposition, (d) an
insulator layer deposition and memristor bit definition (nm or μm), and (e) a metal deposition for top electrode and patterning. (f) A
cross-section diagram of the integrated array.
Bottom Electrodes
Metal Oxide Pillar Electrode
D
Access Electrodes
Transistor Gate
(WL)
F
ReRAM Cell
Hi H
m
z
GND BL Access SL
100 mm
Terminal Transistor
y x
FIGURE 5. An SEM image of memristor-NMOS fabricated arrays
with a bit size of 2 μm in diameter. FIGURE 6. A schematic view of a 3-D-VRAM array [21].
Embedded Stand-Alone
Dense Array
Selector
Nonvolatile Whereas DRAM requires a transistor
3-D XPoint Technology is at each memory cell—making it big
nonvolatile—which means your data and expensive—the amount of
doesn’t go away when your power goes voltage sent to each 3-D XPoint
away—making it a great choice Technology selector enables its
for storage. memory cell to be written to or read
without requiring a transistor.
High Endurance
Unlike other storage memory technologies,
3-D XPoint Technology is not significantly 1 Memory Cell
impacted by the number of write cycles it can Each memory cell can
endure, making it more durable. store a single bit of data.
Transforming the Memory Hierarchy ~8x to 10x Greater Density than DRAM
For the first time, there is a fast, inexpensive and 3-D XPoint Technology’s simple, stackable, transistor-less
nonvolatile memory technology that can serve as design packs more memory into less space, which is
system memory and storage. critical to reducing cost.
1 Gb 1 Gb 1 Gb
1 Gb 1 Gb 1 Gb 1 Gb
1 Gb 1 Gb 1 Gb
(a) (b)
FIGURE 11. A ReRAM crosspoint array for matrix-vector computing implementation. (a) The mathematical matrix-
vector computation and (b) the physical mapping to a crosspoint ReRAM array.
v0 g
WL
x0 w0 v1
w1 v2
x1 Σwi xi
w2
x2 v3
w3
x3 BL
Σgi vi
(a) (b)
FIGURE 12. (a) and (b) The implementation of sums-of-product computation in a neural network on a ReRAM
crosspoint array [34].
medical, the Internet of Things, mobile computing, and wear- joining together to bring ReRAM (called memristor by
ables, as indicated in Figure 8. them) technology to the storage memory market. The appli-
Crossbar Inc. has developed stand-alone 3-D ReRAM cations of ReRAM in storage memory can be classified into
products targeting data storage solutions by integrating the following areas [32].
thousands of ReRAM cells in a crosspoint [29]. As the
milestone research directed by Intel Corp. and Micron Tech- USE AS ON-CHIP CACHES
nology, the 3-D XPoint technology with initial 128-Gb stor- As the fabrication technology node shrinks down to 28 nm or
age per die using the resistive switching technology was below, the high standby power and die size become critical
released in 2015 [30]. The 3-D structure and wafer view can issues. Compared to conventional static random-access mem-
be seen in Figures 9 and 10, respectively. It is claimed that ory (SRAM) technology, ReRAM with nonvolatile data stor-
up to 1,000 times lower latency and exponentially greater age and simple array structure can eliminate the standby
endurance than nand flash memory can be obtained by power and significantly reduce the die size. Particularly, the
the 3-D XPoint technology, and high-speed and high-capac- easy 3-D integration of ReRAM on top of a CMOS processor
ity data storage can be delivered. It is worthwhile to men- will further reduce the final die area and accelerate the com-
tion the Machine project, a long-term HP Labs research munication between logic units and caches. The utilization of
project that is memory-driven computing leveraging nonvolatile ReRAM caches will also enable an instant on/off
ReRAM cells for storage. Notably, HP and SanDisk are feature and make data restore and system recovery easier and
Image Inputs
W
larger cache size and fewer off-chip requests. (ReRAM Periphery Digital
Crosspoint Circuits Outputs
SERVE AS MEMORY REPLACEMENT Array)
Output
The chief value of ReRAM as a memory extension is its Weight Matrix
Image
lower cost compared to dynamic random-access memory
(DRAM) and other characteristics. Supplying more memory
FIGURE 13. The basic computation in neural network implemen-
increases data locality and therefore minimizes the impact of tation with a ReRAM crosspoint array.
interserver messaging over low-speed networks. At the
application level, this facilitates faster information retrieval
and more efficient data processing. The following are some ReRAM FOR LOGIC OPERATION
potential applications of resistive memory. ReRAM technology has been used as new logic operation
▼▼ Big data analytics: Data is currently being generated at media, for example, for field-programmable gate arrays
unprecedented rates and from a growing variety of sources. (FPGAs). Many new FPGA architectures that use ReRAM to
The requirement for accurate and real-time analysis, how- replace conventional SRAM have been proposed, demonstrat-
ever, makes the data process and management very chal- ing significant improvements on the density of integration,
lenging. With large-capacity ReRAM at lower costs, a accessing performance, and energy consumption, and so forth.
server can be equipped with much more memory than For example, Chen et al. presented an FPGA architecture that
DRAM-only solutions, enabling high operation parallelism can completely substitute ReRAM for SRAM in all the major
and high utilization of the computation resources. components, including the logic blocks, switch blocks, and
▼▼ In-memory database: In-memory databases leverage sys- connection blocks [33]. In particular, the lookup table design
tem memory to access and process data very rapidly. The in logic blocks can be engineered in 3-D stacking to maximize
memory availability is critical to the real-world performance the benefit from the high density of ReRAM. It naturally sup-
and effectiveness of these databases. Proper u tilization of ports bit-addressable access and can be used as DRAM, which
ReRAM will fill the gap. usually has limited utilization in the conventional SRAM-
▼▼ Relational databases: When deploying a modern relational based FPGAs due to high design complexity and large area.
database management system, designers must balance con-
flicting requirements. A general goal is to minimize solu- ReRAM FOR COMPUTING
tion cost while maintaining consistent quality of service. People attempt to develop very-large-scale integration hard-
To cope with these demanding workloads, currently data- ware to mimic neurobiological architecture and therefore
bases implement in-memory caches called buffer pools. obtain advanced computing systems with high efficiency.
The introduction of ReRAM can increase the capacity of This approach is called neuromorphic computing. The advan-
memory, buffer, and caches of each server, offering a more tages of ReRAM in high density, fast operation, low power
scalable solution. consumption, and analog to biological synapse make it
extremely attractive in neuromorphic system implementa-
MASSIVE STORAGE DEVICE tion. It is worthwhile to mention that ReRAM crosspoint
A ReRAM-based storage device can be deployed in a distrib- can be naturally used to implement the matrix-vector
uted fashion throughout the server and provide a storage computation (or sum-of-product), as depicted in Figure 11
solution at the lowest latencies possible. It would be a good [8]. Neural networks in which sum-of-product computations
option when extreme performance on the storage is needed, are a fundamental mathematical component can be imple
such as with database and visualization applications. mented efficiently, as can be seen in Figure 12 [34].
▼▼
Database applications have evolved to make efficient use
of faster networking, computing, and memory. Storage has
not kept pace, with current solutions being a bottleneck on
database performance. ReRAM provides a high-perform-
Properly controlling active devices
ing, scalable, and predictable solution for databases. can significantly reduce the leakage
▼▼
Visualization applications require performance. Whether current and minimize the impacts
accelerating databases or virtual desktop infrastructure
instances, providing predictable response times and high of the sneak path.
throughput is essential. ReRAM-based storage helps with
predictable low response times across virtual machines, effi- For example, when implementing an artificial neural net-
cient scaling by consolidating servers with more virtual work for image recognition, digital images are encoded in float-
machines per physical host, the lowest latency, and the high- ing-point format and are then mapped to analog or digital
est bandwidth in storage. voltage signals. Figure 13 demonstrates the basic computation