A compute-in-memory chip based on resistive random-access memory

Weier Wan; Rajkumar Kubendran; Clemens Schaefer; Sukru Burc Eryilmaz; Wenqiang Zhang; Dabin Wu; Stephen Deiss; Priyanka Raina; He Qian; Bin Gao; Siddharth Joshi; Huaqiang Wu; H-S Philip Wong; Gert Cauwenberghs

doi:10.1038/s41586-022-04992-8

A compute-in-memory chip based on resistive random-access memory

Nature. 2022 Aug;608(7923):504-512. doi: 10.1038/s41586-022-04992-8. Epub 2022 Aug 17.

Authors

Weier Wan^{1

2}, Rajkumar Kubendran^{3

4}, Clemens Schaefer⁵, Sukru Burc Eryilmaz⁶, Wenqiang Zhang⁷, Dabin Wu⁷, Stephen Deiss³, Priyanka Raina⁶, He Qian⁷, Bin Gao⁸, Siddharth Joshi^{9

10}, Huaqiang Wu¹¹, H-S Philip Wong¹², Gert Cauwenberghs¹³

Affiliations

¹ Stanford University, Stanford, CA, USA. weierwan@stanford.edu.
² University of California San Diego, La Jolla, CA, USA. weierwan@stanford.edu.
³ University of California San Diego, La Jolla, CA, USA.
⁴ University of Pittsburgh, Pittsburgh, PA, USA.
⁵ University of Notre Dame, Notre Dame, IN, USA.
⁶ Stanford University, Stanford, CA, USA.
⁷ Tsinghua University, Beijing, China.
⁸ Tsinghua University, Beijing, China. gaob1@tsinghua.edu.cn.
⁹ University of California San Diego, La Jolla, CA, USA. sjoshi2@nd.edu.
¹⁰ University of Notre Dame, Notre Dame, IN, USA. sjoshi2@nd.edu.
¹¹ Tsinghua University, Beijing, China. wuhq@tsinghua.edu.cn.
¹² Stanford University, Stanford, CA, USA. hspwong@stanford.edu.
¹³ University of California San Diego, La Jolla, CA, USA. gert@ucsd.edu.

Abstract

Realizing increasingly complex artificial intelligence (AI) functionalities directly on edge devices calls for unprecedented energy efficiency of edge hardware. Compute-in-memory (CIM) based on resistive random-access memory (RRAM)¹ promises to meet such demand by storing AI model weights in dense, analogue and non-volatile RRAM devices, and by performing AI computation directly within RRAM, thus eliminating power-hungry data movement between separate compute and memory^2-5. Although recent studies have demonstrated in-memory matrix-vector multiplication on fully integrated RRAM-CIM hardware^6-17, it remains a goal for a RRAM-CIM chip to simultaneously deliver high energy efficiency, versatility to support diverse models and software-comparable accuracy. Although efficiency, versatility and accuracy are all indispensable for broad adoption of the technology, the inter-related trade-offs among them cannot be addressed by isolated improvements on any single abstraction level of the design. Here, by co-optimizing across all hierarchies of the design from algorithms and architecture to circuits and devices, we present NeuRRAM-a RRAM-based CIM chip that simultaneously delivers versatility in reconfiguring CIM cores for diverse model architectures, energy efficiency that is two-times better than previous state-of-the-art RRAM-CIM chips across various computational bit-precisions, and inference accuracy comparable to software models quantized to four-bit weights across various AI tasks, including accuracy of 99.0 percent on MNIST¹⁸ and 85.7 percent on CIFAR-10¹⁹ image classification, 84.7-percent accuracy on Google speech command recognition²⁰, and a 70-percent reduction in image-reconstruction error on a Bayesian image-recovery task.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.