论文--中国科学院上海高等研究院

论文

您当前的位置：首页科研成果论文

A Universal RRAM-Based DNN Accelerator With Programmable Crossbars Beyond MVM Operator

作者

Zhang, Zihan; Jiang, Jianfei; Zhu, Yongxin; Wang, Qin; Mao, Zhigang; Jing, Naifeng
刊物名称

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS
年、卷、文献号

2022, 41, 0278-0070
关键词

Zhang, Zihan; Jiang, Jianfei; Zhu, Yongxin; Wang, Qin; Mao, Zhigang; Jing, Naifeng
摘要

Resistive-RAM (RRAM)-based deep neural network (DNN) accelerator has shown a great potential as it is good at the matrix-vector multiplication (MVM) operator. However, it does not benefit non-MVM operators, such as transcendental activation or elementwise operations, which often require customized CMOS circuits in conventional DNN accelerator designs. In this article, we propose a new RRAM-based DNN inference accelerator, which leverages the proposed RRAM-CORDIC and RRAM-MLP algorithms to make the transcendental and elementwise operators calculable in the RRAM crossbar just like MVM. Both algorithms can exploit the higher multiply-and-accumulation (MAC) parallelism that is traditionally expensive in CMOS but now efficient in the RRAM crossbar. Then, we further propose an intercrossbar pipelining scheme, which can balance the number of crossbars for MVM and non-MVM operations and orchestrate them in pursuing higher DNN computing throughput. The experimental results show that both algorithms can sustain a high arithmetic accuracy and deliver less than 1% DNN accuracy loss on typical inference workloads. The elimination of expensive CMOS circuits, in turn, can trade more crossbar resources in the same area to speed up the performance by 1.16x to 2.33x. With the extended operators, the RRAM-based DNN accelerator can switch crossbar functions at will, and apply for a diverse of DNN models in a unified in-memory accelerator architecture.

TOP