학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Accelerator for Sparse Convolutional Neural Networks Based on Shift Units

Resource Type: Conference
Authors: Guo, Mengyuan; Mo, Zhiwen; Wang, Qin; Jiang, Jianfei; Sheng, Weiguang; Jing, Naifeng
Source: 2021 IEEE 2nd International Conference on Information Technology, Big Data and Artificial Intelligence (ICIBA) Information Technology, Big Data and Artificial Intelligence (ICIBA), 2021 IEEE 2nd International Conference on. 2:766-771 Dec, 2021
Subject: Communication, Networking and Broadcast Technologies
Computing and Processing
Engineering Profession
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Semiconductor device measurement
Computational modeling
Layout
Neural networks
Approximation algorithms
Energy efficiency
Convolutional neural networks
Sparse Convolutional Neural Network
Hardware Accelerator
Lightweight Algorithm
TSMC 28-nm tape out
Language

Online Access

Full Text (IEEE)

초록

With the development of convolutional neural networks (CNNs), deeper and wider networks improve accuracy while increasing deployment difficulty. Design lightweight algorithms and high-efficiency hardware accelerators has therefore become a research hotspot. In this article, the sparsity and quantized weights of CNNs are explored in the quest to obtain lightweight models that decrease the number of weights to 28% and bit widths to 25% compared with original models. To avoid unbalanced computation, we model sparse CNNs and propose a data flow with high processing element (PE)-utilization ratio and low DRAM-access amount. We design a sparse CNN accelerator based on shift units, in which the zero weights are skipped to save execution time and energy. The layout was taped out in TSMC 28 nm and packaged with QFP144. Passing the functional tests, the design achieves 256.1GOPS performance and gains 1.133TOPS/W efficiency. Compared with similar designs, our design achieves 46.7% less EDP.

공지

DAU Library

학술논문

요약정보

Accelerator for Sparse Convolutional Neural Networks Based on Shift Units

Online Access

초록