eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

A 29.12 TOPS/W and 1.13 TOPS/mm2 NAS-Optimized Mixed-Precision DNN Accelerator with Vector Split- and-Combination Systolic in 28nm CMOS

Resource Type: Conference
Authors: Li, Kai; Huang, Hantao; Huang, Mingqiang; Ding, Chenchen; Lin, Longyang; Ni, Liebing; Yu, Hao
Source: 2024 IEEE Custom Integrated Circuits Conference (CICC) Custom Integrated Circuits Conference (CICC), 2024 IEEE. :1-2 Apr, 2024
Subject: Components, Circuits, Devices and Systems
Application specific integrated circuits
Quantization (signal)
Vectors
CMOS integrated circuits
Language
ISSN: 2152-3630

Online Access

Full Text (IEEE)

초록

Quantization is widely used for DNN compression that usually leads to networks under multiple precisions. One has to configure and support different precisions of operations for the entire network. Envision [1] and TinyVers [2] deploy 16-bit and 8-bit as basic operation units respectively that is further split to low-precision-bit operation. UNPU [3], Bitblade [4] and Marsellus [5] utilize 1-bitx16-bit, 2-bitx2-bit, and t-bix 1-bit as basic operation units respectively that are further combined to support high-precision-bit operations. Most of these works are however not able to optimally select a precision at each layer in a network, and hence commonly result in both low energy and area efficiency.

공지

DAU Library

eArticles

요약정보

A 29.12 TOPS/W and 1.13 TOPS/mm2 NAS-Optimized Mixed-Precision DNN Accelerator with Vector Split- and-Combination Systolic in 28nm CMOS

Online Access

초록