학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Exploiting Weight Statistics for Compressed Neural Network Implementation on Hardware

Resource Type: Conference
Authors: Kashikar, Prachi; Sinha, Sharad; Verma, Ajeet Kumar
Source: 2021 IEEE 3rd International Conference on Artificial Intelligence Circuits and Systems (AICAS) Artificial Intelligence Circuits and Systems (AICAS), 2021 IEEE 3rd International Conference on. :1-4 Jun, 2021
Subject: Bioengineering
Components, Circuits, Devices and Systems
Computing and Processing
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Quantization (signal)
Microcontrollers
Neural networks
Memory management
Random access memory
Hardware
System-on-chip
Language

Online Access

Full Text (IEEE)

초록

Computing hardware like field programmable gate arrays (FPGAs), microcontrollers and microprocessors can have limited compute and on-chip storage resources. This is especially true for computing hardware in Internet of Things (IoT) and low end embedded systems. With the growth in machine and deep learning, it is imperative to build intelligence in these devices. Therefore, this paper proposes exploiting weight statistics to compress floating point based weights in neural networks without any loss in accuracy. The proposed method has been implemented as an optimization pass in open source N2D2 framework. The proposed method thus does not make an assumption that the application can tolerate some accuracy loss which is the case with other methods like quantization, binary weights etc. However, it can also be considered as a further step in optimization after applying existing quantization based methods. The proposed method is able to save nearly 10% on-chip storage requirement, thus reducing the number of Block RAMs (BRAM) in case of FPGAs and the size of on-chip memory (OCM) in case of microcontrollers and microprocessors. We show that layer wise compression gives slightly better compression than global compression. This compression is traded off for execution time overhead in microcontrollers.

공지

DAU Library

학술논문

요약정보

Exploiting Weight Statistics for Compressed Neural Network Implementation on Hardware

Online Access

초록