학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Quad-multiplier packing based on customized floating point for convolutional neural networks on FPGA

Resource Type: Conference
Authors: Zhang, Zhifeng; Zhou, Dajiang; Wang, Shihao; Kimura, Shinji
Source: 2018 23rd Asia and South Pacific Design Automation Conference (ASP-DAC) Design Automation Conference (ASP-DAC), 2018 23rd Asia and South Pacific. :184-189 Jan, 2018
Subject: Components, Circuits, Devices and Systems
Computing and Processing
Training
Field programmable gate arrays
Kernel
Hardware
Convolutional neural networks
Computer vision
Language
ISSN: 2153-697X

Online Access

Full Text (IEEE)

초록

Deep convolutional neural networks (CNNs) are widely used in many computer vision tasks. Since CNNs involve billions of computations, it is critical to reduce the resource /power consumption and improve parallelism. Compared with extensive researches on fixed point conversion for cost reduction, floating point customization has not been paid enough attention due to its higher cost than fixed point. This paper explores the customized floating point for both the training and inference of CNNs. 9-bit customized floating point is found sufficient for the training of ResNet-20 on CIFAR-10 dataset with less than 1% accuracy loss, which can also be applied to the inference of CNNs. With reduced bit-width, a computational unit (CU) based on Quad-Multiplier Packing is proposed to improve the resource efficiency of CNNs on FPGA. This design can save 87.5% DSP slices and 62.5% LUTs on Xilinx Kintex-7 platform compared to CU using 32-bit floating point. More CUs can be arranged on FPGA and higher throughput can be expected accordingly.

공지

DAU Library

학술논문

요약정보

Quad-multiplier packing based on customized floating point for convolutional neural networks on FPGA

Online Access

초록