eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

JNPU: A 1.04TFLOPS Joint-DNN Training Processor with Speculative Cyclic Quantization and Triple Heterogeneity on Microarchitecture / Precision / Dataflow

Resource Type: Conference
Authors: Yang, Je; Lim, Sukbin; Lee, Sukjin; Kim, Jae-Young; Kim, Joo-Young
Source: ESSCIRC 2023- IEEE 49th European Solid State Circuits Conference (ESSCIRC) Solid State Circuits Conference (ESSCIRC), ESSCIRC 2023- IEEE 49th European. :349-352 Sep, 2023
Subject: Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Photonics and Electrooptics
Signal Processing and Analysis
Training
Quantization (signal)
Microarchitecture
Metaverse
Neural networks
Memory management
Energy efficiency
Deep neural network (DNN) training processor
Multiple DNN acceleration
Dataflow
Quantization
Language
ISSN: 2643-1319

Online Access

Full Text (IEEE)

초록

This paper presents JNPU, a 1. 04TFLOPS joint-DNN accelerator that can simultaneously run joint-DNN (MobileNet + GoogLeNet) models with 245FPS (inference) and 1. 26TFLOPS/W (training). It proposes speculative cyclic quantization that enables integer-dominant operations and reduces external memory access by 87.5%. Its tangram dataflow mapper provides optimized sets of heterogeneous stationary types for both forward and backward propagation, enhancing efficiency up to 71.6%. Lastly, its novel processing cluster leverages triple heterogeneity on INT8 arrays and FP16 vector processor, saving 56.3% and 26.9% of computing area and power, respectively.

공지

DAU Library

eArticles

요약정보

JNPU: A 1.04TFLOPS Joint-DNN Training Processor with Speculative Cyclic Quantization and Triple Heterogeneity on Microarchitecture / Precision / Dataflow

Online Access

초록