eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Reinforcement Learning Accelerator using Q-Learning Algorithm with Optimized Bit Precision

Resource Type: Conference
Authors: Sutisna, Nana; Ilmy, Andi M. R.; Setiawan, Handi Nugroho; Syafalni, Infall; Mulyawan, Rahmat; Ahmadi, Nur; Adiono, Trio
Source: 2022 8th International Conference on Wireless and Telematics (ICWT) Wireless and Telematics (ICWT), 2022 8th International Conference on. :1-5 Jul, 2022
Subject: Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Fields, Waves and Electromagnetics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Wireless communication
Q-learning
Automation
Navigation
Computer architecture
Telematics
Generators
Reinforcement Learning
Q-Learning
Approximate Computing
Low Complexity
Bits Precision
Language

Online Access

Full Text (IEEE)

초록

This paper presents a Reinforcement Learning (RL) accelerator using Q-learning algorithm with optimized bit precision. In this work, we perform evaluation of the employed bit width of the data path subject to accuracy of the Q-values. The designed RL accelerator is implementing the Q-Learning algorithm that comprises several blocks: Q-Value memories, Q-Updater, Policy Generator and, Environment block. In addition, we also present the corresponding architecture and implement the design in the FPGA. Experimental results show that the number of bits can be reduced from 32 bits to 16 bits without sacrificing the accuracy. The accuracy can be maintained at around 88% when employing 16 bits data path with 10 bits fraction. Moreover, the designed 16 bits RL accelerator design size offers reduction of LUTs and FFs compared to 32 bits implementation by around 40% and 14 %, respectively. Hence, the optimized accelerator can be useful for low-complexity systems or limited resources such as in robot automation for smart navigation and smart mapping.

공지

DAU Library

eArticles

요약정보

Reinforcement Learning Accelerator using Q-Learning Algorithm with Optimized Bit Precision

Online Access

초록