학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Investigation of Effective Exploration Methods in Reinforcement Learning Control of a Flapping UAV / 羽ばたき型UAVの強化学習制御における効果的な探索法の検討

Resource Type: Journal Article
Authors: Kentaro HIRAI; Lifeng XIE; Miku SAITO; Shunto SASAZAKI; Takanobu WATANABE; Zhi LI; 平井健太郎; 李直; 渡邉孝信; 笹崎舜翔; 謝砺鋒; 齋藤未来
Source: The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec). 2023, :2-D11
Subject: Flapping UAV
Flight posture control
PID control
Reinforcement learning
Language: Japanese
ISSN: 2424-3124

Online Access

Find it @ DONGA

초록

We conducted investigation into an effective scheduling method of the exploration in a reinforcement learning algorithm, aiming at the control of a flapping unmanned aerial vehicle (UAV) we have developed. Deep Q Network (DQN) algorithm was employed to determine optimal gain parameters of PID control of the Yaw angle of the airframe. Although the Yaw angle can be stabilized by this PID-DQN hybrid method, we noticed that the gain parameters tend to be biased toward highly rated values in the early stages of the learning. In this study, we solved this problem by modifiying the scheduling of epsilon-greedy method in DQN.

공지

DAU Library

학술논문

요약정보

Investigation of Effective Exploration Methods in Reinforcement Learning Control of a Flapping UAV / 羽ばたき型UAVの強化学習制御における効果的な探索法の検討

Online Access

초록