학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Federated Reinforcement Learning for Automatic Control in SDN-based IoT Environments

Resource Type: Conference
Authors: Lim, Hyun-Kyo; Kim, Ju-Bong; Kim, Sang-Youn; Han, Youn-Hee
Source: 2020 International Conference on Information and Communication Technology Convergence (ICTC) Information and Communication Technology Convergence (ICTC), 2020 International Conference on. :1868-1873 Oct, 2020
Subject: Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Fields, Waves and Electromagnetics
Power, Energy and Industry Applications
Signal Processing and Analysis
Transportation
Performance evaluation
Optimal control
Reinforcement learning
Information and communication technology
Robots
Smart manufacturing
Convergence
Federated reinforcement learning
Multi-IoT device control
Software-Defined networking
Language

Online Access

Full Text (IEEE)

초록

Recently, reinforcement learning has been applied to various fields and shows better performance than humans. In particular, it is attracting attention in the fields of smart factories and robotics that require automatic control without human intervention. In this paper, we try to allow multiple reinforcement learning agents to learn optimal control policy on their own IoT devices of the same type. There is no guarantee that the reinforcement learning agent that has learned the optimal control policy using one IoT device will perform optimal control of other IoT devices. Therefore, since reinforcement learning must be performed individually for each IoT device, it takes a lot of time and cost. To solve this problem, we propose a new method of federated reinforcement learning. In the proposed federated reinforcement learning, multiple agents have independent IoT devices, perform learning at the same time, and federate with each other to improve learning performance. Therefore, we apply a new gradient sharing method and transfer learning to reinforcement learning. In addition, Actor-Critic PPO, which shows good performance in reinforcement learning algorithms, is used. And, for smooth learning in the IoT environment where numerous devices exist, we propose an architecture based on Software-Defined Networking. Using multiple rotary inverted pendulum devices interconnected via a SDN, we demonstrate that the proposed federated reinforcement learning scheme can effectively facilitate the learning process for multiple IoT devices.

공지

DAU Library

학술논문

요약정보

Federated Reinforcement Learning for Automatic Control in SDN-based IoT Environments

Online Access

초록