학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Optimal Containment Control of Nonlinear MASs: A Time-Aggregation-Based Policy Iteration Algorithm

Resource Type: Conference
Authors: Shi, Xiongtao; Li, Yanjie; Du, Chenglong
Source: 2023 62nd IEEE Conference on Decision and Control (CDC) Decision and Control (CDC), 2023 62nd IEEE Conference on. :7336-7341 Dec, 2023
Subject: Computing and Processing
Power, Energy and Industry Applications
Robotics and Control Systems
Protocols
Computational modeling
Simulation
Process control
Optimal control
Reinforcement learning
Approximation algorithms
Time-aggregation
policy iteration
model-free control
optimal containment control
Language
ISSN: 2576-2370

Online Access

Full Text (IEEE)

초록

In this paper, the optimal containment control of a class of unknown nonlinear multi-agent systems (MASs) is studied via a time-aggregation (TA) based model-free reinforcement learning (RL) algorithm. By proposing TA-based event-state, event-control, and integration-reward, the model-free TA-based policy iteration (TA-PI) approach is synthesized such that the policy evaluation and policy improvement steps are only executed for finite event-state, and the optimal control protocol is obtained with fewer computational requirements. Besides, the control input is intermittently updating only when the event-set is visited, which greatly reduce the updating frequency of control. Therefore, the proposed learning algorithm helps to save computational resources in both learning process and control updating. Moreover, armed with a finite predefined event-set, the developed TA-PI algorithm without employing function approximator and state discretization, resulting a strict convergence analysis via the mathematical induction. Finally, simulation results are given to show the feasibility and effectiveness of the proposed algorithm.

공지

DAU Library

학술논문

요약정보

Optimal Containment Control of Nonlinear MASs: A Time-Aggregation-Based Policy Iteration Algorithm

Online Access

초록