학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Path Optimal Control of Markov Decision Processes with Temporal Logic Specifications

Resource Type: Conference
Authors: Tao, Dong; Li, Xiaohui; Wen, Wen; Zhang, Qing
Source: 2023 IEEE 18th Conference on Industrial Electronics and Applications (ICIEA) Industrial Electronics and Applications (ICIEA), 2023 IEEE 18th Conference on. :1036-1041 Aug, 2023
Subject: Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Simulation
Semantics
Process control
Optimal control
Markov processes
Probabilistic logic
Robustness
path optimal control
markov decision processes
temporal logic specifications
Language
ISSN: 2158-2297

Online Access

Full Text (IEEE)

초록

Markov decision processes (MDP) are probabilistic models widely used in various areas, such as control theory, game theory, machine learning and robotics. Recent years have seen a surge of research interest in the control of Markov decision process (MDP) under temporal logic specifications. Existing methods, such as abstraction-based approach and receding horizon approach, are hard to extend to MDP with continuous states, need a precise knowledge of the model, and are computationally demanding due to the curse of dimensionality. In this letter, we propose a randomized controller design algorithm for continuous state MDP with unknown transition probabilities with respect to signal temporal logic (STL) specifications. Our basic idea is to convert the controller design into an optimization problem with its robustness index as a cost function, where the optimal control policy corresponds to the optimal solution follow a probabilistic distribution. Sampling approach is employed to asymptotically approximate the optimal distribution. The convergence property is formally proved with an estimate of the convergence rate. Numerical example is given to illustrate the effectiveness of the proposed method.

공지

DAU Library

학술논문

요약정보

Path Optimal Control of Markov Decision Processes with Temporal Logic Specifications

Online Access

초록