Benchmarking Communicative Reinforcement Learning Frameworks on Multi-Robot Cooperative Tasks
- Resource Type
- Conference
- Authors
- Abbas, Muhammad Naveed; Liston, Paul; Lee, Brian; Qiao, Yuansong
- Source
- 2023 International Conference on Machine Learning and Applications (ICMLA) ICMLA Machine Learning and Applications (ICMLA), 2023 International Conference on. :988-993 Dec, 2023
- Subject
- Computing and Processing
Engineering Profession
Robotics and Control Systems
Signal Processing and Analysis
Training
Warehousing
Collaboration
Reinforcement learning
Benchmark testing
Digital communication
Fourth Industrial Revolution
communicative
cooperative
multi-agent reinforcement learning
multi-robots
non-communicative
warehouse
- Language
- ISSN
- 1946-0759
Industry 4.0 warehousing is characterised by autonomous multi-robot collaboration systems (MRSs) along with other technologies such as digital communication capabilities and the Internet of Things. These MRSs need to behave coherently for the efficient completion of the assigned cooperative tasks. Multi-agent reinforcement learning (MARL) frameworks are currently considered state-of-the-art to control the behaviour of autonomous MRSs. These MARL frameworks can be with learnable or predefined communication. Current works lack any worthwhile evaluation of communicative MARL frameworks on multi-robot cooperative tasks. This work empirically evaluates current state-of-the-art seminal learnable communicative MARL frameworks by comparing their performance against non-communicative MARL frameworks on multi-robot coop-erative tasks in the context of Industry 4.0 warehousing with the assumptions of partial observability and reward sparsity. The results demonstrate that communicative MARL frameworks outperform their counterparts by a fair margin in training (average returns between 11 and 6 against 8 and 4 for highest and lowest values respectively) and execution performances (average returns between 1.24 and 0.29 against 0.49 and 0.19 for highest and lowest values respectively). This leads to the conclusion that communicative MARL is better suited to multi-robot cooperative tasks under the above-mentioned assumptions.