dc.contributor.author | Abbas, Muhammad Naveed | |
dc.contributor.author | Liston, Paul | |
dc.contributor.author | Lee, Brian | |
dc.contributor.author | Qiao, Yuansong | |
dc.date.accessioned | 2024-12-18T16:08:08Z | |
dc.date.available | 2024-12-18T16:08:08Z | |
dc.date.copyright | 2023 | |
dc.date.issued | 2023-12-15 | |
dc.identifier.citation | Abbas, M., Liston, P., Lee, B., Qiao, Y. (2023). Benchmarking communicative reinforcement learning frameworks on multi-robot cooperative tasks. In 2023 International Conference on Machine Learning (ICMLA). 988-993. Jacksonville, Florida. 15-17 December 2023. DOI: 10.1109/ICMLA58977.2023.00146. | en_US |
dc.identifier.isbn | 1946-0740 | |
dc.identifier.isbn | 979-8-3503-4534-6 | |
dc.identifier.uri | https://research.thea.ie/handle/20.500.12065/4870 | |
dc.description.abstract | Industry 4.0 warehousing is characterised by autonomous multi-robot collaboration systems (MRSs) along with other technologies such as digital communication capabilities and the Internet of Things. These MRSs need to behave coherently for the efficient completion of the assigned cooperative tasks. Multi-agent reinforcement learning (MARL) frameworks are currently considered state-of-the-art to control the behaviour of autonomous MRSs. These MARL frameworks can be with learnable or predefined communication. Current works lack any worthwhile evaluation of communicative MARL frameworks on multi-robot cooperative tasks. This work empirically evaluates current state-of-the-art seminal learnable communicative MARL frameworks by comparing their performance against non-communicative MARL frameworks on multi-robot coop-erative tasks in the context of Industry 4.0 warehousing with the assumptions of partial observability and reward sparsity. The results demonstrate that communicative MARL frameworks outperform their counterparts by a fair margin in training (average returns between 11 and 6 against 8 and 4 for highest and lowest values respectively) and execution performances (average returns between 1.24 and 0.29 against 0.49 and 0.19 for highest and lowest values respectively). This leads to the conclusion that communicative MARL is better suited to multi-robot cooperative tasks under the above-mentioned assumptions. | en_US |
dc.format | PDF | en_US |
dc.language.iso | eng | en_US |
dc.publisher | IEEE | en_US |
dc.relation.ispartof | 2023 International Conference on Machine Learning and Applications (ICMLA) | en_US |
dc.rights | Attribution-States | * |
dc.rights.uri | http://creativecommons.org/licenses/cc-by/4.0 | * |
dc.subject | Communicative | en_US |
dc.subject | Cooperative | en_US |
dc.subject | Multi-agent reinforcement learning | en_US |
dc.subject | Multi-robots | en_US |
dc.subject | Non-communicative | en_US |
dc.subject | Warehouse | en_US |
dc.title | Benchmarking communicative reinforcement learning frameworks on multi-robot cooperative tasks | en_US |
dc.conference.date | 2023-11-15 | |
dc.conference.host | IEEE | en_US |
dc.conference.location | Jacksonville, Florida | en_US |
dc.contributor.affiliation | Technological University of the Shannon: Midlands Midwest | en_US |
dc.description.peerreview | yes | en_US |
dc.identifier.doi | 10.1109/ICMLA58977.2023.00146 | en_US |
dc.identifier.eissn | 1946-0759 | |
dc.identifier.orcid | https://orcid.org/0000-0001-6820-3160 | en_US |
dc.identifier.orcid | https://orcid.org/0000-0003-2832-8975 | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-8475-4074 | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-1543-1589 | en_US |
dc.rights.accessrights | info:eu-repo/semantics/openAccess | en_US |
dc.subject.department | Software Research Institute: TUS Midlands | en_US |
dc.type.version | info:eu-repo/semantics/acceptedVersion | en_US |