Curriculum Learning for Tightly Coupled Multiagent Systems

Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

1 Citation (Scopus)

Abstract

In this paper, we leverage curriculum learning (CL) to improve the performance of multiagent systems (MAS) that are trained with the cooperative coevolution of artificial neural networks. We design curricula to progressively change two dimensions: scale (ie domain size) and coupling (ie the number of agents required to complete a subtask). We demonstrate that CL can successfully mitigate the challenge of learning on a sparse reward signal resulting from a high degree of coupling in complex MAS. We also show that, in most cases, the combination of difference reward shaping with CL can improve performance by up to 56%. We evaluate our CL methods on the tightly coupled multi-rover domain. CL increased converged system performance on all tasks presented. Furthermore, agents were only able to learn when trained with CL for most tasks.
Original languageEnglish (Ireland)
Title of host publication2019 International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019)
PublisherInternational Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Pages2174-2176
Number of pages3
ISBN (Electronic)9781510892002
Publication statusPublished - 1 May 2019
Event18th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2019 - Montreal, Canada
Duration: 13 May 201917 May 2019

Publication series

Name1548-8403

Conference

Conference18th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2019
Country/TerritoryCanada
CityMontreal
Period13/05/1917/05/19

Keywords

  • Curriculum learning
  • Difference rewards
  • Multiagent coordination

Authors (Note for portal: view the doc link for the full list of authors)

  • Authors
  • Rockefeller, G; Mannion, P; Tumer, K

Fingerprint

Dive into the research topics of 'Curriculum Learning for Tightly Coupled Multiagent Systems'. Together they form a unique fingerprint.

Cite this