Opponent modelling for reinforcement learning in multi-objective normal form games

Yijie Zhang, Roxana Rădulescu, Patrick Mannion, Diederik M. Roijers, Ann Nowé

    Research output: Chapter in Book or Conference Publication/ProceedingConference Publicationpeer-review

    16 Citations (Scopus)

    Abstract

    In this paper, we investigate the effects of opponent modelling on multi-objective multi-agent interactions with non-linear utilities. Specifically, we consider multi-objective normal form games (MONFGs) with non-linear utility functions under the scalarised expected returns optimisation criterion. We contribute a novel actor-critic formulation to allow reinforcement learning of mixed strategies in this setting, along with an extension that incorporates opponent policy reconstruction using conditional action frequencies. Our empirical results demonstrate that opponent modelling can drastically alter the learning dynamics in this setting.

    Original languageEnglish
    Title of host publicationProceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020
    EditorsBo An, Amal El Fallah Seghrouchni, Gita Sukthankar
    PublisherInternational Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
    Pages2080-2082
    Number of pages3
    ISBN (Electronic)9781450375184
    Publication statusPublished - 2020
    Event19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 - Virtual, Auckland, New Zealand
    Duration: 19 May 2020 → …

    Publication series

    NameProceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
    Volume2020-May
    ISSN (Print)1548-8403
    ISSN (Electronic)1558-2914

    Conference

    Conference19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020
    Country/TerritoryNew Zealand
    CityVirtual, Auckland
    Period19/05/20 → …

    Keywords

    • Game theory
    • Multi-agent systems
    • Multi-objective decision making
    • Nash equilibrium
    • Opponent modelling
    • Reinforcement learning

    Fingerprint

    Dive into the research topics of 'Opponent modelling for reinforcement learning in multi-objective normal form games'. Together they form a unique fingerprint.

    Cite this