Experience-replay Innovative Dynamics

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Multi-agent reinforcement learning (MARL) has achieved groundbreaking success in recent years. Yet, several open problems remain, including nonstationarity and instability. Evolutionary game theory (EGT) provides a theoretical framework to tackle instability by leveraging the properties of its most well-known model, namely, the replicator dynamics, for theoretical guarantees of convergence to Nash equilibria. However, these guarantees do not hold true in certain settings, e.g., zero-sum games. In contrast, innovative dynamics, such as the Brown-von Neumann-Nash (BNN) or Smith, retain the convergence guarantees in these settings. We develop a novel MARL algorithm based on innovative dynamics with a sampling process that resembles experience replay. We show that our approach is theoretically grounded as other state-of-the-art MARL algorithms, but most importantly it outperforms other approaches in the case of nonstationary environments.

Original languageBritish English
Title of host publicationProceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025
EditorsYevgeniy Vorobeychik, Sanmay Das, Ann Nowe
Pages2829-2831
Number of pages3
ISBN (Electronic)9798400714269
StatePublished - 2025
Event24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025 - Detroit, United States
Duration: 19 May 202523 May 2025

Publication series

NameProceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
ISSN (Print)1548-8403
ISSN (Electronic)1558-2914

Conference

Conference24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025
Country/TerritoryUnited States
CityDetroit
Period19/05/2523/05/25

Keywords

  • Evolutionary game theory
  • multi-agent systems
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Experience-replay Innovative Dynamics'. Together they form a unique fingerprint.

Cite this