Emergence of Cooperation in Heterogeneous Populations of Reinforcement-Learning Agents

Title

Emergence of Cooperation in Heterogeneous Populations of Reinforcement-Learning Agents

Subject

Mathematics

Creator

Daphne Lavi

Date

2025

Contributor

Paolo Turrini and Chin-wing Leung

Abstract

This project extends the framework of Leung and Turrini (AAMAS 2024) by introducing behavioural diversity into populations of reinforcement learning agents playing repeated Prisoner's Dilemma games with partner selection. Agents differ in their learning rate and exploration-exploitation balance, forming “curious” and “conservative” types. Simulations show that heterogeneity sustains long-term cooperation. These insights contribute to understanding how diversity influences collective learning dynamics in multi-agent systems.

Meta Tags

multi-agent systems, reinforcement learning, cooperation, social dilemmas, Prisoner’s Dilemma, partner selection, heterogeneity, exploration-exploitation, Q-learning, Boltzmann exploration, AI agents

Files

Collection

Citation

Daphne Lavi, “Emergence of Cooperation in Heterogeneous Populations of Reinforcement-Learning Agents,” URSS SHOWCASE, accessed November 4, 2025, https://urss.warwick.ac.uk/items/show/830.