This work was part-funded by UK Research and In-
novation (UKRI) and Loughborough University. We
would like to thank Dr. Joe Collenette and Dr. Katie
Atkinson (University of Liverpool) for their provi-
sion of the code from their implementation of Moody
SARSA, and for their friendly support and communi-
Augmenting Reinforcement Learning to Enhance Cooperation in the Iterated Prisoner’s Dilemma