Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication

Adrian Redder; Arunselvan Ramaswamy; Holger Karl

doi:10.5220/0010845400003116

Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication

Adrian Redder, Arunselvan Ramaswamy, Holger Karl

2022

Abstract

Distributed online learning over delaying communication networks is a fundamental problem in multi-agent learning, since the convergence behaviour of interacting agents is distorted by their delayed communication. It is a priori unclear, how much communication delay can be allowed, such that the joint policies of multiple agents can still converge to a solution of a multi-agent learning problem. In this work, we present the decentralization of the well known deep deterministic policy gradient algorithm using a communication network. We illustrate the convergence of the algorithm and the effect of lossy communication on the rate of convergence for a two-agent flow control problem, where the agents exchange their local information over a delaying wireless network. Finally, we discuss theoretical implications for this algorithm using recent advances in the theory of age of information and deep reinforcement learning.

Download

Paper Citation

in Harvard Style

Redder A., Ramaswamy A. and Karl H. (2022). Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-758-547-0, pages 282-289. DOI: 10.5220/0010845400003116

in Bibtex Style

@conference{icaart22,
author={Adrian Redder and Arunselvan Ramaswamy and Holger Karl},
title={Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication},
booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2022},
pages={282-289},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010845400003116},
isbn={978-989-758-547-0},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - Multi-agent Policy Gradient Algorithms for Cyber-physical Systems with Lossy Communication
SN - 978-989-758-547-0
AU - Redder A.
AU - Ramaswamy A.
AU - Karl H.
PY - 2022
SP - 282
EP - 289
DO - 10.5220/0010845400003116