A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning

10/27/2021
by   Gugan Thoppe, et al.
0

In Multi-Agent Reinforcement Learning (MARL), multiple agents interact with a common environment, as also with each other, for solving a shared problem in sequential decision-making. It has wide-ranging applications in gaming, robotics, finance, etc. In this work, we derive a novel law of iterated logarithm for a family of distributed nonlinear stochastic approximation schemes that is useful in MARL. In particular, our result describes the convergence rate on almost every sample path where the algorithm converges. This result is the first of its kind in the distributed setup and provides deeper insights than the existing ones, which only discuss convergence rates in the expected or the CLT sense. Importantly, our result holds under significantly weaker assumptions: neither the gossip matrix needs to be doubly stochastic nor the stepsizes square summable. As an application, we show that, for the stepsize n^-γ with γ∈ (0, 1), the distributed TD(0) algorithm with linear function approximation has a convergence rate of O(√(n^-γln n )) a.s.; for the 1/n type stepsize, the same is O(√(n^-1lnln n)) a.s. These decay rates do not depend on the graph depicting the interactions among the different agents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2019

Voting-Based Multi-Agent Reinforcement Learning

The recent success of single-agent reinforcement learning (RL) encourage...
research
08/07/2019

Fast multi-agent temporal-difference learning via homotopy stochastic primal-dual optimization

We consider a distributed multi-agent policy evaluation problem in reinf...
research
07/09/2020

Geometric Bounds for Convergence Rates of Averaging Algorithms

We develop a generic method for bounding the convergence rate of an aver...
research
09/23/2021

Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning

Cooperative multi-agent reinforcement learning is a decentralized paradi...
research
10/28/2020

Finite-Time Analysis of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning

Stochastic approximation, a data-driven approach for finding the fixed p...
research
11/24/2015

Performance Limits of Stochastic Sub-Gradient Learning, Part I: Single Agent Case

In this work and the supporting Part II, we examine the performance of s...
research
09/30/2020

Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation

The ODE method has been a workhorse for algorithm design and analysis si...

Please sign up or login with your details

Forgot password? Click here to reset