CEM-RL: Combining evolutionary and gradient-based methods for policy search

10/02/2018
by   Aloïs Pourchot, et al.
0

Deep neuroevolution and deep reinforcement learning (deep RL) algorithms are two popular approaches to policy search. The former is widely applicable and rather stable, but suffers from low sample efficiency. By contrast, the latter is more sample efficient, but the most sample efficient variants are also rather unstable and highly sensitive to hyper-parameter setting. So far, these families of methods have mostly been compared as competing tools. However, an emerging approach consists in combining them so as to get the best of both worlds. Two previously existing combinations use either a standard evolutionary algorithm or a goal exploration process together with the DDPG algorithm, a sample efficient off-policy deep RL algorithm. In this paper, we propose a different combination scheme using the simple cross-entropy method (CEM) and TD3, another off-policy deep RL algorithm which improves over DDPG. We evaluate the resulting algorithm, CEM-RL, on a set of benchmarks classically used in deep RL. We show that CEM-RL benefits from several advantages over its competitors and offers a satisfactory trade-off between performance and sample efficiency.

READ FULL TEXT
research
06/15/2020

QD-RL: Efficient Mixing of Quality and Diversity in Reinforcement Learning

We propose a novel reinforcement learning algorithm,QD-RL, that incorpor...
research
08/17/2018

Importance mixing: Improving sample reuse in evolutionary policy search methods

Deep neuroevolution, that is evolutionary policy search methods based on...
research
10/26/2022

ERL-Re^2: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation

Deep Reinforcement Learning (Deep RL) and Evolutionary Algorithm (EA) ar...
research
07/01/2019

FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control

In recent years significant progress has been made in dealing with chall...
research
02/23/2023

Reinforcement Learning for Combining Search Methods in the Calibration of Economic ABMs

Calibrating agent-based models (ABMs) in economics and finance typically...
research
03/26/2022

Combining Evolution and Deep Reinforcement Learning for Policy Search: a Survey

Deep neuroevolution and deep Reinforcement Learning have received a lot ...
research
05/25/2020

Formal Methods with a Touch of Magic

Machine learning and formal methods have complimentary benefits and draw...

Please sign up or login with your details

Forgot password? Click here to reset