Hybrid LMC: Hybrid Learning and Model-based Control for Wheeled Humanoid Robot via Ensemble Deep Reinforcement Learning

by   Donghoon Baek, et al.

Control of wheeled humanoid locomotion is a challenging problem due to the nonlinear dynamics and under-actuated characteristics of these robots. Traditionally, feedback controllers have been utilized for stabilization and locomotion. However, these methods are often limited by the fidelity of the underlying model used, choice of controller, and environmental variables considered (surface type, ground inclination, etc). Recent advances in reinforcement learning (RL) offer promising methods to tackle some of these conventional feedback controller issues, but require large amounts of interaction data to learn. Here, we propose a hybrid learning and model-based controller Hybrid LMC that combines the strengths of a classical linear quadratic regulator (LQR) and ensemble deep reinforcement learning. Ensemble deep reinforcement learning is composed of multiple Soft Actor-Critic (SAC) and is utilized in reducing the variance of RL networks. By using a feedback controller in tandem the network exhibits stable performance in the early stages of training. As a preliminary step, we explore the viability of Hybrid LMC in controlling wheeled locomotion of a humanoid robot over a set of different physical parameters in MuJoCo simulator. Our results show that Hybrid LMC achieves better performance compared to other existing techniques and has increased sample efficiency


page 1

page 4


Learning Ball-balancing Robot Through Deep Reinforcement Learning

The ball-balancing robot (ballbot) is a good platform to test the effect...

Reinforcement Learning Meets Hybrid Zero Dynamics: A Case Study for RABBIT

The design of feedback controllers for bipedal robots is challenging due...

Memory-based Controllers for Efficient Data-driven Control of Soft Robots

Controller design for soft robots is challenging due to nonlinear deform...

CPG-ACTOR: Reinforcement Learning for Central Pattern Generators

Central Pattern Generators (CPGs) have several properties desirable for ...

Learning to Locomote with Deep Neural-Network and CPG-based Control in a Soft Snake Robot

In this paper, we present a new locomotion control method for soft robot...

Learning Bipedal Walking for Humanoids with Current Feedback

Recent advances in deep reinforcement learning (RL) based techniques com...

Feedback Linearization of Car Dynamics for Racing via Reinforcement Learning

Through the method of Learning Feedback Linearization, we seek to learn ...

Please sign up or login with your details

Forgot password? Click here to reset