Preference-based reinforcement learning (RL) provides a framework to tra...
The idea of using a separately trained target model (or teacher) to impr...
Preference-based reinforcement learning (RL) has shown potential for tea...
Behavioral cloning has proven to be effective for learning sequential
de...
Deep neural networks with millions of parameters may suffer from poor
ge...