In the past decade, model-free reinforcement learning (RL) has provided
...
Recent policy optimization approaches (Schulman et al., 2015a, 2017) hav...
To train a statistical spoken dialogue system (SDS) it is essential that...
The natural language generation (NLG) component of a spoken dialogue sys...