While large language models (LMs) have shown remarkable capabilities acr...
Data-to-text generation is challenging due to the great variety of the i...
Probing is popular to analyze whether linguistic information can be capt...
Current practices in metric evaluation focus on one single dataset, e.g....
An important aspect of developing dialogue systems is how to evaluate an...
Vision-and-Language Navigation (VLN) is a natural language grounding tas...