Human vs Automatic Metrics: on the Importance of Correlation Design

05/29/2018
by   Anastasia Shimorina, et al.
0

This paper discusses two existing approaches to the correlation analysis between automatic evaluation metrics and human scores in the area of natural language generation. Our experiments show that depending on the usage of a system- or sentence-level correlation analysis, correlation results between automatic scores and human judgments are inconsistent.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset