DFKI NLP
DFKI NLP
Home
News
People
Publications
Projects
Datasets
Contact
Technical-Report
Order in the Evaluation Court: A Critical Analysis of NLG Evaluation Trends
Despite advances in Natural Language Generation (NLG), evaluation remains challenging. Although various new metrics and LLM-as-a-judge …
Jing Yang
,
Nils Feldhus
,
Salar Mohtaj
,
Leonhard Hennig
,
Qianli Wang
,
Eleni Metheniti
,
Sherzod Hakimov
,
Charlott Jakob
,
Veronika Solopova
,
Konrad Rieck
,
David Schlangen
,
Sebastian Möller
,
Vera Schmitt
PDF
Cite
URL
Cite
×