SOTAVerified

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Showing 4150 of 71 papers

TitleStatusHype
NLG-Metricverse: An End-to-End Library for Evaluating Natural Language Generation0
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation MetricsCode0
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation – through the Lens of Semantic Similarity Rating0
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation -- through the Lens of Semantic Similarity Rating0
The Authenticity Gap in Human Evaluation0
Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications0
Near-Negative Distinction: Giving a Second Life to Human Evaluation DatasetsCode0
Bridging Cross-Lingual Gaps During Leveraging the Multilingual Sequence-to-Sequence Pretraining for Text Generation and UnderstandingCode0
Active Evaluation: Efficient NLG Evaluation with Few Pairwise ComparisonsCode1
Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text0
Show:102550
← PrevPage 5 of 8Next →

No leaderboard results yet.