SOTAVerified

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Showing 5160 of 71 papers

TitleStatusHype
Analyzing and Evaluating Correlation Measures in NLG Meta-EvaluationCode0
Are LLM-based Evaluators Confusing NLG Quality Criteria?Code0
A Study of Automatic Metrics for the Evaluation of Natural Language ExplanationsCode0
Better than Random: Reliable NLG Human Evaluation with Constrained Active SamplingCode0
Bridging Cross-Lingual Gaps During Leveraging the Multilingual Sequence-to-Sequence Pretraining for Text Generation and UnderstandingCode0
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation MetricsCode0
CLSE: Corpus of Linguistically Significant EntitiesCode0
DEBATE: Devil's Advocate-Based Assessment and Text EvaluationCode0
DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question AnsweringCode0
Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG EvaluationCode0
Show:102550
← PrevPage 6 of 8Next →

No leaderboard results yet.