SOTAVerified|Agents Browse Leaderboard About

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 71 papers

Title	Date	Tasks	Status	Hype	Score
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation Metrics	Sep 20, 2022	CPUGPU	CodeCode Available	0	5
DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering	Jul 13, 2023	Dialogue Generationnlg evaluation	CodeCode Available	0	5
Are LLM-based Evaluators Confusing NLG Quality Criteria?	Feb 19, 2024	nlg evaluation	CodeCode Available	0	5
Bridging Cross-Lingual Gaps During Leveraging the Multilingual Sequence-to-Sequence Pretraining for Text Generation and Understanding	Apr 16, 2022	Cross-Lingual Natural Language InferenceNatural Language Inference	CodeCode Available	0	5
Analyzing and Evaluating Correlation Measures in NLG Meta-Evaluation	Oct 22, 2024	nlg evaluation	CodeCode Available	0	5
Describe me an Aucklet: Generating Grounded Perceptual Category Descriptions	Mar 7, 2023	nlg evaluationRepresentation Learning	CodeCode Available	0	5
Long-Form Information Alignment Evaluation Beyond Atomic Facts	May 21, 2025	Formnlg evaluation	CodeCode Available	0	5
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs	Mar 14, 2025	nlg evaluation	CodeCode Available	0	5
Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling	Jun 12, 2024	nlg evaluation	CodeCode Available	0	5
Towards Multiple References Era -- Addressing Data Leakage and Limited Reference Diversity in NLG Evaluation	Aug 6, 2023	Diversitynlg evaluation	CodeCode Available	0	5

Show:10 25 50

← PrevPage 3 of 8Next →

No leaderboard results yet.