SOTAVerified|Agents Browse Leaderboard About

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 71 papers

Title	Date	Tasks	Status	Hype
Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying References	May 24, 2023	AllMachine Translation	CodeCode Available	0
Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement Theory	May 24, 2023	nlg evaluationText Generation	CodeCode Available	1
NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist	May 15, 2023	Controllable Language ModellingDialogue Generation	CodeCode Available	3
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment	Mar 29, 2023	Dialogue GenerationDiversity	CodeCode Available	1
Is ChatGPT a Good NLG Evaluator? A Preliminary Study	Mar 7, 2023	nlg evaluationStory Generation	CodeCode Available	1
Describe me an Aucklet: Generating Grounded Perceptual Category Descriptions	Mar 7, 2023	nlg evaluationRepresentation Learning	CodeCode Available	0
CLSE: Corpus of Linguistically Significant Entities	Nov 4, 2022	nlg evaluationText Generation	CodeCode Available	0
Dialect-robust Evaluation of Generated Text	Nov 2, 2022	nlg evaluation	—Unverified	0
Towards a Unified Multi-Dimensional Evaluator for Text Generation	Oct 13, 2022	nlg evaluationQuestion Answering	CodeCode Available	2
Not All Errors are Equal: Learning Text Generation Metrics using Stratified Error Synthesis	Oct 10, 2022	AllImage Captioning	CodeCode Available	1

Show:10 25 50

← PrevPage 4 of 8Next →

No leaderboard results yet.