SOTAVerified|Agents Browse Leaderboard About

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 71 papers

Title	Date	Tasks	Status	Hype
Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language Generation	Sep 14, 2021	nlg evaluationStyle Transfer	CodeCode Available	1
A Tutorial on Evaluation Metrics used in Natural Language Generation	Jun 1, 2021	nlg evaluationText Generation	—Unverified	0
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation -- through the Lens of Semantic Similarity Rating	May 24, 2022	nlg evaluationSemantic Similarity	—Unverified	0
All That's `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text	Aug 1, 2021	AllArticles	—Unverified	0
A Survey of Natural Language Generation	Dec 22, 2021	Data-to-Text GenerationDeep Learning	—Unverified	0
A Survey of Evaluation Metrics Used for NLG Systems	Aug 27, 2020	Image Captioningnlg evaluation	—Unverified	0
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text	Jun 30, 2021	AllArticles	—Unverified	0
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG	May 24, 2023	Dialogue GenerationDiversity	—Unverified	0
Agreement is overrated: A plea for correlation to assess human evaluation reliability	Oct 1, 2019	nlg evaluation	—Unverified	0
CoAScore: Chain-of-Aspects Prompting for NLG Evaluation	Dec 16, 2023	nlg evaluationResponse Generation	—Unverified	0

Show:10 25 50

← PrevPage 2 of 8Next →

No leaderboard results yet.