SOTAVerified

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Showing 1120 of 71 papers

TitleStatusHype
Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language GenerationCode1
A Tutorial on Evaluation Metrics used in Natural Language Generation0
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation -- through the Lens of Semantic Similarity Rating0
All That's `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text0
A Survey of Natural Language Generation0
A Survey of Evaluation Metrics Used for NLG Systems0
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text0
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG0
Agreement is overrated: A plea for correlation to assess human evaluation reliability0
CoAScore: Chain-of-Aspects Prompting for NLG Evaluation0
Show:102550
← PrevPage 2 of 8Next →

No leaderboard results yet.