SOTAVerified

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Showing 4150 of 71 papers

TitleStatusHype
Language Model Augmented Relevance Score0
Large Language Models Are Active Critics in NLG Evaluation0
A Snapshot of NLG Evaluation Practices 2005 - 20140
LLM-based NLG Evaluation: Current Status and Challenges0
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation – through the Lens of Semantic Similarity Rating0
All That's `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text0
MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG Evaluation0
The Pitfalls of Defining Hallucination0
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text0
NLG-Metricverse: An End-to-End Library for Evaluating Natural Language Generation0
Show:102550
← PrevPage 5 of 8Next →

No leaderboard results yet.