SOTAVerified

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Showing 2650 of 71 papers

TitleStatusHype
NLG-Metricverse: An End-to-End Library for Evaluating Natural Language Generation0
Evaluation of Text Generation: A Survey0
Evaluation rules! On the use of grammars and rule-based systems for NLG evaluation0
Exploring the Multilingual NLG Evaluation Abilities of LLM-Based Evaluators0
The Authenticity Gap in Human Evaluation0
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation0
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation0
Language Model Augmented Relevance Score0
Large Language Models Are Active Critics in NLG Evaluation0
LLM-based NLG Evaluation: Current Status and Challenges0
A Survey of Natural Language Generation0
MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG Evaluation0
A Tutorial on Evaluation Metrics used in Natural Language Generation0
Ev2R: Evaluating Evidence Retrieval in Automated Fact-Checking0
Agreement is overrated: A plea for correlation to assess human evaluation reliability0
Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts0
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text0
All That's `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text0
Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text0
Rethinking Model Evaluation as Narrowing the Socio-Technical Gap0
SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text0
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation0
The Pitfalls of Defining Hallucination0
The use of rating and Likert scales in Natural Language Generation human evaluation tasks: A review and some recommendations0
LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language ModelsCode0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.