SOTAVerified

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Showing 2650 of 71 papers

TitleStatusHype
One Prompt To Rule Them All: LLMs for Opinion Summary EvaluationCode0
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMsCode0
Perturbation CheckLists for Evaluating NLG Evaluation MetricsCode0
ReFeR: Improving Evaluation and Reasoning through Hierarchy of ModelsCode0
Towards Multiple References Era -- Addressing Data Leakage and Limited Reference Diversity in NLG EvaluationCode0
Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language ModelsCode0
Why We Need New Evaluation Metrics for NLGCode0
LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language ModelsCode0
Evaluation rules! On the use of grammars and rule-based systems for NLG evaluation0
Exploring the Multilingual NLG Evaluation Abilities of LLM-Based Evaluators0
A Survey of Natural Language Generation0
The Authenticity Gap in Human Evaluation0
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation0
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation0
A Survey of Evaluation Metrics Used for NLG Systems0
Language Model Augmented Relevance Score0
Large Language Models Are Active Critics in NLG Evaluation0
A Snapshot of NLG Evaluation Practices 2005 - 20140
LLM-based NLG Evaluation: Current Status and Challenges0
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation – through the Lens of Semantic Similarity Rating0
All That's `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text0
MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG Evaluation0
The Pitfalls of Defining Hallucination0
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text0
NLG-Metricverse: An End-to-End Library for Evaluating Natural Language Generation0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.