SOTAVerified|Agents Browse Leaderboard About

nlg evaluation

Evaluate the generated text by NLG (Natural Language Generation) systems, like large language models

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 41–50 of 71 papers

Title	Date	Tasks	Status	Hype	Score
Language Model Augmented Relevance Score	Aug 19, 2021	Language ModelingLanguage Modelling	—Unverified	0	0
Large Language Models Are Active Critics in NLG Evaluation	Oct 14, 2024	nlg evaluationPrompt Engineering	—Unverified	0	0
A Snapshot of NLG Evaluation Practices 2005 - 2014	Sep 1, 2015	nlg evaluationText Generation	—Unverified	0	0
LLM-based NLG Evaluation: Current Status and Challenges	Feb 2, 2024	nlg evaluationText Generation	—Unverified	0	0
A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation – through the Lens of Semantic Similarity Rating	Jul 1, 2022	nlg evaluationSemantic Similarity	—Unverified	0	0
All That's `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text	Aug 1, 2021	AllArticles	—Unverified	0	0
MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG Evaluation	Jul 24, 2021	Diversitynlg evaluation	—Unverified	0	0
The Pitfalls of Defining Hallucination	Jan 15, 2024	Hallucinationnlg evaluation	—Unverified	0	0
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text	Jun 30, 2021	AllArticles	—Unverified	0	0
NLG-Metricverse: An End-to-End Library for Evaluating Natural Language Generation	Oct 1, 2022	Managementnlg evaluation	—Unverified	0	0

Show:10 25 50

← PrevPage 5 of 8Next →

No leaderboard results yet.