SOTAVerified

Hallucination

Papers

Showing 501510 of 1816 papers

TitleStatusHype
MedScore: Factuality Evaluation of Free-Form Medical AnswersCode0
On the Universal Truthfulness Hyperplane Inside LLMsCode0
MAVEN-Fact: A Large-scale Event Factuality Detection DatasetCode0
Addressing Topic Granularity and Hallucination in Large Language Models for Topic ModellingCode0
MCiteBench: A Multimodal Benchmark for Generating Text with CitationsCode0
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison FeedbackCode0
Appraising the Potential Uses and Harms of LLMs for Medical Systematic ReviewsCode0
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language ModelsCode0
OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language ModelsCode0
LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model CompressionCode0
Show:102550
← PrevPage 51 of 182Next →

No leaderboard results yet.