SOTAVerified

Hallucination

Papers

Showing 426450 of 1816 papers

TitleStatusHype
Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and BeyondCode1
Extract Free Dense Misalignment from CLIPCode1
AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language GenerationCode1
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference OptimizationCode1
MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-ExpertsCode1
Face Hallucination via Split-Attention in Split-Attention NetworkCode1
Evaluation and Analysis of Hallucination in Large Vision-Language ModelsCode1
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?Code1
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language ModelsCode1
Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language ModelsCode1
Theory of Mind for Multi-Agent Collaboration via Large Language ModelsCode1
EventHallusion: Diagnosing Event Hallucinations in Video LLMsCode1
DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion ModelsCode1
Doc2Query--: When Less is MoreCode1
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
Evaluating Image Hallucination in Text-to-Image Generation with Question-AnsweringCode1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and MitigationCode1
Distinguishing Ignorance from Error in LLM HallucinationsCode1
Federated Recommendation via Hybrid Retrieval Augmented GenerationCode1
Hallucinated Neural Radiance Fields in the WildCode1
Label Hallucination for Few-Shot ClassificationCode1
PREFER: Prompt Ensemble Learning via Feedback-Reflect-RefineCode1
Trustworthiness in Retrieval-Augmented Generation Systems: A SurveyCode1
Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation0
Show:102550
← PrevPage 18 of 73Next →

No leaderboard results yet.