SOTAVerified

Hallucination

Papers

Showing 101125 of 1816 papers

TitleStatusHype
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language ModelsCode2
DeliLaw: A Chinese Legal Counselling System Based on a Large Language ModelCode2
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention MapsCode2
Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie WorksheetsCode2
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language ModelsCode2
MeMemo: On-device Retrieval Augmentation for Private and Personalized Text GenerationCode2
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented GenerationCode2
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMsCode2
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based FrameworkCode2
Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging casesCode2
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language ModelsCode2
mDPO: Conditional Preference Optimization for Multimodal Large Language ModelsCode2
Understanding Hallucinations in Diffusion Models through Mode InterpolationCode2
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language ModelsCode2
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image DescriptionsCode2
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less HallucinationCode2
ANAH: Analytical Annotation of Hallucinations in Large Language ModelsCode2
Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-ImprovementCode2
Calibrated Self-Rewarding Vision Language ModelsCode2
Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question AnsweringCode2
KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual CheckingCode2
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image AnalysisCode2
A Diffusion-Based Generative Equalizer for Music RestorationCode2
Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language ModelsCode2
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination MitigationCode2
Show:102550
← PrevPage 5 of 73Next →

No leaderboard results yet.