SOTAVerified

Hallucination

Papers

Showing 101150 of 1816 papers

TitleStatusHype
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language ModelsCode2
DeliLaw: A Chinese Legal Counselling System Based on a Large Language ModelCode2
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention MapsCode2
Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie WorksheetsCode2
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language ModelsCode2
MeMemo: On-device Retrieval Augmentation for Private and Personalized Text GenerationCode2
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented GenerationCode2
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMsCode2
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based FrameworkCode2
Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging casesCode2
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language ModelsCode2
mDPO: Conditional Preference Optimization for Multimodal Large Language ModelsCode2
Understanding Hallucinations in Diffusion Models through Mode InterpolationCode2
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language ModelsCode2
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image DescriptionsCode2
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less HallucinationCode2
ANAH: Analytical Annotation of Hallucinations in Large Language ModelsCode2
Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-ImprovementCode2
Calibrated Self-Rewarding Vision Language ModelsCode2
Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question AnsweringCode2
KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual CheckingCode2
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image AnalysisCode2
A Diffusion-Based Generative Equalizer for Music RestorationCode2
Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language ModelsCode2
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination MitigationCode2
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast DecodingCode2
TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful SpaceCode2
Less is More: Mitigating Multimodal Hallucination from an EOS Decision PerspectiveCode2
Reformatted AlignmentCode2
Aligning Modalities in Vision Large Language Models via Preference Fine-tuningCode2
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference AlignmentCode2
A Survey on Hallucination in Large Vision-Language ModelsCode2
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and DistillationCode2
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language ModelsCode2
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-AllocationCode2
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive DecodingCode2
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsCode2
Woodpecker: Hallucination Correction for Multimodal Large Language ModelsCode2
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsCode2
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language ModelsCode2
FreshLLMs: Refreshing Large Language Models with Search Engine AugmentationCode2
MLAgentBench: Evaluating Language Agents on Machine Learning ExperimentationCode2
MMICL: Empowering Vision-language Model with Multi-Modal In-Context LearningCode2
Benchmarking Large Language Models in Retrieval-Augmented GenerationCode2
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language ModelsCode2
TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language ModelsCode2
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategiesCode2
Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge GraphCode2
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction TuningCode2
ToolQA: A Dataset for LLM Question Answering with External ToolsCode2
Show:102550
← PrevPage 3 of 37Next →

No leaderboard results yet.