| Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs | Mar 11, 2025 | Hallucination | —Unverified | 0 |
| Seeing What's Not There: Spurious Correlation in Multimodal LLMs | Mar 11, 2025 | HallucinationObject | —Unverified | 0 |
| Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation | Mar 11, 2025 | Computational EfficiencyHallucination | —Unverified | 0 |
| EAZY: Eliminating Hallucinations in LVLMs by Zeroing out Hallucinatory Image Tokens | Mar 10, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies | Mar 10, 2025 | BenchmarkingEthics | —Unverified | 0 |
| VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models | Mar 10, 2025 | Binary ClassificationHallucination | CodeCode Available | 0 |
| CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation | Mar 10, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Mitigating Hallucinations in YOLO-based Object Detection Models: A Revisit to Out-of-Distribution Detection | Mar 10, 2025 | Hallucinationobject-detection | —Unverified | 0 |
| PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training | Mar 9, 2025 | HallucinationImage Captioning | —Unverified | 0 |
| CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model | Mar 9, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Treble Counterfactual VLMs: A Causal Approach to Hallucination | Mar 8, 2025 | Autonomous Drivingcounterfactual | CodeCode Available | 0 |
| SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs | Mar 7, 2025 | ClusteringHallucination | —Unverified | 0 |
| Maximum Hallucination Standards for Domain-Specific Large Language Models | Mar 7, 2025 | AttributeHallucination | —Unverified | 0 |
| TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction | Mar 6, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression | Mar 6, 2025 | BenchmarkingCommon Sense Reasoning | CodeCode Available | 0 |
| Monitoring Decoding: Mitigating Hallucination via Evaluating the Factuality of Partial Response during Generation | Mar 5, 2025 | Hallucination | —Unverified | 0 |
| DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models | Mar 5, 2025 | HallucinationText Generation | —Unverified | 0 |
| Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models | Mar 5, 2025 | HallucinationInstruction Following | CodeCode Available | 11 |
| See What You Are Told: Visual Attention Sink in Large Multimodal Models | Mar 5, 2025 | Hallucination | —Unverified | 0 |
| Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias | Mar 5, 2025 | DenoisingHallucination | —Unverified | 0 |
| Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers | Mar 4, 2025 | Hallucination | CodeCode Available | 0 |
| SAFE: A Sparse Autoencoder-Based Framework for Robust Query Enrichment and Hallucination Mitigation in LLMs | Mar 4, 2025 | Hallucination | —Unverified | 0 |
| MCiteBench: A Multimodal Benchmark for Generating Text with Citations | Mar 4, 2025 | HallucinationText Generation | CodeCode Available | 0 |
| WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation | Mar 4, 2025 | Hallucination | CodeCode Available | 2 |
| Adaptively profiling models with task elicitation | Mar 3, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization | Mar 3, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| LLM-Advisor: An LLM Benchmark for Cost-efficient Path Planning across Multiple Terrains | Mar 3, 2025 | Common Sense ReasoningHallucination | —Unverified | 0 |
| Tackling Hallucination from Conditional Models for Medical Image Reconstruction with DynamicDPS | Mar 3, 2025 | HallucinationImage Reconstruction | —Unverified | 0 |
| Explainable Depression Detection in Clinical Interviews with Personalized Retrieval-Augmented Generation | Mar 3, 2025 | Depression DetectionHallucination | —Unverified | 0 |
| NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SeflCheckGPT | Mar 2, 2025 | Hallucination | CodeCode Available | 0 |
| Unmasking Digital Falsehoods: A Comparative Analysis of LLM-Based Misinformation Detection Strategies | Mar 2, 2025 | Fact CheckingFederated Learning | —Unverified | 0 |
| Steer LLM Latents for Hallucination Detection | Mar 1, 2025 | Hallucination | —Unverified | 0 |
| UniFa: A unified feature hallucination framework for any-shot object detection | Mar 1, 2025 | Generalized Zero-Shot Object DetectionHallucination | —Unverified | 0 |
| U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack | Mar 1, 2025 | HallucinationRAG | CodeCode Available | 0 |
| MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models | Feb 28, 2025 | Decision MakingHallucination | CodeCode Available | 0 |
| Towards General Visual-Linguistic Face Forgery Detection(V2) | Feb 28, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs | Feb 28, 2025 | Hallucination | —Unverified | 0 |
| Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow | Feb 28, 2025 | HallucinationObject | CodeCode Available | 1 |
| One-for-More: Continual Diffusion Model for Anomaly Detection | Feb 27, 2025 | Anomaly Detectioncontinual anomaly detection | CodeCode Available | 2 |
| ProAPO: Progressively Automatic Prompt Optimization for Visual Classification | Feb 27, 2025 | ClassificationHallucination | CodeCode Available | 1 |
| Vision-Encoders (Already) Know What They See: Mitigating Object Hallucination via Simple Fine-Grained CLIPScore | Feb 27, 2025 | HallucinationObject | CodeCode Available | 0 |
| Exploring the Generalizability of Factual Hallucination Mitigation via Enhancing Precise Knowledge Utilization | Feb 26, 2025 | Hallucination | —Unverified | 0 |
| Medical Hallucinations in Foundation Models and Their Impact on Healthcare | Feb 26, 2025 | BenchmarkingHallucination | CodeCode Available | 2 |
| On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation | Feb 26, 2025 | Cross-Modal RetrievalHallucination | —Unverified | 0 |
| Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents | Feb 26, 2025 | HallucinationKnowledge Distillation | —Unverified | 0 |
| BRIDO: Bringing Democratic Order to Abstractive Summarization | Feb 25, 2025 | Abstractive Text SummarizationContrastive Learning | —Unverified | 0 |
| Verdict: A Library for Scaling Judge-Time Compute | Feb 25, 2025 | Fact CheckingHallucination | CodeCode Available | 3 |
| Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models | Feb 25, 2025 | Backdoor AttackHallucination | —Unverified | 0 |
| Hallucination Detection in LLMs Using Spectral Features of Attention Maps | Feb 24, 2025 | Hallucination | CodeCode Available | 1 |
| Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models | Feb 24, 2025 | HallucinationImage Captioning | —Unverified | 0 |