| Hallucination Detection on a Budget: Efficient Bayesian Estimation of Semantic Entropy | Apr 4, 2025 | Hallucination | —Unverified | 0 |
| Bridging LMS and Generative AI: Dynamic Course Content Integration (DCCI) for Connecting LLMs to Course Content -- The Ask ME Assistant | Apr 4, 2025 | HallucinationPrompt Engineering | —Unverified | 0 |
| Noise Augmented Fine Tuning for Mitigating Hallucinations in Large Language Models | Apr 4, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Practical Poisoning Attacks against Retrieval-Augmented Generation | Apr 4, 2025 | HallucinationRAG | —Unverified | 0 |
| A Memory-Augmented LLM-Driven Method for Autonomous Merging of 3D Printing Work Orders | Apr 3, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| The Illusionist's Prompt: Exposing the Factual Vulnerabilities of Large Language Models with Linguistic Nuances | Apr 1, 2025 | Hallucination | —Unverified | 0 |
| A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System | Apr 1, 2025 | Dialogue GenerationEnsemble Learning | —Unverified | 0 |
| GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments | Apr 1, 2025 | HallucinationText Generation | —Unverified | 0 |
| Catch Me if You Search: When Contextual Web Search Results Affect the Detection of Hallucinations | Apr 1, 2025 | Hallucination | CodeCode Available | 0 |
| HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation | Mar 31, 2025 | HallucinationHuman-Object Interaction Detection | —Unverified | 0 |
| An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering | Mar 30, 2025 | HallucinationMulti-hop Question Answering | —Unverified | 0 |
| Learning to Instruct for Visual Instruction Tuning | Mar 28, 2025 | HallucinationInstruction Following | —Unverified | 0 |
| Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best? | Mar 27, 2025 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search | Mar 27, 2025 | HallucinationKnowledge Distillation | —Unverified | 0 |
| Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack | Mar 27, 2025 | HallucinationRAG | —Unverified | 0 |
| Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy | Mar 26, 2025 | HallucinationImage Captioning | —Unverified | 0 |
| Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering | Mar 26, 2025 | DiagnosticHallucination | —Unverified | 0 |
| TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes | Mar 26, 2025 | Hallucination | —Unverified | 0 |
| Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs | Mar 26, 2025 | HallucinationHallucination Evaluation | —Unverified | 0 |
| GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization | Mar 26, 2025 | HallucinationPrompt Learning | CodeCode Available | 0 |
| KSHSeek: Data-Driven Approaches to Mitigating and Detecting Knowledge-Shortcut Hallucinations in Generative Models | Mar 25, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| HausaNLP at SemEval-2025 Task 3: Towards a Fine-Grained Model-Aware Hallucination Detection | Mar 25, 2025 | HallucinationNatural Language Inference | —Unverified | 0 |
| ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices | Mar 23, 2025 | HallucinationTriviaQA | —Unverified | 0 |
| good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval | Mar 22, 2025 | DiversityHallucination | —Unverified | 0 |
| Judge Anything: MLLM as a Judge Across Any Modality | Mar 21, 2025 | Hallucination | —Unverified | 0 |
| FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs | Mar 21, 2025 | HallucinationKnowledge Graphs | —Unverified | 0 |
| ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| Towards Lighter and Robust Evaluation for Retrieval Augmented Generation | Mar 20, 2025 | HallucinationRAG | CodeCode Available | 0 |
| MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations | Mar 20, 2025 | HallucinationVideo Understanding | —Unverified | 0 |
| MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models | Mar 19, 2025 | Adversarial RobustnessAutonomous Driving | —Unverified | 0 |
| R^2: A LLM Based Novel-to-Screenplay Generation Framework with Causal Plot Graphs | Mar 19, 2025 | graph constructionHallucination | —Unverified | 0 |
| Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models | Mar 19, 2025 | Fact CheckingFact Verification | —Unverified | 0 |
| Enhancing LLM Generation with Knowledge Hypergraph for Evidence-Based Medicine | Mar 18, 2025 | HallucinationRAG | —Unverified | 0 |
| Learning on LLM Output Signatures for gray-box LLM Behavior Analysis | Mar 18, 2025 | Hallucination | CodeCode Available | 0 |
| RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving | Mar 18, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| From "Hallucination" to "Suture": Insights from Language Philosophy to Enhance Large Language Models | Mar 18, 2025 | HallucinationPhilosophy | —Unverified | 0 |
| HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models | Mar 17, 2025 | HallucinationQuestion Answering | CodeCode Available | 0 |
| LLMSeR: Enhancing Sequential Recommendation via LLM-based Data Augmentation | Mar 16, 2025 | Data AugmentationHallucination | —Unverified | 0 |
| Applications of Large Language Model Reasoning in Feature Generation | Mar 15, 2025 | Computational EfficiencyDomain Adaptation | —Unverified | 0 |
| Prompt Injection Detection and Mitigation via AI Multi-Agent NLP Frameworks | Mar 14, 2025 | Hallucination | CodeCode Available | 0 |
| AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation | Mar 14, 2025 | Abstractive Text SummarizationChunking | CodeCode Available | 0 |
| RAG-KG-IL: A Multi-Agent Hybrid Framework for Reducing Hallucinations and Enhancing LLM Reasoning through RAG and Incremental Knowledge Graph Learning Integration | Mar 14, 2025 | Graph LearningHallucination | —Unverified | 0 |
| LLM Agents for Education: Advances and Applications | Mar 14, 2025 | FairnessHallucination | —Unverified | 0 |
| Learning to Inference Adaptively for Multimodal Large Language Models | Mar 13, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding | Mar 13, 2025 | HallucinationText Generation | CodeCode Available | 0 |
| Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets | Mar 12, 2025 | Answer GenerationConversational Search | CodeCode Available | 0 |
| NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model | Mar 12, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection | Mar 12, 2025 | Decision MakingFake News Detection | —Unverified | 0 |
| OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting | Mar 11, 2025 | HallucinationObject | —Unverified | 0 |