| Enhanced document retrieval with topic embeddings | Aug 19, 2024 | HallucinationRAG | —Unverified | 0 |
| CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs | Aug 19, 2024 | Hallucinationzero-shot-classification | —Unverified | 0 |
| Cognitive LLMs: Towards Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making | Aug 17, 2024 | Decision MakingHallucination | —Unverified | 0 |
| Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused | Aug 16, 2024 | HallucinationTruthfulQA | —Unverified | 0 |
| Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions | Aug 16, 2024 | DescriptiveHallucination | —Unverified | 0 |
| Plan with Code: Comparing approaches for robust NL to DSL generation | Aug 15, 2024 | Code GenerationHallucination | —Unverified | 0 |
| CodeMirage: Hallucinations in Code Generated by Large Language Models | Aug 14, 2024 | Code GenerationHallucination | —Unverified | 0 |
| Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability | Aug 14, 2024 | Hallucination | —Unverified | 0 |
| Audit-LLM: Multi-Agent Collaboration for Log-based Insider Threat Detection | Aug 12, 2024 | Common Sense ReasoningHallucination | —Unverified | 0 |
| Reference-free Hallucination Detection for Large Vision-Language Models | Aug 11, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text | Aug 10, 2024 | Automatic Speech RecognitionHallucination | —Unverified | 0 |
| FiSTECH: Financial Style Transfer to Enhance Creativity without Hallucinations in LLMs | Aug 9, 2024 | ChatbotHallucination | —Unverified | 0 |
| Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models | Aug 9, 2024 | Hallucination | CodeCode Available | 0 |
| Handwritten Code Recognition for Pen-and-Paper CS Education | Aug 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| KnowPO: Knowledge-aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models | Aug 6, 2024 | HallucinationRAG | —Unverified | 0 |
| MAO: A Framework for Process Model Generation with Multi-Agent Orchestration | Aug 4, 2024 | Hallucinationsoftware testing | —Unverified | 0 |
| Improving Zero-Shot ObjectNav with Generative Communication | Aug 3, 2024 | HallucinationNavigate | —Unverified | 0 |
| Misinforming LLMs: vulnerabilities, challenges and opportunities | Aug 2, 2024 | HallucinationMisinformation | —Unverified | 0 |
| Piculet: Specialized Models-Guided Hallucination Decrease for MultiModal Large Language Models | Aug 2, 2024 | Hallucination | —Unverified | 0 |
| Alleviating Hallucination in Large Vision-Language Models with Active Retrieval Augmentation | Aug 1, 2024 | HallucinationImage Comprehension | —Unverified | 0 |
| Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering | Jul 31, 2024 | DiagnosticHallucination | —Unverified | 0 |
| Cost-Effective Hallucination Detection for LLMs | Jul 31, 2024 | Decision MakingFact Checking | —Unverified | 0 |
| Interpreting and Mitigating Hallucination in MLLMs through Multi-agent Debate | Jul 30, 2024 | Hallucination | —Unverified | 0 |
| VILA^2: VILA Augmented VILA | Jul 24, 2024 | HallucinationOptical Character Recognition (OCR) | —Unverified | 0 |
| WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries | Jul 24, 2024 | ChatbotForm | —Unverified | 0 |
| LawLuo: A Multi-Agent Collaborative Framework for Multi-Round Chinese Legal Consultation | Jul 23, 2024 | HallucinationRAG | —Unverified | 0 |
| Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models | Jul 23, 2024 | HallucinationParaphrase Generation | CodeCode Available | 0 |
| Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models | Jul 23, 2024 | HallucinationMachine Translation | CodeCode Available | 0 |
| Generation Constraint Scaling Can Mitigate Hallucination | Jul 23, 2024 | DecoderHallucination | —Unverified | 0 |
| Shared Imagination: LLMs Hallucinate Alike | Jul 23, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| Multilingual Fine-Grained News Headline Hallucination Detection | Jul 22, 2024 | HallucinationHeadline Generation | —Unverified | 0 |
| Text2Place: Affordance-aware Text Guided Human Placement | Jul 22, 2024 | AttributeHallucination | —Unverified | 0 |
| Developing a Reliable, Fast, General-Purpose Hallucination Detection and Mitigation Service | Jul 22, 2024 | Hallucinationnamed-entity-recognition | —Unverified | 0 |
| MAVEN-Fact: A Large-scale Event Factuality Detection Dataset | Jul 22, 2024 | Hallucination | CodeCode Available | 0 |
| Data-Centric Human Preference Optimization with Rationales | Jul 19, 2024 | Hallucination | CodeCode Available | 0 |
| Retrieval-Augmented Generation for Natural Language Processing: A Survey | Jul 18, 2024 | HallucinationRAG | —Unverified | 0 |
| BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models | Jul 18, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |
| Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Jul 18, 2024 | Decision MakingHallucination | —Unverified | 0 |
| ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection | Jul 18, 2024 | Cross-Lingual TransferHallucination | CodeCode Available | 0 |
| Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks | Jul 18, 2024 | Hallucinationobject-detection | —Unverified | 0 |
| Localizing and Mitigating Errors in Long-form Question Answering | Jul 16, 2024 | FormHallucination | CodeCode Available | 0 |
| What's Wrong? Refining Meeting Summaries with LLM Feedback | Jul 16, 2024 | HallucinationInformativeness | CodeCode Available | 0 |
| Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Jul 15, 2024 | Common Sense ReasoningHallucination | —Unverified | 0 |
| GraphEval: A Knowledge-Graph Based LLM Hallucination Evaluation Framework | Jul 15, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Look Within, Why LLMs Hallucinate: A Causal Perspective | Jul 14, 2024 | HallucinationReading Comprehension | —Unverified | 0 |
| On Mitigating Code LLM Hallucinations with API Documentation | Jul 13, 2024 | Hallucinationvalid | —Unverified | 0 |
| Cohesive Conversations: Enhancing Authenticity in Multi-Agent Simulated Dialogues | Jul 13, 2024 | DiversityHallucination | —Unverified | 0 |
| The Two Sides of the Coin: Hallucination Generation and Detection with LLMs as Evaluators for LLMs | Jul 12, 2024 | Hallucination | —Unverified | 0 |
| Mitigating Entity-Level Hallucination in Large Language Models | Jul 12, 2024 | HallucinationInformation Retrieval | CodeCode Available | 0 |
| DAHRS: Divergence-Aware Hallucination-Remediated SRL Projection | Jul 12, 2024 | fr-enHallucination | —Unverified | 0 |