| Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient Conversations | Sep 24, 2021 | Hallucination | CodeCode Available | 0 |
| NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly | Oct 16, 2022 | Cultural Vocal Bursts Intensity PredictionHallucination | CodeCode Available | 0 |
| DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine | Nov 14, 2024 | FormHallucination | CodeCode Available | 0 |
| NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model | Mar 12, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Learning with privileged information via adversarial discriminative modality distillation | Oct 19, 2018 | Action RecognitionHallucination | CodeCode Available | 0 |
| Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks | Feb 7, 2025 | Abstractive Text SummarizationExplanation Generation | CodeCode Available | 0 |
| Object Hallucination in Image Captioning | Sep 6, 2018 | HallucinationImage Captioning | CodeCode Available | 0 |
| DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models | May 31, 2024 | HallucinationModel Editing | CodeCode Available | 0 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 |
| Localizing and Mitigating Errors in Long-form Question Answering | Jul 16, 2024 | FormHallucination | CodeCode Available | 0 |
| Self-training Large Language Models through Knowledge Detection | Jun 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Fine-grained Contract NER using instruction based model | Jan 24, 2024 | Few-Shot LearningHallucination | CodeCode Available | 0 |
| Learning on LLM Output Signatures for gray-box LLM Behavior Analysis | Mar 18, 2025 | Hallucination | CodeCode Available | 0 |
| Semantic Noise Matters for Neural Natural Language Generation | Nov 10, 2019 | Data-to-Text GenerationHallucination | CodeCode Available | 0 |
| Learning Fine-grained Domain Generalization via Hyperbolic State Space Hallucination | Apr 10, 2025 | Domain GeneralizationHallucination | CodeCode Available | 0 |
| Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language Models | Feb 8, 2025 | Conformal PredictionDecision Making | CodeCode Available | 0 |
| Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts | Aug 21, 2023 | ArticlesHallucination | CodeCode Available | 0 |
| Cross-modal Learning by Hallucinating Missing Modalities in RGB-D Vision | Jan 1, 2019 | Action RecognitionHallucination | CodeCode Available | 0 |
| SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration Mistakes | Apr 16, 2025 | Hallucination | CodeCode Available | 0 |
| Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks | Jul 1, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Brain-like Flexible Visual Inference by Harnessing Feedback-Feedforward Alignment | Oct 31, 2023 | DenoisingHallucination | CodeCode Available | 0 |
| BordIRlines: A Dataset for Evaluating Cross-lingual Retrieval-Augmented Generation | Oct 2, 2024 | HallucinationRAG | CodeCode Available | 0 |
| On hallucinations in tomographic image reconstruction | Dec 1, 2020 | HallucinationImage Reconstruction | CodeCode Available | 0 |
| OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models | Jan 22, 2025 | Hallucination | CodeCode Available | 0 |
| On Large Language Models' Hallucination with Regard to Known Facts | Mar 29, 2024 | HallucinationTriplet | CodeCode Available | 0 |
| Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation | Oct 25, 2023 | Data-to-Text GenerationHallucination | CodeCode Available | 0 |
| Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks | Dec 19, 2022 | Extractive Question-AnsweringHallucination | CodeCode Available | 0 |
| Adversarial Semantic Hallucination for Domain Generalized Semantic Segmentation | Jun 8, 2021 | Domain AdaptationDomain Generalization | CodeCode Available | 0 |
| On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation | Jun 18, 2024 | HallucinationResponse Generation | CodeCode Available | 0 |
| Vision-Encoders (Already) Know What They See: Mitigating Object Hallucination via Simple Fine-Grained CLIPScore | Feb 27, 2025 | HallucinationObject | CodeCode Available | 0 |
| On the Benefits of Fine-Grained Loss Truncation: A Case Study on Factuality in Summarization | Mar 9, 2024 | HallucinationText Summarization | CodeCode Available | 0 |
| Crafting In-context Examples according to LMs' Parametric Knowledge | Nov 16, 2023 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| SH2: Self-Highlighted Hesitation Helps You Decode More Truthfully | Jan 11, 2024 | HallucinationText Generation | CodeCode Available | 0 |
| Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs | Jun 17, 2024 | counterfactualHallucination | CodeCode Available | 0 |
| On the Hallucination in Simultaneous Machine Translation | Jun 11, 2024 | HallucinationMachine Translation | CodeCode Available | 0 |
| Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers | Mar 4, 2025 | Hallucination | CodeCode Available | 0 |
| BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science | Jun 29, 2024 | AI AgentClaim Verification | CodeCode Available | 0 |
| Beyond Ontology in Dialogue State Tracking for Goal-Oriented Chatbot | Oct 30, 2024 | ChatbotDialogue State Tracking | CodeCode Available | 0 |
| SHROOM-INDElab at SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification for Hallucination Detection | Apr 4, 2024 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Language Models Hallucinate, but May Excel at Fact Verification | Oct 23, 2023 | Fact VerificationHallucination | CodeCode Available | 0 |
| On the Universal Truthfulness Hyperplane Inside LLMs | Jul 11, 2024 | DiversityDomain Generalization | CodeCode Available | 0 |
| Ontology-Constrained Generation of Domain-Specific Clinical Summaries | Nov 23, 2024 | HallucinationText Summarization | CodeCode Available | 0 |
| SiGAN: Siamese Generative Adversarial Network for Identity-Preserving Face Hallucination | Jul 22, 2018 | Face HallucinationFace Reconstruction | CodeCode Available | 0 |
| Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text Generation | Oct 23, 2023 | Abstractive Text SummarizationDialogue Generation | CodeCode Available | 0 |
| SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection | Jun 20, 2020 | HallucinationMorphological Inflection | CodeCode Available | 0 |
| Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models | May 20, 2025 | Hallucinationscientific discovery | CodeCode Available | 0 |
| Multi-Source Knowledge Pruning for Retrieval-Augmented Generation: A Benchmark and Empirical Study | Sep 3, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem | Mar 6, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions | Jul 8, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 |
| keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection | May 23, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 |