| Distilling Reasoning Ability from Large Language Models with Adaptive Thinking | Apr 14, 2024 | Hallucination | —Unverified | 0 |
| Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models | Apr 14, 2024 | Hallucination | —Unverified | 0 |
| Reducing hallucination in structured outputs via Retrieval-Augmented Generation | Apr 12, 2024 | HallucinationRAG | —Unverified | 0 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 |
| An Audit on the Perspectives and Challenges of Hallucinations in NLP | Apr 11, 2024 | HallucinationSurvey | —Unverified | 0 |
| BRAVE: Broadening the visual encoding of vision-language models | Apr 10, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |
| MetaCheckGPT -- A Multi-task Hallucination Detector Using LLM Uncertainty and Meta-models | Apr 10, 2024 | Hallucination | —Unverified | 0 |
| Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports | Apr 9, 2024 | FormHallucination | CodeCode Available | 0 |
| SmurfCat at SemEval-2024 Task 6: Leveraging Synthetic Data for Hallucination Detection | Apr 9, 2024 | Hallucination | CodeCode Available | 0 |
| Automating Research Synthesis with Domain-Specific Large Language Model Fine-Tuning | Apr 8, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Hyperbolic Learning with Synthetic Captions for Open-World Detection | Apr 7, 2024 | HallucinationNovel Concepts | —Unverified | 0 |
| HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models | Apr 7, 2024 | HallucinationRepresentation Learning | —Unverified | 0 |
| FGAIF: Aligning Large Vision-Language Models with Fine-grained AI Feedback | Apr 7, 2024 | AttributeHallucination | —Unverified | 0 |
| SLPL SHROOM at SemEval2024 Task 06: A comprehensive study on models ability to detect hallucination | Apr 7, 2024 | HallucinationMachine Translation | CodeCode Available | 0 |
| PoLLMgraph: Unraveling Hallucinations in Large Language Models via State Transition Dynamics | Apr 6, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| On the Limitations of Large Language Models (LLMs): False Attribution | Apr 6, 2024 | Author AttributionHallucination | —Unverified | 0 |
| FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping | Apr 5, 2024 | AttributeHallucination | —Unverified | 0 |
| Fakes of Varying Shades: How Warning Affects Human Perception and Engagement Regarding LLM Hallucinations | Apr 4, 2024 | HallucinationHuman Detection | CodeCode Available | 0 |
| A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation | Apr 4, 2024 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| SHROOM-INDElab at SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification for Hallucination Detection | Apr 4, 2024 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Mitigating LLM Hallucinations via Conformal Abstention | Apr 4, 2024 | Conformal PredictionGenerative Question Answering | —Unverified | 0 |
| Scalable Model Editing via Customized Expert Networks | Apr 3, 2024 | Hallucinationmodel | CodeCode Available | 0 |
| ALOHa: A New Measure for Hallucination in Captioning Models | Apr 3, 2024 | HallucinationObject | —Unverified | 0 |
| Hallucination Diversity-Aware Active Learning for Text Summarization | Apr 2, 2024 | Active LearningDiversity | —Unverified | 0 |
| Extracting Norms from Contracts Via ChatGPT: Opportunities and Challenges | Apr 2, 2024 | Hallucination | —Unverified | 0 |
| Comparative Study of Domain Driven Terms Extraction Using Large Language Models | Apr 2, 2024 | Document SummarizationHallucination | —Unverified | 0 |
| Exploring and Evaluating Hallucinations in LLM-Powered Code Generation | Apr 1, 2024 | Code GenerationHallucination | —Unverified | 0 |
| AILS-NTUA at SemEval-2024 Task 6: Efficient model tuning for hallucination detection and analysis | Apr 1, 2024 | Binary ClassificationHallucination | CodeCode Available | 0 |
| On Large Language Models' Hallucination with Regard to Known Facts | Mar 29, 2024 | HallucinationTriplet | CodeCode Available | 0 |
| Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning | Mar 29, 2024 | HallucinationTask Planning | CodeCode Available | 0 |
| Are Large Language Models Good at Utility Judgments? | Mar 28, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 |
| FACTOID: FACtual enTailment fOr hallucInation Detection | Mar 28, 2024 | AvgHallucination | —Unverified | 0 |
| Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback | Mar 27, 2024 | Hallucination | —Unverified | 0 |
| Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations | Mar 27, 2024 | AttributeDiagnostic | CodeCode Available | 0 |
| "Sorry, Come Again?" Prompting -- Enhancing Comprehension and Diminishing Hallucination with [PAUSE]-injected Optimal Paraphrasing | Mar 27, 2024 | Hallucination | —Unverified | 0 |
| Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models | Mar 26, 2024 | HallucinationInformation Retrieval | CodeCode Available | 0 |
| DGoT: Dynamic Graph of Thoughts for Scientific Abstract Generation | Mar 26, 2024 | Abstract generationHallucination | CodeCode Available | 0 |
| Visual Hallucination: Definition, Quantification, and Prescriptive Remediations | Mar 26, 2024 | HallucinationImage Captioning | —Unverified | 0 |
| Dyna-LfLH: Learning Agile Navigation in Dynamic Environments from Learned Hallucination | Mar 25, 2024 | HallucinationImitation Learning | —Unverified | 0 |
| Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art | Mar 25, 2024 | Common Sense ReasoningDecision Making | —Unverified | 0 |
| ESREAL: Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models | Mar 24, 2024 | HallucinationSemantic Similarity | —Unverified | 0 |
| Make VLM Recognize Visual Hallucination on Cartoon Character Image with Pose Information | Mar 22, 2024 | 3D ReconstructionHallucination | —Unverified | 0 |
| Sphere Neural-Networks for Rational Reasoning | Mar 22, 2024 | HallucinationLogical Reasoning | —Unverified | 0 |
| Multi-Modal Hallucination Control by Visual Information Grounding | Mar 20, 2024 | HallucinationVisual Question Answering (VQA) | —Unverified | 0 |
| DEE: Dual-stage Explainable Evaluation Method for Text Generation | Mar 18, 2024 | DiagnosticHallucination | —Unverified | 0 |
| Zero-Shot Multi-task Hallucination Detection | Mar 18, 2024 | Computational EfficiencyHallucination | —Unverified | 0 |
| SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors | Mar 18, 2024 | HallucinationMotion Planning | —Unverified | 0 |
| Logic Query of Thoughts: Guiding Large Language Models to Answer Complex Logic Queries with Knowledge Graphs | Mar 17, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 |
| Mitigating Dialogue Hallucination for Large Vision Language Models via Adversarial Instruction Tuning | Mar 15, 2024 | HallucinationInstruction Following | —Unverified | 0 |
| Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection | Mar 15, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |