| ALMol: Aligned Language-Molecule Translation LLMs through Offline Preference Contrastive Optimisation | May 14, 2024 | Hallucinationscientific discovery | —Unverified | 0 |
| Control Token with Dense Passage Retrieval | May 13, 2024 | HallucinationPassage Retrieval | —Unverified | 0 |
| Benchmarking Retrieval-Augmented Large Language Models in Biomedical NLP: Application, Robustness, and Self-Awareness | May 13, 2024 | Benchmarkingcounterfactual | —Unverified | 0 |
| Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval | May 10, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought | May 9, 2024 | HallucinationMath | —Unverified | 0 |
| THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models | May 8, 2024 | AttributeData Augmentation | CodeCode Available | 1 |
| Is the House Ready For Sleeptime? Generating and Evaluating Situational Queries for Embodied Question Answering | May 8, 2024 | 2kEmbodied Question Answering | —Unverified | 0 |
| SUTRA: Scalable Multilingual Language Model Architecture | May 7, 2024 | Computational EfficiencyHallucination | —Unverified | 0 |
| Sora Detector: A Unified Hallucination Detection for Large Text-to-Video Models | May 7, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 |
| Deception in Reinforced Autonomous Agents | May 7, 2024 | Deception DetectionHallucination | —Unverified | 0 |