| Stress-Testing Multimodal Foundation Models for Crystallographic Reasoning | Jun 16, 2025 | HallucinationSpatial Interpolation | CodeCode Available | 0 |
| Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning | Mar 29, 2024 | HallucinationTask Planning | CodeCode Available | 0 |
| ProveRAG: Provenance-Driven Vulnerability Analysis with Automated Retrieval-Augmented LLMs | Oct 22, 2024 | ChunkingHallucination | CodeCode Available | 0 |
| A Unified Hallucination Mitigation Framework for Large Vision-Language Models | Sep 24, 2024 | HallucinationQuestion Answering | CodeCode Available | 0 |
| HaRiM^+: Evaluating Summary Quality with Hallucination Risk | Nov 22, 2022 | Automated Writing EvaluationDecoder | CodeCode Available | 0 |
| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 |
| Are Large Language Models Good at Utility Judgments? | Mar 28, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 |
| Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses | Jul 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Pushing the Limits of Low-Resource Morphological Inflection | Aug 16, 2019 | Cross-Lingual TransferDecoder | CodeCode Available | 0 |
| Embedding Hallucination for Few-Shot Language Fine-tuning | May 3, 2022 | Data AugmentationHallucination | CodeCode Available | 0 |