| Low to High Dimensional Modality Hallucination using Aggregated Fields of View | Jul 13, 2020 | HallucinationVocal Bursts Intensity Prediction | CodeCode Available | 0 | 5 |
| Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models | Jul 23, 2024 | HallucinationMachine Translation | CodeCode Available | 0 | 5 |
| DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation | Jun 13, 2024 | BenchmarkingHallucination | CodeCode Available | 0 | 5 |
| Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation Oncology | Apr 24, 2023 | BenchmarkingDecision Making | CodeCode Available | 0 | 5 |
| LLMs and Memorization: On Quality and Specificity of Copyright Compliance | May 28, 2024 | HallucinationMemorization | CodeCode Available | 0 | 5 |
| LLM Internal States Reveal Hallucination Risk Faced With a Query | Jul 3, 2024 | HallucinationResponse Generation | CodeCode Available | 0 | 5 |
| Behind the Magic, MERLIM: Multi-modal Evaluation Benchmark for Large Image-Language Models | Dec 3, 2023 | HallucinationVisual Grounding | CodeCode Available | 0 | 5 |
| Logic Query of Thoughts: Guiding Large Language Models to Answer Complex Logic Queries with Knowledge Graphs | Mar 17, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models | Oct 19, 2023 | HallucinationMathematical Reasoning | CodeCode Available | 0 | 5 |
| LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries | May 19, 2025 | HallucinationRetrieval | CodeCode Available | 0 | 5 |