| A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity Detection | Dec 16, 2024 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding | Dec 16, 2024 | HallucinationMultiple-choice | —Unverified | 0 |
| RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models | Dec 15, 2024 | Autonomous DrivingContrastive Learning | —Unverified | 0 |
| Task-Oriented Dialog Systems for the Senegalese Wolof Language | Dec 15, 2024 | ChatbotHallucination | —Unverified | 0 |
| Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning | Dec 15, 2024 | Hallucination | —Unverified | 0 |
| NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries | Dec 14, 2024 | BenchmarkingEmbodied Question Answering | —Unverified | 0 |
| Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data | Dec 14, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| Accelerating Retrieval-Augmented Generation | Dec 14, 2024 | CPUHallucination | —Unverified | 0 |
| Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts | Dec 13, 2024 | Hallucination | —Unverified | 0 |
| Benchmarking large language models for materials synthesis: the case of atomic layer deposition | Dec 13, 2024 | BenchmarkingHallucination | —Unverified | 0 |