| Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation | Dec 19, 2024 | Hallucination | —Unverified | 0 |
| Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling | Dec 19, 2024 | HallucinationText Generation | —Unverified | 0 |
| A Comparative Study of DSPy Teleprompter Algorithms for Aligning Large Language Models Evaluation Metrics to Human Evaluation | Dec 19, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Query pipeline optimization for cancer patient question answering systems | Dec 19, 2024 | HallucinationPassage Retrieval | —Unverified | 0 |
| Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation | Dec 19, 2024 | HallucinationRAG | —Unverified | 0 |
| Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence | Dec 18, 2024 | HallucinationMultimodal Reasoning | —Unverified | 0 |
| Are LLMs Good Literature Review Writers? Evaluating the Literature Review Writing Ability of Large Language Models | Dec 18, 2024 | Hallucination | —Unverified | 0 |
| ReXTrust: A Model for Fine-Grained Hallucination Detection in AI-Generated Radiology Reports | Dec 17, 2024 | Hallucination | —Unverified | 0 |
| A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models | Dec 17, 2024 | HallucinationRAG | —Unverified | 0 |
| When to Speak, When to Abstain: Contrastive Decoding with Abstention | Dec 17, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| What External Knowledge is Preferred by LLMs? Characterizing and Exploring Chain of Evidence in Imperfect Context | Dec 17, 2024 | HallucinationMisinformation | —Unverified | 0 |
| A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity Detection | Dec 16, 2024 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning | Dec 16, 2024 | HallucinationRobot Manipulation | CodeCode Available | 2 |
| CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding | Dec 16, 2024 | HallucinationMultiple-choice | —Unverified | 0 |
| Task-Oriented Dialog Systems for the Senegalese Wolof Language | Dec 15, 2024 | ChatbotHallucination | —Unverified | 0 |
| RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models | Dec 15, 2024 | Autonomous DrivingContrastive Learning | —Unverified | 0 |
| Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning | Dec 15, 2024 | Hallucination | —Unverified | 0 |
| Accelerating Retrieval-Augmented Generation | Dec 14, 2024 | CPUHallucination | —Unverified | 0 |
| NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries | Dec 14, 2024 | BenchmarkingEmbodied Question Answering | —Unverified | 0 |
| Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data | Dec 14, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| TACOMORE: Leveraging the Potential of LLMs in Corpus-based Discourse Analysis with Prompt Engineering | Dec 13, 2024 | ArticlesHallucination | —Unverified | 0 |
| Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts | Dec 13, 2024 | Hallucination | —Unverified | 0 |
| Benchmarking large language models for materials synthesis: the case of atomic layer deposition | Dec 13, 2024 | BenchmarkingHallucination | —Unverified | 0 |
| Multi-Task Learning with LLMs for Implicit Sentiment Analysis: Data-level and Task-level Automatic Weight Learning | Dec 12, 2024 | Aspect-Based Sentiment Analysis (ABSA)Hallucination | —Unverified | 0 |
| Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph Completion | Dec 12, 2024 | HallucinationKnowledge Graph Completion | CodeCode Available | 1 |