| A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity Detection | Dec 16, 2024 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding | Dec 16, 2024 | HallucinationMultiple-choice | —Unverified | 0 |
| RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models | Dec 15, 2024 | Autonomous DrivingContrastive Learning | —Unverified | 0 |
| Task-Oriented Dialog Systems for the Senegalese Wolof Language | Dec 15, 2024 | ChatbotHallucination | —Unverified | 0 |
| Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning | Dec 15, 2024 | Hallucination | —Unverified | 0 |
| Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data | Dec 14, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries | Dec 14, 2024 | BenchmarkingEmbodied Question Answering | —Unverified | 0 |
| Accelerating Retrieval-Augmented Generation | Dec 14, 2024 | CPUHallucination | —Unverified | 0 |
| Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts | Dec 13, 2024 | Hallucination | —Unverified | 0 |
| Benchmarking large language models for materials synthesis: the case of atomic layer deposition | Dec 13, 2024 | BenchmarkingHallucination | —Unverified | 0 |
| TACOMORE: Leveraging the Potential of LLMs in Corpus-based Discourse Analysis with Prompt Engineering | Dec 13, 2024 | ArticlesHallucination | —Unverified | 0 |
| Multi-Task Learning with LLMs for Implicit Sentiment Analysis: Data-level and Task-level Automatic Weight Learning | Dec 12, 2024 | Aspect-Based Sentiment Analysis (ABSA)Hallucination | —Unverified | 0 |
| Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic Scenarios | Dec 10, 2024 | Autonomous DrivingDescriptive | CodeCode Available | 0 |
| HalluCana: Fixing LLM Hallucination with A Canary Lookahead | Dec 10, 2024 | Hallucination | —Unverified | 0 |
| Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models | Dec 9, 2024 | Hallucination | CodeCode Available | 0 |
| Methods for Legal Citation Prediction in the Age of LLMs: An Australian Law Case Study | Dec 9, 2024 | Citation PredictionHallucination | —Unverified | 0 |
| Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent | Dec 7, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| LLM-Align: Utilizing Large Language Models for Entity Alignment in Knowledge Graphs | Dec 6, 2024 | Entity AlignmentEntity Embeddings | —Unverified | 0 |
| Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models | Dec 6, 2024 | HallucinationOptical Character Recognition (OCR) | —Unverified | 0 |
| Steps are all you need: Rethinking STEM Education with Prompt Engineering | Dec 6, 2024 | AllHallucination | —Unverified | 0 |
| Multi-Objective Alignment of Large Language Models Through Hypervolume Maximization | Dec 6, 2024 | Hallucination | —Unverified | 0 |
| 100% Elimination of Hallucinations on RAGTruth for GPT-4 and GPT-3.5 Turbo | Dec 6, 2024 | HallucinationRAG | —Unverified | 0 |
| Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling | Dec 6, 2024 | document understandingHallucination | —Unverified | 0 |
| TOBUGraph: Knowledge Graph-Based Retrieval for Enhanced LLM Performance Beyond RAG | Dec 6, 2024 | ChunkingHallucination | —Unverified | 0 |
| Deep priors for satellite image restoration with accurate uncertainties | Dec 5, 2024 | DeblurringDenoising | —Unverified | 0 |