| Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization | Sep 22, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| Contrastive Learning for Knowledge-Based Question Generation in Large Language Models | Sep 21, 2024 | Contrastive LearningHallucination | —Unverified | 0 |
| FIHA: Autonomous Hallucination Evaluation in Vision-Language Models with Davidson Scene Graphs | Sep 20, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| A Multiple-Fill-in-the-Blank Exam Approach for Enhancing Zero-Resource Hallucination Detection in Large Language Models | Sep 20, 2024 | HallucinationSentence | —Unverified | 0 |
| JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images | Sep 19, 2024 | HallucinationImage Captioning | CodeCode Available | 0 |
| LLMs Can Check Their Own Results to Mitigate Hallucinations in Traffic Understanding Tasks | Sep 19, 2024 | Autonomous DrivingHallucination | —Unverified | 0 |
| Textualized Agent-Style Reasoning for Complex Tasks by Multiple Round LLM Generation | Sep 19, 2024 | Hallucination | —Unverified | 0 |
| THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models | Sep 17, 2024 | BenchmarkingBinary Classification | CodeCode Available | 0 |
| Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling | Sep 17, 2024 | HallucinationText Generation | —Unverified | 0 |
| Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB | Sep 17, 2024 | 3D Human Pose EstimationHallucination | —Unverified | 0 |
| Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant | Sep 17, 2024 | HallucinationInstruction Following | CodeCode Available | 0 |
| Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection | Sep 16, 2024 | Hallucination | —Unverified | 0 |
| HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision Making | Sep 16, 2024 | Answer GenerationDecision Making | CodeCode Available | 0 |
| SFR-RAG: Towards Contextually Faithful LLMs | Sep 16, 2024 | counterfactualHallucination | —Unverified | 0 |
| Confidence Estimation for LLM-Based Dialogue State Tracking | Sep 15, 2024 | Dialogue State TrackingHallucination | CodeCode Available | 0 |
| Explore the Hallucination on Low-level Perception for MLLMs | Sep 15, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models | Sep 14, 2024 | AttributeHallucination | —Unverified | 0 |
| Winning Solution For Meta KDD Cup' 24 | Sep 13, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications | Sep 11, 2024 | EthicsHallucination | —Unverified | 0 |
| Safety challenges of AI in medicine in the era of large language models | Sep 11, 2024 | Hallucination | —Unverified | 0 |
| Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding | Sep 10, 2024 | HallucinationImage Captioning | —Unverified | 0 |
| LLMs Will Always Hallucinate, and We Need to Live With This | Sep 9, 2024 | Fact CheckingHallucination | —Unverified | 0 |
| Generating Faithful and Salient Text from Multimodal Data | Sep 6, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 |
| Detecting Buggy Contracts via Smart Testing | Sep 6, 2024 | Hallucination | —Unverified | 0 |
| Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering | Sep 6, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| Vietnamese Legal Information Retrieval in Question-Answering System | Sep 5, 2024 | HallucinationInformation Retrieval | —Unverified | 0 |
| CLUE: Concept-Level Uncertainty Estimation for Large Language Models | Sep 4, 2024 | HallucinationSentence | —Unverified | 0 |
| Improved Single Camera BEV Perception Using Multi-Camera Training | Sep 4, 2024 | Autonomous DrivingHallucination | —Unverified | 0 |
| Hallucination Detection in LLMs: Fast and Memory-Efficient Fine-Tuned Models | Sep 4, 2024 | GPUHallucination | CodeCode Available | 0 |
| Multi-Source Knowledge Pruning for Retrieval-Augmented Generation: A Benchmark and Empirical Study | Sep 3, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| Understanding Multimodal Hallucination with Parameter-Free Representation Alignment | Sep 2, 2024 | HallucinationObject | CodeCode Available | 0 |
| What does it take to get state of the art in simultaneous speech-to-speech translation? | Sep 2, 2024 | HallucinationManagement | —Unverified | 0 |
| LLMs Prompted for Graphs: Hallucinations and Generative Capabilities | Aug 30, 2024 | DiversityHallucination | —Unverified | 0 |
| Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding Data | Aug 30, 2024 | HallucinationPhrase Grounding | —Unverified | 0 |
| UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches | Aug 30, 2024 | HallucinationRecommendation Systems | —Unverified | 0 |
| VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images | Aug 28, 2024 | Hallucination | CodeCode Available | 0 |
| Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation | Aug 27, 2024 | HallucinationRetrieval-augmented Generation | —Unverified | 0 |
| Evidence-Enhanced Triplet Generation Framework for Hallucination Alleviation in Generative Question Answering | Aug 27, 2024 | Generative Question AnsweringHallucination | —Unverified | 0 |
| Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation | Aug 27, 2024 | HallucinationImage Generation | —Unverified | 0 |
| Genetic Approach to Mitigate Hallucination in Generative IR | Aug 25, 2024 | Answer GenerationHallucination | CodeCode Available | 0 |
| Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models | Aug 25, 2024 | Decision MakingHallucination | —Unverified | 0 |
| Internal and External Knowledge Interactive Refinement Framework for Knowledge-Intensive Question Answering | Aug 23, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning | Aug 23, 2024 | HallucinationPrompt Engineering | —Unverified | 0 |
| Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators | Aug 22, 2024 | HallucinationMixture-of-Experts | CodeCode Available | 0 |
| RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data | Aug 22, 2024 | Hallucination | CodeCode Available | 0 |
| GRATR: Zero-Shot Evidence Graph Retrieval-Augmented Trustworthiness Reasoning | Aug 22, 2024 | Decision MakingHallucination | CodeCode Available | 0 |
| MedDiT: A Knowledge-Controlled Diffusion Transformer Framework for Dynamic Medical Image Generation in Virtual Simulated Patient | Aug 22, 2024 | DiagnosticHallucination | —Unverified | 0 |
| Towards Analyzing and Mitigating Sycophancy in Large Vision-Language Models | Aug 21, 2024 | HallucinationPrompt Engineering | —Unverified | 0 |
| RAG-Optimized Tibetan Tourism LLMs: Enhancing Accuracy and Personalization | Aug 21, 2024 | HallucinationRAG | —Unverified | 0 |
| MAPLE: Enhancing Review Generation with Multi-Aspect Prompt LEarning in Explainable Recommendation | Aug 19, 2024 | DiversityExplainable Recommendation | —Unverified | 0 |