| Safety challenges of AI in medicine in the era of large language models | Sep 11, 2024 | Hallucination | —Unverified | 0 |
| MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications | Sep 11, 2024 | EthicsHallucination | —Unverified | 0 |
| Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding | Sep 10, 2024 | HallucinationImage Captioning | —Unverified | 0 |
| LLMs Will Always Hallucinate, and We Need to Live With This | Sep 9, 2024 | Fact CheckingHallucination | —Unverified | 0 |
| Detecting Buggy Contracts via Smart Testing | Sep 6, 2024 | Hallucination | —Unverified | 0 |
| Generating Faithful and Salient Text from Multimodal Data | Sep 6, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 |
| Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering | Sep 6, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| Vietnamese Legal Information Retrieval in Question-Answering System | Sep 5, 2024 | HallucinationInformation Retrieval | —Unverified | 0 |
| Hallucination Detection in LLMs: Fast and Memory-Efficient Fine-Tuned Models | Sep 4, 2024 | GPUHallucination | CodeCode Available | 0 |
| CLUE: Concept-Level Uncertainty Estimation for Large Language Models | Sep 4, 2024 | HallucinationSentence | —Unverified | 0 |
| Improved Single Camera BEV Perception Using Multi-Camera Training | Sep 4, 2024 | Autonomous DrivingHallucination | —Unverified | 0 |
| Multi-Source Knowledge Pruning for Retrieval-Augmented Generation: A Benchmark and Empirical Study | Sep 3, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| What does it take to get state of the art in simultaneous speech-to-speech translation? | Sep 2, 2024 | HallucinationManagement | —Unverified | 0 |
| Understanding Multimodal Hallucination with Parameter-Free Representation Alignment | Sep 2, 2024 | HallucinationObject | CodeCode Available | 0 |
| Towards Empathetic Conversational Recommender Systems | Aug 30, 2024 | HallucinationRecommendation Systems | CodeCode Available | 1 |
| Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding Data | Aug 30, 2024 | HallucinationPhrase Grounding | —Unverified | 0 |
| LLMs Prompted for Graphs: Hallucinations and Generative Capabilities | Aug 30, 2024 | DiversityHallucination | —Unverified | 0 |
| Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning | Aug 30, 2024 | Hallucination | CodeCode Available | 1 |
| UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches | Aug 30, 2024 | HallucinationRecommendation Systems | —Unverified | 0 |
| VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images | Aug 28, 2024 | Hallucination | CodeCode Available | 0 |
| LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation | Aug 28, 2024 | Computational EfficiencyHallucination | CodeCode Available | 3 |
| Negation Blindness in Large Language Models: Unveiling the NO Syndrome in Image Generation | Aug 27, 2024 | HallucinationImage Generation | —Unverified | 0 |
| Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation | Aug 27, 2024 | HallucinationRetrieval-augmented Generation | —Unverified | 0 |
| Evidence-Enhanced Triplet Generation Framework for Hallucination Alleviation in Generative Question Answering | Aug 27, 2024 | Generative Question AnsweringHallucination | —Unverified | 0 |
| Genetic Approach to Mitigate Hallucination in Generative IR | Aug 25, 2024 | Answer GenerationHallucination | CodeCode Available | 0 |