| Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models | Aug 4, 2024 | Hallucination | CodeCode Available | 2 |
| DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model | Aug 1, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Jul 9, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie Worksheets | Jul 8, 2024 | HallucinationNavigate | CodeCode Available | 2 |
| ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models | Jul 5, 2024 | HallucinationLong Form Question Answering | CodeCode Available | 2 |
| MeMemo: On-device Retrieval Augmentation for Private and Personalized Text Generation | Jul 2, 2024 | HallucinationRAG | CodeCode Available | 2 |
| Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation | Jun 26, 2024 | HallucinationKnowledge Base Question Answering | CodeCode Available | 2 |
| Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs | Jun 22, 2024 | HallucinationUncertainty Quantification | CodeCode Available | 2 |
| Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework | Jun 20, 2024 | HallucinationQuestion Answering | CodeCode Available | 2 |
| Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases | Jun 19, 2024 | 8kHallucination | CodeCode Available | 2 |