| Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection | Sep 24, 2024 | HallucinationSemantic Parsing | —Unverified | 0 |
| Parse Trees Guided LLM Prompt Compression | Sep 23, 2024 | Hallucination | CodeCode Available | 0 |
| A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor? | Sep 23, 2024 | HallucinationMedQA | —Unverified | 0 |
| Enhancing Scientific Reproducibility Through Automated BioCompute Object Creation Using Retrieval-Augmented Generation from Publications | Sep 23, 2024 | HallucinationLong-Context Understanding | —Unverified | 0 |
| Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization | Sep 22, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| Contrastive Learning for Knowledge-Based Question Generation in Large Language Models | Sep 21, 2024 | Contrastive LearningHallucination | —Unverified | 0 |
| FAIR GPT: A virtual consultant for research data management in ChatGPT | Sep 20, 2024 | FairnessHallucination | CodeCode Available | 1 |
| A Multiple-Fill-in-the-Blank Exam Approach for Enhancing Zero-Resource Hallucination Detection in Large Language Models | Sep 20, 2024 | HallucinationSentence | —Unverified | 0 |
| FIHA: Autonomous Hallucination Evaluation in Vision-Language Models with Davidson Scene Graphs | Sep 20, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images | Sep 19, 2024 | HallucinationImage Captioning | CodeCode Available | 0 |
| Textualized Agent-Style Reasoning for Complex Tasks by Multiple Round LLM Generation | Sep 19, 2024 | Hallucination | —Unverified | 0 |
| LLMs Can Check Their Own Results to Mitigate Hallucinations in Traffic Understanding Tasks | Sep 19, 2024 | Autonomous DrivingHallucination | —Unverified | 0 |
| Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering | Sep 19, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB | Sep 17, 2024 | 3D Human Pose EstimationHallucination | —Unverified | 0 |
| Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling | Sep 17, 2024 | HallucinationText Generation | —Unverified | 0 |
| Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant | Sep 17, 2024 | HallucinationInstruction Following | CodeCode Available | 0 |
| THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models | Sep 17, 2024 | BenchmarkingBinary Classification | CodeCode Available | 0 |
| Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection | Sep 16, 2024 | Hallucination | —Unverified | 0 |
| SFR-RAG: Towards Contextually Faithful LLMs | Sep 16, 2024 | counterfactualHallucination | —Unverified | 0 |
| Trustworthiness in Retrieval-Augmented Generation Systems: A Survey | Sep 16, 2024 | FairnessHallucination | CodeCode Available | 1 |
| HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision Making | Sep 16, 2024 | Answer GenerationDecision Making | CodeCode Available | 0 |
| Confidence Estimation for LLM-Based Dialogue State Tracking | Sep 15, 2024 | Dialogue State TrackingHallucination | CodeCode Available | 0 |
| Explore the Hallucination on Low-level Perception for MLLMs | Sep 15, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models | Sep 14, 2024 | AttributeHallucination | —Unverified | 0 |
| Winning Solution For Meta KDD Cup' 24 | Sep 13, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |