| Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text | Aug 10, 2024 | Automatic Speech RecognitionHallucination | —Unverified | 0 |
| SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning | Aug 10, 2024 | HallucinationOptical Character Recognition | CodeCode Available | 11 |
| Order Matters in Hallucination: Reasoning Order as Benchmark and Reflexive Prompting for Large-Language-Models | Aug 9, 2024 | Hallucination | CodeCode Available | 0 |
| FiSTECH: Financial Style Transfer to Enhance Creativity without Hallucinations in LLMs | Aug 9, 2024 | ChatbotHallucination | —Unverified | 0 |
| Handwritten Code Recognition for Pen-and-Paper CS Education | Aug 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| KnowPO: Knowledge-aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models | Aug 6, 2024 | HallucinationRAG | —Unverified | 0 |
| Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models | Aug 4, 2024 | Hallucination | CodeCode Available | 2 |
| MAO: A Framework for Process Model Generation with Multi-Agent Orchestration | Aug 4, 2024 | Hallucinationsoftware testing | —Unverified | 0 |
| Improving Zero-Shot ObjectNav with Generative Communication | Aug 3, 2024 | HallucinationNavigate | —Unverified | 0 |
| MiniCPM-V: A GPT-4V Level MLLM on Your Phone | Aug 3, 2024 | HallucinationMultiple-choice | CodeCode Available | 12 |
| Misinforming LLMs: vulnerabilities, challenges and opportunities | Aug 2, 2024 | HallucinationMisinformation | —Unverified | 0 |
| Piculet: Specialized Models-Guided Hallucination Decrease for MultiModal Large Language Models | Aug 2, 2024 | Hallucination | —Unverified | 0 |
| Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs | Aug 2, 2024 | AttributeHallucination | CodeCode Available | 1 |
| RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework | Aug 2, 2024 | BenchmarkingDataset Generation | CodeCode Available | 3 |
| Alleviating Hallucination in Large Vision-Language Models with Active Retrieval Augmentation | Aug 1, 2024 | HallucinationImage Comprehension | —Unverified | 0 |
| Mitigating Multilingual Hallucination in Large Vision-Language Models | Aug 1, 2024 | Hallucination | CodeCode Available | 1 |
| DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model | Aug 1, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs | Jul 31, 2024 | HallucinationImage Comprehension | CodeCode Available | 1 |
| Cost-Effective Hallucination Detection for LLMs | Jul 31, 2024 | Decision MakingFact Checking | —Unverified | 0 |
| Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering | Jul 31, 2024 | DiagnosticHallucination | —Unverified | 0 |
| Automated Review Generation Method Based on Large Language Models | Jul 30, 2024 | ArticlesHallucination | CodeCode Available | 1 |
| Interpreting and Mitigating Hallucination in MLLMs through Multi-agent Debate | Jul 30, 2024 | Hallucination | —Unverified | 0 |
| VILA^2: VILA Augmented VILA | Jul 24, 2024 | HallucinationOptical Character Recognition (OCR) | —Unverified | 0 |
| WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries | Jul 24, 2024 | ChatbotForm | —Unverified | 0 |
| Enhancing LLM's Cognition via Structurization | Jul 23, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models | Jul 23, 2024 | HallucinationMachine Translation | CodeCode Available | 0 |
| Generation Constraint Scaling Can Mitigate Hallucination | Jul 23, 2024 | DecoderHallucination | —Unverified | 0 |
| Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models | Jul 23, 2024 | HallucinationParaphrase Generation | CodeCode Available | 0 |
| LawLuo: A Multi-Agent Collaborative Framework for Multi-Round Chinese Legal Consultation | Jul 23, 2024 | HallucinationRAG | —Unverified | 0 |
| Shared Imagination: LLMs Hallucinate Alike | Jul 23, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| Multilingual Fine-Grained News Headline Hallucination Detection | Jul 22, 2024 | HallucinationHeadline Generation | —Unverified | 0 |
| Developing a Reliable, Fast, General-Purpose Hallucination Detection and Mitigation Service | Jul 22, 2024 | Hallucinationnamed-entity-recognition | —Unverified | 0 |
| MAVEN-Fact: A Large-scale Event Factuality Detection Dataset | Jul 22, 2024 | Hallucination | CodeCode Available | 0 |
| HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning | Jul 22, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| Text2Place: Affordance-aware Text Guided Human Placement | Jul 22, 2024 | AttributeHallucination | —Unverified | 0 |
| Data-Centric Human Preference Optimization with Rationales | Jul 19, 2024 | Hallucination | CodeCode Available | 0 |
| Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks | Jul 18, 2024 | Hallucinationobject-detection | —Unverified | 0 |
| ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection | Jul 18, 2024 | Cross-Lingual TransferHallucination | CodeCode Available | 0 |
| Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Jul 18, 2024 | Decision MakingHallucination | —Unverified | 0 |
| BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models | Jul 18, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |
| Retrieval-Augmented Generation for Natural Language Processing: A Survey | Jul 18, 2024 | HallucinationRAG | —Unverified | 0 |
| Halu-J: Critique-Based Hallucination Judge | Jul 17, 2024 | Evidence SelectionHallucination | CodeCode Available | 4 |
| Localizing and Mitigating Errors in Long-form Question Answering | Jul 16, 2024 | FormHallucination | CodeCode Available | 0 |
| What's Wrong? Refining Meeting Summaries with LLM Feedback | Jul 16, 2024 | HallucinationInformativeness | CodeCode Available | 0 |
| Learning Dynamics of LLM Finetuning | Jul 15, 2024 | Hallucination | CodeCode Available | 3 |
| Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Jul 15, 2024 | Common Sense ReasoningHallucination | —Unverified | 0 |
| GraphEval: A Knowledge-Graph Based LLM Hallucination Evaluation Framework | Jul 15, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Look Within, Why LLMs Hallucinate: A Causal Perspective | Jul 14, 2024 | HallucinationReading Comprehension | —Unverified | 0 |
| Cohesive Conversations: Enhancing Authenticity in Multi-Agent Simulated Dialogues | Jul 13, 2024 | DiversityHallucination | —Unverified | 0 |
| Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks | Jul 13, 2024 | HallucinationNavigate | CodeCode Available | 1 |