| Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models | Jul 23, 2024 | HallucinationMachine Translation | CodeCode Available | 0 |
| Generation Constraint Scaling Can Mitigate Hallucination | Jul 23, 2024 | DecoderHallucination | —Unverified | 0 |
| Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models | Jul 23, 2024 | HallucinationParaphrase Generation | CodeCode Available | 0 |
| LawLuo: A Multi-Agent Collaborative Framework for Multi-Round Chinese Legal Consultation | Jul 23, 2024 | HallucinationRAG | —Unverified | 0 |
| Shared Imagination: LLMs Hallucinate Alike | Jul 23, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| Multilingual Fine-Grained News Headline Hallucination Detection | Jul 22, 2024 | HallucinationHeadline Generation | —Unverified | 0 |
| Developing a Reliable, Fast, General-Purpose Hallucination Detection and Mitigation Service | Jul 22, 2024 | Hallucinationnamed-entity-recognition | —Unverified | 0 |
| MAVEN-Fact: A Large-scale Event Factuality Detection Dataset | Jul 22, 2024 | Hallucination | CodeCode Available | 0 |
| HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning | Jul 22, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |
| Text2Place: Affordance-aware Text Guided Human Placement | Jul 22, 2024 | AttributeHallucination | —Unverified | 0 |
| Data-Centric Human Preference Optimization with Rationales | Jul 19, 2024 | Hallucination | CodeCode Available | 0 |
| Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks | Jul 18, 2024 | Hallucinationobject-detection | —Unverified | 0 |
| ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection | Jul 18, 2024 | Cross-Lingual TransferHallucination | CodeCode Available | 0 |
| Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Jul 18, 2024 | Decision MakingHallucination | —Unverified | 0 |
| BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models | Jul 18, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |
| Retrieval-Augmented Generation for Natural Language Processing: A Survey | Jul 18, 2024 | HallucinationRAG | —Unverified | 0 |
| Halu-J: Critique-Based Hallucination Judge | Jul 17, 2024 | Evidence SelectionHallucination | CodeCode Available | 4 |
| Localizing and Mitigating Errors in Long-form Question Answering | Jul 16, 2024 | FormHallucination | CodeCode Available | 0 |
| What's Wrong? Refining Meeting Summaries with LLM Feedback | Jul 16, 2024 | HallucinationInformativeness | CodeCode Available | 0 |
| Learning Dynamics of LLM Finetuning | Jul 15, 2024 | Hallucination | CodeCode Available | 3 |
| Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Jul 15, 2024 | Common Sense ReasoningHallucination | —Unverified | 0 |
| GraphEval: A Knowledge-Graph Based LLM Hallucination Evaluation Framework | Jul 15, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Look Within, Why LLMs Hallucinate: A Causal Perspective | Jul 14, 2024 | HallucinationReading Comprehension | —Unverified | 0 |
| Cohesive Conversations: Enhancing Authenticity in Multi-Agent Simulated Dialogues | Jul 13, 2024 | DiversityHallucination | —Unverified | 0 |
| Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks | Jul 13, 2024 | HallucinationNavigate | CodeCode Available | 1 |