| Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean | Apr 18, 2024 | Automated Theorem ProvingHallucination | CodeCode Available | 5 |
| Is There No Such Thing as a Bad Question? H4R: HalluciBot For Ratiocination, Rewriting, Ranking, and Routing | Apr 18, 2024 | HallucinationMultiple-choice | —Unverified | 0 |
| Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation | Apr 18, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory | Apr 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts | Apr 17, 2024 | Deep LearningHallucination | CodeCode Available | 0 |
| Exploring the Transferability of Visual Prompting for Multimodal Large Language Models | Apr 17, 2024 | HallucinationMultimodal Reasoning | CodeCode Available | 1 |
| Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales | Apr 17, 2024 | Hallucination | —Unverified | 0 |
| Fewer Truncations Improve Language Modeling | Apr 16, 2024 | Combinatorial OptimizationHallucination | —Unverified | 0 |
| A computational account of the development and evolution of psychotic symptoms | Apr 16, 2024 | Hallucination | —Unverified | 0 |
| Prescribing the Right Remedy: Mitigating Hallucinations in Large Vision-Language Models via Targeted Instruction Tuning | Apr 16, 2024 | DiagnosticHallucination | —Unverified | 0 |
| Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering | Apr 16, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Anatomy of Industrial Scale Multilingual ASR | Apr 15, 2024 | AnatomyAutomatic Speech Recognition | —Unverified | 0 |
| Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs | Apr 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations | Apr 15, 2024 | BenchmarkingBias Detection | CodeCode Available | 1 |
| Mitigating Hallucination in Abstractive Summarization with Domain-Conditional Mutual Information | Apr 15, 2024 | Abstractive Text SummarizationHallucination | CodeCode Available | 0 |
| Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration | Apr 15, 2024 | Hallucination | CodeCode Available | 1 |
| Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models | Apr 14, 2024 | Hallucination | —Unverified | 0 |
| Distilling Reasoning Ability from Large Language Models with Adaptive Thinking | Apr 14, 2024 | Hallucination | —Unverified | 0 |
| CuriousLLM: Elevating Multi-Document QA with Reasoning-Infused Knowledge Graph Prompting | Apr 13, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 1 |
| Reducing hallucination in structured outputs via Retrieval-Augmented Generation | Apr 12, 2024 | HallucinationRAG | —Unverified | 0 |
| View Selection for 3D Captioning via Diffusion Ranking | Apr 11, 2024 | 3D Object CaptioningHallucination | CodeCode Available | 3 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 |
| An Audit on the Perspectives and Challenges of Hallucinations in NLP | Apr 11, 2024 | HallucinationSurvey | —Unverified | 0 |
| MetaCheckGPT -- A Multi-task Hallucination Detector Using LLM Uncertainty and Meta-models | Apr 10, 2024 | Hallucination | —Unverified | 0 |
| BRAVE: Broadening the visual encoding of vision-language models | Apr 10, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |
| Tackling Structural Hallucination in Image Translation with Local Diffusion | Apr 9, 2024 | HallucinationImage Generation | CodeCode Available | 1 |
| Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports | Apr 9, 2024 | FormHallucination | CodeCode Available | 0 |
| SmurfCat at SemEval-2024 Task 6: Leveraging Synthetic Data for Hallucination Detection | Apr 9, 2024 | Hallucination | CodeCode Available | 0 |
| Automating Research Synthesis with Domain-Specific Large Language Model Fine-Tuning | Apr 8, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Hyperbolic Learning with Synthetic Captions for Open-World Detection | Apr 7, 2024 | HallucinationNovel Concepts | —Unverified | 0 |
| FGAIF: Aligning Large Vision-Language Models with Fine-grained AI Feedback | Apr 7, 2024 | AttributeHallucination | —Unverified | 0 |
| HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models | Apr 7, 2024 | HallucinationRepresentation Learning | —Unverified | 0 |
| SLPL SHROOM at SemEval2024 Task 06: A comprehensive study on models ability to detect hallucination | Apr 7, 2024 | HallucinationMachine Translation | CodeCode Available | 0 |
| On the Limitations of Large Language Models (LLMs): False Attribution | Apr 6, 2024 | Author AttributionHallucination | —Unverified | 0 |
| PoLLMgraph: Unraveling Hallucinations in Large Language Models via State Transition Dynamics | Apr 6, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping | Apr 5, 2024 | AttributeHallucination | —Unverified | 0 |
| Mitigating LLM Hallucinations via Conformal Abstention | Apr 4, 2024 | Conformal PredictionGenerative Question Answering | —Unverified | 0 |
| SHROOM-INDElab at SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification for Hallucination Detection | Apr 4, 2024 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Fakes of Varying Shades: How Warning Affects Human Perception and Engagement Regarding LLM Hallucinations | Apr 4, 2024 | HallucinationHuman Detection | CodeCode Available | 0 |
| A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation | Apr 4, 2024 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking | Apr 3, 2024 | Fact CheckingForm | CodeCode Available | 2 |
| Scalable Model Editing via Customized Expert Networks | Apr 3, 2024 | Hallucinationmodel | CodeCode Available | 0 |
| ALOHa: A New Measure for Hallucination in Captioning Models | Apr 3, 2024 | HallucinationObject | —Unverified | 0 |
| Comparative Study of Domain Driven Terms Extraction Using Large Language Models | Apr 2, 2024 | Document SummarizationHallucination | —Unverified | 0 |
| Extracting Norms from Contracts Via ChatGPT: Opportunities and Challenges | Apr 2, 2024 | Hallucination | —Unverified | 0 |
| Hallucination Diversity-Aware Active Learning for Text Summarization | Apr 2, 2024 | Active LearningDiversity | —Unverified | 0 |
| AILS-NTUA at SemEval-2024 Task 6: Efficient model tuning for hallucination detection and analysis | Apr 1, 2024 | Binary ClassificationHallucination | CodeCode Available | 0 |
| Exploring and Evaluating Hallucinations in LLM-Powered Code Generation | Apr 1, 2024 | Code GenerationHallucination | —Unverified | 0 |
| Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning | Mar 29, 2024 | HallucinationTask Planning | CodeCode Available | 0 |
| On Large Language Models' Hallucination with Regard to Known Facts | Mar 29, 2024 | HallucinationTriplet | CodeCode Available | 0 |