| Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean | Apr 18, 2024 | Automated Theorem ProvingHallucination | CodeCode Available | 5 |
| Is There No Such Thing as a Bad Question? H4R: HalluciBot For Ratiocination, Rewriting, Ranking, and Routing | Apr 18, 2024 | HallucinationMultiple-choice | —Unverified | 0 |
| Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation | Apr 18, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory | Apr 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts | Apr 17, 2024 | Deep LearningHallucination | CodeCode Available | 0 |
| Exploring the Transferability of Visual Prompting for Multimodal Large Language Models | Apr 17, 2024 | HallucinationMultimodal Reasoning | CodeCode Available | 1 |
| Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales | Apr 17, 2024 | Hallucination | —Unverified | 0 |
| Fewer Truncations Improve Language Modeling | Apr 16, 2024 | Combinatorial OptimizationHallucination | —Unverified | 0 |
| A computational account of the development and evolution of psychotic symptoms | Apr 16, 2024 | Hallucination | —Unverified | 0 |
| Prescribing the Right Remedy: Mitigating Hallucinations in Large Vision-Language Models via Targeted Instruction Tuning | Apr 16, 2024 | DiagnosticHallucination | —Unverified | 0 |
| Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering | Apr 16, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Anatomy of Industrial Scale Multilingual ASR | Apr 15, 2024 | AnatomyAutomatic Speech Recognition | —Unverified | 0 |
| Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs | Apr 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations | Apr 15, 2024 | BenchmarkingBias Detection | CodeCode Available | 1 |
| Mitigating Hallucination in Abstractive Summarization with Domain-Conditional Mutual Information | Apr 15, 2024 | Abstractive Text SummarizationHallucination | CodeCode Available | 0 |
| Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration | Apr 15, 2024 | Hallucination | CodeCode Available | 1 |
| Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models | Apr 14, 2024 | Hallucination | —Unverified | 0 |
| Distilling Reasoning Ability from Large Language Models with Adaptive Thinking | Apr 14, 2024 | Hallucination | —Unverified | 0 |
| CuriousLLM: Elevating Multi-Document QA with Reasoning-Infused Knowledge Graph Prompting | Apr 13, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 1 |
| Reducing hallucination in structured outputs via Retrieval-Augmented Generation | Apr 12, 2024 | HallucinationRAG | —Unverified | 0 |
| View Selection for 3D Captioning via Diffusion Ranking | Apr 11, 2024 | 3D Object CaptioningHallucination | CodeCode Available | 3 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 |
| An Audit on the Perspectives and Challenges of Hallucinations in NLP | Apr 11, 2024 | HallucinationSurvey | —Unverified | 0 |
| MetaCheckGPT -- A Multi-task Hallucination Detector Using LLM Uncertainty and Meta-models | Apr 10, 2024 | Hallucination | —Unverified | 0 |
| BRAVE: Broadening the visual encoding of vision-language models | Apr 10, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |