| Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models | Oct 20, 2023 | FormHallucination | —Unverified | 0 |
| MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models | Oct 19, 2023 | HallucinationMathematical Reasoning | CodeCode Available | 0 |
| Know Where to Go: Make LLM a Relevant, Responsible, and Trustworthy Searcher | Oct 19, 2023 | HallucinationInformation Retrieval | —Unverified | 0 |
| Reliable Academic Conference Question Answering: A Study Based on Large Language Model | Oct 19, 2023 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks | Oct 19, 2023 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Flow Dynamics Correction for Action Recognition | Oct 16, 2023 | Action RecognitionFine-grained Action Recognition | —Unverified | 0 |
| Towards reducing hallucination in extracting information from financial reports using Large Language Models | Oct 16, 2023 | HallucinationOptical Character Recognition | —Unverified | 0 |
| Metric Ensembles For Hallucination Detection | Oct 16, 2023 | Abstractive Text SummarizationHallucination | —Unverified | 0 |
| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 |
| Configuration Validation with Large Language Models | Oct 15, 2023 | Code GenerationFew-Shot Learning | —Unverified | 0 |
| GameGPT: Multi-agent Collaborative Framework for Game Development | Oct 12, 2023 | Code GenerationHallucination | —Unverified | 0 |
| GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models | Oct 12, 2023 | Answer GenerationHallucination | CodeCode Available | 0 |
| A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection | Oct 10, 2023 | HallucinationSentence | CodeCode Available | 0 |
| Towards Mitigating Hallucination in Large Language Models via Self-Reflection | Oct 10, 2023 | Answer GenerationHallucination | —Unverified | 0 |
| Teaching Language Models to Hallucinate Less with Synthetic Tasks | Oct 10, 2023 | Abstractive Text SummarizationHallucination | —Unverified | 0 |
| Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models | Oct 9, 2023 | HallucinationObject | —Unverified | 0 |
| The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations | Oct 8, 2023 | Hallucination | —Unverified | 0 |
| Improving the Reliability of Large Language Models by Leveraging Uncertainty-Aware In-Context Learning | Oct 7, 2023 | HallucinationIn-Context Learning | —Unverified | 0 |
| AutoHall: Automated Hallucination Dataset Generation for Large Language Models | Sep 30, 2023 | Dataset GenerationFact Checking | —Unverified | 0 |
| Self-Specialization: Uncovering Latent Expertise within Large Language Models | Sep 29, 2023 | HallucinationInstruction Following | —Unverified | 0 |
| Neuro Symbolic Reasoning for Planning: Counterexample Guided Inductive Synthesis using Large Language Models and Satisfiability Solving | Sep 28, 2023 | HallucinationQuestion Answering | —Unverified | 0 |
| Hallucination Reduction in Long Input Text Summarization | Sep 28, 2023 | DecoderHallucination | CodeCode Available | 0 |
| Augmenting LLMs with Knowledge: A survey on hallucination prevention | Sep 28, 2023 | HallucinationLanguage Modeling | —Unverified | 0 |
| Aligning Large Multimodal Models with Factually Augmented RLHF | Sep 25, 2023 | HallucinationImage Captioning | —Unverified | 0 |
| Chain-of-Verification Reduces Hallucination in Large Language Models | Sep 20, 2023 | HallucinationText Generation | CodeCode Available | 0 |