| Crafting In-context Examples according to LMs' Parametric Knowledge | Nov 16, 2023 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization | Nov 15, 2023 | Abstractive Text SummarizationHallucination | CodeCode Available | 1 |
| How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities | Nov 15, 2023 | EthicsFairness | CodeCode Available | 0 |
| Ever: Mitigating Hallucination in Large Language Models through Real-Time Verification and Rectification | Nov 15, 2023 | HallucinationRetrieval | CodeCode Available | 0 |
| Enhancing Emergency Decision-making with Knowledge Graphs and Large Language Models | Nov 15, 2023 | Decision MakingHallucination | —Unverified | 0 |
| Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models | Nov 15, 2023 | HallucinationRetrieval | —Unverified | 0 |
| Insights into Classifying and Mitigating LLMs' Hallucinations | Nov 14, 2023 | HallucinationMachine Translation | —Unverified | 0 |
| Predicting Text Preference Via Structured Comparative Reasoning | Nov 14, 2023 | HallucinationRetrieval | —Unverified | 0 |
| Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision | Nov 13, 2023 | HallucinationMM-Vet | CodeCode Available | 1 |
| AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation | Nov 13, 2023 | AttributeHallucination | CodeCode Available | 1 |
| Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers | Nov 13, 2023 | Hallucinationknowledge editing | CodeCode Available | 1 |
| GPT-4V(ision) as A Social Media Analysis Engine | Nov 13, 2023 | HallucinationHate Speech Detection | —Unverified | 0 |
| Hallucination Augmented Recitations for Language Models | Nov 13, 2023 | counterfactualHallucination | —Unverified | 0 |
| Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models | Nov 13, 2023 | HallucinationMachine Translation | CodeCode Available | 0 |
| Hallucination-minimized Data-to-answer Framework for Financial Decision-makers | Nov 9, 2023 | Decision MakingHallucination | —Unverified | 0 |
| A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions | Nov 9, 2023 | HallucinationInformation Retrieval | CodeCode Available | 2 |
| CBSiMT: Mitigating Hallucination in Simultaneous Machine Translation with Weighted Prefix-to-Prefix Training | Nov 7, 2023 | HallucinationMachine Translation | —Unverified | 0 |
| Holistic Analysis of Hallucination in GPT-4V(ision): Bias and Interference Challenges | Nov 6, 2023 | Hallucination | CodeCode Available | 1 |
| ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models | Nov 5, 2023 | HallucinationIn-Context Learning | —Unverified | 0 |
| SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency | Nov 3, 2023 | HallucinationQuestion Answering | CodeCode Available | 1 |
| CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQL | Nov 2, 2023 | HallucinationRetrieval | CodeCode Available | 1 |
| Collaborative Large Language Model for Recommender Systems | Nov 2, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism | Nov 2, 2023 | HallucinationMisinformation | —Unverified | 0 |
| Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling | Nov 1, 2023 | HallucinationKnowledge Distillation | CodeCode Available | 4 |
| Brain-like Flexible Visual Inference by Harnessing Feedback-Feedforward Alignment | Oct 31, 2023 | DenoisingHallucination | CodeCode Available | 0 |
| Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization | Oct 30, 2023 | Hallucination | CodeCode Available | 0 |
| Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation | Oct 28, 2023 | Dialogue GenerationHallucination | —Unverified | 0 |
| N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics | Oct 28, 2023 | FairnessHallucination | —Unverified | 0 |
| Virtual Accessory Try-On via Keypoint Hallucination | Oct 26, 2023 | HallucinationSemantic Segmentation | —Unverified | 0 |
| LightLM: A Lightweight Deep and Narrow Language Model for Generative Recommendation | Oct 26, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation | Oct 25, 2023 | Data-to-Text GenerationHallucination | CodeCode Available | 0 |
| Correction with Backtracking Reduces Hallucination in Summarization | Oct 24, 2023 | Abstractive Text SummarizationHallucination | CodeCode Available | 0 |
| Learned, uncertainty-driven adaptive acquisition for photon-efficient scanning microscopy | Oct 24, 2023 | DenoisingHallucination | —Unverified | 0 |
| Woodpecker: Hallucination Correction for Multimodal Large Language Models | Oct 24, 2023 | Hallucination | CodeCode Available | 2 |
| Hallucination Detection for Grounded Instruction Generation | Oct 23, 2023 | HallucinationNavigate | —Unverified | 0 |
| Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text Generation | Oct 23, 2023 | Abstractive Text SummarizationDialogue Generation | CodeCode Available | 0 |
| Language Models Hallucinate, but May Excel at Fact Verification | Oct 23, 2023 | Fact VerificationHallucination | CodeCode Available | 0 |
| HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | Oct 23, 2023 | DiagnosticHallucination | CodeCode Available | 2 |
| Unleashing the potential of prompt engineering for large language models | Oct 23, 2023 | HallucinationPrompt Engineering | —Unverified | 0 |
| Chainpoll: A high efficacy method for LLM hallucination detection | Oct 22, 2023 | HallucinationRetrieval-augmented Generation | CodeCode Available | 0 |
| Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models | Oct 20, 2023 | FormHallucination | —Unverified | 0 |
| Reliable Academic Conference Question Answering: A Study Based on Large Language Model | Oct 19, 2023 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models | Oct 19, 2023 | HallucinationMathematical Reasoning | CodeCode Available | 0 |
| ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks | Oct 19, 2023 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Know Where to Go: Make LLM a Relevant, Responsible, and Trustworthy Searcher | Oct 19, 2023 | HallucinationInformation Retrieval | —Unverified | 0 |
| FactCHD: Benchmarking Fact-Conflicting Hallucination Detection | Oct 18, 2023 | BenchmarkingHallucination | CodeCode Available | 1 |
| LiDAR-based 4D Occupancy Completion and Forecasting | Oct 17, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Theory of Mind for Multi-Agent Collaboration via Large Language Models | Oct 16, 2023 | HallucinationMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Towards reducing hallucination in extracting information from financial reports using Large Language Models | Oct 16, 2023 | HallucinationOptical Character Recognition | —Unverified | 0 |
| Flow Dynamics Correction for Action Recognition | Oct 16, 2023 | Action RecognitionFine-grained Action Recognition | —Unverified | 0 |