| Re-Ex: Revising after Explanation Reduces the Factual Errors in LLM Responses | Feb 27, 2024 | Hallucination | CodeCode Available | 0 |
| Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models | Feb 26, 2024 | Decision MakingHallucination | —Unverified | 0 |
| GROUNDHOG: Grounding Large Language Models to Holistic Segmentation | Feb 26, 2024 | Causal Language ModelingGeneralized Referring Expression Segmentation | —Unverified | 0 |
| HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs | Feb 25, 2024 | BenchmarkingChatbot | CodeCode Available | 0 |
| Rethinking Software Engineering in the Foundation Model Era: A Curated Catalogue of Challenges in the Development of Trustworthy FMware | Feb 25, 2024 | Hallucination | —Unverified | 0 |
| AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation | Feb 25, 2024 | Face GenerationHallucination | —Unverified | 0 |
| Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy | Feb 25, 2024 | HallucinationSentence | CodeCode Available | 1 |
| Citation-Enhanced Generation for LLM-based Chatbots | Feb 25, 2024 | ChatbotCitation Prediction | CodeCode Available | 1 |
| Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models | Feb 24, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models | Feb 23, 2024 | Hallucination | CodeCode Available | 1 |
| CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean | Feb 23, 2024 | ClassificationHallucination | —Unverified | 0 |
| Seeing is Believing: Mitigating Hallucination in Large Vision-Language Models via CLIP-Guided Decoding | Feb 23, 2024 | HallucinationObject | CodeCode Available | 1 |
| UFO: a Unified and Flexible Framework for Evaluating Factuality of Large Language Models | Feb 22, 2024 | HallucinationRetrieval | CodeCode Available | 0 |
| DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models | Feb 22, 2024 | Hallucination | CodeCode Available | 0 |
| Visual Hallucinations of Multi-modal Large Language Models | Feb 22, 2024 | DiversityHallucination | CodeCode Available | 1 |
| Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective | Feb 22, 2024 | HallucinationSentence | CodeCode Available | 2 |
| Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer | Feb 22, 2024 | Generative Question AnsweringHallucination | —Unverified | 0 |
| Science Checker Reloaded: A Bidirectional Paradigm for Transparency and Logical Reasoning | Feb 21, 2024 | HallucinationInformation Retrieval | CodeCode Available | 0 |
| Emergence and dynamics of delusions and hallucinations across stages in early psychosis | Feb 20, 2024 | Hallucination | —Unverified | 0 |
| Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation | Feb 20, 2024 | HallucinationMachine Translation | —Unverified | 0 |
| OPDAI at SemEval-2024 Task 6: Small LLMs can Accelerate Hallucination Detection with Weakly Supervised Data | Feb 20, 2024 | Few-Shot LearningHallucination | —Unverified | 0 |
| OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification | Feb 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| GOOD: Towards Domain Generalized Orientated Object Detection | Feb 20, 2024 | HallucinationObject | —Unverified | 0 |
| TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization | Feb 20, 2024 | HallucinationNews Summarization | CodeCode Available | 1 |
| Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations | Feb 19, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |