| Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization | May 28, 2024 | Hallucination | CodeCode Available | 1 |
| LLMs and Memorization: On Quality and Specificity of Copyright Compliance | May 28, 2024 | HallucinationMemorization | CodeCode Available | 0 |
| Data-augmented phrase-level alignment for mitigating object hallucination | May 28, 2024 | Data AugmentationHallucination | —Unverified | 0 |
| RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language Models | May 28, 2024 | HallucinationMME | —Unverified | 0 |
| Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action | May 28, 2024 | Conversational Question AnsweringHallucination | —Unverified | 0 |
| TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models | May 28, 2024 | Hallucination | CodeCode Available | 1 |
| RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness | May 27, 2024 | HallucinationImage Captioning | CodeCode Available | 11 |
| Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings | May 27, 2024 | Domain AdaptationGPU | —Unverified | 0 |
| Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language Tasks | May 27, 2024 | HallucinationObject Hallucination | CodeCode Available | 0 |
| GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases | May 25, 2024 | BenchmarkingHallucination | —Unverified | 0 |
| Large Language Model Pruning | May 24, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement | May 24, 2024 | HallucinationImage Comprehension | CodeCode Available | 2 |
| CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems | May 24, 2024 | DiagnosticHallucination | —Unverified | 0 |
| Scaling Laws for Discriminative Classification in Large Language Models | May 24, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception | May 24, 2024 | Hallucination | CodeCode Available | 1 |
| Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization | May 24, 2024 | Hallucination | CodeCode Available | 0 |
| Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs | May 24, 2024 | HallucinationResponse Generation | CodeCode Available | 1 |
| Calibrated Self-Rewarding Vision Language Models | May 23, 2024 | HallucinationLanguage Modelling | CodeCode Available | 2 |
| RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models | May 23, 2024 | HallucinationSentence | CodeCode Available | 3 |
| WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models | May 23, 2024 | HallucinationModel Editing | —Unverified | 0 |
| Less for More: Enhanced Feedback-aligned Mixed LLMs for Molecule Caption Generation and Fine-Grained NLI Evaluation | May 22, 2024 | Caption GenerationHallucination | —Unverified | 0 |
| Gradient Projection For Continual Parameter-Efficient Tuning | May 22, 2024 | Continual LearningHallucination | —Unverified | 0 |
| CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models | May 22, 2024 | BenchmarkingHallucination | —Unverified | 0 |
| GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games | May 22, 2024 | Code GenerationDecision Making | —Unverified | 0 |
| Presentations are not always linear! GNN meets LLM for Document-to-Presentation Transformation with Attribution | May 21, 2024 | Graph Neural NetworkHallucination | —Unverified | 0 |