| DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation | Jun 9, 2024 | Common Sense ReasoningDenoising | CodeCode Available | 1 |
| An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models | Jun 7, 2024 | Hallucinationparameter-efficient fine-tuning | CodeCode Available | 1 |
| Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training | May 31, 2024 | HallucinationMulti-Task Learning | CodeCode Available | 1 |
| TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models | May 28, 2024 | Hallucination | CodeCode Available | 1 |
| Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization | May 28, 2024 | Hallucination | CodeCode Available | 1 |
| DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception | May 24, 2024 | Hallucination | CodeCode Available | 1 |
| Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs | May 24, 2024 | HallucinationResponse Generation | CodeCode Available | 1 |
| The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG) | May 21, 2024 | HallucinationRAG | CodeCode Available | 1 |
| Automated Multi-level Preference for MLLMs | May 18, 2024 | Dataset GenerationHallucination | CodeCode Available | 1 |
| Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling | May 16, 2024 | Contrastive LearningHallucination | CodeCode Available | 1 |