| A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition | Mar 25, 2025 | Contrastive LearningIntent Recognition | —Unverified | 0 |
| Improved Alignment of Modalities in Large Vision Language Models | Mar 25, 2025 | GPUImage Captioning | —Unverified | 0 |
| SemEval-2025 Task 9: The Food Hazard Detection Challenge | Mar 25, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model | Mar 25, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Optimizing Language Models for Inference Time Objectives using Reinforcement Learning | Mar 25, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| 1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image Analysis | Mar 25, 2025 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 |
| LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Mar 25, 2025 | Code CompletionLanguage Modeling | CodeCode Available | 1 |
| CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model | Mar 25, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Exploring Textual Semantics Diversity for Image Transmission in Semantic Communication Systems using Visual Language Model | Mar 25, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Generative Linguistics, Large Language Models, and the Social Nature of Scientific Success | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLEAR: Contrasting Textual Feedback with Experts and Amateurs for Reasoning | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Uncertainty Quantification with Attention Chain | Mar 24, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling | Mar 24, 2025 | Continual PretrainingLanguage Modeling | —Unverified | 0 |
| A Survey of Large Language Model Agents for Question Answering | Mar 24, 2025 | Answer GenerationInformation Retrieval | —Unverified | 0 |
| LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model | Mar 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MC-LLaVA: Multi-Concept Personalized Vision-Language Model | Mar 24, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ClinText-SP and RigoBERTa Clinical: a new set of open resources for Spanish Clinical NLP | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Solving Situation Puzzles with Large Language Model and External Reformulation | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MMCR: Advancing Visual Language Model in Multimodal Multi-Turn Contextual Reasoning | Mar 24, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics | Mar 24, 2025 | Human-Object Interaction DetectionLanguage Modeling | —Unverified | 0 |