| Deformable Attentive Visual Enhancement for Referring Segmentation Using Vision-Language Model | May 25, 2025 | cross-modal alignmentImage Segmentation | —Unverified | 0 |
| Meta-aware Learning in text-to-SQL Large Language Model | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FiLLM -- A Filipino-optimized Large Language Model based on Southeast Asia Large Language Model (SEALLM) | May 25, 2025 | Dependency ParsingLanguage Modeling | —Unverified | 0 |
| Towards Reliable Large Audio Language Model | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-QFL: Distilling Large Language Model for Quantum Federated Learning | May 24, 2025 | Federated LearningLanguage Modeling | CodeCode Available | 0 |
| Partition Generative Modeling: Masked Modeling Without Masks | May 24, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 4 |
| REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| metaTextGrad: Automatically optimizing language model optimizers | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Skip-Thinking: Chunk-wise Chain-of-Thought Distillation Enable Smaller Language Models to Reason Better and Faster | May 24, 2025 | Heuristic SearchLanguage Modeling | —Unverified | 0 |
| Anchored Diffusion Language Model | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TULUN: Transparent and Adaptable Low-resource Machine Translation | May 24, 2025 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Disentangling Knowledge Representations for Large Language Model Editing | May 24, 2025 | Disentanglementknowledge editing | —Unverified | 0 |
| Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment | May 24, 2025 | Image Super-ResolutionLanguage Modeling | —Unverified | 0 |
| Inference Compute-Optimal Video Vision Language Models | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BiomechGPT: Towards a Biomechanically Fluent Multimodal Foundation Model for Clinically Relevant Motion Tasks | May 24, 2025 | Activity RecognitionDescriptive | —Unverified | 0 |
| EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models | May 24, 2025 | Image-text RetrievalLanguage Modeling | —Unverified | 0 |
| Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning | May 23, 2025 | DecoderImage Captioning | CodeCode Available | 4 |
| RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning | May 23, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem Solving | May 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments | May 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models | May 23, 2025 | GPULanguage Modeling | CodeCode Available | 0 |
| QwenLong-CPRS: Towards -LLMs with Dynamic Context Optimization | May 23, 2025 | 4kLanguage Modeling | —Unverified | 0 |
| Large language model as user daily behavior data generator: balancing population diversity and individual personality | May 23, 2025 | Data AugmentationDiversity | —Unverified | 0 |