| REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Skip-Thinking: Chunk-wise Chain-of-Thought Distillation Enable Smaller Language Models to Reason Better and Faster | May 24, 2025 | Heuristic SearchLanguage Modeling | —Unverified | 0 |
| BiomechGPT: Towards a Biomechanically Fluent Multimodal Foundation Model for Clinically Relevant Motion Tasks | May 24, 2025 | Activity RecognitionDescriptive | —Unverified | 0 |
| EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models | May 24, 2025 | Image-text RetrievalLanguage Modeling | —Unverified | 0 |
| Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment | May 24, 2025 | Image Super-ResolutionLanguage Modeling | —Unverified | 0 |
| Inference Compute-Optimal Video Vision Language Models | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning | May 23, 2025 | DecoderImage Captioning | CodeCode Available | 4 |
| RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning | May 23, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 1 |