| WHISTRESS: Enriching Transcriptions with Sentence Stress Detection | May 25, 2025 | SentenceZero-shot Generalization | —Unverified | 0 |
| G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning | May 24, 2025 | Link PredictionNode Classification | —Unverified | 0 |
| Anchored Diffusion Language Model | May 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning | May 22, 2025 | Zero-shot Generalization | —Unverified | 0 |
| EasyInsert: A Data-Efficient and Generalizable Insertion Policy | May 22, 2025 | Pose PredictionZero-shot Generalization | —Unverified | 0 |
| Prompt Tuning Vision Language Models with Margin Regularizer for Few-Shot Learning under Distribution Shifts | May 21, 2025 | Few-Shot LearningTask 2 | CodeCode Available | 0 |
| AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation | May 21, 2025 | Zero-shot Generalization | —Unverified | 0 |
| gen2seg: Generative Models Enable Generalizable Instance Segmentation | May 21, 2025 | DecoderInstance Segmentation | —Unverified | 0 |
| EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy | May 21, 2025 | Motion PlanningVision-Language-Action | —Unverified | 0 |
| ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling | May 19, 2025 | Graph GenerationKnowledge Distillation | —Unverified | 0 |