| The Empty Chair: Using LLMs to Raise Missing Perspectives in Policy Deliberations | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences | Mar 18, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| KVShare: An LLM Service System with Efficient and Effective Multi-Tenant KV Cache Reuse | Mar 17, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model | Mar 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model | Mar 17, 2025 | Continual LearningLanguage Modeling | —Unverified | 0 |
| PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing | Mar 17, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Agents Play Thousands of 3D Video Games | Mar 17, 2025 | FPS GamesLanguage Modeling | —Unverified | 0 |
| High-entropy Advantage in Neural Networks' Generalizability | Mar 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HybridGen: VLM-Guided Hybrid Planning for Scalable Data Generation of Imitation Learning | Mar 17, 2025 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling | Mar 17, 2025 | GPULanguage Modeling | CodeCode Available | 2 |