| HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model | Mar 13, 2025 | Common Sense ReasoningDenoising | —Unverified | 0 |
| CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games | Mar 12, 2025 | Decision MakingVision-Language-Action | CodeCode Available | 2 |
| MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models | Mar 11, 2025 | Large Language ModelMixture-of-Experts | —Unverified | 0 |
| PointVLA: Injecting the 3D World into Vision-Language-Action Models | Mar 10, 2025 | Imitation LearningSpatial Reasoning | CodeCode Available | 4 |
| Refined Policy Distillation: From VLA Generalists to RL Experts | Mar 6, 2025 | Vision-Language-Action | —Unverified | 0 |
| SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning | Mar 5, 2025 | Safe Reinforcement LearningSafety Alignment | —Unverified | 0 |
| OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction | Mar 5, 2025 | Vision-Language-ActionZero-shot Generalization | —Unverified | 0 |
| Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding | Mar 4, 2025 | ChunkingVision-Language-Action | —Unverified | 0 |
| A Taxonomy for Evaluating Generalist Robot Policies | Mar 3, 2025 | Robot ManipulationVision-Language-Action | —Unverified | 0 |
| DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping | Feb 28, 2025 | Imitation LearningVision-Language-Action | —Unverified | 0 |