| OG-VLA: 3D-Aware Vision Language Action Model via Orthographic Image Generation | Jun 1, 2025 | Image GenerationLarge Language Model | —Unverified | 0 |
| LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks | May 31, 2025 | Task PlanningVision-Language-Action | —Unverified | 0 |
| Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction | May 30, 2025 | Action GenerationOptical Flow Estimation | —Unverified | 0 |
| TrackVLA: Embodied Visual Tracking in the Wild | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better | May 29, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation | May 28, 2025 | Contact-rich ManipulationMixture-of-Experts | —Unverified | 0 |
| Hume: Introducing System-2 Thinking in Visual-Language-Action Model | May 27, 2025 | DenoisingVision-Language-Action | —Unverified | 0 |
| Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review | May 26, 2025 | Decision Making Under UncertaintySensor Fusion | —Unverified | 0 |
| What Can RL Bring to VLA Generalization? An Empirical Study | May 26, 2025 | Reinforcement Learning (RL)Vision-Language-Action | —Unverified | 0 |
| BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization | May 22, 2025 | Backdoor AttackVision-Language-Action | —Unverified | 0 |