| CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models | Mar 27, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| GRAPE: Generalizing Robot Policy via Preference Alignment | Nov 28, 2024 | Vision-Language-Action | —Unverified | 0 | 0 |
| GR00T N1: An Open Foundation Model for Generalist Humanoid Robots | Mar 18, 2025 | Imitation LearningVision-Language-Action | —Unverified | 0 | 0 |
| GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation | Feb 13, 2025 | Contrastive LearningVideo Generation | —Unverified | 0 | 0 |
| Conditioning Matters: Training Diffusion Policies is Faster Than You Think | May 16, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| Automated Data Curation Using GPS & NLP to Generate Instruction-Action Pairs for Autonomous Vehicle Vision-Language Navigation Datasets | May 6, 2025 | Autonomous VehiclesTAG | —Unverified | 0 | 0 |
| General-purpose foundation models for increased autonomy in robot-assisted surgery | Jan 1, 2024 | Vision-Language-Action | —Unverified | 0 | 0 |
| Grounding Multimodal LLMs to Embodied Agents that Ask for Help with Reinforcement Learning | Apr 1, 2025 | Reinforcement Learning (RL)Vision-Language-Action | —Unverified | 0 | 0 |
| From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models | Jun 11, 2025 | Imitation LearningVision-Language-Action | —Unverified | 0 | 0 |
| ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation | May 28, 2025 | Contact-rich ManipulationMixture-of-Experts | —Unverified | 0 | 0 |