| ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis | Mar 15, 2025 | Domain GeneralizationRobot Manipulation | —Unverified | 0 | 0 |
| Refined Policy Distillation: From VLA Generalists to RL Experts | Mar 6, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| ReVLA: Reverting Visual Domain Limitation of Robotic Foundation Models | Sep 23, 2024 | Vision-Language-Action | —Unverified | 0 | 0 |
| RLRC: Reinforcement Learning-based Recovery for Compressed Vision-Language-Action Models | Jun 21, 2025 | Model CompressionQuantization | —Unverified | 0 | 0 |
| RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation | Jun 7, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation | Jun 6, 2024 | Common Sense ReasoningMamba | —Unverified | 0 | 0 |
| RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation | Dec 18, 2024 | DiversityImitation Learning | —Unverified | 0 | 0 |
| RoboMonkey: Scaling Test-Time Sampling and Verification for Vision-Language-Action Models | Jun 21, 2025 | Synthetic Data GenerationVision-Language-Action | —Unverified | 0 | 0 |
| Robotic Control via Embodied Chain-of-Thought Reasoning | Jul 11, 2024 | Vision-Language-Action | —Unverified | 0 | 0 |
| Robotic Policy Learning via Human-assisted Action Preference Optimization | Jun 8, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| ROSA: Harnessing Robot States for Vision-Language and Action Alignment | Jun 16, 2025 | State EstimationVision-Language-Action | —Unverified | 0 | 0 |
| RT-cache: Efficient Robot Trajectory Retrieval System | May 14, 2025 | RetrievalVision-Language-Action | —Unverified | 0 | 0 |
| Run-time Observation Interventions Make Vision-Language-Action Models More Visually Robust | Oct 2, 2024 | Vision-Language-Action | —Unverified | 0 | 0 |
| SAFE: Multitask Failure Detection for Vision-Language-Action Models | Jun 11, 2025 | Conformal PredictionVision-Language-Action | —Unverified | 0 | 0 |
| SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning | Mar 5, 2025 | Safe Reinforcement LearningSafety Alignment | —Unverified | 0 | 0 |
| SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention | Dec 4, 2023 | Vision-Language-Action | —Unverified | 0 | 0 |
| SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters | Jan 1, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| Survey on Vision-Language-Action Models | Feb 7, 2025 | Review GenerationSurvey | —Unverified | 0 | 0 |
| Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction | May 30, 2025 | Action GenerationOptical Flow Estimation | —Unverified | 0 | 0 |
| Towards Natural Language-Driven Assembly Using Foundation Models | Jun 23, 2024 | FrictionVision-Language-Action | —Unverified | 0 | 0 |
| A Taxonomy for Evaluating Generalist Robot Policies | Mar 3, 2025 | Robot ManipulationVision-Language-Action | —Unverified | 0 | 0 |
| TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies | Dec 13, 2024 | Robot ManipulationVision-Language-Action | —Unverified | 0 | 0 |
| TrackVLA: Embodied Visual Tracking in the Wild | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Unified Vision-Language-Action Model | Jun 24, 2025 | Autonomous Drivingmodel | —Unverified | 0 | 0 |
| Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks | Dec 9, 2024 | Vision-Language-Action | —Unverified | 0 | 0 |