| LaViPlan : Language-Guided Visual Path Planning with RLVR | Jul 17, 2025 | Autonomous DrivingVision-Language-Action | —Unverified | 0 |
| AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation | Jul 17, 2025 | Vision-Language-Action | —Unverified | 0 |
| Vision Language Action Models in Robotic Manipulation: A Systematic Review | Jul 14, 2025 | Dataset GenerationNatural Language Understanding | CodeCode Available | 2 |
| VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting | Jul 7, 2025 | Depth EstimationVision-Language-Action | CodeCode Available | 1 |
| DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge | Jul 6, 2025 | Image GenerationMultimodal Reasoning | CodeCode Available | 3 |
| A Survey on Vision-Language-Action Models for Autonomous Driving | Jun 30, 2025 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 4 |
| WorldVLA: Towards Autoregressive Action World Model | Jun 26, 2025 | Action Generationmodel | CodeCode Available | 4 |
| Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends | Jun 26, 2025 | Action GenerationVision-Language-Action | CodeCode Available | 2 |
| Unified Vision-Language-Action Model | Jun 24, 2025 | Autonomous Drivingmodel | —Unverified | 0 |
| CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation | Jun 24, 2025 | ChunkingVision-Language-Action | —Unverified | 0 |
| VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models | Jun 21, 2025 | Action GenerationContinual Learning | —Unverified | 0 |
| RLRC: Reinforcement Learning-based Recovery for Compressed Vision-Language-Action Models | Jun 21, 2025 | Model CompressionQuantization | —Unverified | 0 |
| RoboMonkey: Scaling Test-Time Sampling and Verification for Vision-Language-Action Models | Jun 21, 2025 | Synthetic Data GenerationVision-Language-Action | —Unverified | 0 |
| CapsDT: Diffusion-Transformer for Capsule Robot Manipulation | Jun 19, 2025 | DiagnosticRobot Manipulation | —Unverified | 0 |
| A Comprehensive Survey on Continual Learning in Generative Models | Jun 16, 2025 | Continual LearningSurvey | CodeCode Available | 2 |
| LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction | Jun 16, 2025 | Instruction FollowingVision-Language-Action | —Unverified | 0 |
| ROSA: Harnessing Robot States for Vision-Language and Action Alignment | Jun 16, 2025 | State EstimationVision-Language-Action | —Unverified | 0 |
| Block-wise Adaptive Caching for Accelerating Diffusion Policy | Jun 16, 2025 | Action GenerationDenoising | —Unverified | 0 |
| AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning | Jun 16, 2025 | Action GenerationAutonomous Driving | CodeCode Available | 3 |
| EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action Models | Jun 11, 2025 | Vision-Language-Action | —Unverified | 0 |
| From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models | Jun 11, 2025 | Imitation LearningVision-Language-Action | —Unverified | 0 |
| SAFE: Multitask Failure Detection for Vision-Language-Action Models | Jun 11, 2025 | Conformal PredictionVision-Language-Action | —Unverified | 0 |
| An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models | Jun 10, 2025 | Action GenerationImage Captioning | —Unverified | 0 |
| TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization | Jun 10, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency | Jun 10, 2025 | Action GenerationImage Generation | —Unverified | 0 |