| BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models | Jun 9, 2025 | Robot ManipulationVision-Language-Action | —Unverified | 0 | 0 |
| CapsDT: Diffusion-Transformer for Capsule Robot Manipulation | Jun 19, 2025 | DiagnosticRobot Manipulation | —Unverified | 0 | 0 |
| CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation | Nov 29, 2024 | QuantizationVision-Language-Action | —Unverified | 0 | 0 |
| Conditioning Matters: Training Diffusion Policies is Faster Than You Think | May 16, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models | Mar 27, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving | Aug 19, 2024 | Autonomous DrivingCaption Generation | —Unverified | 0 | 0 |
| CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation | Jun 24, 2025 | ChunkingVision-Language-Action | —Unverified | 0 | 0 |
| DataPlatter: Boosting Robotic Manipulation Generalization with Minimal Costly Data | Mar 25, 2025 | Robot ManipulationSpatial Reasoning | —Unverified | 0 | 0 |
| DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping | Feb 28, 2025 | Imitation LearningVision-Language-Action | —Unverified | 0 | 0 |
| DriveAction: A Benchmark for Exploring Human-like Driving Decisions in VLA Models | Jun 6, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |