| UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent | Jan 31, 2025 | Robot ManipulationVision-Language-Action | —Unverified | 0 |
| Vision-Language-Action Model and Diffusion Policy Switching Enables Dexterous Control of an Anthropomorphic Hand | Oct 17, 2024 | Vision-Language-Action | —Unverified | 0 |
| Vision-Language-Action Models: Concepts, Progress, Applications and Challenges | May 7, 2025 | Autonomous VehiclesNatural Language Understanding | —Unverified | 0 |
| Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation | Oct 10, 2024 | Robot ManipulationVision-Language-Action | —Unverified | 0 |
| Hybrid Reasoning for Perception, Explanation, and Autonomous Action in Manufacturing | Jun 10, 2025 | Retrieval-augmented GenerationVision-Language-Action | —Unverified | 0 |
| FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency | Jun 10, 2025 | Action GenerationImage Generation | —Unverified | 0 |
| 3D CAVLA: Leveraging Depth and 3D Context to Generalize Vision Language Action Models for Unseen Tasks | May 9, 2025 | Vision-Language-Action | —Unverified | 0 |
| 3D-VLA: A 3D Vision-Language-Action Generative World Model | Mar 14, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding | Mar 4, 2025 | ChunkingVision-Language-Action | —Unverified | 0 |
| A Dual Process VLA: Efficient Robotic Manipulation Leveraging VLM | Oct 21, 2024 | Decision MakingVision-Language-Action | —Unverified | 0 |
| An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models | Jun 10, 2025 | Action GenerationImage Captioning | —Unverified | 0 |
| AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation | Jul 17, 2025 | Vision-Language-Action | —Unverified | 0 |
| Automated Data Curation Using GPS & NLP to Generate Instruction-Action Pairs for Autonomous Vehicle Vision-Language Navigation Datasets | May 6, 2025 | Autonomous VehiclesTAG | —Unverified | 0 |
| BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization | May 22, 2025 | Backdoor AttackVision-Language-Action | —Unverified | 0 |
| Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding | Jan 8, 2025 | Robot ManipulationText Generation | —Unverified | 0 |
| Block-wise Adaptive Caching for Accelerating Diffusion Policy | Jun 16, 2025 | Action GenerationDenoising | —Unverified | 0 |
| BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models | Jun 9, 2025 | Robot ManipulationVision-Language-Action | —Unverified | 0 |
| CapsDT: Diffusion-Transformer for Capsule Robot Manipulation | Jun 19, 2025 | DiagnosticRobot Manipulation | —Unverified | 0 |
| CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation | Nov 29, 2024 | QuantizationVision-Language-Action | —Unverified | 0 |
| Conditioning Matters: Training Diffusion Policies is Faster Than You Think | May 16, 2025 | Vision-Language-Action | —Unverified | 0 |
| CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models | Mar 27, 2025 | Vision-Language-Action | —Unverified | 0 |
| CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving | Aug 19, 2024 | Autonomous DrivingCaption Generation | —Unverified | 0 |
| CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation | Jun 24, 2025 | ChunkingVision-Language-Action | —Unverified | 0 |
| DataPlatter: Boosting Robotic Manipulation Generalization with Minimal Costly Data | Mar 25, 2025 | Robot ManipulationSpatial Reasoning | —Unverified | 0 |
| DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping | Feb 28, 2025 | Imitation LearningVision-Language-Action | —Unverified | 0 |