| UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent | Jan 31, 2025 | Robot ManipulationVision-Language-Action | —Unverified | 0 | 0 |
| Vision-Language-Action Model and Diffusion Policy Switching Enables Dexterous Control of an Anthropomorphic Hand | Oct 17, 2024 | Vision-Language-Action | —Unverified | 0 | 0 |
| Vision-Language-Action Models: Concepts, Progress, Applications and Challenges | May 7, 2025 | Autonomous VehiclesNatural Language Understanding | —Unverified | 0 | 0 |
| VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks | Dec 24, 2024 | Common Sense ReasoningTransfer Learning | —Unverified | 0 | 0 |
| VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation | Feb 4, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models | Jun 21, 2025 | Action GenerationContinual Learning | —Unverified | 0 | 0 |
| What Can RL Bring to VLA Generalization? An Empirical Study | May 26, 2025 | Reinforcement Learning (RL)Vision-Language-Action | —Unverified | 0 | 0 |
| Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation | Oct 10, 2024 | Robot ManipulationVision-Language-Action | —Unverified | 0 | 0 |
| Hybrid Reasoning for Perception, Explanation, and Autonomous Action in Manufacturing | Jun 10, 2025 | Retrieval-augmented GenerationVision-Language-Action | —Unverified | 0 | 0 |
| FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency | Jun 10, 2025 | Action GenerationImage Generation | —Unverified | 0 | 0 |
| 3D CAVLA: Leveraging Depth and 3D Context to Generalize Vision Language Action Models for Unseen Tasks | May 9, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| 3D-VLA: A 3D Vision-Language-Action Generative World Model | Mar 14, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding | Mar 4, 2025 | ChunkingVision-Language-Action | —Unverified | 0 | 0 |
| A Dual Process VLA: Efficient Robotic Manipulation Leveraging VLM | Oct 21, 2024 | Decision MakingVision-Language-Action | —Unverified | 0 | 0 |
| An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models | Jun 10, 2025 | Action GenerationImage Captioning | —Unverified | 0 | 0 |
| AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation | Jul 17, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| Automated Data Curation Using GPS & NLP to Generate Instruction-Action Pairs for Autonomous Vehicle Vision-Language Navigation Datasets | May 6, 2025 | Autonomous VehiclesTAG | —Unverified | 0 | 0 |
| BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization | May 22, 2025 | Backdoor AttackVision-Language-Action | —Unverified | 0 | 0 |
| Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding | Jan 8, 2025 | Robot ManipulationText Generation | —Unverified | 0 | 0 |
| Block-wise Adaptive Caching for Accelerating Diffusion Policy | Jun 16, 2025 | Action GenerationDenoising | —Unverified | 0 | 0 |
| BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models | Jun 9, 2025 | Robot ManipulationVision-Language-Action | —Unverified | 0 | 0 |
| CapsDT: Diffusion-Transformer for Capsule Robot Manipulation | Jun 19, 2025 | DiagnosticRobot Manipulation | —Unverified | 0 | 0 |
| CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation | Nov 29, 2024 | QuantizationVision-Language-Action | —Unverified | 0 | 0 |
| Conditioning Matters: Training Diffusion Policies is Faster Than You Think | May 16, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |
| CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models | Mar 27, 2025 | Vision-Language-Action | —Unverified | 0 | 0 |