| UAV-VLA: Vision-Language-Action System for Large Scale Aerial Mission Generation | Jan 9, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 | 5 |
| Vision Language Action Models in Robotic Manipulation: A Systematic Review | Jul 14, 2025 | Dataset GenerationNatural Language Understanding | CodeCode Available | 2 | 5 |
| RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control | Jul 28, 2023 | ObjectQuestion Answering | CodeCode Available | 2 | 5 |
| RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World | Nov 29, 2024 | Robot Task PlanningScheduling | CodeCode Available | 2 | 5 |
| BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation | Jun 9, 2025 | QuantizationVision-Language-Action | CodeCode Available | 2 | 5 |
| An Embodied Generalist Agent in 3D World | Nov 18, 2023 | 3D dense captioning3D Question Answering (3D-QA) | CodeCode Available | 2 | 5 |
| Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends | Jun 26, 2025 | Action GenerationVision-Language-Action | CodeCode Available | 2 | 5 |
| Diffusion Transformer Policy | Oct 21, 2024 | DenoisingVision-Language-Action | CodeCode Available | 2 | 5 |
| TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation | Sep 19, 2024 | Vision-Language-Action | CodeCode Available | 2 | 5 |
| Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics | Nov 18, 2024 | Vision-Language-Action | CodeCode Available | 2 | 5 |
| DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution | Nov 4, 2024 | GPURobot Manipulation | CodeCode Available | 2 | 5 |
| ChatVLA-2: Vision-Language-Action Model with Open-World Embodied Reasoning from Pretrained Knowledge | May 28, 2025 | Imitation LearningMath | CodeCode Available | 1 | 5 |
| VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting | Jul 7, 2025 | Depth EstimationVision-Language-Action | CodeCode Available | 1 | 5 |
| ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model | Feb 20, 2025 | Mixture-of-ExpertsQuestion Answering | CodeCode Available | 1 | 5 |
| Bridging Language, Vision and Action: Multimodal VAEs in Robotic Manipulation Tasks | Apr 2, 2024 | Vision-Language-Action | CodeCode Available | 1 | 5 |
| RoboFAC: A Comprehensive Framework for Robotic Failure Analysis and Correction | May 18, 2025 | Vision-Language-Action | CodeCode Available | 1 | 5 |
| Benchmarking Vision, Language, & Action Models on Robotic Learning Tasks | Nov 4, 2024 | Action GenerationBenchmarking | CodeCode Available | 1 | 5 |
| DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control | Feb 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments | May 8, 2025 | BenchmarkingPrompt Engineering | CodeCode Available | 1 | 5 |
| Adversarial Attacks on Robotic Vision Language Action Models | Jun 3, 2025 | Vision-Language-Action | CodeCode Available | 1 | 5 |
| From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation | May 13, 2025 | Robot ManipulationSpatial Reasoning | CodeCode Available | 1 | 5 |
| Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning | May 31, 2024 | Action RecognitionContrastive Learning | CodeCode Available | 0 | 5 |
| Perceptual Quality Assessment for Embodied AI | May 22, 2025 | Image Quality AssessmentVision-Language-Action | CodeCode Available | 0 | 5 |
| Surgeon Style Fingerprinting and Privacy Risk Quantification via Discrete Diffusion Models in a Vision-Language-Action Framework | Jun 9, 2025 | DenoisingVision-Language-Action | CodeCode Available | 0 | 5 |
| TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization | Jun 10, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 | 5 |