| Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success | Feb 27, 2025 | Action GenerationChunking | CodeCode Available | 5 |
| ObjectVLA: End-to-End Open-World Object Manipulation Without Demonstration | Feb 26, 2025 | Imitation LearningObject | —Unverified | 0 |
| Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models | Feb 26, 2025 | Instruction FollowingVision-Language-Action | —Unverified | 0 |
| Evolution 6.0: Evolving Robotic Capabilities Through Generative Design | Feb 24, 2025 | Action GenerationText to 3D | —Unverified | 0 |
| ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model | Feb 20, 2025 | Mixture-of-ExpertsQuestion Answering | CodeCode Available | 1 |
| GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation | Feb 13, 2025 | Contrastive LearningVideo Generation | —Unverified | 0 |
| DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control | Feb 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy | Feb 8, 2025 | Q-LearningSafe Exploration | CodeCode Available | 3 |
| HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation | Feb 8, 2025 | Robot ManipulationVision-Language-Action | —Unverified | 0 |
| Survey on Vision-Language-Action Models | Feb 7, 2025 | Review GenerationSurvey | —Unverified | 0 |