| RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation | Jun 6, 2024 | Common Sense ReasoningMamba | —Unverified | 0 |
| Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning | May 31, 2024 | Action RecognitionContrastive Learning | CodeCode Available | 0 |
| LEGENT: Open Platform for Embodied Agents | Apr 28, 2024 | Vision-Language-Action | —Unverified | 0 |
| 3D-VLA: A 3D Vision-Language-Action Generative World Model | Mar 14, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| General-purpose foundation models for increased autonomy in robot-assisted surgery | Jan 1, 2024 | Vision-Language-Action | —Unverified | 0 |
| QUAR-VLA: Vision-Language-Action Model for Quadruped Robots | Dec 22, 2023 | Decision MakingVision-Language-Action | —Unverified | 0 |
| SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention | Dec 4, 2023 | Vision-Language-Action | —Unverified | 0 |