| GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation | Feb 13, 2025 | Contrastive LearningVideo Generation | —Unverified | 0 | 0 |
| GR00T N1: An Open Foundation Model for Generalist Humanoid Robots | Mar 18, 2025 | Imitation LearningVision-Language-Action | —Unverified | 0 | 0 |
| GRAPE: Generalizing Robot Policy via Preference Alignment | Nov 28, 2024 | Vision-Language-Action | —Unverified | 0 | 0 |
| Grounding Multimodal LLMs to Embodied Agents that Ask for Help with Reinforcement Learning | Apr 1, 2025 | Reinforcement Learning (RL)Vision-Language-Action | —Unverified | 0 | 0 |
| HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation | Feb 8, 2025 | Robot ManipulationVision-Language-Action | —Unverified | 0 | 0 |
| Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models | Feb 26, 2025 | Instruction FollowingVision-Language-Action | —Unverified | 0 | 0 |
| HiRT: Enhancing Robotic Control with Hierarchical Robot Transformers | Sep 12, 2024 | Vision-Language-Action | —Unverified | 0 | 0 |
| Hume: Introducing System-2 Thinking in Visual-Language-Action Model | May 27, 2025 | DenoisingVision-Language-Action | —Unverified | 0 | 0 |
| HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model | Mar 13, 2025 | Common Sense ReasoningDenoising | —Unverified | 0 | 0 |
| Improving Vision-Language-Action Model with Online Reinforcement Learning | Jan 28, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |