SOTAVerified

Vision-Language-Action

Papers

Showing 101125 of 157 papers

TitleStatusHype
MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation0
ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis0
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model0
MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models0
Refined Policy Distillation: From VLA Generalists to RL Experts0
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction0
SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning0
Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding0
A Taxonomy for Evaluating Generalist Robot Policies0
DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping0
ObjectVLA: End-to-End Open-World Object Manipulation Without Demonstration0
Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models0
Evolution 6.0: Evolving Robotic Capabilities Through Generative Design0
GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation0
HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation0
Survey on Vision-Language-Action Models0
Probing a Vision-Language-Action Model for Symbolic States and Integration into a Cognitive Architecture0
VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation0
UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent0
Improving Vision-Language-Action Model with Online Reinforcement Learning0
FAST: Efficient Action Tokenization for Vision-Language-Action Models0
Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding0
Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches0
Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation0
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters0
Show:102550
← PrevPage 5 of 7Next →

No leaderboard results yet.