SOTAVerified

Robot Manipulation

Papers

Showing 150 of 430 papers

TitleStatusHype
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
Geometry-aware 4D Video Generation for Robot Manipulation0
CapsDT: Diffusion-Transformer for Capsule Robot Manipulation0
Robust Instant Policy: Leveraging Student's t-Regression Model for Robust In-context Imitation Learning of Robot Manipulation0
SENIOR: Efficient Query Selection and Preference-Guided Exploration in Preference-based Reinforcement Learning0
What Matters in Learning from Large-Scale Datasets for Robot Manipulation0
Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success0
BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models0
3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World ModelCode1
OG-VLA: 3D-Aware Vision Language Action Model via Orthographic Image Generation0
Bi-Manual Joint Camera Calibration and Scene Representation0
PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation0
WorldEval: World Model as Real-World Robot Policies Evaluator0
Is Single-View Mesh Reconstruction Ready for Robotics?0
SEM: Enhancing Spatial Understanding for Robust Robot Manipulation0
Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation0
Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets0
Vid2World: Crafting Video Diffusion Models to Interactive World Models0
RLVR-World: Training World Models with Reinforcement LearningCode3
Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation0
Object-Centric Representations Improve Policy Generalization in Robot Manipulation0
Zero-Shot Visual Generalization in Robot Manipulation0
Exploiting Radiance Fields for Grasp Generation on Novel Synthetic Views0
LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution0
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation0
NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning0
EmbodiedMAE: A Unified 3D Multi-Modal Representation for Robot Manipulation0
IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-TuningCode0
ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation0
Mini Diffuser: Fast Multi-task Diffusion Policy Training Using Two-level Mini-batchesCode1
From Seeing to Doing: Bridging Reasoning and Decision for Robotic ManipulationCode1
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real0
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsCode5
Efficient Sensorimotor Learning for Open-world Robot Manipulation0
OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic ManipulationCode3
The Unreasonable Effectiveness of Discrete-Time Gaussian Process Mixtures for Robot Policy Learning0
Sim2Real Transfer for Vision-Based Grasp VerificationCode0
RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation0
SPECI: Skill Prompts based Hierarchical Continual Imitation Learning for Robot Manipulation0
Trajectory Adaptation using Large Language Models0
Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation0
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionCode1
Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation0
HACTS: a Human-As-Copilot Teleoperation System for Robot Learning0
AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real WorldCode2
REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot Manipulation0
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation0
DataPlatter: Boosting Robotic Manipulation Generalization with Minimal Costly Data0
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action PolicyCode2
PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from VideosCode3
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DreamVLAavg. sequence length (D to D)4.44Unverified
2VPPavg. sequence length (D to D)4.29Unverified
3RoboVLMsavg. sequence length (D to D)4.25Unverified
4Openhelixavg. sequence length (D to D)4.08Unverified
5UP-VLAavg. sequence length (D to D)4.08Unverified
6GR-MGavg. sequence length (D to D)4.04Unverified
7MoDEavg. sequence length (D to D)4.01Unverified
8RoboUniViewavg. sequence length (D to D)3.86Unverified
9UniVLAavg. sequence length (D to D)3.8Unverified
10RoboDualavg. sequence length (D to D)3.66Unverified
#ModelMetricClaimedVerifiedStatus
1EquActSucc. Rate (18 tasks, 100 demo/task)89.4Unverified
2SAM2ActSucc. Rate (18 tasks, 100 demo/task)86.8Unverified
3ARP+Succ. Rate (18 tasks, 100 demo/task)84.9Unverified
43D-LOTUSSucc. Rate (18 tasks, 100 demo/task)83.1Unverified
5RVT-2Succ. Rate (18 tasks, 100 demo/task)81.4Unverified
63D Diffuser ActorSucc. Rate (18 tasks, 100 demo/task)81.3Unverified
7Mini DiffuserSucc. Rate (18 tasks, 100 demo/task)77.6Unverified
8SAM-ESucc. Rate (18 tasks, 100 demo/task)70.6Unverified
9Auto-λSucc. Rate (10 tasks, 100 demos/task)69.3Unverified
10Act3DSucc. Rate (18 tasks, 100 demo/task)65Unverified
#ModelMetricClaimedVerifiedStatus
1SoFarVisual Matching0.75Unverified
2SpatialVLAVisual Matching0.72Unverified
3Dita-300MVisual Matching0.69Unverified
4RT-2-XVisual Matching0.61Unverified
5RoboVLMVisual Matching0.56Unverified
6RT-1-XVisual Matching0.53Unverified
7TraceVLAVisual Matching0.46Unverified
8OpenVLAVisual Matching0.28Unverified
9Octo-BaseVisual Matching0.17Unverified
#ModelMetricClaimedVerifiedStatus
1SDPSucc. Rate (12 tasks, 100 demo/task)76Unverified
2EquiDiff (Voxel)Succ. Rate (12 tasks, 100 demo/task)63.9Unverified
3EquiDiff (Image)Succ. Rate (12 tasks, 100 demo/task)53.7Unverified
4DP (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)42Unverified
5DP3 (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)23.9Unverified
6BC RNN (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)22.9Unverified
7ACT (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)21.3Unverified
#ModelMetricClaimedVerifiedStatus
1SoFarAverage0.58Unverified
2SpatialVLAAverage0.34Unverified
3Octo-SmallAverage0.3Unverified
4Octo-BaseAverage0.16Unverified
5RoboVLMAverage0.14Unverified
6RT-1-XAverage0.01Unverified
7OpenVLAAverage0.01Unverified