SOTAVerified

Robot Manipulation

Papers

Showing 76100 of 430 papers

TitleStatusHype
Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding0
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual RepresentationsCode3
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action ModelsCode3
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation0
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask LearningCode2
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial ReasoningCode2
ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation?0
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies0
Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR30
P3-PO: Prescriptive Point Priors for Visuo-Spatial Generalization of Robot Policies0
Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from VideosCode2
Variable-Speed Teaching-Playback as Real-World Data Augmentation for Imitation Learning0
From Mystery to Mastery: Failure Diagnosis for Improving Manipulation Policies0
Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control0
Prediction with Action: Visual Policy Learning via Joint Denoising Process0
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics0
Rethinking the Intermediate Features in Adversarial Attacks: Misleading Robotic Models via Adversarial Distillation0
Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile RepresentationCode0
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation0
ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy0
RT-Affordance: Affordances are Versatile Intermediate Representations for Robot Manipulation0
Improving Trust Estimation in Human-Robot Collaboration Using Beta Reputation at Fine-grained TimescalesCode0
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot ExecutionCode2
Non-rigid Relative Placement through 3D Dense Diffusion0
Learning to Look: Seeking Information for Decision Making via Policy Factorization0
Show:102550
← PrevPage 4 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DreamVLAavg. sequence length (D to D)4.44Unverified
2VPPavg. sequence length (D to D)4.29Unverified
3RoboVLMsavg. sequence length (D to D)4.25Unverified
4Openhelixavg. sequence length (D to D)4.08Unverified
5UP-VLAavg. sequence length (D to D)4.08Unverified
6GR-MGavg. sequence length (D to D)4.04Unverified
7MoDEavg. sequence length (D to D)4.01Unverified
8RoboUniViewavg. sequence length (D to D)3.86Unverified
9UniVLAavg. sequence length (D to D)3.8Unverified
10RoboDualavg. sequence length (D to D)3.66Unverified
#ModelMetricClaimedVerifiedStatus
1EquActSucc. Rate (18 tasks, 100 demo/task)89.4Unverified
2SAM2ActSucc. Rate (18 tasks, 100 demo/task)86.8Unverified
3ARP+Succ. Rate (18 tasks, 100 demo/task)84.9Unverified
43D-LOTUSSucc. Rate (18 tasks, 100 demo/task)83.1Unverified
5RVT-2Succ. Rate (18 tasks, 100 demo/task)81.4Unverified
63D Diffuser ActorSucc. Rate (18 tasks, 100 demo/task)81.3Unverified
7Mini DiffuserSucc. Rate (18 tasks, 100 demo/task)77.6Unverified
8SAM-ESucc. Rate (18 tasks, 100 demo/task)70.6Unverified
9Auto-λSucc. Rate (10 tasks, 100 demos/task)69.3Unverified
10Act3DSucc. Rate (18 tasks, 100 demo/task)65Unverified
#ModelMetricClaimedVerifiedStatus
1SoFarVisual Matching0.75Unverified
2SpatialVLAVisual Matching0.72Unverified
3Dita-300MVisual Matching0.69Unverified
4RT-2-XVisual Matching0.61Unverified
5RoboVLMVisual Matching0.56Unverified
6RT-1-XVisual Matching0.53Unverified
7TraceVLAVisual Matching0.46Unverified
8OpenVLAVisual Matching0.28Unverified
9Octo-BaseVisual Matching0.17Unverified
#ModelMetricClaimedVerifiedStatus
1SDPSucc. Rate (12 tasks, 100 demo/task)76Unverified
2EquiDiff (Voxel)Succ. Rate (12 tasks, 100 demo/task)63.9Unverified
3EquiDiff (Image)Succ. Rate (12 tasks, 100 demo/task)53.7Unverified
4DP (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)42Unverified
5DP3 (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)23.9Unverified
6BC RNN (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)22.9Unverified
7ACT (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)21.3Unverified
#ModelMetricClaimedVerifiedStatus
1SoFarAverage0.58Unverified
2SpatialVLAAverage0.34Unverified
3Octo-SmallAverage0.3Unverified
4Octo-BaseAverage0.16Unverified
5RoboVLMAverage0.14Unverified
6RT-1-XAverage0.01Unverified
7OpenVLAAverage0.01Unverified