SOTAVerified

Robot Manipulation

Papers

Showing 51100 of 430 papers

TitleStatusHype
ARC-Calib: Autonomous Markerless Camera-to-Robot Calibration via Exploratory Robot Motions0
ReBot: Scaling Robot Learning with Real-to-Sim-to-Real Robotic Video Synthesis0
Is Your Imitation Learning Policy Better than Mine? Policy Comparison with Near-Optimal Stopping0
Deformable Linear Object Surface Placement Using Elastica Planning and Local Shape Control0
Combined Physics and Event Camera Simulator for Slip DetectionCode0
A Taxonomy for Evaluating Generalist Robot Policies0
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation LearningCode1
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids0
Enhancing Reusability of Learned Skills for Robot Manipulation via Gaze and Bottleneck0
ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action ModelCode1
Magma: A Foundation Model for Multimodal AI AgentsCode5
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object ManipulationCode3
Robot Deformable Object Manipulation via NMPC-generated Demonstrations in Deep Reinforcement Learning0
S^2-Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation0
COMBO-Grasp: Learning Constraint-Based Manipulation for Bimanual Occluded Grasping0
Select before Act: Spatially Decoupled Action Repetition for Continuous Control0
HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation0
AnyPlace: Learning Generalized Object Placement for Robot Manipulation0
Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation0
UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent0
Strong and Controllable 3D Motion Generation0
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model0
Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding0
Learning to Transfer Human Hand Skills for Robot Manipulations0
3D-MVP: 3D Multiview Pretraining for Manipulation0
Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding0
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual RepresentationsCode3
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action ModelsCode3
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation0
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask LearningCode2
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial ReasoningCode2
ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation?0
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies0
Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR30
P3-PO: Prescriptive Point Priors for Visuo-Spatial Generalization of Robot Policies0
Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from VideosCode2
Variable-Speed Teaching-Playback as Real-World Data Augmentation for Imitation Learning0
From Mystery to Mastery: Failure Diagnosis for Improving Manipulation Policies0
Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control0
Prediction with Action: Visual Policy Learning via Joint Denoising Process0
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics0
Rethinking the Intermediate Features in Adversarial Attacks: Misleading Robotic Models via Adversarial Distillation0
Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile RepresentationCode0
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation0
ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy0
RT-Affordance: Affordances are Versatile Intermediate Representations for Robot Manipulation0
Improving Trust Estimation in Human-Robot Collaboration Using Beta Reputation at Fine-grained TimescalesCode0
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot ExecutionCode2
Non-rigid Relative Placement through 3D Dense Diffusion0
Learning to Look: Seeking Information for Decision Making via Policy Factorization0
Show:102550
← PrevPage 2 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DreamVLAavg. sequence length (D to D)4.44Unverified
2VPPavg. sequence length (D to D)4.29Unverified
3RoboVLMsavg. sequence length (D to D)4.25Unverified
4Openhelixavg. sequence length (D to D)4.08Unverified
5UP-VLAavg. sequence length (D to D)4.08Unverified
6GR-MGavg. sequence length (D to D)4.04Unverified
7MoDEavg. sequence length (D to D)4.01Unverified
8RoboUniViewavg. sequence length (D to D)3.86Unverified
9UniVLAavg. sequence length (D to D)3.8Unverified
10RoboDualavg. sequence length (D to D)3.66Unverified
#ModelMetricClaimedVerifiedStatus
1EquActSucc. Rate (18 tasks, 100 demo/task)89.4Unverified
2SAM2ActSucc. Rate (18 tasks, 100 demo/task)86.8Unverified
3ARP+Succ. Rate (18 tasks, 100 demo/task)84.9Unverified
43D-LOTUSSucc. Rate (18 tasks, 100 demo/task)83.1Unverified
5RVT-2Succ. Rate (18 tasks, 100 demo/task)81.4Unverified
63D Diffuser ActorSucc. Rate (18 tasks, 100 demo/task)81.3Unverified
7Mini DiffuserSucc. Rate (18 tasks, 100 demo/task)77.6Unverified
8SAM-ESucc. Rate (18 tasks, 100 demo/task)70.6Unverified
9Auto-λSucc. Rate (10 tasks, 100 demos/task)69.3Unverified
10Act3DSucc. Rate (18 tasks, 100 demo/task)65Unverified
#ModelMetricClaimedVerifiedStatus
1SoFarVisual Matching0.75Unverified
2SpatialVLAVisual Matching0.72Unverified
3Dita-300MVisual Matching0.69Unverified
4RT-2-XVisual Matching0.61Unverified
5RoboVLMVisual Matching0.56Unverified
6RT-1-XVisual Matching0.53Unverified
7TraceVLAVisual Matching0.46Unverified
8OpenVLAVisual Matching0.28Unverified
9Octo-BaseVisual Matching0.17Unverified
#ModelMetricClaimedVerifiedStatus
1SDPSucc. Rate (12 tasks, 100 demo/task)76Unverified
2EquiDiff (Voxel)Succ. Rate (12 tasks, 100 demo/task)63.9Unverified
3EquiDiff (Image)Succ. Rate (12 tasks, 100 demo/task)53.7Unverified
4DP (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)42Unverified
5DP3 (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)23.9Unverified
6BC RNN (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)22.9Unverified
7ACT (Evaluated in EquiDiff)Succ. Rate (12 tasks, 100 demo/task)21.3Unverified
#ModelMetricClaimedVerifiedStatus
1SoFarAverage0.58Unverified
2SpatialVLAAverage0.34Unverified
3Octo-SmallAverage0.3Unverified
4Octo-BaseAverage0.16Unverified
5RoboVLMAverage0.14Unverified
6RT-1-XAverage0.01Unverified
7OpenVLAAverage0.01Unverified