| Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding | Dec 31, 2024 | Robot ManipulationScene Understanding | —Unverified | 0 |
| Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations | Dec 19, 2024 | Contrastive LearningImage Reconstruction | CodeCode Available | 3 |
| Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models | Dec 18, 2024 | Representation LearningRobot Manipulation | CodeCode Available | 3 |
| RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation | Dec 18, 2024 | DiversityImitation Learning | —Unverified | 0 |
| Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning | Dec 17, 2024 | Denoising | CodeCode Available | 2 |
| Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning | Dec 16, 2024 | HallucinationRobot Manipulation | CodeCode Available | 2 |
| ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation? | Dec 13, 2024 | Robot Manipulation | —Unverified | 0 |
| TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies | Dec 13, 2024 | Robot ManipulationVision-Language-Action | —Unverified | 0 |
| Grasp Diffusion Network: Learning Grasp Generators from Partial Point Clouds with Diffusion Models in SO(3)xR3 | Dec 11, 2024 | Collision AvoidanceRobot Manipulation | —Unverified | 0 |
| P3-PO: Prescriptive Point Priors for Visuo-Spatial Generalization of Robot Policies | Dec 9, 2024 | Out-of-Distribution GeneralizationRobot Manipulation | —Unverified | 0 |
| Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos | Dec 5, 2024 | Robot Manipulation | CodeCode Available | 2 |
| Variable-Speed Teaching-Playback as Real-World Data Augmentation for Imitation Learning | Dec 4, 2024 | Data AugmentationImitation Learning | —Unverified | 0 |
| From Mystery to Mastery: Failure Diagnosis for Improving Manipulation Policies | Dec 3, 2024 | Deep Reinforcement LearningRobot Manipulation | —Unverified | 0 |
| Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control | Dec 2, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Prediction with Action: Visual Policy Learning via Joint Denoising Process | Nov 27, 2024 | DenoisingImage Generation | —Unverified | 0 |
| RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics | Nov 25, 2024 | Robot ManipulationScene Understanding | —Unverified | 0 |
| Rethinking the Intermediate Features in Adversarial Attacks: Misleading Robotic Models via Adversarial Distillation | Nov 21, 2024 | Robot Manipulation | —Unverified | 0 |
| Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation | Nov 18, 2024 | Knowledge GraphsRobot Manipulation | CodeCode Available | 0 |
| VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation | Nov 14, 2024 | DenoisingRobot Manipulation | —Unverified | 0 |
| ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy | Nov 6, 2024 | Imitation LearningRobot Manipulation | —Unverified | 0 |
| RT-Affordance: Affordances are Versatile Intermediate Representations for Robot Manipulation | Nov 5, 2024 | Robot Manipulation | —Unverified | 0 |
| Improving Trust Estimation in Human-Robot Collaboration Using Beta Reputation at Fine-grained Timescales | Nov 4, 2024 | Bayesian InferenceBehavioural cloning | CodeCode Available | 0 |
| DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution | Nov 4, 2024 | GPURobot Manipulation | CodeCode Available | 2 |
| Non-rigid Relative Placement through 3D Dense Diffusion | Oct 25, 2024 | ObjectRobot Manipulation | —Unverified | 0 |
| Learning to Look: Seeking Information for Decision Making via Policy Factorization | Oct 24, 2024 | Decision MakingRobot Manipulation | —Unverified | 0 |