| OG-VLA: 3D-Aware Vision Language Action Model via Orthographic Image Generation | Jun 1, 2025 | Image GenerationLarge Language Model | —Unverified | 0 |
| Bi-Manual Joint Camera Calibration and Scene Representation | May 30, 2025 | Camera CalibrationRobot Manipulation | —Unverified | 0 |
| PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation | May 27, 2025 | Instruction FollowingObject | —Unverified | 0 |
| WorldEval: World Model as Real-World Robot Policies Evaluator | May 25, 2025 | Robot ManipulationVideo Generation | —Unverified | 0 |
| Is Single-View Mesh Reconstruction Ready for Robotics? | May 23, 2025 | 3D ReconstructionBenchmarking | —Unverified | 0 |
| SEM: Enhancing Spatial Understanding for Robust Robot Manipulation | May 22, 2025 | 3D geometryRobot Manipulation | —Unverified | 0 |
| Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets | May 21, 2025 | Dataset GenerationDescriptive | —Unverified | 0 |
| Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation | May 21, 2025 | ObjectPose Estimation | —Unverified | 0 |
| Vid2World: Crafting Video Diffusion Models to Interactive World Models | May 20, 2025 | Robot ManipulationSequential Decision Making | —Unverified | 0 |
| Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation | May 19, 2025 | Multimodal ReasoningRobot Manipulation | —Unverified | 0 |
| Object-Centric Representations Improve Policy Generalization in Robot Manipulation | May 16, 2025 | Optical Character Recognition (OCR)Robot Manipulation | —Unverified | 0 |
| Exploiting Radiance Fields for Grasp Generation on Novel Synthetic Views | May 16, 2025 | Grasp GenerationNovel View Synthesis | —Unverified | 0 |
| Zero-Shot Visual Generalization in Robot Manipulation | May 16, 2025 | Imitation LearningRepresentation Learning | —Unverified | 0 |
| NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning | May 15, 2025 | Novel View SynthesisRobot Manipulation | —Unverified | 0 |
| EmbodiedMAE: A Unified 3D Multi-Modal Representation for Robot Manipulation | May 15, 2025 | Robot Manipulation | —Unverified | 0 |
| LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution | May 15, 2025 | Robot ManipulationTask Planning | —Unverified | 0 |
| IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-Tuning | May 15, 2025 | Efficient ExplorationImitation Learning | CodeCode Available | 0 |
| FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation | May 15, 2025 | Robot ManipulationSemantic Similarity | —Unverified | 0 |
| ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation | May 14, 2025 | BenchmarkingDeformable Object Manipulation | —Unverified | 0 |
| X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real | May 11, 2025 | Domain AdaptationImitation Learning | —Unverified | 0 |
| Efficient Sensorimotor Learning for Open-world Robot Manipulation | May 7, 2025 | Robot Manipulation | —Unverified | 0 |
| The Unreasonable Effectiveness of Discrete-Time Gaussian Process Mixtures for Robot Policy Learning | May 6, 2025 | CPUGaussian Processes | —Unverified | 0 |
| Sim2Real Transfer for Vision-Based Grasp Verification | May 5, 2025 | Objectobject-detection | CodeCode Available | 0 |
| RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation | May 3, 2025 | Robot Manipulation | —Unverified | 0 |
| SPECI: Skill Prompts based Hierarchical Continual Imitation Learning for Robot Manipulation | Apr 22, 2025 | Action GenerationImitation Learning | —Unverified | 0 |