| SAM 2: Segment Anything in Images and Videos | Aug 1, 2024 | Image SegmentationRobot Manipulation Generalization | CodeCode Available | 11 | 5 |
| Segment Anything | Apr 5, 2023 | Event-based Object SegmentationImage Segmentation | CodeCode Available | 5 | 5 |
| Segment Anything for Videos: A Systematic Survey | Jul 31, 2024 | Image SegmentationRobot Manipulation Generalization | CodeCode Available | 5 | 5 |
| Evaluating Real-World Robot Manipulation Policies in Simulation | May 9, 2024 | Robotic GraspingRobot Manipulation | CodeCode Available | 5 | 5 |
| SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More | Aug 8, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 5 | 5 |
| 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Feb 18, 2024 | DenoisingRobot Manipulation | CodeCode Available | 3 | 5 |
| RVT-2: Learning Precise Manipulation from Few Demonstrations | Jun 12, 2024 | Robot ManipulationRobot Manipulation Generalization | CodeCode Available | 3 | 5 |
| Masked Visual Pre-training for Motor Control | Mar 11, 2022 | Robot Manipulation GeneralizationState Estimation | CodeCode Available | 2 | 5 |
| Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation | Sep 12, 2022 | Robot ManipulationRobot Manipulation Generalization | CodeCode Available | 2 | 5 |
| THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation | Feb 13, 2024 | Robot Manipulation Generalization | CodeCode Available | 2 | 5 |
| R3M: A Universal Visual Representation for Robot Manipulation | Mar 23, 2022 | Contrastive LearningRobot Manipulation | CodeCode Available | 2 | 5 |
| Generative Image as Action Models | Jul 10, 2024 | Image GenerationRobot Manipulation | CodeCode Available | 2 | 5 |
| RVT: Robotic View Transformer for 3D Object Manipulation | Jun 26, 2023 | ObjectRobot Manipulation | CodeCode Available | 2 | 5 |
| Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy | Oct 2, 2024 | Motion PlanningRobot Manipulation | CodeCode Available | 2 | 5 |
| PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation | Sep 27, 2023 | Multi-Task LearningRobot Manipulation | CodeCode Available | 1 | 5 |
| Instruction-driven history-aware policies for robotic manipulations | Sep 11, 2022 | Robot ManipulationRobot Manipulation Generalization | CodeCode Available | 1 | 5 |
| Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts | Sep 10, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 | 5 |
| 0/1 Deep Neural Networks via Block Coordinate Descent | Jun 19, 2022 | 10-shot image generation | —Unverified | 0 | 0 |
| Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware | Apr 23, 2023 | ChunkingImitation Learning | —Unverified | 0 | 0 |