| SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction | Jul 21, 2025 | ObjectSegmentation | —Unverified | 0 |
| AutoPartGen: Autogressive 3D Part Generation and Discovery | Jul 17, 2025 | 3D Generation3D Reconstruction | —Unverified | 0 |
| A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains | Jul 17, 2025 | Action RecognitionHand-Object Interaction Detection | —Unverified | 0 |
| A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning | Jul 16, 2025 | ObjectPoint Cloud Registration | —Unverified | 0 |
| RoHOI: Robustness Benchmark for Human-Object Interaction Detection | Jul 12, 2025 | Human-Object Interaction DetectionObject | CodeCode Available | 0 |
| Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework | Jul 11, 2025 | ClusteringCrowd Counting | CodeCode Available | 0 |
| MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation | Jul 10, 2025 | NeRFObject | —Unverified | 0 |
| ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge | Jul 8, 2025 | Edge-computingObject | —Unverified | 0 |
| DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation | Jul 8, 2025 | 3D geometry3D Reconstruction | —Unverified | 0 |
| EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow | Jul 8, 2025 | Deformable Object ManipulationImitation Learning | —Unverified | 0 |
| Dyn-O: Building Structured World Models with Object-Centric Representations | Jul 4, 2025 | Object | —Unverified | 0 |
| LMPNet for Weakly-supervised Keypoint Discovery | Jul 3, 2025 | ObjectPose Estimation | —Unverified | 0 |
| NOCTIS: Novel Object Cyclic Threshold based Instance Segmentation | Jul 2, 2025 | Instance SegmentationObject | CodeCode Available | 0 |
| DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World | Jun 30, 2025 | Caption GenerationObject | CodeCode Available | 2 |
| Refine Any Object in Any Scene | Jun 30, 2025 | Novel View SynthesisObject | CodeCode Available | 1 |
| Deterministic Object Pose Confidence Region Estimation | Jun 28, 2025 | Conformal PredictionObject | —Unverified | 0 |
| Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval | Jun 28, 2025 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 |
| Controllable 3D Placement of Objects with Scene-Aware Diffusion Models | Jun 26, 2025 | Object | —Unverified | 0 |
| DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic | Jun 26, 2025 | Autonomous DrivingAvg | —Unverified | 0 |
| SAMURAI: Shape-Aware Multimodal Retrieval for 3D Object Identification | Jun 26, 2025 | 3D Object RetrievalObject | —Unverified | 0 |
| Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection | Jun 26, 2025 | Objectobject-detection | CodeCode Available | 0 |
| PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling | Jun 26, 2025 | ObjectObject Reconstruction | —Unverified | 0 |
| DreamAnywhere: Object-Centric Panoramic 3D Scene Generation | Jun 25, 2025 | Novel View SynthesisObject | —Unverified | 0 |
| Consensus-Driven Uncertainty for Robotic Grasping based on RGB Perception | Jun 24, 2025 | ObjectPose Estimation | CodeCode Available | 0 |
| USVTrack: USV-Based 4D Radar-Camera Tracking Dataset for Autonomous Driving in Inland Waterways | Jun 23, 2025 | Autonomous DrivingObject | —Unverified | 0 |
| Class Agnostic Instance-level Descriptor for Visual Instance Search | Jun 20, 2025 | Content-Based Image RetrievalImage Retrieval | —Unverified | 0 |
| Learning Dexterous Object Handover | Jun 20, 2025 | ObjectReinforcement Learning (RL) | —Unverified | 0 |
| RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Jun 20, 2025 | 6D Pose EstimationObject | CodeCode Available | 2 |
| Retrospective Memory for Camouflaged Object Detection | Jun 18, 2025 | Objectobject-detection | —Unverified | 0 |
| MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System | Jun 18, 2025 | ObjectObject SLAM | —Unverified | 0 |
| Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos | Jun 18, 2025 | Object | —Unverified | 0 |
| Object-Centric Neuro-Argumentative Learning | Jun 17, 2025 | Deep LearningObject | CodeCode Available | 0 |
| Sparse Convolutional Recurrent Learning for Efficient Event-based Neuromorphic Object Detection | Jun 16, 2025 | Computational EfficiencyObject | —Unverified | 0 |
| FOAM: A General Frequency-Optimized Anti-Overlapping Framework for Overlapping Object Perception | Jun 16, 2025 | ObjectPneumonia Detection | —Unverified | 0 |
| JENGA: Object selection and pose estimation for robotic grasping from a stack | Jun 16, 2025 | BenchmarkingObject | —Unverified | 0 |
| Generative 4D Scene Gaussian Splatting with Object View-Synthesis Priors | Jun 15, 2025 | Novel View SynthesisObject | —Unverified | 0 |
| iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer | Jun 15, 2025 | ObjectVideo Generation | —Unverified | 0 |
| M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation | Jun 15, 2025 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Vision-based Lifting of 2D Object Detections for Automated Driving | Jun 13, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| SPLATART: Articulated Gaussian Splatting with Estimated Object Structure | Jun 13, 2025 | Object | —Unverified | 0 |
| ViTaSCOPE: Visuo-tactile Implicit Representation for In-hand Pose and Extrinsic Contact Estimation | Jun 13, 2025 | 3D geometryObject | —Unverified | 0 |
| Occlusion-Aware 3D Hand-Object Pose Estimation with Masked AutoEncoders | Jun 12, 2025 | hand-object poseObject | —Unverified | 0 |
| SlotPi: Physics-informed Object-centric Reasoning Models | Jun 12, 2025 | ObjectQuestion Answering | CodeCode Available | 0 |
| Efficient Part-level 3D Object Generation via Dual Volume Packing | Jun 11, 2025 | DiversityObject | CodeCode Available | 4 |
| Scoop-and-Toss: Dynamic Object Collection for Quadrupedal Systems | Jun 11, 2025 | Object | —Unverified | 0 |
| UAD: Unsupervised Affordance Distillation for Generalization in Robotic Manipulation | Jun 10, 2025 | DecoderImitation Learning | —Unverified | 0 |
| ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations | Jun 10, 2025 | Objectobject-detection | —Unverified | 0 |
| ORIDa: Object-centric Real-world Image Composition Dataset | Jun 10, 2025 | counterfactualObject | —Unverified | 0 |
| Orientation Matters: Making 3D Generative Models Orientation-Aligned | Jun 10, 2025 | Object | —Unverified | 0 |
| Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection | Jun 10, 2025 | Class-Incremental Object DetectionObject | —Unverified | 0 |