| Segment Any 3D Object with Language | Apr 2, 2024 | 3D Instance SegmentationDecoder | —Unverified | 0 |
| Task Integration Distillation for Object Detectors | Apr 2, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| What is Point Supervision Worth in Video Instance Segmentation? | Apr 1, 2024 | Instance SegmentationObject | —Unverified | 0 |
| ContactHandover: Contact-Guided Robot-to-Human Object Handover | Apr 1, 2024 | Object | —Unverified | 0 |
| Object-conditioned Bag of Instances for Few-Shot Personalized Instance Recognition | Apr 1, 2024 | Objectobject-detection | —Unverified | 0 |
| SUGAR: Pre-training 3D Visual Representations for Robotics | Apr 1, 2024 | 3D Instance Segmentation3D Object Recognition | —Unverified | 0 |
| Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs | Apr 1, 2024 | Common Sense ReasoningObject | —Unverified | 0 |
| Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts | Apr 1, 2024 | Objectobject-detection | —Unverified | 0 |
| Object-level Copy-Move Forgery Image Detection based on Inconsistency Mining | Mar 31, 2024 | Forgery Image DetectionObject | —Unverified | 0 |
| Weak-to-Strong 3D Object Detection with X-Ray Distillation | Mar 31, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| Constrained Layout Generation with Factor Graphs | Mar 30, 2024 | Graph Neural NetworkLayout Generation | —Unverified | 0 |
| Cognitive Planning for Object Goal Navigation using Generative AI Models | Mar 30, 2024 | Efficient ExplorationIn-Context Learning | —Unverified | 0 |
| HOI-M3:Capture Multiple Humans and Objects Interaction within Contextual Environment | Mar 30, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| PLoc: A New Evaluation Criterion Based on Physical Location for Autonomous Driving Datasets | Mar 29, 2024 | Autonomous DrivingObject | CodeCode Available | 0 |
| GraspXL: Generating Grasping Motions for Diverse Objects at Scale | Mar 28, 2024 | Object | —Unverified | 0 |
| OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion | Mar 28, 2024 | Motion SynthesisObject | —Unverified | 0 |
| RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentation | Mar 28, 2024 | Imitation LearningObject | —Unverified | 0 |
| Algorithmic Ways of Seeing: Using Object Detection to Facilitate Art Exploration | Mar 28, 2024 | Objectobject-detection | —Unverified | 0 |
| ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion | Mar 27, 2024 | counterfactualObject | —Unverified | 0 |
| FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing | Mar 27, 2024 | DenoisingObject | —Unverified | 0 |
| Enhancing Multiple Object Tracking Accuracy via Quantum Annealing | Mar 27, 2024 | ManagementMultiple Object Tracking | —Unverified | 0 |
| Tracking-Assisted Object Detection with Event Cameras | Mar 27, 2024 | AttributeObject | CodeCode Available | 0 |
| Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations | Mar 27, 2024 | AttributeDiagnostic | CodeCode Available | 0 |
| Online Embedding Multi-Scale CLIP Features into 3D Maps | Mar 27, 2024 | ObjectRetrieval | —Unverified | 0 |
| BAM: Box Abstraction Monitors for Real-time OoD Detection in Object Detection | Mar 27, 2024 | Objectobject-detection | —Unverified | 0 |
| Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation | Mar 26, 2024 | ObjectRobot Navigation | —Unverified | 0 |
| Exploring Dynamic Transformer for Efficient Object Tracking | Mar 26, 2024 | ObjectObject Tracking | —Unverified | 0 |
| SpectralWaste Dataset: Multimodal Data for Waste Sorting Automation | Mar 26, 2024 | ManagementObject | —Unverified | 0 |
| DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions | Mar 26, 2024 | Object | —Unverified | 0 |
| Co-Occurring of Object Detection and Identification towards unlabeled object discovery | Mar 25, 2024 | Objectobject-detection | —Unverified | 0 |
| ASDF: Assembly State Detection Utilizing Late Fusion by Integrating 6D Pose Estimation | Mar 25, 2024 | 6D Pose EstimationObject | CodeCode Available | 0 |
| Comp4D: LLM-Guided Compositional 4D Scene Generation | Mar 25, 2024 | ObjectScene Generation | —Unverified | 0 |
| Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning | Mar 25, 2024 | 3D GenerationObject | CodeCode Available | 0 |
| Data-Efficient 3D Visual Grounding via Order-Aware Referring | Mar 25, 2024 | 3D visual groundingObject | —Unverified | 0 |
| DOCTR: Disentangled Object-Centric Transformer for Point Scene Understanding | Mar 25, 2024 | DecoderObject | CodeCode Available | 0 |
| V2X-PC: Vehicle-to-everything Collaborative Perception via Point Cluster | Mar 25, 2024 | Object | —Unverified | 0 |
| Toward Open-Set Human Object Interaction Detection | Mar 24, 2024 | Contrastive LearningHuman-Object Interaction Detection | CodeCode Available | 0 |
| Cross-domain Multi-modal Few-shot Object Detection via Rich Text | Mar 24, 2024 | Cross-Domain Few-ShotDomain Adaptation | CodeCode Available | 0 |
| Gaze-guided Hand-Object Interaction Synthesis: Dataset and Method | Mar 24, 2024 | DenoisingHuman motion prediction | —Unverified | 0 |
| Fusion of Active and Passive Measurements for Robust and Scalable Positioning | Mar 24, 2024 | Object | —Unverified | 0 |
| Realtime Robust Shape Estimation of Deformable Linear Object | Mar 24, 2024 | ObjectUnity | —Unverified | 0 |
| Towards Two-Stream Foveation-based Active Vision Learning | Mar 24, 2024 | FoveationObject | —Unverified | 0 |
| Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields | Mar 24, 2024 | Inverse RenderingNeRF | —Unverified | 0 |
| Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation | Mar 23, 2024 | NavigateObject | —Unverified | 0 |
| Inpainting-Driven Mask Optimization for Object Removal | Mar 23, 2024 | Image InpaintingObject | —Unverified | 0 |
| Reasoning-Enhanced Object-Centric Learning for Videos | Mar 22, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization | Mar 22, 2024 | ObjectPose Estimation | CodeCode Available | 0 |
| PseudoTouch: Efficiently Imaging the Surface Feel of Objects for Robotic Manipulation | Mar 22, 2024 | ObjectObject Recognition | —Unverified | 0 |
| Survey on Modeling of Human-made Articulated Objects | Mar 22, 2024 | ObjectSurvey | —Unverified | 0 |
| VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation | Mar 21, 2024 | ObjectPose Estimation | —Unverified | 0 |