| Sim-Grasp: Learning 6-DOF Grasp Policies for Cluttered Environments Using a Synthetic Benchmark | May 1, 2024 | Object | CodeCode Available | 1 |
| Retrieval Robust to Object Motion Blur | Apr 27, 2024 | ObjectRetrieval | CodeCode Available | 1 |
| ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion | Apr 26, 2024 | Image InpaintingObject | CodeCode Available | 1 |
| Revisiting Out-of-Distribution Detection in LiDAR-based 3D Object Detection | Apr 24, 2024 | 3D Object DetectionObject | CodeCode Available | 1 |
| Unified Unsupervised Salient Object Detection via Knowledge Transfer | Apr 23, 2024 | Objectobject-detection | CodeCode Available | 1 |
| The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models | Apr 18, 2024 | Instance SegmentationObject | CodeCode Available | 1 |
| Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems | Apr 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation | Apr 12, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models | Apr 11, 2024 | AttributeObject | CodeCode Available | 1 |
| DepthMOT: Depth Cues Lead to a Strong Multi-Object Tracker | Apr 8, 2024 | Camera Pose EstimationMulti-Object Tracking | CodeCode Available | 1 |
| Retrieval-Augmented Open-Vocabulary Object Detection | Apr 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Detecting Every Object from Events | Apr 8, 2024 | Autonomous DrivingClass-agnostic Object Detection | CodeCode Available | 1 |
| Self-Supervised Multi-Object Tracking with Path Consistency | Apr 8, 2024 | Multi-Object TrackingObject | CodeCode Available | 1 |
| MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection | Apr 7, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers | Apr 5, 2024 | 2D Object Detection2D Tiny Object Detection | CodeCode Available | 1 |
| FlightScope: An Experimental Comparative Review of Aircraft Detection Algorithms in Satellite Imagery | Apr 3, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Event-assisted Low-Light Video Object Segmentation | Apr 2, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Disentangled Pre-training for Human-Object Interaction Detection | Apr 2, 2024 | Action RecognitionDecoder | CodeCode Available | 1 |
| VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection | Mar 29, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 1 |
| Temporally Consistent Referring Video Object Segmentation with Hybrid Memory | Mar 28, 2024 | HTRObject | CodeCode Available | 1 |
| Benchmarking Object Detectors with COCO: A New Path Forward | Mar 27, 2024 | BenchmarkingObject | CodeCode Available | 1 |
| DODA: Diffusion for Object-detection Domain Adaptation in Agriculture | Mar 27, 2024 | Domain AdaptationHead Detection | CodeCode Available | 1 |
| Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge | Mar 26, 2024 | ObjectSound Source Localization | CodeCode Available | 1 |
| Object Detectors in the Open Environment: Challenges, Solutions, and Outlook | Mar 24, 2024 | Incremental LearningObject | CodeCode Available | 1 |
| SUP-NeRF: A Streamlined Unification of Pose Estimation and NeRF for Monocular 3D Object Reconstruction | Mar 23, 2024 | 3D Object Reconstruction3D Reconstruction | CodeCode Available | 1 |
| PNAS-MOT: Multi-Modal Object Tracking with Pareto Neural Architecture Search | Mar 23, 2024 | Autonomous DrivingMultiple Object Tracking | CodeCode Available | 1 |
| SFOD: Spiking Fusion Object Detector | Mar 22, 2024 | Objectobject-detection | CodeCode Available | 1 |
| VRSO: Visual-Centric Reconstruction for Static Object Annotation | Mar 22, 2024 | Objectobject-detection | CodeCode Available | 1 |
| DVMNet++: Rethinking Relative Pose Estimation for Unseen Objects | Mar 20, 2024 | Natural Language UnderstandingObject | CodeCode Available | 1 |
| Few-shot Object Localization | Mar 19, 2024 | Model OptimizationObject | CodeCode Available | 1 |
| DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM | Mar 19, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Prioritized Semantic Learning for Zero-shot Instance Navigation | Mar 18, 2024 | Language ModellingObject | CodeCode Available | 1 |
| Video Object Segmentation with Dynamic Query Modulation | Mar 18, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object Retrieval | Mar 16, 2024 | Metric LearningObject | CodeCode Available | 1 |
| Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Mar 12, 2024 | Autonomous DrivingConformal Prediction | CodeCode Available | 1 |
| FSC: Few-point Shape Completion | Mar 12, 2024 | DecoderObject | CodeCode Available | 1 |
| Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors | Mar 12, 2024 | ObjectPseudo Label | CodeCode Available | 1 |
| Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer | Mar 11, 2024 | AnatomyDisentanglement | CodeCode Available | 1 |
| SeSame: Simple, Easy 3D Object Detection with Point-Wise Semantics | Mar 11, 2024 | 2D Object Detection3D Object Detection | CodeCode Available | 1 |
| Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors | Mar 10, 2024 | 3D Object DetectionObject | CodeCode Available | 1 |
| RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from LiDAR Features | Mar 8, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery | Mar 8, 2024 | Few-Shot Object DetectionObject | CodeCode Available | 1 |
| CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view Images | Mar 7, 2024 | 3D Object Detection3D Reconstruction | CodeCode Available | 1 |
| ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes | Mar 7, 2024 | Image to textObject | CodeCode Available | 1 |
| FriendNet: Detection-Friendly Dehazing Network | Mar 7, 2024 | Autonomous DrivingImage Dehazing | CodeCode Available | 1 |
| MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding | Mar 5, 2024 | 3D visual groundingDecision Making | CodeCode Available | 1 |
| A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection | Feb 29, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Aligning Knowledge Graph with Visual Perception for Object-goal Navigation | Feb 29, 2024 | Object | CodeCode Available | 1 |
| EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving | Feb 28, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 1 |
| OSCaR: Object State Captioning and State Change Representation | Feb 27, 2024 | Change DetectionObject | CodeCode Available | 1 |