| Multi-Scale Fusion for Object Representation | Oct 2, 2024 | Object | CodeCode Available | 1 |
| Improving Visual Object Tracking through Visual Prompting | Sep 27, 2024 | Object | CodeCode Available | 1 |
| Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation | Sep 26, 2024 | 6D Pose Estimation6D Pose Estimation using RGB | CodeCode Available | 1 |
| Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model | Sep 25, 2024 | 3D ReconstructionObject | CodeCode Available | 1 |
| Mind the Prompt: A Novel Benchmark for Prompt-based Class-Agnostic Counting | Sep 24, 2024 | ObjectObject Counting | CodeCode Available | 1 |
| LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation | Sep 24, 2024 | ObjectPose Estimation | CodeCode Available | 1 |
| DecoupleNet: A Lightweight Backbone Network With Efficient Feature Decoupling for Remote Sensing Visual Tasks | Sep 23, 2024 | ARCComputational Efficiency | CodeCode Available | 1 |
| STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking | Sep 17, 2024 | Multiple Object TrackingObject | CodeCode Available | 1 |
| Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown | Sep 14, 2024 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 1 |
| From COCO to COCO-FP: A Deep Dive into Background False Positives for COCO Detectors | Sep 12, 2024 | Object | CodeCode Available | 1 |
| LEROjD: Lidar Extended Radar-Only Object Detection | Sep 9, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| Can OOD Object Detectors Learn from Foundation Models? | Sep 8, 2024 | Objectobject-detection | CodeCode Available | 1 |
| SSFam: Scribble Supervised Salient Object Detection Family | Sep 7, 2024 | DecoderObject | CodeCode Available | 1 |
| Frequency-Spatial Entanglement Learning for Camouflaged Object Detection | Sep 3, 2024 | Objectobject-detection | CodeCode Available | 1 |
| TrackSSM: A General Motion Predictor by State-Space Model | Aug 31, 2024 | DecoderMamba | CodeCode Available | 1 |
| PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View | Aug 29, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation | Aug 29, 2024 | ObjectPose Estimation | CodeCode Available | 1 |
| A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future Directions | Aug 28, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Low-Light Object Tracking: A Benchmark | Aug 21, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding | Aug 20, 2024 | ObjectScene Understanding | CodeCode Available | 1 |
| PADetBench: Towards Benchmarking Physical Attacks against Object Detection | Aug 17, 2024 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| Comparative Evaluation of 3D Reconstruction Methods for Object Pose Estimation | Aug 15, 2024 | 3D ReconstructionObject | CodeCode Available | 1 |
| DC3DO: Diffusion Classifier for 3D Objects | Aug 13, 2024 | 3D Object ClassificationClassification | CodeCode Available | 1 |
| Unified-IoU: For High-Quality Object Detection | Aug 13, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection | Aug 13, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| FADE: A Dataset for Detecting Falling Objects around Buildings in Video | Aug 11, 2024 | Moving Object DetectionObject | CodeCode Available | 1 |
| SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes | Aug 8, 2024 | Autonomous VehiclesObject | CodeCode Available | 1 |
| GUI Element Detection Using SOTA YOLO Deep Learning Models | Aug 7, 2024 | 2D Object DetectionCode Generation | CodeCode Available | 1 |
| Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Aug 7, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Line-based 6-DoF Object Pose Estimation and Tracking With an Event Camera | Aug 6, 2024 | ObjectPose Estimation | CodeCode Available | 1 |
| Visual Grounding for Object-Level Generalization in Reinforcement Learning | Aug 4, 2024 | Language ModellingObject | CodeCode Available | 1 |
| RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment | Aug 1, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection | Jul 31, 2024 | Language ModellingObject | CodeCode Available | 1 |
| Spatial Transformer Network YOLO Model for Agricultural Object Detection | Jul 31, 2024 | Objectobject-detection | CodeCode Available | 1 |
| StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset | Jul 30, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| 3D-GRES: Generalized 3D Referring Expression Segmentation | Jul 30, 2024 | ObjectReferring Expression | CodeCode Available | 1 |
| Monocular Human-Object Reconstruction in the Wild | Jul 30, 2024 | DiversityHuman-Object Interaction Detection | CodeCode Available | 1 |
| Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering | Jul 30, 2024 | Inverse RenderingNeRF | CodeCode Available | 1 |
| Move and Act: Enhanced Object Manipulation and Background Integrity for Image Editing | Jul 25, 2024 | ObjectPosition | CodeCode Available | 1 |
| ReCorD: Reasoning and Correcting Diffusion for HOI Generation | Jul 25, 2024 | Human-Object Interaction GenerationImage Generation | CodeCode Available | 1 |
| What Matters in Range View 3D Object Detection | Jul 23, 2024 | 3D Object DetectionObject | CodeCode Available | 1 |
| FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network | Jul 23, 2024 | Objectobject-detection | CodeCode Available | 1 |
| The Art of Imitation: Learning Long-Horizon Manipulation Tasks from Few Demonstrations | Jul 18, 2024 | Imitation LearningInductive Bias | CodeCode Available | 1 |
| General Geometry-aware Weakly Supervised 3D Object Detection | Jul 18, 2024 | 3D Object DetectionObject | CodeCode Available | 1 |
| CerberusDet: Unified Multi-Dataset Object Detection | Jul 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models | Jul 16, 2024 | 6D Pose Estimation using RGBNovel View Synthesis | CodeCode Available | 1 |
| Part2Object: Hierarchical Unsupervised 3D Instance Segmentation | Jul 14, 2024 | 3D Instance SegmentationClustering | CodeCode Available | 1 |
| Plain-Det: A Plain Multi-Dataset Object Detector | Jul 14, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Textual Query-Driven Mask Transformer for Domain Generalized Segmentation | Jul 12, 2024 | Domain GeneralizationObject | CodeCode Available | 1 |
| Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Jul 12, 2024 | Collaborative InferenceLanguage Modelling | CodeCode Available | 1 |