| RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation | Nov 20, 2024 | Image Generationobject-detection | CodeCode Available | 2 |
| GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Nov 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion | Nov 13, 2024 | 3D Object DetectionDenoising | CodeCode Available | 2 |
| Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation | Nov 4, 2024 | Earth ObservationObject | CodeCode Available | 2 |
| ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images | Oct 31, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors | Oct 25, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Oct 22, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 |
| Multiview Scene Graph | Oct 15, 2024 | DecoderObject | CodeCode Available | 2 |
| Open World Object Detection: A Survey | Oct 15, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection | Oct 10, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Oct 2, 2024 | 3DGS3D Object Detection | CodeCode Available | 2 |
| DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction | Sep 30, 2024 | 3D Object Detection3D Semantic Occupancy Prediction | CodeCode Available | 2 |
| HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes | Sep 30, 2024 | Objectobject-detection | CodeCode Available | 2 |
| A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation | Sep 27, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 |
| Source-Free Domain Adaptation for YOLO Object Detection | Sep 25, 2024 | Domain AdaptationModel Selection | CodeCode Available | 2 |
| RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework | Sep 18, 2024 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 |
| One missing piece in Vision and Language: A Survey on Comics Understanding | Sep 14, 2024 | document understandingimage-classification | CodeCode Available | 2 |
| UniDet3D: Multi-dataset Indoor 3D Object Detection | Sep 6, 2024 | 3D Object DetectionObject | CodeCode Available | 2 |
| UTrack: Multi-Object Tracking with Uncertain Detections | Aug 30, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 2 |
| RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments | Aug 28, 2024 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 2 |
| GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Aug 15, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection | Aug 8, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Aug 7, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection | Aug 7, 2024 | 3D Object DetectionAutonomous Navigation | CodeCode Available | 2 |
| Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Aug 2, 2024 | cross-modal alignmentMultiple Object Tracking | CodeCode Available | 2 |