| Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection | Oct 24, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 3 |
| Salient Object Detection in RGB-D Videos | Oct 24, 2023 | AttributeObject | CodeCode Available | 1 |
| Mean Teacher DETR with Masked Feature Alignment: A Robust Domain Adaptive Detection Transformer Framework | Oct 24, 2023 | Domain Adaptationobject-detection | —Unverified | 0 |
| Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection | Oct 24, 2023 | ClassificationDecoder | —Unverified | 0 |
| CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting | Oct 24, 2023 | Image Segmentationobject-detection | —Unverified | 0 |
| GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection | Oct 24, 2023 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 1 |
| Safe Navigation: Training Autonomous Vehicles using Deep Reinforcement Learning in CARLA | Oct 23, 2023 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Pre-Training LiDAR-Based 3D Object Detectors Through Colorization | Oct 23, 2023 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 0 |
| Rethinking Scale Imbalance in Semi-supervised Object Detection for Aerial Images | Oct 23, 2023 | object-detectionObject Detection | —Unverified | 0 |
| MaRU: A Manga Retrieval and Understanding System Connecting Vision and Language | Oct 22, 2023 | Decoderobject-detection | —Unverified | 0 |
| The Importance of Anti-Aliasing in Tiny Object Detection | Oct 22, 2023 | Objectobject-detection | CodeCode Available | 0 |
| Skipped Feature Pyramid Network with Grid Anchor for Object Detection | Oct 22, 2023 | Objectobject-detection | —Unverified | 0 |
| OV-VG: A Benchmark for Open-Vocabulary Visual Grounding | Oct 22, 2023 | Novel Conceptsobject-detection | CodeCode Available | 1 |
| Deep MDP: A Modular Framework for Multi-Object Tracking | Oct 22, 2023 | Multi-Object TrackingObject | CodeCode Available | 0 |
| Guidance system for Visually Impaired Persons using Deep Learning and Optical flow | Oct 22, 2023 | Depth EstimationObject | —Unverified | 0 |
| Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing Images | Oct 21, 2023 | Earth ObservationObject | CodeCode Available | 1 |
| Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in NMS | Oct 21, 2023 | 3D Object Detectionobject-detection | —Unverified | 0 |
| A review of individual tree crown detection and delineation from optical remote sensing images | Oct 20, 2023 | Deep LearningImage Segmentation | —Unverified | 0 |
| EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye View | Oct 20, 2023 | 3D Object DetectionMulti-Object Tracking | CodeCode Available | 1 |
| ScalableMap: Scalable Map Learning for Online Long-Range Vectorized HD Map Construction | Oct 20, 2023 | 3D Lane Detectionobject-detection | CodeCode Available | 1 |
| Zone Evaluation: Revealing Spatial Bias in Object Detection | Oct 20, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Multi‑camera trajectory matching based on hierarchical clustering and constraints | Oct 19, 2023 | AttributeAutonomous Driving | CodeCode Available | 1 |
| RTNH+: Enhanced 4D Radar Object Detection Network using Combined CFAR-based Two-level Preprocessing and Vertical Encoding | Oct 19, 2023 | 3D Object DetectionObject | —Unverified | 0 |
| DT/MARS-CycleGAN: Improved Object Detection for MARS Phenotyping Robot | Oct 19, 2023 | Image AugmentationObject | —Unverified | 0 |
| Lost in Translation: When GPT-4V(ision) Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond | Oct 19, 2023 | Image CaptioningLanguage Modeling | —Unverified | 0 |