| Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation | Jun 11, 2024 | Grounded Multimodal Named Entity Recognitionnamed-entity-recognition | CodeCode Available | 1 |
| Do text-free diffusion models learn discriminative visual representations? | Nov 29, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images | Mar 25, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| DPNet: Dual-Path Network for Real-time Object Detection with Lightweight Attention | Sep 28, 2022 | object-detectionObject Detection | CodeCode Available | 1 |
| Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation | Jul 23, 2023 | Instance SegmentationObject | CodeCode Available | 1 |
| ASOD60K: An Audio-Induced Salient Object Detection Dataset for Panoramic Videos | Jul 24, 2021 | 4kObject | CodeCode Available | 1 |
| Advancing Referring Expression Segmentation Beyond Single Image | May 21, 2023 | Co-Salient Object DetectionObject | CodeCode Available | 1 |
| Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene | Jul 11, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| ApproxDet: Content and Contention-Aware Approximate Object Detection for Mobiles | Oct 21, 2020 | Objectobject-detection | CodeCode Available | 1 |
| DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection | Oct 23, 2024 | Image RestorationObject | CodeCode Available | 1 |
| A Structure-Aware Relation Network for Thoracic Diseases Detection and Segmentation | Apr 21, 2021 | Instance SegmentationObject Detection | CodeCode Available | 1 |
| DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training | Feb 28, 2022 | GPUInstance Segmentation | CodeCode Available | 1 |
| Advancing Self-supervised Monocular Depth Learning with Sparse LiDAR | Sep 20, 2021 | 3D Object DetectionDepth Completion | CodeCode Available | 1 |
| DSGN: Deep Stereo Geometry Network for 3D Object Detection | Jan 10, 2020 | 3D Object Detection3D Object Detection From Stereo Images | CodeCode Available | 1 |
| Advancing Vision Transformers with Group-Mix Attention | Nov 26, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| 3D Small Object Detection with Dynamic Spatial Pruning | May 5, 2023 | 3D Object DetectionDecoder | CodeCode Available | 1 |
| DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection | Oct 2, 2023 | Novel Object DetectionObject | CodeCode Available | 1 |
| A Simple Pooling-Based Design for Real-Time Salient Object Detection | Apr 21, 2019 | object-detectionObject Detection | CodeCode Available | 1 |
| Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave Radar | Jul 14, 2023 | 2D Semantic SegmentationAutonomous Navigation | CodeCode Available | 1 |
| Dual-Level Collaborative Transformer for Image Captioning | Jan 16, 2021 | DescriptiveImage Captioning | CodeCode Available | 1 |
| AQD: Towards Accurate Fully-Quantized Object Detection | Jul 14, 2020 | Image ClassificationObject | CodeCode Available | 1 |
| Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous Driving | Oct 11, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities | Dec 14, 2023 | Autonomous NavigationMulti-Task Learning | CodeCode Available | 1 |
| AquaVision: Automating the detection of waste in water bodies using deep transfer learning | Jul 18, 2020 | object-detectionObject Detection | CodeCode Available | 1 |
| Data Augmentation for Object Detection via Differentiable Neural Rendering | Mar 4, 2021 | Data AugmentationNeural Rendering | CodeCode Available | 1 |