| Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion | Mar 22, 2021 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| Cross-dataset Training for Class Increasing Object Detection | Jan 14, 2020 | Objectobject-detection | CodeCode Available | 1 | 5 |
| CrossDet: Crossline Representation for Object Detection | Jan 1, 2021 | Objectobject-detection | CodeCode Available | 1 | 5 |
| LUAI Challenge 2021 on Learning to Understand Aerial Images | Aug 30, 2021 | Objectobject-detection | CodeCode Available | 1 | 5 |
| L-Verse: Bidirectional Generation Between Image and Text | Nov 22, 2021 | Image CaptioningImage Generation | CodeCode Available | 1 | 5 |
| 3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection Transformers | Nov 27, 2022 | 3D Object DetectionDecoder | CodeCode Available | 1 | 5 |
| AI2-THOR: An Interactive 3D Environment for Visual AI | Dec 14, 2017 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 | 5 |
| Cross-domain Detection via Graph-induced Prototype Alignment | Mar 28, 2020 | Domain Adaptationobject-detection | CodeCode Available | 1 | 5 |
| Cross-Domain Document Object Detection: Benchmark Suite and Method | Mar 30, 2020 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| AI Accelerator Survey and Trends | Sep 18, 2021 | BenchmarkingComputational Efficiency | CodeCode Available | 1 | 5 |
| LSTFE-Net:Long Short-Term Feature Enhancement Network for Video Small Object Detection | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 | 5 |
| LVIS: A Dataset for Large Vocabulary Instance Segmentation | Aug 8, 2019 | Instance SegmentationObject | CodeCode Available | 1 | 5 |
| Cross-domain Few-shot Object Detection with Multi-modal Textual Enrichment | Feb 23, 2025 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 1 | 5 |
| M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark | May 10, 2025 | Autonomous DrivingMotion Forecasting | CodeCode Available | 1 | 5 |
| Cross Domain Object Detection by Target-Perceived Dual Branch Distillation | May 3, 2022 | Objectobject-detection | CodeCode Available | 1 | 5 |
| Meta-Learning of Neural Architectures for Few-Shot Learning | Nov 25, 2019 | Few-Shot LearningMeta-Learning | CodeCode Available | 1 | 5 |
| Cross-Domain Adaptive Teacher for Object Detection | Nov 25, 2021 | Data AugmentationDomain Adaptation | CodeCode Available | 1 | 5 |
| Convolutional Neural Networks with Gated Recurrent Connections | Jun 5, 2021 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Nov 24, 2024 | Ensemble LearningObject | CodeCode Available | 1 | 5 |
| Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers | Mar 23, 2021 | Amodal Instance SegmentationBoundary Detection | CodeCode Available | 1 | 5 |
| Deep object detection for waterbird monitoring using aerial imagery | Oct 10, 2022 | ManagementObject | CodeCode Available | 1 | 5 |
| A Robotic Approach towards Quantifying Epipelagic Bound Plastic Using Deep Visual Models | May 5, 2021 | Object Detection | CodeCode Available | 1 | 5 |
| Low-Light Image Enhancement Framework for Improved Object Detection in Fisheye Lens Datasets | Apr 15, 2024 | Ensemble LearningImage Enhancement | CodeCode Available | 1 | 5 |
| LP-OVOD: Open-Vocabulary Object Detection by Linear Probing | Oct 26, 2023 | Objectobject-detection | CodeCode Available | 1 | 5 |
| 3D Object Detection with a Self-supervised Lidar Scene Flow Backbone | May 2, 2022 | 3D Object DetectionObject | CodeCode Available | 1 | 5 |
| DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection | Jul 18, 2022 | 3D Object DetectionAttribute | CodeCode Available | 1 | 5 |
| ConvNet Architecture Search for Spatiotemporal Feature Learning | Aug 16, 2017 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| CrossKD: Cross-Head Knowledge Distillation for Object Detection | Jun 20, 2023 | Dense Object DetectionKnowledge Distillation | CodeCode Available | 1 | 5 |
| LPYOLO: Low Precision YOLO for Face Detection on FPGA | Jul 21, 2022 | CPUDecision Making | CodeCode Available | 1 | 5 |
| LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation | Nov 9, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 | 5 |
| Cross-Layer Retrospective Retrieving via Layer Attention | Feb 8, 2023 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| 3DPPE: 3D Point Positional Encoding for Transformer-based Multi-Camera 3D Object Detection | Jan 1, 2023 | 3D Object DetectionDecoder | CodeCode Available | 1 | 5 |
| Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays | Mar 11, 2023 | DenoisingObject | CodeCode Available | 1 | 5 |
| DiffuBox: Refining 3D Object Detection with Point Diffusion | May 25, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 | 5 |
| Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization | Jun 14, 2016 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers | Apr 24, 2021 | 3D Object Detectionobject-detection | CodeCode Available | 1 | 5 |
| ConvMLP: Hierarchical Convolutional MLPs for Vision | Sep 9, 2021 | Image ClassificationInstance Segmentation | CodeCode Available | 1 | 5 |
| A General Regret Bound of Preconditioned Gradient Method for DNN Training | Jan 1, 2023 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition | Nov 22, 2022 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| Cross-Modality Fusion Transformer for Multispectral Object Detection | Oct 30, 2021 | Multispectral Object DetectionObject | CodeCode Available | 1 | 5 |
| DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer | May 9, 2025 | Action DetectionDecoder | CodeCode Available | 1 | 5 |
| Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection | Nov 14, 2022 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 | 5 |
| Dilated convolution with learnable spacings | Dec 7, 2021 | Image ClassificationObject Detection | CodeCode Available | 1 | 5 |
| Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds | Dec 16, 2021 | Objectobject-detection | CodeCode Available | 1 | 5 |
| Cross-modal transformers for infrared and visible image fusion | Jun 26, 2023 | Cross-Modal RetrievalDepth Estimation | CodeCode Available | 1 | 5 |
| Look-into-Object: Self-supervised Structure Modeling for Object Recognition | Mar 31, 2020 | Fine-Grained Image ClassificationImage Recognition | CodeCode Available | 1 | 5 |
| Cross-Modal Weighting Network for RGB-D Salient Object Detection | Jul 9, 2020 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| Mining the Benefits of Two-stage and One-stage HOI Detection | Aug 11, 2021 | ClassificationHuman-Object Interaction Detection | CodeCode Available | 1 | 5 |
| Deeply supervised salient object detection with short connections | Nov 15, 2016 | Boundary DetectionObject | CodeCode Available | 1 | 5 |
| DeepMIM: Deep Supervision for Masked Image Modeling | Mar 15, 2023 | image-classificationImage Classification | CodeCode Available | 1 | 5 |