| Efficient Teacher: Semi-Supervised Object Detection for YOLOv5 | Feb 15, 2023 | Objectobject-detection | CodeCode Available | 2 |
| Universal Guidance for Diffusion Models | Feb 14, 2023 | Face Recognitionobject-detection | CodeCode Available | 2 |
| SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition | Jan 30, 2023 | Feature Upsamplingimage-classification | CodeCode Available | 2 |
| DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets | Jan 15, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| Wildfire Smoke Detection with Computer Vision | Jan 12, 2023 | Object Detection | CodeCode Available | 2 |
| FocalFormer3D: Focusing on Hard Instance for 3D Object Detection | Jan 1, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Improving CLIP Fine-tuning Performance | Jan 1, 2023 | Diagnosticobject-detection | CodeCode Available | 2 |
| DETR Does Not Need Multi-Scale or Locality Design | Jan 1, 2023 | DecoderObject Detection | CodeCode Available | 2 |
| Reversible Column Networks | Dec 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| NMS Strikes Back | Dec 12, 2022 | Attributeobject-detection | CodeCode Available | 2 |
| Recurrent Vision Transformers for Object Detection with Event Cameras | Dec 11, 2022 | Event-based visionGPU | CodeCode Available | 2 |
| Deep Incubation: Training Large Models by Divide-and-Conquering | Dec 8, 2022 | Image Segmentationobject-detection | CodeCode Available | 2 |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Dec 2, 2022 | Domain Adaptationimage-classification | CodeCode Available | 2 |
| GRiT: A Generative Region-to-text Transformer for Object Understanding | Dec 1, 2022 | DecoderDense Captioning | CodeCode Available | 2 |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Nov 26, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark | Nov 24, 2022 | 2D Object DetectionImage Retrieval | CodeCode Available | 2 |
| Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration | Nov 23, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning | Nov 21, 2022 | 3D Classification3D Object Detection | CodeCode Available | 2 |
| NeRF-RPN: A general framework for object detection in NeRFs | Nov 21, 2022 | NeRFobject-detection | CodeCode Available | 2 |
| MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception | Nov 19, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 2 |
| Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal Fusion | Nov 19, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| SMILEtrack: SiMIlarity LEarning for Occlusion-Aware Multiple Object Tracking | Nov 16, 2022 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 |
| MogaNet: Multi-order Gated Aggregation Network | Nov 7, 2022 | 3D Human Pose EstimationImage Classification | CodeCode Available | 2 |
| Large Scale Radio Frequency Wideband Signal Detection & Recognition | Nov 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object Detection | Nov 4, 2022 | Domain AdaptationKnowledge Distillation | CodeCode Available | 2 |
| CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion | Oct 19, 2022 | Camera Pose EstimationDepth Estimation | CodeCode Available | 2 |
| The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition | Oct 11, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection | Oct 5, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| Centralized Feature Pyramid for Object Detection | Oct 5, 2022 | Objectobject-detection | CodeCode Available | 2 |
| MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features | Sep 30, 2022 | Image Classification | CodeCode Available | 2 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| SuperYOLO: Super Resolution Assisted Object Detection in Multimodal Remote Sensing Imagery | Sep 27, 2022 | Object DetectionReal-Time Object Detection | CodeCode Available | 2 |
| Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps | Sep 26, 2022 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 2 |
| BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal Stereo | Sep 21, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| RDD2022: A multi-national image dataset for automatic Road Damage Detection | Sep 18, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Scalable SoftGroup for 3D Instance Segmentation on Point Clouds | Sep 17, 2022 | 3D Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| CenterFormer: Center-based Transformer for 3D Object Detection | Sep 12, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| Multi-Grained Angle Representation for Remote Sensing Object Detection | Sep 7, 2022 | Objectobject-detection | CodeCode Available | 2 |
| Task-wise Sampling Convolutions for Arbitrary-Oriented Object Detection in Aerial Images | Sep 6, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection | Sep 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Visual Prompting via Image Inpainting | Sep 1, 2022 | ColorizationEdge Detection | CodeCode Available | 2 |
| VEViD: Vision Enhancement via Virtual diffraction and coherent Detection | Aug 25, 2022 | 4kImage Enhancement | CodeCode Available | 2 |
| YOLOPv2: Better, Faster, Stronger for Panoptic Driving Perception | Aug 24, 2022 | Autonomous DrivingDrivable Area Detection | CodeCode Available | 2 |
| CitySim: A Drone-Based Vehicle Trajectory Dataset for Safety Oriented Research and Digital Twins | Aug 23, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| DeepInteraction: 3D Object Detection via Modality Interaction | Aug 23, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 |
| YOLOV: Making Still Image Object Detectors Great at Video Object Detection | Aug 20, 2022 | GPUObject | CodeCode Available | 2 |
| RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection | Aug 18, 2022 | Objectobject-detection | CodeCode Available | 2 |
| No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects | Aug 7, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation Exploitation | Jul 30, 2022 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 2 |
| HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions | Jul 28, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |