| Efficient Teacher: Semi-Supervised Object Detection for YOLOv5 | Feb 15, 2023 | Objectobject-detection | CodeCode Available | 2 |
| Universal Guidance for Diffusion Models | Feb 14, 2023 | Face Recognitionobject-detection | CodeCode Available | 2 |
| SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition | Jan 30, 2023 | Feature Upsamplingimage-classification | CodeCode Available | 2 |
| DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets | Jan 15, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| Wildfire Smoke Detection with Computer Vision | Jan 12, 2023 | Object Detection | CodeCode Available | 2 |
| FocalFormer3D: Focusing on Hard Instance for 3D Object Detection | Jan 1, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Improving CLIP Fine-tuning Performance | Jan 1, 2023 | Diagnosticobject-detection | CodeCode Available | 2 |
| DETR Does Not Need Multi-Scale or Locality Design | Jan 1, 2023 | DecoderObject Detection | CodeCode Available | 2 |
| Reversible Column Networks | Dec 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| NMS Strikes Back | Dec 12, 2022 | Attributeobject-detection | CodeCode Available | 2 |
| Recurrent Vision Transformers for Object Detection with Event Cameras | Dec 11, 2022 | Event-based visionGPU | CodeCode Available | 2 |
| Deep Incubation: Training Large Models by Divide-and-Conquering | Dec 8, 2022 | Image Segmentationobject-detection | CodeCode Available | 2 |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Dec 2, 2022 | Domain Adaptationimage-classification | CodeCode Available | 2 |
| GRiT: A Generative Region-to-text Transformer for Object Understanding | Dec 1, 2022 | DecoderDense Captioning | CodeCode Available | 2 |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Nov 26, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark | Nov 24, 2022 | 2D Object DetectionImage Retrieval | CodeCode Available | 2 |
| Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration | Nov 23, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning | Nov 21, 2022 | 3D Classification3D Object Detection | CodeCode Available | 2 |
| NeRF-RPN: A general framework for object detection in NeRFs | Nov 21, 2022 | NeRFobject-detection | CodeCode Available | 2 |
| MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception | Nov 19, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 2 |
| Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal Fusion | Nov 19, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| SMILEtrack: SiMIlarity LEarning for Occlusion-Aware Multiple Object Tracking | Nov 16, 2022 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 |
| MogaNet: Multi-order Gated Aggregation Network | Nov 7, 2022 | 3D Human Pose EstimationImage Classification | CodeCode Available | 2 |
| Large Scale Radio Frequency Wideband Signal Detection & Recognition | Nov 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| SSDA-YOLO: Semi-supervised Domain Adaptive YOLO for Cross-Domain Object Detection | Nov 4, 2022 | Domain AdaptationKnowledge Distillation | CodeCode Available | 2 |