| F2DNet: Fast Focal Detection Network for Pedestrian Detection | Mar 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| StrongSORT: Make DeepSORT Great Again | Feb 28, 2022 | Multi-Object Trackingobject-detection | CodeCode Available | 2 |
| FreeSOLO: Learning to Segment Objects without Annotations | Feb 24, 2022 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut | Feb 23, 2022 | Objectobject-detection | CodeCode Available | 2 |
| GroupViT: Semantic Segmentation Emerges from Text Supervision | Feb 22, 2022 | Object DetectionScene Understanding | CodeCode Available | 2 |
| Tiny Object Tracking: A Large-scale Dataset and A Baseline | Feb 11, 2022 | AttributeKnowledge Distillation | CodeCode Available | 2 |
| Context Autoencoder for Self-Supervised Representation Learning | Feb 7, 2022 | DecoderInstance Segmentation | CodeCode Available | 2 |
| VOS: Learning What You Don't Know by Virtual Outlier Synthesis | Feb 2, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| The KFIoU Loss for Rotated Object Detection | Jan 29, 2022 | Objectobject-detection | CodeCode Available | 2 |
| DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR | Jan 28, 2022 | 2D Object DetectionObject Detection | CodeCode Available | 2 |
| RelTR: Relation Transformer for Scene Graph Generation | Jan 27, 2022 | DecoderGraph Generation | CodeCode Available | 2 |
| When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism | Jan 26, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |
| UniFormer: Unifying Convolution and Self-attention for Visual Recognition | Jan 24, 2022 | Image Classificationobject-detection | CodeCode Available | 2 |
| TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers | Jan 13, 2022 | GPUObject | CodeCode Available | 2 |
| Pedestrian Detection: Domain Generalization, CNNs, Transformers and Beyond | Jan 10, 2022 | AttributeAutonomous Driving | CodeCode Available | 2 |
| QuadTree Attention for Vision Transformers | Jan 8, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Equalized Focal Loss for Dense Long-Tailed Object Detection | Jan 7, 2022 | Long-tailed Object DetectionObject | CodeCode Available | 2 |
| Vision Transformer with Deformable Attention | Jan 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View | Dec 22, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Grounded Language-Image Pre-training | Dec 7, 2021 | 2D Object DetectionDescribed Object Detection | CodeCode Available | 2 |
| MetaFormer Is Actually What You Need for Vision | Nov 22, 2021 | Image ClassificationObject Detection | CodeCode Available | 2 |
| Attention Mechanisms in Computer Vision: A Survey | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| Deep Neural Networks to Detect Weeds from Crops in Agricultural Environments in Real-Time: A Review | Nov 8, 2021 | Object DetectionTransfer Learning | CodeCode Available | 2 |
| MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer | Oct 5, 2021 | Image Classificationobject-detection | CodeCode Available | 2 |
| PubTables-1M: Towards comprehensive table extraction from unstructured documents | Sep 30, 2021 | Articlesobject-detection | CodeCode Available | 2 |