| BatchFormerV2: Exploring Sample Relationships for Dense Representation Learning | Apr 4, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft Dataset | Apr 1, 2022 | 3D Object DetectionBenchmarking | CodeCode Available | 2 |
| Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection | Mar 30, 2022 | 2D Object DetectionBilevel Optimization | CodeCode Available | 2 |
| AdaMixer: A Fast-Converging Query-Based Object Detector | Mar 30, 2022 | ObjectObject Detection | CodeCode Available | 2 |
| Exploring Plain Vision Transformer Backbones for Object Detection | Mar 30, 2022 | Cross-Domain Few-Shot Object DetectionInstance Segmentation | CodeCode Available | 2 |
| Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data | Mar 30, 2022 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| LiDAR Snowfall Simulation for Robust 3D Object Detection | Mar 28, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection | Mar 24, 2022 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| Sparse Instance Activation for Real-Time Instance Segmentation | Mar 24, 2022 | Instance SegmentationObject | CodeCode Available | 2 |
| BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training | Mar 24, 2022 | Objectobject-detection | CodeCode Available | 2 |
| Real-time Object Detection for Streaming Perception | Mar 23, 2022 | Autonomous DrivingObject | CodeCode Available | 2 |
| Focal Modulation Networks | Mar 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Open-Vocabulary DETR with Conditional Matching | Mar 22, 2022 | Language Modellingobject-detection | CodeCode Available | 2 |
| TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers | Mar 22, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds | Mar 21, 2022 | AllGPU | CodeCode Available | 2 |
| V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer | Mar 20, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds | Mar 19, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion | Mar 18, 2022 | 3D Object DetectionData Augmentation | CodeCode Available | 2 |
| HybridNets: End-to-End Perception Network | Mar 17, 2022 | Autonomous DrivingDrivable Area Detection | CodeCode Available | 2 |
| Decoupled Knowledge Distillation | Mar 16, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Accelerating DETR Convergence via Semantic-Aligned Matching | Mar 14, 2022 | Objectobject-detection | CodeCode Available | 2 |
| QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization | Mar 11, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection | Mar 9, 2022 | Co-Salient Object Detectionobject-detection | CodeCode Available | 2 |
| CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers | Mar 9, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer | Mar 8, 2022 | Image Classificationobject-detection | CodeCode Available | 2 |
| F2DNet: Fast Focal Detection Network for Pedestrian Detection | Mar 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| StrongSORT: Make DeepSORT Great Again | Feb 28, 2022 | Multi-Object Trackingobject-detection | CodeCode Available | 2 |
| FreeSOLO: Learning to Segment Objects without Annotations | Feb 24, 2022 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut | Feb 23, 2022 | Objectobject-detection | CodeCode Available | 2 |
| GroupViT: Semantic Segmentation Emerges from Text Supervision | Feb 22, 2022 | Object DetectionScene Understanding | CodeCode Available | 2 |
| Tiny Object Tracking: A Large-scale Dataset and A Baseline | Feb 11, 2022 | AttributeKnowledge Distillation | CodeCode Available | 2 |
| Context Autoencoder for Self-Supervised Representation Learning | Feb 7, 2022 | DecoderInstance Segmentation | CodeCode Available | 2 |
| VOS: Learning What You Don't Know by Virtual Outlier Synthesis | Feb 2, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| The KFIoU Loss for Rotated Object Detection | Jan 29, 2022 | Objectobject-detection | CodeCode Available | 2 |
| DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR | Jan 28, 2022 | 2D Object DetectionObject Detection | CodeCode Available | 2 |
| RelTR: Relation Transformer for Scene Graph Generation | Jan 27, 2022 | DecoderGraph Generation | CodeCode Available | 2 |
| When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism | Jan 26, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |
| UniFormer: Unifying Convolution and Self-attention for Visual Recognition | Jan 24, 2022 | Image Classificationobject-detection | CodeCode Available | 2 |
| TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers | Jan 13, 2022 | GPUObject | CodeCode Available | 2 |
| Pedestrian Detection: Domain Generalization, CNNs, Transformers and Beyond | Jan 10, 2022 | AttributeAutonomous Driving | CodeCode Available | 2 |
| QuadTree Attention for Vision Transformers | Jan 8, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Equalized Focal Loss for Dense Long-Tailed Object Detection | Jan 7, 2022 | Long-tailed Object DetectionObject | CodeCode Available | 2 |
| Vision Transformer with Deformable Attention | Jan 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View | Dec 22, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Grounded Language-Image Pre-training | Dec 7, 2021 | 2D Object DetectionDescribed Object Detection | CodeCode Available | 2 |
| MetaFormer Is Actually What You Need for Vision | Nov 22, 2021 | Image ClassificationObject Detection | CodeCode Available | 2 |
| Attention Mechanisms in Computer Vision: A Survey | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| Deep Neural Networks to Detect Weeds from Crops in Agricultural Environments in Real-Time: A Review | Nov 8, 2021 | Object DetectionTransfer Learning | CodeCode Available | 2 |
| MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer | Oct 5, 2021 | Image Classificationobject-detection | CodeCode Available | 2 |
| PubTables-1M: Towards comprehensive table extraction from unstructured documents | Sep 30, 2021 | Articlesobject-detection | CodeCode Available | 2 |