| MViTv2: Improved Multiscale Vision Transformers for Classification and Detection | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion Attention | Dec 2, 2021 | Object DetectionRepresentation Learning | CodeCode Available | 1 |
| Robust End-to-End Focal Liver Lesion Detection using Unregistered Multiphase Computed Tomography Images | Dec 2, 2021 | Automatic Liver And Tumor SegmentationComputed Tomography (CT) | CodeCode Available | 1 |
| Pooling by Sliced-Wasserstein Embedding | Dec 1, 2021 | Graph Learningimage-classification | CodeCode Available | 1 |
| Confidence Propagation Cluster: Unleash Full Potential of Object Detectors | Dec 1, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning | Dec 1, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Focal Attention for Long-Range Interactions in Vision Transformers | Dec 1, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Object-Aware Cropping for Self-Supervised Learning | Dec 1, 2021 | Data AugmentationObject | CodeCode Available | 1 |
| Container: Context Aggregation Networks | Dec 1, 2021 | Inductive BiasInstance Segmentation | CodeCode Available | 1 |
| Generalized and Discriminative Few-Shot Object Detection via SVD-Dictionary Enhancement | Dec 1, 2021 | Dictionary LearningFew-Shot Object Detection | CodeCode Available | 1 |
| The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization | Dec 1, 2021 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 |
| TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information | Nov 30, 2021 | Active LearningAutonomous Vehicles | CodeCode Available | 1 |
| A Unified Pruning Framework for Vision Transformers | Nov 30, 2021 | Model Compressionobject-detection | CodeCode Available | 1 |
| Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection | Nov 30, 2021 | 3D Object DetectionDomain Adaptation | CodeCode Available | 1 |
| Event-Based Fusion for Motion Deblurring with Cross-modal Attention | Nov 30, 2021 | DeblurringImage Deblurring | CodeCode Available | 1 |
| DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion | Nov 29, 2021 | Multi-Object TrackingObject | CodeCode Available | 1 |
| Searching the Search Space of Vision Transformer | Nov 29, 2021 | Neural Architecture Searchobject-detection | CodeCode Available | 1 |
| Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity | Nov 29, 2021 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition | Nov 25, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| CDNet is all you need: Cascade DCN based underwater object detection RCNN | Nov 25, 2021 | AllObject | CodeCode Available | 1 |
| BoxeR: Box-Attention for 2D and 3D Transformers | Nov 25, 2021 | 3D Object DetectionInstance Segmentation | CodeCode Available | 1 |
| Cross-Domain Adaptive Teacher for Object Detection | Nov 25, 2021 | Data AugmentationDomain Adaptation | CodeCode Available | 1 |
| Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark | Nov 25, 2021 | Matrix CompletionMoving Object Detection | CodeCode Available | 1 |
| PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers | Nov 24, 2021 | Image Classificationobject-detection | CodeCode Available | 1 |
| Focal and Global Knowledge Distillation for Detectors | Nov 23, 2021 | image-classificationImage Classification | CodeCode Available | 1 |