| Robust End-to-End Focal Liver Lesion Detection using Unregistered Multiphase Computed Tomography Images | Dec 2, 2021 | Automatic Liver And Tumor SegmentationComputed Tomography (CT) | CodeCode Available | 1 |
| MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion Attention | Dec 2, 2021 | Object DetectionRepresentation Learning | CodeCode Available | 1 |
| MViTv2: Improved Multiscale Vision Transformers for Classification and Detection | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Pooling by Sliced-Wasserstein Embedding | Dec 1, 2021 | Graph Learningimage-classification | CodeCode Available | 1 |
| Container: Context Aggregation Networks | Dec 1, 2021 | Inductive BiasInstance Segmentation | CodeCode Available | 1 |
| Object-Aware Cropping for Self-Supervised Learning | Dec 1, 2021 | Data AugmentationObject | CodeCode Available | 1 |
| Confidence Propagation Cluster: Unleash Full Potential of Object Detectors | Dec 1, 2021 | Objectobject-detection | CodeCode Available | 1 |
| The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by Normalization | Dec 1, 2021 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 |
| Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning | Dec 1, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Focal Attention for Long-Range Interactions in Vision Transformers | Dec 1, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Generalized and Discriminative Few-Shot Object Detection via SVD-Dictionary Enhancement | Dec 1, 2021 | Dictionary LearningFew-Shot Object Detection | CodeCode Available | 1 |
| Event-Based Fusion for Motion Deblurring with Cross-modal Attention | Nov 30, 2021 | DeblurringImage Deblurring | CodeCode Available | 1 |
| TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information | Nov 30, 2021 | Active LearningAutonomous Vehicles | CodeCode Available | 1 |
| A Unified Pruning Framework for Vision Transformers | Nov 30, 2021 | Model Compressionobject-detection | CodeCode Available | 1 |
| Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection | Nov 30, 2021 | 3D Object DetectionDomain Adaptation | CodeCode Available | 1 |
| DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion | Nov 29, 2021 | Multi-Object TrackingObject | CodeCode Available | 1 |
| Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity | Nov 29, 2021 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| Searching the Search Space of Vision Transformer | Nov 29, 2021 | Neural Architecture Searchobject-detection | CodeCode Available | 1 |
| NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition | Nov 25, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| CDNet is all you need: Cascade DCN based underwater object detection RCNN | Nov 25, 2021 | AllObject | CodeCode Available | 1 |
| BoxeR: Box-Attention for 2D and 3D Transformers | Nov 25, 2021 | 3D Object DetectionInstance Segmentation | CodeCode Available | 1 |
| Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark | Nov 25, 2021 | Matrix CompletionMoving Object Detection | CodeCode Available | 1 |
| Cross-Domain Adaptive Teacher for Object Detection | Nov 25, 2021 | Data AugmentationDomain Adaptation | CodeCode Available | 1 |
| PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers | Nov 24, 2021 | Image Classificationobject-detection | CodeCode Available | 1 |
| Focal and Global Knowledge Distillation for Detectors | Nov 23, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Few-Shot Object Detection via Association and DIscrimination | Nov 23, 2021 | Few-Shot Object DetectionObject | CodeCode Available | 1 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images | Nov 22, 2021 | Instance SegmentationObject Detection | CodeCode Available | 1 |
| Class-agnostic Object Detection with Multi-modal Transformer | Nov 22, 2021 | Class-agnostic Object DetectionObject | CodeCode Available | 1 |
| Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras | Nov 22, 2021 | Multi-Object Trackingobject-detection | CodeCode Available | 1 |
| Benchmarking Detection Transfer Learning with Vision Transformers | Nov 22, 2021 | Benchmarkingobject-detection | CodeCode Available | 1 |
| Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture | Nov 22, 2021 | Handwritten Text Recognitionobject-detection | CodeCode Available | 1 |
| L-Verse: Bidirectional Generation Between Image and Text | Nov 22, 2021 | Image CaptioningImage Generation | CodeCode Available | 1 |
| MUM : Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object Detection | Nov 22, 2021 | Data Augmentationobject-detection | CodeCode Available | 1 |
| FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks | Nov 22, 2021 | BenchmarkingFederated Learning | CodeCode Available | 1 |
| FCOSR: A Simple Anchor-free Rotated Detector for Aerial Object Detection | Nov 21, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Grounded Situation Recognition with Transformers | Nov 19, 2021 | DecoderGrounded Situation Recognition | CodeCode Available | 1 |
| Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Point Density Level Estimation | Nov 18, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Open Vocabulary Object Detection with Pseudo Bounding-Box Labels | Nov 18, 2021 | Objectobject-detection | CodeCode Available | 1 |
| TransMix: Attend to Mix for Vision Transformers | Nov 18, 2021 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Swin Transformer V2: Scaling Up Capacity and Resolution | Nov 18, 2021 | Action Classificationimage-classification | CodeCode Available | 1 |
| ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data | Nov 17, 2021 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Tracklet-Switch Adversarial Attack against Pedestrian Multi-Object Tracking Trackers | Nov 17, 2021 | Adversarial AttackMulti-Object Tracking | CodeCode Available | 1 |
| SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Deraining | Nov 17, 2021 | Contrastive LearningImage Restoration | CodeCode Available | 1 |
| TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video | Nov 17, 2021 | Data AugmentationImage Augmentation | CodeCode Available | 1 |
| Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts | Nov 16, 2021 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 1 |
| iBOT: Image BERT Pre-Training with Online Tokenizer | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Attention Guided Cosine Margin For Overcoming Class-Imbalance in Few-Shot Road Object Detection | Nov 12, 2021 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 1 |
| Indian Licence Plate Dataset in the wild | Nov 11, 2021 | object-detectionObject Detection | CodeCode Available | 1 |