| Language as Queries for Referring Video Object Segmentation | Jan 3, 2022 | ObjectObject Tracking | CodeCode Available | 2 |
| C2AM: Contrastive Learning of Class-Agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation | Jan 1, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Mask2Former for Video Instance Segmentation | Dec 20, 2021 | Image SegmentationInstance Segmentation | CodeCode Available | 2 |
| Improving Image Restoration by Revisiting Global Information Aggregation | Dec 8, 2021 | Color Image DenoisingDeblurring | CodeCode Available | 2 |
| Masked-attention Mask Transformer for Universal Image Segmentation | Dec 2, 2021 | 2D Semantic SegmentationImage Segmentation | CodeCode Available | 2 |
| MetaFormer Is Actually What You Need for Vision | Nov 22, 2021 | Image ClassificationObject Detection | CodeCode Available | 2 |
| Attention Mechanisms in Computer Vision: A Survey | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery | Sep 18, 2021 | Change DetectionDecoder | CodeCode Available | 2 |
| Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking | Sep 8, 2021 | BenchmarkingDiversity | CodeCode Available | 2 |
| Open-World Entity Segmentation | Jul 29, 2021 | Image ManipulationImage Segmentation | CodeCode Available | 2 |
| Per-Pixel Classification is Not All You Need for Semantic Segmentation | Jul 13, 2021 | AllClassification | CodeCode Available | 2 |
| Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling | Jul 6, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene Images | Jun 23, 2021 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| BEiT: BERT Pre-Training of Image Transformers | Jun 15, 2021 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 2 |
| Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations | Jun 10, 2021 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks | May 5, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images | Apr 25, 2021 | DecoderSegmentation | CodeCode Available | 2 |
| Multi-Modal Fusion Transformer for End-to-End Autonomous Driving | Apr 19, 2021 | Autonomous Driving | CodeCode Available | 2 |
| Swin Transformer: Hierarchical Vision Transformer using Shifted Windows | Mar 25, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| Full Page Handwriting Recognition via Image to Sequence Extraction | Mar 11, 2021 | Handwriting RecognitionHandwritten Text Recognition | CodeCode Available | 2 |
| Coordinate Attention for Efficient Mobile Network Design | Mar 4, 2021 | object-detectionObject Detection | CodeCode Available | 2 |
| LambdaNetworks: Modeling Long-Range Interactions Without Attention | Feb 17, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation | Feb 8, 2021 | Cardiac SegmentationDecoder | CodeCode Available | 2 |
| Simplifying Object Segmentation with PixelLib Library | Jan 20, 2021 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| Boundary-Aware Segmentation Network for Mobile and Web Applications | Jan 12, 2021 | Camouflaged Object SegmentationDecoder | CodeCode Available | 2 |