| CLIP is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation | Dec 16, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Deep Incubation: Training Large Models by Divide-and-Conquering | Dec 8, 2022 | Image Segmentationobject-detection | CodeCode Available | 2 |
| UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation | Dec 8, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation | Dec 7, 2022 | Semantic Segmentationzero-shot-classification | CodeCode Available | 2 |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Dec 2, 2022 | Domain Adaptationimage-classification | CodeCode Available | 2 |
| PLA: Language-Driven Open-Vocabulary 3D Scene Understanding | Nov 29, 2022 | 3D Open-Vocabulary Instance SegmentationContrastive Learning | CodeCode Available | 2 |
| Semi-Supervised Confidence-Level-based Contrastive Discrimination for Class-Imbalanced Semantic Segmentation | Nov 28, 2022 | Contrastive LearningRoad Segmentation | CodeCode Available | 2 |
| OpenScene: 3D Scene Understanding with Open Vocabularies | Nov 28, 2022 | 3D Open-Vocabulary Instance Segmentation3D Semantic Segmentation | CodeCode Available | 2 |
| Medical Image Segmentation Review: The success of U-Net | Nov 27, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Nov 26, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration | Nov 23, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| MogaNet: Multi-order Gated Aggregation Network | Nov 7, 2022 | 3D Human Pose EstimationImage Classification | CodeCode Available | 2 |
| SimpleClick: Interactive Image Segmentation with Simple Vision Transformers | Oct 20, 2022 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| Decoupling Features in Hierarchical Propagation for Video Object Segmentation | Oct 18, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Model-Based Imitation Learning for Urban Driving | Oct 14, 2022 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| SegViT: Semantic Segmentation with Plain Vision Transformers | Oct 12, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition | Oct 11, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Point Transformer V2: Grouped Vector Attention and Partition-based Pooling | Oct 11, 2022 | 3D Point Cloud Classification3D Semantic Segmentation | CodeCode Available | 2 |
| What the DAAM: Interpreting Stable Diffusion Using Cross Attention | Oct 10, 2022 | DenoisingDescriptive | CodeCode Available | 2 |
| Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP | Oct 9, 2022 | Image CaptioningOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Mask3D: Mask Transformer for 3D Semantic Instance Segmentation | Oct 6, 2022 | 3D Instance Segmentation3D Semantic Instance Segmentation | CodeCode Available | 2 |
| GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models | Oct 5, 2022 | Out-of-Distribution DetectionSegmentation | CodeCode Available | 2 |
| MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features | Sep 30, 2022 | Image Classification | CodeCode Available | 2 |
| 3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation | Sep 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| Generalized Parametric Contrastive Learning | Sep 26, 2022 | Contrastive LearningDomain Generalization | CodeCode Available | 2 |
| Understanding the Tricks of Deep Learning in Medical Image Segmentation: Challenges and Future Directions | Sep 21, 2022 | Data AugmentationDomain Adaptation | CodeCode Available | 2 |
| SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation | Sep 18, 2022 | Real-Time Semantic SegmentationSegmentation | CodeCode Available | 2 |
| DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments | Sep 17, 2022 | Motion SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Scalable SoftGroup for 3D Instance Segmentation on Point Clouds | Sep 17, 2022 | 3D Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| MCIBI++: Soft Mining Contextual Information Beyond Image for Semantic Segmentation | Sep 9, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation | Aug 21, 2022 | Medical Image AnalysisSemantic Segmentation | CodeCode Available | 2 |
| PyMIC: A deep learning toolkit for annotation-efficient medical image segmentation | Aug 19, 2022 | Deep LearningImage Segmentation | CodeCode Available | 2 |
| FEC: Fast Euclidean Clustering for Point Cloud Segmentation | Aug 16, 2022 | ClusteringInstance Segmentation | CodeCode Available | 2 |
| Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model | Aug 8, 2022 | Aerial Scene ClassificationFew-Shot Learning | CodeCode Available | 2 |
| Occlusion-Aware Instance Segmentation via BiLayer Network Architectures | Aug 8, 2022 | Human Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training | Aug 3, 2022 | Instance SegmentationSegmentation | CodeCode Available | 2 |
| HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions | Jul 28, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |
| In Defense of Online Models for Video Instance Segmentation | Jul 21, 2022 | Contrastive LearningInstance Segmentation | CodeCode Available | 2 |
| SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery | Jul 17, 2022 | Land Cover ClassificationSemantic Segmentation | CodeCode Available | 2 |
| Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning | Jul 11, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| SFNet: Faster, Accurate, and Domain Agnostic Semantic Segmentation via Semantic Flow | Jul 10, 2022 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| 2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds | Jul 10, 2022 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 2 |
| More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity | Jul 7, 2022 | Object DetectionSemantic Segmentation | CodeCode Available | 2 |
| CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers | Jul 5, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object Segmentation | Jul 5, 2022 | Autonomous DrivingCollision Avoidance | CodeCode Available | 2 |
| Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable Filters | Jul 4, 2022 | Autonomous DrivingScene Segmentation | CodeCode Available | 2 |
| Rethinking Unsupervised Domain Adaptation for Semantic Segmentation | Jun 30, 2022 | Domain AdaptationSemantic Segmentation | CodeCode Available | 2 |
| LaserMix for Semi-Supervised LiDAR Semantic Segmentation | Jun 30, 2022 | LIDAR Semantic SegmentationSegmentation | CodeCode Available | 2 |
| LViT: Language meets Vision Transformer in Medical Image Segmentation | Jun 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |