| CLIP is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation | Dec 16, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Deep Incubation: Training Large Models by Divide-and-Conquering | Dec 8, 2022 | Image Segmentationobject-detection | CodeCode Available | 2 |
| UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation | Dec 8, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation | Dec 7, 2022 | Semantic Segmentationzero-shot-classification | CodeCode Available | 2 |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Dec 2, 2022 | Domain Adaptationimage-classification | CodeCode Available | 2 |
| PLA: Language-Driven Open-Vocabulary 3D Scene Understanding | Nov 29, 2022 | 3D Open-Vocabulary Instance SegmentationContrastive Learning | CodeCode Available | 2 |
| OpenScene: 3D Scene Understanding with Open Vocabularies | Nov 28, 2022 | 3D Open-Vocabulary Instance Segmentation3D Semantic Segmentation | CodeCode Available | 2 |
| Semi-Supervised Confidence-Level-based Contrastive Discrimination for Class-Imbalanced Semantic Segmentation | Nov 28, 2022 | Contrastive LearningRoad Segmentation | CodeCode Available | 2 |
| Medical Image Segmentation Review: The success of U-Net | Nov 27, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Nov 26, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration | Nov 23, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| MogaNet: Multi-order Gated Aggregation Network | Nov 7, 2022 | 3D Human Pose EstimationImage Classification | CodeCode Available | 2 |
| SimpleClick: Interactive Image Segmentation with Simple Vision Transformers | Oct 20, 2022 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| Decoupling Features in Hierarchical Propagation for Video Object Segmentation | Oct 18, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Model-Based Imitation Learning for Urban Driving | Oct 14, 2022 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| SegViT: Semantic Segmentation with Plain Vision Transformers | Oct 12, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition | Oct 11, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Point Transformer V2: Grouped Vector Attention and Partition-based Pooling | Oct 11, 2022 | 3D Point Cloud Classification3D Semantic Segmentation | CodeCode Available | 2 |
| What the DAAM: Interpreting Stable Diffusion Using Cross Attention | Oct 10, 2022 | DenoisingDescriptive | CodeCode Available | 2 |
| Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP | Oct 9, 2022 | Image CaptioningOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Mask3D: Mask Transformer for 3D Semantic Instance Segmentation | Oct 6, 2022 | 3D Instance Segmentation3D Semantic Instance Segmentation | CodeCode Available | 2 |
| GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models | Oct 5, 2022 | Out-of-Distribution DetectionSegmentation | CodeCode Available | 2 |
| MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features | Sep 30, 2022 | Image Classification | CodeCode Available | 2 |
| 3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation | Sep 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |