| Video Object Segmentation in Panoptic Wild Scenes | May 8, 2023 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| OctFormer: Octree-based Transformers for 3D Point Clouds | May 4, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model | May 3, 2023 | Instance SegmentationObject | CodeCode Available | 2 |
| TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding | May 1, 2023 | 3D Object DetectionMonocular Depth Estimation | CodeCode Available | 2 |
| Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation | May 1, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Customized Segment Anything Model for Medical Image Segmentation | Apr 26, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| EasyPortrait -- Face Parsing and Portrait Segmentation Dataset | Apr 26, 2023 | DiversityDomain Generalization | CodeCode Available | 2 |
| Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation | Apr 26, 2023 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review | Apr 20, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding | Apr 14, 2023 | 3D Object DetectionScene Understanding | CodeCode Available | 2 |
| Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement | Apr 14, 2023 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 2 |
| Unifying and Personalizing Weakly-supervised Federated Medical Image Segmentation via Adaptive Representation and Aggregation | Apr 12, 2023 | channel selectionFederated Learning | CodeCode Available | 2 |
| SAMM (Segment Any Medical Model): A 3D Slicer Integration to SAM | Apr 12, 2023 | Image SegmentationSegmentation | CodeCode Available | 2 |
| UniverSeg: Universal Medical Image Segmentation | Apr 12, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction | Apr 11, 2023 | 3D Semantic Occupancy Prediction3D Semantic Scene Completion | CodeCode Available | 2 |
| Ambiguous Medical Image Segmentation using Diffusion Models | Apr 10, 2023 | DiagnosticDiversity | CodeCode Available | 2 |
| Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition | Apr 10, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| CherryPicker: Semantic Skeletonization and Topological Reconstruction of Cherry Trees | Apr 10, 2023 | Monocular ReconstructionPlant Phenotyping | CodeCode Available | 2 |
| RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding | Apr 3, 2023 | Contrastive LearningInstance Segmentation | CodeCode Available | 2 |
| Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth Estimation | Apr 3, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| DDP: Diffusion Model for Dense Visual Prediction | Mar 30, 2023 | DenoisingDepth Estimation | CodeCode Available | 2 |
| Mask-Free Video Instance Segmentation | Mar 28, 2023 | Instance SegmentationOptical Flow Estimation | CodeCode Available | 2 |
| Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching | Mar 27, 2023 | DecoderFew-Shot Learning | CodeCode Available | 2 |
| Vision Transformer with Quadrangle Attention | Mar 27, 2023 | object-detectionObject Detection | CodeCode Available | 2 |
| You Only Segment Once: Towards Real-Time Panoptic Segmentation | Mar 26, 2023 | DecoderPanoptic Segmentation | CodeCode Available | 2 |
| Spherical Transformer for LiDAR-based 3D Recognition | Mar 22, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation | Mar 21, 2023 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Generative Semantic Segmentation | Mar 20, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| M^2SNet: Multi-scale in Multi-scale Subtraction Network for Medical Image Segmentation | Mar 20, 2023 | Computed Tomography (CT)Decoder | CodeCode Available | 2 |
| Towards Diverse Binary Segmentation via A Simple yet General Gated Network | Mar 18, 2023 | DecoderSegmentation | CodeCode Available | 2 |
| MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation | Mar 17, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| Large Selective Kernel Network for Remote Sensing Object Detection | Mar 16, 2023 | Objectobject-detection | CodeCode Available | 2 |
| BiFormer: Vision Transformer with Bi-Level Routing Attention | Mar 15, 2023 | Computational EfficiencyGPU | CodeCode Available | 2 |
| DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception | Mar 15, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation | Mar 15, 2023 | DecoderInstance Segmentation | CodeCode Available | 2 |
| CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention | Mar 13, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Mar 8, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning | Mar 8, 2023 | Semantic Segmentation | CodeCode Available | 2 |
| Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern Analysis | Mar 4, 2023 | BenchmarkingContrastive Learning | CodeCode Available | 2 |
| Unleashing Text-to-Image Diffusion Models for Visual Perception | Mar 3, 2023 | DenoisingDepth Estimation | CodeCode Available | 2 |
| Delivering Arbitrary-Modal Semantic Segmentation | Mar 2, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Side Adapter Network for Open-Vocabulary Semantic Segmentation | Feb 23, 2023 | Language ModellingOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| 1st Place Solution for PSG competition with ECCV'22 SenseHuman Workshop | Feb 6, 2023 | Multi-class ClassificationPanoptic Segmentation | CodeCode Available | 2 |
| MOSE: A New Dataset for Video Object Segmentation in Complex Scenes | Feb 3, 2023 | ObjectSegmentation | CodeCode Available | 2 |
| Audio-Visual Segmentation with Semantics | Jan 30, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition | Jan 30, 2023 | Feature Upsamplingimage-classification | CodeCode Available | 2 |
| ViTs for SITS: Vision Transformers for Satellite Image Time Series | Jan 12, 2023 | Semantic SegmentationTime Series | CodeCode Available | 2 |
| Benchmarking the Robustness of LiDAR Semantic Segmentation Models | Jan 3, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| XNet: Wavelet-Based Low and High Frequency Fusion Networks for Fully- and Semi-Supervised Semantic Segmentation of Biomedical Images | Jan 1, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Reversible Column Networks | Dec 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 |