| SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images | Feb 20, 2025 | Image SegmentationSegmentation | CodeCode Available | 1 |
| WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields | Feb 18, 2025 | Instance SegmentationManagement | CodeCode Available | 1 |
| Leveraging Labelled Data Knowledge: A Cooperative Rectification Learning Network for Semi-supervised 3D Medical Image Segmentation | Feb 17, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| QMaxViT-Unet+: A Query-Based MaxViT-Unet with Edge Enhancement for Scribble-Supervised Segmentation of Medical Images | Feb 14, 2025 | DecoderImage Segmentation | CodeCode Available | 1 |
| MITO: Enabling Non-Line-of-Sight Perception using Millimeter-waves through Real-World Datasets and Simulation Tools | Feb 14, 2025 | Semantic Segmentation | CodeCode Available | 1 |
| SQ-GAN: Semantic Image Communications Using Masked Vector Quantization | Feb 13, 2025 | Image CompressionQuantization | CodeCode Available | 1 |
| Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation | Feb 12, 2025 | Computational EfficiencyImage Segmentation | CodeCode Available | 1 |
| HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and Classification | Feb 12, 2025 | Cell SegmentationImage Generation | CodeCode Available | 1 |
| Conditional diffusion model with spatial attention and latent embedding for medical image segmentation | Feb 10, 2025 | HippocampusImage Segmentation | CodeCode Available | 1 |
| Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Feb 5, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation | Feb 4, 2025 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| UD-Mamba: A pixel-level uncertainty-driven Mamba model for medical image segmentation | Feb 4, 2025 | Image SegmentationMamba | CodeCode Available | 1 |
| Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification | Feb 4, 2025 | Cell SegmentationDecoder | CodeCode Available | 1 |
| FSPGD: Rethinking Black-box Attacks on Semantic Segmentation | Feb 3, 2025 | Semantic Segmentation | CodeCode Available | 1 |
| Complex Wavelet Mutual Information Loss: A Multi-Scale Loss Function for Semantic Segmentation | Feb 1, 2025 | Semantic Segmentation | CodeCode Available | 1 |
| ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation Transformer | Jan 29, 2025 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation | Jan 29, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| SeqSeg: Learning Local Segments for Automatic Vascular Model Construction | Jan 27, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| 3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving | Jan 24, 2025 | Autonomous DrivingDomain Generalization | CodeCode Available | 1 |
| MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation | Jan 23, 2025 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| MedicoSAM: Towards foundation models for medical image segmentation | Jan 20, 2025 | Image SegmentationInteractive Segmentation | CodeCode Available | 1 |
| Automatic Labelling & Semantic Segmentation with 4D Radar Tensors | Jan 20, 2025 | Semantic Segmentationvehicle detection | CodeCode Available | 1 |
| Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention | Jan 18, 2025 | Image SegmentationSegmentation | CodeCode Available | 1 |
| Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks | Jan 17, 2025 | Few-Shot Semantic SegmentationSegmentation | CodeCode Available | 1 |
| HSPFormer: Hierarchical Spatial Perception Transformer for Semantic Segmentation | Jan 16, 2025 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 1 |
| Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation | Jan 14, 2025 | Objectobject-detection | CodeCode Available | 1 |
| Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers | Jan 14, 2025 | Future predictionPrediction | CodeCode Available | 1 |
| TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations | Jan 13, 2025 | BenchmarkingDomain Adaptation | CodeCode Available | 1 |
| Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion | Jan 13, 2025 | 3D Semantic Scene CompletionMamba | CodeCode Available | 1 |
| Toward Realistic Camouflaged Object Detection: Benchmarks and Method | Jan 13, 2025 | Instance SegmentationObject | CodeCode Available | 1 |
| Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints | Jan 12, 2025 | Image SegmentationReferring Expression | CodeCode Available | 1 |
| D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription | Jan 9, 2025 | DenoisingImage Segmentation | CodeCode Available | 1 |
| LM-Net: A Light-weight and Multi-scale Network for Medical Image Segmentation | Jan 7, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation | Jan 6, 2025 | Domain AdaptationImage Segmentation | CodeCode Available | 1 |
| KM-UNet KAN Mamba UNet for medical image segmentation | Jan 5, 2025 | Computational EfficiencyImage Segmentation | CodeCode Available | 1 |
| Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function | Jan 2, 2025 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| EffiDec3D: An Optimized Decoder for High-Performance and Efficient 3D Medical Image Segmentation | Jan 1, 2025 | DecoderImage Segmentation | CodeCode Available | 1 |
| POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation | Jan 1, 2025 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 1 |
| CSC-PA: Cross-image Semantic Correlation via Prototype Attentions for Single-network Semi-supervised Breast Tumor Segmentation | Jan 1, 2025 | Image SegmentationLesion Segmentation | CodeCode Available | 1 |
| Relation3D : Enhancing Relation Modeling for Point Cloud Instance Segmentation | Jan 1, 2025 | 3D Instance SegmentationContrastive Learning | CodeCode Available | 1 |
| FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation | Jan 1, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | Jan 1, 2025 | Action RecognitionAction Segmentation | CodeCode Available | 1 |
| Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization | Dec 24, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 1 |
| VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis | Dec 24, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| QTSeg: A Query Token-Based Architecture for Efficient 2D Medical Image Segmentation | Dec 23, 2024 | Breast Cancer DetectionDecoder | CodeCode Available | 1 |
| AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation | Dec 23, 2024 | Few-Shot LearningFew-Shot Semantic Segmentation | CodeCode Available | 1 |
| Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation | Dec 23, 2024 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 1 |
| Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation | Dec 19, 2024 | Image SegmentationSegmentation | CodeCode Available | 1 |
| PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation | Dec 19, 2024 | LIDAR Semantic SegmentationScene Understanding | CodeCode Available | 1 |
| M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation | Dec 18, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 |