| Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation | Jan 14, 2025 | Objectobject-detection | CodeCode Available | 1 |
| Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers | Jan 14, 2025 | Future predictionPrediction | CodeCode Available | 1 |
| TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations | Jan 13, 2025 | BenchmarkingDomain Adaptation | CodeCode Available | 1 |
| Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion | Jan 13, 2025 | 3D Semantic Scene CompletionMamba | CodeCode Available | 1 |
| Toward Realistic Camouflaged Object Detection: Benchmarks and Method | Jan 13, 2025 | Instance SegmentationObject | CodeCode Available | 1 |
| Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints | Jan 12, 2025 | Image SegmentationReferring Expression | CodeCode Available | 1 |
| D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription | Jan 9, 2025 | DenoisingImage Segmentation | CodeCode Available | 1 |
| LM-Net: A Light-weight and Multi-scale Network for Medical Image Segmentation | Jan 7, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation | Jan 6, 2025 | Domain AdaptationImage Segmentation | CodeCode Available | 1 |
| KM-UNet KAN Mamba UNet for medical image segmentation | Jan 5, 2025 | Computational EfficiencyImage Segmentation | CodeCode Available | 1 |
| Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss Function | Jan 2, 2025 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| EffiDec3D: An Optimized Decoder for High-Performance and Efficient 3D Medical Image Segmentation | Jan 1, 2025 | DecoderImage Segmentation | CodeCode Available | 1 |
| POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation | Jan 1, 2025 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 1 |
| CSC-PA: Cross-image Semantic Correlation via Prototype Attentions for Single-network Semi-supervised Breast Tumor Segmentation | Jan 1, 2025 | Image SegmentationLesion Segmentation | CodeCode Available | 1 |
| Relation3D : Enhancing Relation Modeling for Point Cloud Instance Segmentation | Jan 1, 2025 | 3D Instance SegmentationContrastive Learning | CodeCode Available | 1 |
| FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation | Jan 1, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | Jan 1, 2025 | Action RecognitionAction Segmentation | CodeCode Available | 1 |
| Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization | Dec 24, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 1 |
| VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis | Dec 24, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| QTSeg: A Query Token-Based Architecture for Efficient 2D Medical Image Segmentation | Dec 23, 2024 | Breast Cancer DetectionDecoder | CodeCode Available | 1 |
| AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation | Dec 23, 2024 | Few-Shot LearningFew-Shot Semantic Segmentation | CodeCode Available | 1 |
| Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation | Dec 23, 2024 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 1 |
| Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation | Dec 19, 2024 | Image SegmentationSegmentation | CodeCode Available | 1 |
| PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation | Dec 19, 2024 | LIDAR Semantic SegmentationScene Understanding | CodeCode Available | 1 |
| M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation | Dec 18, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 |