| Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images | Mar 21, 2025 | Image SegmentationMamba | CodeCode Available | 2 | 5 |
| DDP: Diffusion Model for Dense Visual Prediction | Mar 30, 2023 | DenoisingDepth Estimation | CodeCode Available | 2 | 5 |
| PixelLM: Pixel Reasoning with Large Multimodal Model | Dec 4, 2023 | Decodermodel | CodeCode Available | 2 | 5 |
| Pixel-Wise Recognition for Holistic Surgical Scene Understanding | Jan 20, 2024 | Scene UnderstandingSegmentation | CodeCode Available | 2 | 5 |
| PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation | Jan 15, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery | Apr 3, 2025 | Field Boundary DelineationInstance Segmentation | CodeCode Available | 2 | 5 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 | 5 |
| PosSAM: Panoptic Open-vocabulary Segment Anything | Mar 14, 2024 | DecoderOpen Vocabulary Panoptic Segmentation | CodeCode Available | 2 | 5 |
| Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Jun 10, 2024 | Instance SegmentationSalient Object Detection | CodeCode Available | 2 | 5 |
| Context-Aware Video Instance Segmentation | Jul 3, 2024 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 2 | 5 |