| ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding | Oct 17, 2024 | 3D Semantic SegmentationImage Generation | CodeCode Available | 2 |
| WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic Segmentation | Oct 15, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity | Oct 14, 2024 | DenoisingDichotomous Image Segmentation | CodeCode Available | 2 |
| Locality Alignment Improves Vision-Language Models | Oct 14, 2024 | Semantic SegmentationSpatial Reasoning | CodeCode Available | 2 |
| Text4Seg: Reimagining Image Segmentation as Text Generation | Oct 13, 2024 | Image SegmentationReferring Expression | CodeCode Available | 2 |
| Towards Natural Image Matting in the Wild via Real-Scenario Prior | Oct 9, 2024 | DecoderImage Matting | CodeCode Available | 2 |
| MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal Model | Oct 8, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| A Simple Image Segmentation Framework via In-Context Examples | Oct 7, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos | Sep 29, 2024 | AllImage Segmentation | CodeCode Available | 2 |
| MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation | Sep 28, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 |
| Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Sep 26, 2024 | Image SegmentationNavigate | CodeCode Available | 2 |
| EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation | Sep 26, 2024 | Image SegmentationMamba | CodeCode Available | 2 |
| Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation | Sep 24, 2024 | DiversityInstance Segmentation | CodeCode Available | 2 |
| PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images | Sep 20, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting | Sep 19, 2024 | Scene UnderstandingSemantic Segmentation | CodeCode Available | 2 |
| One missing piece in Vision and Language: A Survey on Comics Understanding | Sep 14, 2024 | document understandingimage-classification | CodeCode Available | 2 |
| RevSAM2: Prompt SAM2 for Medical Image Segmentation via Reverse-Propagation without Fine-tuning | Sep 6, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease Segmentation | Sep 6, 2024 | Benchmarkingimage-classification | CodeCode Available | 2 |
| Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure | Sep 4, 2024 | Crack SegmentationDecoder | CodeCode Available | 2 |
| MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image Segmentation | Sep 4, 2024 | Image SegmentationLesion Segmentation | CodeCode Available | 2 |
| AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditions | Sep 3, 2024 | Autonomous DrivingDeep Attention | CodeCode Available | 2 |
| Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes | Aug 30, 2024 | Deep LearningImage Segmentation | CodeCode Available | 2 |
| Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation | Aug 28, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Aug 25, 2024 | Autonomous DrivingDenoising | CodeCode Available | 2 |
| MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation | Aug 25, 2024 | Image SegmentationMamba | CodeCode Available | 2 |