SOTAVerified

Semantic Segmentation

Papers

Showing 251275 of 14763 papers

TitleStatusHype
ARKit LabelMaker: A New Scale for Indoor 3D Scene UnderstandingCode2
WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic SegmentationCode2
High-Precision Dichotomous Image Segmentation via Probing Diffusion CapacityCode2
Locality Alignment Improves Vision-Language ModelsCode2
Text4Seg: Reimagining Image Segmentation as Text GenerationCode2
Towards Natural Image Matting in the Wild via Real-Scenario PriorCode2
MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal ModelCode2
A Simple Image Segmentation Framework via In-Context ExamplesCode2
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image SegmentationCode2
Revisit Anything: Visual Place Recognition via Image Segment RetrievalCode2
EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image SegmentationCode2
Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary SegmentationCode2
PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing ImagesCode2
Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian SplattingCode2
One missing piece in Vision and Language: A Survey on Comics UnderstandingCode2
RevSAM2: Prompt SAM2 for Medical Image Segmentation via Reverse-Propagation without Fine-tuningCode2
PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease SegmentationCode2
Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil InfrastructureCode2
MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image SegmentationCode2
AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditionsCode2
Generative AI Enables Medical Image Segmentation in Ultra Low-Data RegimesCode2
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object SegmentationCode2
TripleMixer: A 3D Point Cloud Denoising Model for Adverse WeatherCode2
MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image SegmentationCode2
Show:102550
← PrevPage 11 of 591Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified