SOTAVerified

Semantic Segmentation

Papers

Showing 251300 of 14763 papers

TitleStatusHype
ARKit LabelMaker: A New Scale for Indoor 3D Scene UnderstandingCode2
WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic SegmentationCode2
High-Precision Dichotomous Image Segmentation via Probing Diffusion CapacityCode2
Locality Alignment Improves Vision-Language ModelsCode2
Text4Seg: Reimagining Image Segmentation as Text GenerationCode2
Towards Natural Image Matting in the Wild via Real-Scenario PriorCode2
MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal ModelCode2
A Simple Image Segmentation Framework via In-Context ExamplesCode2
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image SegmentationCode2
Revisit Anything: Visual Place Recognition via Image Segment RetrievalCode2
EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image SegmentationCode2
Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary SegmentationCode2
PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing ImagesCode2
Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian SplattingCode2
One missing piece in Vision and Language: A Survey on Comics UnderstandingCode2
RevSAM2: Prompt SAM2 for Medical Image Segmentation via Reverse-Propagation without Fine-tuningCode2
PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease SegmentationCode2
Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil InfrastructureCode2
MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image SegmentationCode2
AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditionsCode2
Generative AI Enables Medical Image Segmentation in Ultra Low-Data RegimesCode2
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object SegmentationCode2
MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image SegmentationCode2
TripleMixer: A 3D Point Cloud Denoising Model for Adverse WeatherCode2
Image Segmentation in Foundation Model Era: A SurveyCode2
HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image SegmentationCode2
UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing ImagesCode2
MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series AnalysisCode2
Robust Semi-supervised Multimodal Medical Image Segmentation via Cross Modality CollaborationCode2
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic SegmentationCode2
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic SegmentationCode2
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary SegmentationCode2
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile ApplicationsCode2
DaCapo: a modular deep learning framework for scalable 3D image segmentationCode2
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic SegmentationCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary SegmentationCode2
MSA^2Net: Multi-scale Adaptive Attention-guided Network for Medical Image SegmentationCode2
RefMask3D: Language-Guided Transformer for 3D Referring SegmentationCode2
ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image SegmentationCode2
GroupMamba: Efficient Group-Based Visual State Space ModelCode2
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded ScenesCode2
SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point CloudsCode2
DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image SegmentationCode2
IRSAM: Advancing Segment Anything Model for Infrared Small Target DetectionCode2
Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain ShiftCode2
Exploiting Scale-Variant Attention for Segmenting Small Medical ObjectsCode2
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous ExplorationCode2
Training-free CryoET Tomogram SegmentationCode2
Show:102550
← PrevPage 6 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified