SOTAVerified

Semantic Segmentation

Papers

Showing 926950 of 14763 papers

TitleStatusHype
MulModSeg: Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating TrainingCode1
Revisiting the Integration of Convolution and Attention for Vision BackboneCode1
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic SegmentationCode1
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic SegmentationCode1
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural EnhancementsCode1
RETR: Multi-View Radar Detection Transformer for Indoor PerceptionCode1
OneNet: A Channel-Wise 1D Convolutional U-NetCode1
Fast and Efficient Transformer-based Method for Bird's Eye View Instance PredictionCode1
Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantificationCode1
ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark DatasetCode1
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution ShiftsCode1
LiVOS: Light Video Object Segmentation with Gated Linear MatchingCode1
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression PerspectiveCode1
Automated Classification of Cell Shapes: A Comparative Evaluation of Shape DescriptorsCode1
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image SegmentationCode1
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion ModelCode1
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered ScenesCode1
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic SegmentationCode1
Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic SegmentationCode1
IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream TasksCode1
Unlocking Comics: The AI4VA Dataset for Visual UnderstandingCode1
Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic SegmentationCode1
Context-Based Visual-Language Place RecognitionCode1
Gaze-Assisted Medical Image SegmentationCode1
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context PromptingCode1
Show:102550
← PrevPage 38 of 591Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified