SOTAVerified

Semantic Segmentation

Papers

Showing 301350 of 14763 papers

TitleStatusHype
FastInst: A Simple Query-Based Model for Real-Time Instance SegmentationCode2
Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic SegmentationCode2
EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise OptimizationCode2
Exploring Color Invariance through Image-Level Ensemble LearningCode2
Fast Vision Transformers with HiLo AttentionCode2
Embedding Earth: Self-supervised contrastive pre-training for dense land cover classificationCode2
Efficient Video Object Segmentation via Modulated Cross-Attention MemoryCode2
EGE-UNet: an Efficient Group Enhanced UNet for skin lesion segmentationCode2
Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object SegmentationCode2
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic SegmentationCode2
MogaNet: Multi-order Gated Aggregation NetworkCode2
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-SupervisionCode2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsCode2
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural NetworksCode2
Efficient 3D Semantic Segmentation with Superpoint TransformerCode2
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency AdaptationCode2
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object SegmentationCode2
DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic EnvironmentsCode2
EasyPortrait -- Face Parsing and Portrait Segmentation DatasetCode2
DreamColour: Controllable Video Colour Editing without TrainingCode2
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image SegmentationCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
Does Image Anonymization Impact Computer Vision Training?Code2
Domain Adaptation with a Single Vision-Language EmbeddingCode2
DSNet: A Novel Way to Use Atrous Convolutions in Semantic SegmentationCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Distribution-Free, Risk-Controlling Prediction SetsCode2
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationCode2
E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance SegmentationCode2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic SegmentationCode2
ARKit LabelMaker: A New Scale for Indoor 3D Scene UnderstandingCode2
Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation?Code2
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt TuningCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
Diversified and Personalized Multi-rater Medical Image SegmentationCode2
DINO in the Room: Leveraging 2D Foundation Models for 3D SegmentationCode2
ASAM: Boosting Segment Anything Model with Adversarial TuningCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale DatasetCode2
DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic SegmentationCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
Generalized Parametric Contrastive LearningCode2
Advancing Plain Vision Transformer Towards Remote Sensing Foundation ModelCode2
EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image SegmentationCode2
ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image SegmentationCode2
Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic SegmentationCode2
DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image SegmentationCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable DiffusionCode2
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask DiffusionCode2
Show:102550
← PrevPage 7 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified