SOTAVerified

Semantic Segmentation

Papers

Showing 601650 of 14763 papers

TitleStatusHype
Global Context Vision TransformersCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and LocalizationCode2
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNsCode2
Deep Snake for Real-Time Instance SegmentationCode2
GroupViT: Semantic Segmentation Emerges from Text SupervisionCode2
Deep Video Prior for Video Consistency and PropagationCode2
Densely Connected Parameter-Efficient Tuning for Referring Image SegmentationCode2
Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian SplattingCode2
HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image SegmentationCode2
Deep Hierarchical Semantic SegmentationCode2
BlenderProcCode2
HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic SegmentationCode2
Hulk: A Universal Knowledge Translator for Human-Centric TasksCode2
DeepGCNs: Making GCNs Go as Deep as CNNsCode2
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene UnderstandingCode2
Understanding the Tricks of Deep Learning in Medical Image Segmentation: Challenges and Future DirectionsCode2
iFormer: Integrating ConvNet and Transformer for Mobile ApplicationCode2
Decoupling Features in Hierarchical Propagation for Video Object SegmentationCode2
Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial ImageryCode2
Image-to-Lidar Self-Supervised Distillation for Autonomous Driving DataCode2
Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable FiltersCode2
Deep Covariance Alignment for Domain Adaptive Remote Sensing Image SegmentationCode2
Deep Incubation: Training Large Models by Divide-and-ConqueringCode2
DaViT: Dual Attention Vision TransformersCode2
DAT++: Spatially Dynamic Vision Transformer with Deformable AttentionCode2
Advancing Plain Vision Transformer Towards Remote Sensing Foundation ModelCode2
DDP: Diffusion Model for Dense Visual PredictionCode2
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion ModelsCode2
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene UnderstandingCode2
Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual PromptsCode2
Label Efficient Visual Abstractions for Autonomous DrivingCode2
Dataset QuantizationCode2
An Empirical Study of Remote Sensing PretrainingCode2
Language-driven Semantic SegmentationCode2
An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion ModelsCode2
LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image SegmentationCode2
LaSagnA: Language-based Segmentation Assistant for Complex QueriesCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
MobileOne: An Improved One millisecond Mobile BackboneCode2
An Image is Worth 16x16 Words: Transformers for Image Recognition at ScaleCode2
Learning What Not to Segment: A New Perspective on Few-Shot SegmentationCode2
DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionCode2
LHU-Net: A Light Hybrid U-Net for Cost-Efficient, High-Performance Volumetric Medical Image SegmentationCode2
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask DiffusionCode2
Adversarial Supervision Makes Layout-to-Image Diffusion Models ThriveCode2
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded ScenesCode2
Cross Language Image Matching for Weakly Supervised Semantic SegmentationCode2
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous ExplorationCode2
Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT ImagesCode2
Show:102550
← PrevPage 13 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified