SOTAVerified

Semantic Segmentation

Papers

Showing 37513800 of 14763 papers

TitleStatusHype
Exploring Cross-Image Pixel Contrast for Semantic SegmentationCode1
Exploring Consistency in Cross-Domain Transformer for Domain Adaptive Semantic SegmentationCode1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene ContextsCode1
Exploring Effective Factors for Improving Visual In-Context LearningCode1
Detection and Segmentation of Custom Objects using High Distraction Photorealistic Synthetic DataCode1
Exploiting Diffusion Prior for Generalizable Dense PredictionCode1
SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose EstimationCode1
Explicitly incorporating spatial information to recurrent networks for agricultureCode1
Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action DetectionCode1
DensePASS: Dense Panoramic Semantic Segmentation via Unsupervised Domain Adaptation with Attention-Augmented Context ExchangeCode1
Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-ResolutionCode1
Transfer Learning from Synthetic to Real LiDAR Point Cloud for Semantic SegmentationCode1
BPKD: Boundary Privileged Knowledge Distillation For Semantic SegmentationCode1
Synthetic Data for Robust Stroke SegmentationCode1
Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image SegmentationCode1
Tackling Catastrophic Forgetting and Background Shift in Continual Semantic SegmentationCode1
Explicit Attention-Enhanced Fusion for RGB-Thermal Perception TasksCode1
TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic SegmentationCode1
Dense Siamese Network for Dense Unsupervised LearningCode1
TagLab: A human-centric AI system for interactive semantic segmentationCode1
Exploiting Temporal State Space Sharing for Video Semantic SegmentationCode1
Target and Task specific Source-Free Domain Adaptive Image SegmentationCode1
Dense Unsupervised Learning for Video SegmentationCode1
TAROT: Targeted Data Selection via Optimal TransportCode1
BoxVIS: Video Instance Segmentation with Box AnnotationsCode1
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance SegmentationCode1
Teachers in concordance for pseudo-labeling of 3D sequential dataCode1
Teach me to segment with mixed supervision: Confident students become mastersCode1
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
Temporal Consistent 3D LiDAR Representation Learning for Semantic Perception in Autonomous DrivingCode1
An Energy and GPU-Computation Efficient Backbone Network for Real-Time Object DetectionCode1
Temporally Distributed Networks for Fast Video Semantic SegmentationCode1
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningCode1
Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus ImagesCode1
Depth-Assisted ResiDualGAN for Cross-Domain Aerial Images Semantic SegmentationCode1
TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image SegmentationCode1
Explain Any Concept: Segment Anything Meets Concept-Based ExplanationCode1
Depth-aware Test-Time Training for Zero-shot Video Object SegmentationCode1
Depth-based 6DoF Object Pose Estimation using Swin TransformerCode1
Depth Based Semantic Scene Completion with Position Importance Aware LossCode1
Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic SegmentationCode1
Depthformer : Multiscale Vision Transformer For Monocular Depth Estimation With Local Global Information FusionCode1
EffiDec3D: An Optimized Decoder for High-Performance and Efficient 3D Medical Image SegmentationCode1
Test-Time Generative Augmentation for Medical Image SegmentationCode1
TextBraTS: Text-Guided Volumetric Brain Tumor Segmentation with Innovative Dataset Development and Fusion Module ExplorationCode1
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion ModelCode1
BoxSnake: Polygonal Instance Segmentation with Box SupervisionCode1
Textual Query-Driven Mask Transformer for Domain Generalized SegmentationCode1
4D Unsupervised Object DiscoveryCode1
EViT-Unet: U-Net Like Efficient Vision Transformer for Medical Image Segmentation on Mobile and Edge DevicesCode1
Show:102550
← PrevPage 76 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified