SOTAVerified

Semantic Segmentation

Papers

Showing 18511900 of 14763 papers

TitleStatusHype
Feature-Proxy Transformer for Few-Shot SegmentationCode1
Distribution Alignment: A Unified Framework for Long-tail Visual RecognitionCode1
A Robust Feature Downsampling Module for Remote Sensing Visual TasksCode1
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object SegmentationCode1
Feature Pyramid Network for Multi-Class Land SegmentationCode1
Adapt Everywhere: Unsupervised Adaptation of Point-Clouds and Entropy Minimisation for Multi-modal Cardiac Image SegmentationCode1
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background IntegrationCode1
Diverse Image Synthesis from Semantic Layouts via Conditional IMLECode1
1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object SegmentationCode1
ComBiNet: Compact Convolutional Bayesian Neural Network for Image SegmentationCode1
BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point CloudsCode1
FedCV: A Federated Learning Framework for Diverse Computer Vision TasksCode1
DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive ArchitectureCode1
Document-level Relation Extraction as Semantic SegmentationCode1
Adversarial Continual Learning for Multi-Domain Hippocampal SegmentationCode1
DocSegTr: An Instance-Level End-to-End Document Image Segmentation TransformerCode1
Learning Deformable Image Registration from Optimization: Perspective, Modules, Bilevel Training and BeyondCode1
DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic SegmentationCode1
Ariadne's Thread:Using Text Prompts to Improve Segmentation of Infected Areas from Chest X-ray imagesCode1
DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasetsCode1
Collaborating Foundation Models for Domain Generalized Semantic SegmentationCode1
Domain Adaptive Semantic Segmentation with Self-Supervised Depth EstimationCode1
iSegFormer: Interactive Segmentation via Transformers with Application to 3D Knee MR ImagesCode1
ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving ScenesCode1
Contrastive Masked Autoencoders are Stronger Vision LearnersCode1
Argmax Flows and Multinomial Diffusion: Learning Categorical DistributionsCode1
CoinSeg: Contrast Inter- and Intra- Class Representations for Incremental SegmentationCode1
Domain Adaptation of Echocardiography Segmentation Via Reinforcement LearningCode1
A Multi-Task Deep Learning Framework for Building Footprint SegmentationCode1
Is segmentation uncertainty useful?Code1
CompNet: Complementary Segmentation Network for Brain MRI ExtractionCode1
Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image SegmentationCode1
Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object MixingCode1
Domain Adaptive Video Segmentation via Temporal Pseudo SupervisionCode1
A Multi-task Framework for Infrared Small Target Detection and SegmentationCode1
Dual Prototype Attention for Unsupervised Video Object SegmentationCode1
3D Spatial Recognition without Spatially Labeled 3DCode1
Domain generalization of 3D semantic segmentation in autonomous drivingCode1
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
Beyond One-to-One: Rethinking the Referring Image SegmentationCode1
DOMINO: Domain-aware Model Calibration in Medical Image SegmentationCode1
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word EmphasisCode1
Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label ClassifierCode1
Beyond pixel-wise supervision for segmentation: A few global shape descriptors might be surprisingly good!Code1
Adapting Segment Anything Model (SAM) through Prompt-based Learning for Enhanced Protein Identification in Cryo-EM MicrographsCode1
Do text-free diffusion models learn discriminative visual representations?Code1
Coherent Reconstruction of Multiple Humans from a Single ImageCode1
JetSeg: Efficient Real-Time Semantic Segmentation Model for Low-Power GPU-Embedded SystemsCode1
Collaborative Video Object Segmentation by Foreground-Background IntegrationCode1
Feature Alignment and Uniformity for Test Time AdaptationCode1
Show:102550
← PrevPage 38 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified