SOTAVerified

Semantic Segmentation

Papers

Showing 33513400 of 14763 papers

TitleStatusHype
DEFN: Dual-Encoder Fourier Group Harmonics Network for Three-Dimensional Indistinct-Boundary Object SegmentationCode1
In-N-Out Generative Learning for Dense Unsupervised Video SegmentationCode1
Algorithm-hardware Co-design for Deformable ConvolutionCode1
Local Patch Network with Global Attention for Infrared Small Target DetectionCode1
Active Boundary Loss for Semantic SegmentationCode1
DC-SAM: In-Context Segment Anything in Images and Videos via Dual ConsistencyCode1
DeepVoxNet2: Yet another CNN frameworkCode1
DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian SplattingCode1
Make One-Shot Video Object Segmentation Efficient AgainCode1
1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video SegmentationCode1
DC-UNet: Rethinking the U-Net Architecture with Dual Channel Efficient CNN for Medical Images SegmentationCode1
DDANet: Dual Decoder Attention Network for Automatic Polyp SegmentationCode1
Instance Adaptive Self-Training for Unsupervised Domain AdaptationCode1
Instance As Identity: A Generic Online Paradigm for Video Instance SegmentationCode1
Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene ParsingCode1
MagicBathyNet: A Multimodal Remote Sensing Dataset for Bathymetry Prediction and Pixel-based Classification in Shallow WatersCode1
Deep Variational Instance SegmentationCode1
Instance Segmentation and Teeth Classification in Panoramic X-raysCode1
MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and RecoveryCode1
InstanceFormer: An Online Video Instance Segmentation FrameworkCode1
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing AttentionCode1
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual PerceptionCode1
ScaleFormer: Revisiting the Transformer-based Backbones from a Scale-wise Perspective for Medical Image SegmentationCode1
UniDA3D: Unified Domain Adaptive 3D Semantic Segmentation PipelineCode1
Deformable ConvNets v2: More Deformable, Better ResultsCode1
Instance Segmentation for Autonomous Log Grasping in Forestry OperationsCode1
MALUNet: A Multi-Attention and Light-weight UNet for Skin Lesion SegmentationCode1
Decoder Denoising Pretraining for Semantic SegmentationCode1
BT-Unet: A self-supervised learning framework for biomedical image segmentation using Barlow Twins with U-Net modelsCode1
Instance Segmentation in 3D Scenes using Semantic Superpoint Tree NetworksCode1
Decomposed Knowledge Distillation for Class-Incremental Semantic SegmentationCode1
Decomposing 3D Scenes into Objects via Unsupervised Volume SegmentationCode1
Automatic segmentation of spinal multiple sclerosis lesions: How to generalize across MRI contrasts?Code1
Instance Segmentation of Dense and Overlapping Objects via LayeringCode1
InstantDL-An easy-to-use deep learning pipeline for image segmentation and classificationCode1
Scan2LoD3: Reconstructing semantic 3D building models at LoD3 using ray casting and Bayesian networksCode1
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode1
MACU-Net for Semantic Segmentation of Fine-Resolution Remotely Sensed ImagesCode1
Decoupled Dynamic Filter NetworksCode1
Decoupled Local Aggregation for Point Cloud LearningCode1
BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance SegmentationCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and GrounderCode1
Inter-slice Context Residual Learning for 3D Medical Image SegmentationCode1
Integrative Analysis for COVID-19 Patient Outcome PredictionCode1
DecoupleNet: A Lightweight Backbone Network With Efficient Feature Decoupling for Remote Sensing Visual TasksCode1
DecoupleNet: Decoupled Network for Domain Adaptive Semantic SegmentationCode1
SCPNet: Semantic Scene Completion on Point CloudCode1
M^4oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of ExpertsCode1
MADAN: Multi-source Adversarial Domain Aggregation Network for Domain AdaptationCode1
Show:102550
← PrevPage 68 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified