SOTAVerified

Semantic Segmentation

Papers

Showing 501550 of 14763 papers

TitleStatusHype
Video Object Segmentation in Panoptic Wild ScenesCode2
OctFormer: Octree-based Transformers for 3D Point CloudsCode2
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything ModelCode2
TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene UnderstandingCode2
Bidirectional Copy-Paste for Semi-Supervised Medical Image SegmentationCode2
Customized Segment Anything Model for Medical Image SegmentationCode2
EasyPortrait -- Face Parsing and Portrait Segmentation DatasetCode2
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image SegmentationCode2
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive ReviewCode2
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene UnderstandingCode2
Learning Semantic-Aware Knowledge Guidance for Low-Light Image EnhancementCode2
Unifying and Personalizing Weakly-supervised Federated Medical Image Segmentation via Adaptive Representation and AggregationCode2
SAMM (Segment Any Medical Model): A 3D Slicer Integration to SAMCode2
UniverSeg: Universal Medical Image SegmentationCode2
OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy PredictionCode2
Ambiguous Medical Image Segmentation using Diffusion ModelsCode2
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual RecognitionCode2
CherryPicker: Semantic Skeletonization and Topological Reconstruction of Cherry TreesCode2
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene UnderstandingCode2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth EstimationCode2
DDP: Diffusion Model for Dense Visual PredictionCode2
Mask-Free Video Instance SegmentationCode2
Universal Few-shot Learning of Dense Prediction Tasks with Visual Token MatchingCode2
Vision Transformer with Quadrangle AttentionCode2
You Only Segment Once: Towards Real-Time Panoptic SegmentationCode2
Spherical Transformer for LiDAR-based 3D RecognitionCode2
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic SegmentationCode2
Generative Semantic SegmentationCode2
M^2SNet: Multi-scale in Multi-scale Subtraction Network for Medical Image SegmentationCode2
Towards Diverse Binary Segmentation via A Simple yet General Gated NetworkCode2
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image SegmentationCode2
Large Selective Kernel Network for Remote Sensing Object DetectionCode2
BiFormer: Vision Transformer with Bi-Level Routing AttentionCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
FastInst: A Simple Query-Based Model for Real-Time Instance SegmentationCode2
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale AttentionCode2
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion ModelsCode2
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data PruningCode2
Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern AnalysisCode2
Unleashing Text-to-Image Diffusion Models for Visual PerceptionCode2
Delivering Arbitrary-Modal Semantic SegmentationCode2
Side Adapter Network for Open-Vocabulary Semantic SegmentationCode2
1st Place Solution for PSG competition with ECCV'22 SenseHuman WorkshopCode2
MOSE: A New Dataset for Video Object Segmentation in Complex ScenesCode2
Audio-Visual Segmentation with SemanticsCode2
SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual RecognitionCode2
ViTs for SITS: Vision Transformers for Satellite Image Time SeriesCode2
Benchmarking the Robustness of LiDAR Semantic Segmentation ModelsCode2
XNet: Wavelet-Based Low and High Frequency Fusion Networks for Fully- and Semi-Supervised Semantic Segmentation of Biomedical ImagesCode2
Reversible Column NetworksCode2
Show:102550
← PrevPage 11 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified