SOTAVerified

Semantic Segmentation

Papers

Showing 651700 of 14763 papers

TitleStatusHype
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene UnderstandingCode2
Learning What Not to Segment: A New Perspective on Few-Shot SegmentationCode2
Embedding Earth: Self-supervised contrastive pre-training for dense land cover classificationCode2
A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object DetectionCode2
UNeXt: MLP-based Rapid Medical Image Segmentation NetworkCode2
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with TransformersCode2
E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance SegmentationCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
Cross Language Image Matching for Weakly Supervised Semantic SegmentationCode2
Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with TransformersCode2
SoftGroup for 3D Instance Segmentation on Point CloudsCode2
A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and BenchmarkCode2
FreeSOLO: Learning to Segment Objects without AnnotationsCode2
GroupViT: Semantic Segmentation Emerges from Text SupervisionCode2
Context Autoencoder for Self-Supervised Representation LearningCode2
TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical ImagesCode2
Deep Video Prior for Video Consistency and PropagationCode2
When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention MechanismCode2
UniFormer: Unifying Convolution and Self-attention for Visual RecognitionCode2
AiTLAS: Artificial Intelligence Toolbox for Earth ObservationCode2
Omnivore: A Single Model for Many Visual ModalitiesCode2
Language-driven Semantic SegmentationCode2
QuadTree Attention for Vision TransformersCode2
Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI ImagesCode2
Vision Transformer with Deformable AttentionCode2
Language as Queries for Referring Video Object SegmentationCode2
C2AM: Contrastive Learning of Class-Agnostic Activation Map for Weakly Supervised Object Localization and Semantic SegmentationCode2
Mask2Former for Video Instance SegmentationCode2
Improving Image Restoration by Revisiting Global Information AggregationCode2
Masked-attention Mask Transformer for Universal Image SegmentationCode2
MetaFormer Is Actually What You Need for VisionCode2
Attention Mechanisms in Computer Vision: A SurveyCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and TrackingCode2
Open-World Entity SegmentationCode2
Per-Pixel Classification is Not All You Need for Semantic SegmentationCode2
Learning Semantic Segmentation of Large-Scale Point Clouds with Random SamplingCode2
Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene ImagesCode2
BEiT: BERT Pre-Training of Image TransformersCode2
Revisiting Contrastive Methods for Unsupervised Learning of Visual RepresentationsCode2
Beyond Self-attention: External Attention using Two Linear Layers for Visual TasksCode2
A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing ImagesCode2
Multi-Modal Fusion Transformer for End-to-End Autonomous DrivingCode2
Swin Transformer: Hierarchical Vision Transformer using Shifted WindowsCode2
Full Page Handwriting Recognition via Image to Sequence ExtractionCode2
Coordinate Attention for Efficient Mobile Network DesignCode2
LambdaNetworks: Modeling Long-Range Interactions Without AttentionCode2
TransUNet: Transformers Make Strong Encoders for Medical Image SegmentationCode2
Simplifying Object Segmentation with PixelLib LibraryCode2
Boundary-Aware Segmentation Network for Mobile and Web ApplicationsCode2
Show:102550
← PrevPage 14 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified