SOTAVerified

Semantic Segmentation

Papers

Showing 151200 of 14763 papers

TitleStatusHype
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything ModelCode3
LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image SegmentationCode3
Anything-3D: Towards Single-view Anything Reconstruction in the WildCode3
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition TasksCode3
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse LandscapesCode3
Generalized Decoding for Pixel, Image, and LanguageCode3
Nuclei instance segmentation and classification in histopathology images with StarDistCode3
FastViT: A Fast Hybrid Vision Transformer using Structural ReparameterizationCode3
FDA: Fourier Domain Adaptation for Semantic SegmentationCode3
EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image SegmentationCode3
Exploring Regional Clues in CLIP for Zero-Shot Semantic SegmentationCode3
Transformers in Medical Imaging: A SurveyCode3
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling StrategiesCode3
Point Transformer V3: Simpler, Faster, StrongerCode3
PSALM: Pixelwise SegmentAtion with Large Multi-Modal ModelCode3
Quantifying the robustness of deep multispectral segmentation models against natural perturbations and data poisoningCode3
RAP-SAM: Towards Real-Time All-Purpose Segment AnythingCode3
MogaNet: Multi-order Gated Aggregation NetworkCode2
Ambiguous Medical Image Segmentation using Diffusion ModelsCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object SegmentationCode2
Adapting Pre-Trained Vision Models for Novel Instance Detection and SegmentationCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsCode2
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural NetworksCode2
EasyPortrait -- Face Parsing and Portrait Segmentation DatasetCode2
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt TuningCode2
Efficient 3D Semantic Segmentation with Superpoint TransformerCode2
AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditionsCode2
AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic SegmentationCode2
E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance SegmentationCode2
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationCode2
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object SegmentationCode2
DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic EnvironmentsCode2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic SegmentationCode2
Adaptive Bidirectional Displacement for Semi-Supervised Medical Image SegmentationCode2
DSNet: A Novel Way to Use Atrous Convolutions in Semantic SegmentationCode2
DreamColour: Controllable Video Colour Editing without TrainingCode2
Adapter is All You Need for Tuning Visual TasksCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic SegmentationCode2
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency AdaptationCode2
Efficient Video Object Segmentation via Modulated Cross-Attention MemoryCode2
Diversified and Personalized Multi-rater Medical Image SegmentationCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale DatasetCode2
1st Place Solution for PSG competition with ECCV'22 SenseHuman WorkshopCode2
Distribution-Free, Risk-Controlling Prediction SetsCode2
Does Image Anonymization Impact Computer Vision Training?Code2
Show:102550
← PrevPage 4 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified