SOTAVerified

Semantic Segmentation

Papers

Showing 351400 of 14763 papers

TitleStatusHype
LaSagnA: Language-based Segmentation Assistant for Complex QueriesCode2
ViM-UNet: Vision Mamba for Biomedical SegmentationCode2
Multi-view Aggregation Network for Dichotomous Image SegmentationCode2
Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image SegmentationCode2
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic GraspingCode2
LHU-Net: A Light Hybrid U-Net for Cost-Efficient, High-Performance Volumetric Medical Image SegmentationCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual LossCode2
Samba: Semantic Segmentation of Remotely Sensed Images with State Space ModelCode2
T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT SegmentationCode2
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse PromptsCode2
MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image SegmentationCode2
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt TuningCode2
AgileFormer: Spatially Agile Transformer UNet for Medical Image SegmentationCode2
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTsCode2
Generative Medical SegmentationCode2
Unleashing the Potential of SAM for Medical Adaptation via Hierarchical DecodingCode2
Efficient Video Object Segmentation via Modulated Cross-Attention MemoryCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane SegmentationCode2
Is Your LiDAR Placement Optimized for 3D Scene Understanding?Code2
LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse KernelsCode2
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion ModelsCode2
H-vmunet: High-order Vision Mamba UNet for Medical Image SegmentationCode2
Diversified and Personalized Multi-rater Medical Image SegmentationCode2
Modeling the Label Distributions for Weakly-Supervised Semantic SegmentationCode2
Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial ImageryCode2
BEVCar: Camera-Radar Fusion for BEV Map and Object SegmentationCode2
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationCode2
DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic SegmentationCode2
VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image SegmentationCode2
Caltech Aerial RGB-Thermal Dataset in the WildCode2
LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image SegmentationCode2
Open-World Semantic Segmentation Including Class SimilarityCode2
SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAMCode2
Frequency-Adaptive Dilated Convolution for Semantic SegmentationCode2
FedFMS: Exploring Federated Foundation Models for Medical Image SegmentationCode2
Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical LabelsCode2
AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic SegmentationCode2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic SegmentationCode2
Rethinking Few-shot 3D Point Cloud Semantic SegmentationCode2
PEM: Prototype-based Efficient MaskFormer for Image SegmentationCode2
FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anythingCode2
Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic SegmentationCode2
UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei ImagesCode2
SPINEPS -- Automatic Whole Spine Segmentation of T2-weighted MR images using a Two-Phase Approach to Multi-class Semantic and Instance SegmentationCode2
BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image SegmentationCode2
FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation ModelsCode2
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything ModelCode2
nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space ModelCode2
Show:102550
← PrevPage 8 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified