SOTAVerified

Semantic Segmentation

Papers

Showing 351400 of 14763 papers

TitleStatusHype
Diversified and Personalized Multi-rater Medical Image SegmentationCode2
Does Image Anonymization Impact Computer Vision Training?Code2
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature FieldsCode2
Feature Pyramid Networks for Object DetectionCode2
A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object DetectionCode2
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object SegmentationCode2
FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation ModelsCode2
FocalClick: Towards Practical Interactive Image SegmentationCode2
RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point CloudsCode2
Frequency-Adaptive Dilated Convolution for Semantic SegmentationCode2
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic SegmentationCode2
RevSAM2: Prompt SAM2 for Medical Image Segmentation via Reverse-Propagation without Fine-tuningCode2
Attention Mechanisms in Computer Vision: A SurveyCode2
FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anythingCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Domain Adaptation with a Single Vision-Language EmbeddingCode2
Generative Medical SegmentationCode2
GLaMM: Pixel Grounding Large Multimodal ModelCode2
DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic SegmentationCode2
Audio-Visual Segmentation with SemanticsCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
Golden Cudgel Network for Real-Time Semantic SegmentationCode2
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNsCode2
Fast Vision Transformers with HiLo AttentionCode2
Hulk: A Universal Knowledge Translator for Human-Centric TasksCode2
Hierarchical Multi-Scale Attention for Semantic SegmentationCode2
Augmented Object Intelligence with XR-ObjectsCode2
AgileFormer: Spatially Agile Transformer UNet for Medical Image SegmentationCode2
A Unified Framework for 3D Scene UnderstandingCode2
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal ModelCode2
Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian SplattingCode2
Diffusion models as plug-and-play priorsCode2
Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil InfrastructureCode2
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene UnderstandingCode2
IDRNet: Intervention-Driven Relation Network for Semantic SegmentationCode2
Digital Twin Generation from Visual Data: A SurveyCode2
Image Segmentation in Foundation Model Era: A SurveyCode2
DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image SegmentationCode2
AiTLAS: Artificial Intelligence Toolbox for Earth ObservationCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable DiffusionCode2
A large annotated medical image dataset for the development and evaluation of segmentation algorithmsCode2
Dilated Neighborhood Attention TransformerCode2
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous ConvolutionCode2
Interlaced Sparse Self-Attention for Semantic SegmentationCode2
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask DiffusionCode2
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelCode2
Densely Connected Parameter-Efficient Tuning for Referring Image SegmentationCode2
Show:102550
← PrevPage 8 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified