SOTAVerified

Semantic Segmentation

Papers

Showing 401450 of 14763 papers

TitleStatusHype
Learning Semantic-Aware Knowledge Guidance for Low-Light Image EnhancementCode2
Learning Semantic Segmentation of Large-Scale Point Clouds with Random SamplingCode2
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image SegmentationCode2
Learn to Rectify the Bias of CLIP for Unsupervised Semantic SegmentationCode2
Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3DCode2
Scalable Video Object Segmentation with Identification MechanismCode2
Distribution-Free, Risk-Controlling Prediction SetsCode2
Locality Alignment Improves Vision-Language ModelsCode2
DINO in the Room: Leveraging 2D Foundation Models for 3D SegmentationCode2
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous ExplorationCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual TasksCode2
Make-A-Scene: Scene-Based Text-to-Image Generation with Human PriorsCode2
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic SegmentationCode2
DreamColour: Controllable Video Colour Editing without TrainingCode2
Diffusion models as plug-and-play priorsCode2
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic SegmentationCode2
An Empirical Study of Remote Sensing PretrainingCode2
Digital Twin Generation from Visual Data: A SurveyCode2
A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and BenchmarkCode2
Masked Generative DistillationCode2
Mask-Free Video Instance SegmentationCode2
DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image SegmentationCode2
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable DiffusionCode2
MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image SegmentationCode2
MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image SegmentationCode2
Dilated Neighborhood Attention TransformerCode2
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask InpaintingCode2
3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentationCode2
Benchmarking the Robustness of LiDAR Semantic Segmentation ModelsCode2
1st Place Solution for PSG competition with ECCV'22 SenseHuman WorkshopCode2
BEVCar: Camera-Radar Fusion for BEV Map and Object SegmentationCode2
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask DiffusionCode2
Merging Context Clustering with Visual State Space Models for Medical Image SegmentationCode2
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-LearningCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
MIC: Masked Image Consistency for Context-Enhanced Domain AdaptationCode2
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based TrainingCode2
Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual LossCode2
MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input FeaturesCode2
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
Model-Based Imitation Learning for Urban DrivingCode2
Beyond Self-attention: External Attention using Two Linear Layers for Visual TasksCode2
MOSE: A New Dataset for Video Object Segmentation in Complex ScenesCode2
Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image SegmentationCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion ModelsCode2
Densely Connected Parameter-Efficient Tuning for Referring Image SegmentationCode2
Show:102550
← PrevPage 9 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified