SOTAVerified

Semantic Segmentation

Papers

Showing 58515900 of 14763 papers

TitleStatusHype
Tsanet: Temporal and Scale Alignment for Unsupervised Video Object Segmentation0
Radio astronomical images object detection and segmentation: A benchmark on deep learning methods0
SEMv2: Table Separation Line Detection Based on Instance SegmentationCode1
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion ModelsCode2
Rethinking the editing of generative adversarial networks: a method to estimate editing vectors based on dimension reduction0
MAST: Masked Augmentation Subspace Training for Generalizable Self-Supervised Priors0
Parsing Line Segments of Floor Plan Images Using Graph Neural Networks0
FFT-based Dynamic Token Mixer for VisionCode1
InsMOS: Instance-Aware Moving Object Segmentation in LiDAR DataCode1
F2BEV: Bird's Eye View Generation from Surround-View Fisheye Camera Images for Automated DrivingCode1
Adaptive Texture Filtering for Single-Domain Generalized SegmentationCode0
Traffic Scene Parsing through the TSP6K DatasetCode1
UniHCP: A Unified Model for Human-Centric PerceptionsCode1
CANet: Context aware network with dual-stream pyramid for medical image segmentationCode0
IDA: Informed Domain Adaptive Semantic Segmentation0
Exploiting Implicit Rigidity Constraints via Weight-Sharing Aggregation for Scene Flow Estimation from Point CloudsCode0
Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern AnalysisCode2
Exploit CAM by itself: Complementary Learning System for Weakly Supervised Semantic Segmentation0
BayeSeg: Bayesian Modeling for Medical Image Segmentation with Interpretable Generalizability0
Generalized Semantic Segmentation by Self-Supervised Source Domain Projection and Multi-Level Contrastive LearningCode0
Collaborative Learning with a Drone OrchestratorCode0
Depth-based 6DoF Object Pose Estimation using Swin TransformerCode1
EcoTTA: Memory-Efficient Continual Test-time Adaptation via Self-distilled RegularizationCode1
Unleashing Text-to-Image Diffusion Models for Visual PerceptionCode2
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving0
X^3KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection0
Token Contrast for Weakly-Supervised Semantic SegmentationCode1
Transmission-Guided Bayesian Generative Model for Smoke SegmentationCode1
Multi-Source Soft Pseudo-Label Learning with Domain Similarity-based Weighting for Semantic SegmentationCode0
Spatial Layout Consistency for 3D Semantic Segmentation0
Meta-information-aware Dual-path Transformer for Differential Diagnosis of Multi-type Pancreatic Lesions in Multi-phase CT0
Deep-NFA: a Deep a contrario Framework for Small Object Detection0
Conflict-Based Cross-View Consistency for Semi-Supervised Semantic SegmentationCode1
Bayesian Deep Learning for Affordance Segmentation in images0
Delivering Arbitrary-Modal Semantic SegmentationCode2
DAN-NucNet: A dual attention based framework for nuclei segmentation in cancer histology images under wild clinical conditionsCode0
ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic ConvolutionCode1
DMSA: Dynamic Multi-scale Unsupervised Semantic Segmentation Based on Adaptive AffinityCode0
Leveraging SO(3)-steerable convolutions for pose-robust semantic segmentation in 3D medical dataCode1
BiSVP: Building Footprint Extraction via Bidirectional Serialized Vertex Prediction0
Applying Plain Transformers to Real-World Point Clouds0
Kartezio: Evolutionary Design of Explainable Pipelines for Biomedical Image AnalysisCode1
Efficient Masked Autoencoders with Self-Consistency0
Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger0
FPCD: An Open Aerial VHR Dataset for Farm Pond Change Detection0
Interactive Segmentation as Gaussian Process ClassificationCode1
Generic-to-Specific Distillation of Masked AutoencodersCode1
Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation0
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors0
Towards Surgical Context Inference and Translation to GesturesCode0
Show:102550
← PrevPage 118 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified