SOTAVerified

Semantic Segmentation

Papers

Showing 24512500 of 14763 papers

TitleStatusHype
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic SegmentationCode2
Discriminative Hamiltonian Variational Autoencoder for Accurate Tumor Segmentation in Data-Scarce Regimes0
Multimodal Learning With Intraoperative CBCT & Variably Aligned Preoperative CT Data To Improve Segmentation0
OoDIS: Anomaly Instance Segmentation BenchmarkCode1
Visually Consistent Hierarchical Image Classification0
Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding0
SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic SegmentationCode0
Boosting Medical Image Classification with Segmentation Foundation Model0
Benchmarking Label Noise in Instance Segmentation: Spatial Noise MattersCode0
ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything ModelCode1
PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing ImageryCode5
α-OCC: Uncertainty-Aware Camera-based 3D Semantic Occupancy Prediction0
Microscopy Image Dataset for Deep Learning-Based Quantitative Assessment of Pulmonary Vascular Changes0
Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation0
GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR0
A Late-Stage Bitemporal Feature Fusion Network for Semantic Change DetectionCode0
MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor PerceptionCode0
The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences0
Enhancing Incomplete Multi-modal Brain Tumor Segmentation with Intra-modal Asymmetry and Inter-modal DependencyCode0
Open-Vocabulary Semantic Segmentation with Image Embedding BalancingCode1
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision TransformersCode1
D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video0
Exploring the Benefits of Vision Foundation Models for Unsupervised Domain AdaptationCode1
Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather ConditionsCode1
RobustSAM: Segment Anything Robustly on Degraded ImagesCode3
DenoiseRep: Denoising Model for Representation LearningCode1
4M-21: An Any-to-Any Vision Model for Tens of Tasks and ModalitiesCode5
Instance-level quantitative saliency in multiple sclerosis lesion segmentationCode0
A Labeled Array Distance Metric for Measuring Image Segmentation Quality0
Generalizable Disaster Damage Assessment via Change Detection with Vision Foundation Model0
Dataset Enhancement with Instance-Level AugmentationsCode1
SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image SegmentationCode0
APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation0
A^2-MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder0
GRU-Net: Gaussian Attention Aided Dense Skip Connection Based MultiResUNet for Breast Histopathology Image SegmentationCode0
2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation0
Real2Code: Reconstruct Articulated Objects via Code Generation0
RMem: Restricted Memory Banks Improve Video Object Segmentation0
Small Scale Data-Free Knowledge DistillationCode1
OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained UnderstandingCode1
Spatial-Frequency Dual Progressive Attention Network For Medical Image SegmentationCode1
A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image FusionCode0
Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos0
LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection0
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph0
PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving0
1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video SegmentationCode1
Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach0
Dual Thinking and Logical Processing -- Are Multi-modal Large Language Models Closing the Gap with Human Vision ?Code0
UVIS: Unsupervised Video Instance Segmentation0
Show:102550
← PrevPage 50 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified