SOTAVerified

Semantic Segmentation

Papers

Showing 601650 of 14763 papers

TitleStatusHype
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNsCode2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsCode2
Global Context Vision TransformersCode2
ORFD: A Dataset and Benchmark for Off-Road Freespace DetectionCode2
Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy AutoencodersCode2
Diffusion models as plug-and-play priorsCode2
Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D ConvolutionsCode2
MobileOne: An Improved One millisecond Mobile BackboneCode2
PIDNet: A Real-time Semantic Segmentation Network Inspired by PID ControllersCode2
What Are Expected Queries in End-to-End Object Detection?Code2
You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure CorrectionCode2
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature DistillationCode2
Fast Vision Transformers with HiLo AttentionCode2
Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and LocalizationCode2
Surface Representation for Point CloudsCode2
ConvMAE: Masked Convolution Meets Masked AutoencodersCode2
Neural 3D Scene Reconstruction with the Manhattan-world AssumptionCode2
Masked Generative DistillationCode2
Computer Vision for Road Imaging and Pothole Detection: A State-of-the-Art Review of Systems and AlgorithmsCode2
HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic SegmentationCode2
Understanding The Robustness in Vision TransformersCode2
Toward Fast, Flexible, and Robust Low-Light Image EnhancementCode2
RangeUDF: Semantic Surface Reconstruction from 3D Point CloudsCode2
Temporally Efficient Vision Transformer for Video Instance SegmentationCode2
VSA: Learning Varied-Size Window Attention in Vision TransformersCode2
Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic SegmentationCode2
ResT V2: Simpler, Faster and StrongerCode2
Neighborhood Attention TransformerCode2
Cross-Image Relational Knowledge Distillation for Semantic SegmentationCode2
TopFormer: Token Pyramid Transformer for Mobile Semantic SegmentationCode2
Sat2lod2: A Software For Automated Lod-2 Modeling From Satellite-Derived Orthophoto And Digital Surface ModelCode2
DaViT: Dual Attention Vision TransformersCode2
An Empirical Study of Remote Sensing PretrainingCode2
FocalClick: Towards Practical Interactive Image SegmentationCode2
Region Rebalance for Long-Tailed Semantic SegmentationCode2
MultiMAE: Multi-modal Multi-task Masked AutoencodersCode2
Image-to-Lidar Self-Supervised Distillation for Autonomous Driving DataCode2
Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object DetectionCode2
Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene SegmentationCode2
Rethinking Semantic Segmentation: A Prototype ViewCode2
Stratified Transformer for 3D Point Cloud SegmentationCode2
Deep Hierarchical Semantic SegmentationCode2
Video Polyp Segmentation: A Deep Learning PerspectiveCode2
Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic SegmentationCode2
Make-A-Scene: Scene-Based Text-to-Image Generation with Human PriorsCode2
Sparse Instance Activation for Real-Time Instance SegmentationCode2
Scalable Video Object Segmentation with Identification MechanismCode2
Focal Modulation NetworksCode2
Scribble-Supervised LiDAR Semantic SegmentationCode2
Unsupervised Semantic Segmentation by Distilling Feature CorrespondencesCode2
Show:102550
← PrevPage 13 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified