Semantic Segmentation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–650 of 14763 papers

Title	Date	Tasks	Status	Hype
Deep Incubation: Training Large Models by Divide-and-Conquering	Dec 8, 2022	Image Segmentationobject-detection	CodeCode Available	2
Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization	May 16, 2022	graph partitioningSegmentation	CodeCode Available	2
Asymmetric Non-local Neural Networks for Semantic Segmentation	Aug 21, 2019	GPUSegmentation	CodeCode Available	2
Global Context Vision Transformers	Jun 20, 2022	image-classificationImage Classification	CodeCode Available	2
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models	Oct 5, 2022	Out-of-Distribution DetectionSegmentation	CodeCode Available	2
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence	May 24, 2023	Dense Pixel Correspondence EstimationRepresentation Learning	CodeCode Available	2
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs	Mar 28, 2024	Fine-Grained Image ClassificationImage Classification	CodeCode Available	2
Ambiguous Medical Image Segmentation using Diffusion Models	Apr 10, 2023	DiagnosticDiversity	CodeCode Available	2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras	Apr 3, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation	Jul 3, 2024	Image SegmentationMedical Image Segmentation	CodeCode Available	2
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception	May 7, 2025	object-detectionObject Detection	CodeCode Available	2
Decoupling Features in Hierarchical Propagation for Video Object Segmentation	Oct 18, 2022	ObjectSemantic Segmentation	CodeCode Available	2
Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting	Sep 19, 2024	Scene UnderstandingSemantic Segmentation	CodeCode Available	2
HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image Segmentation	Aug 21, 2024	Image SegmentationMamba	CodeCode Available	2
DaViT: Dual Attention Vision Transformers	Apr 7, 2022	Computational EfficiencyImage Classification	CodeCode Available	2
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention	Sep 4, 2023	Image ClassificationInstance Segmentation	CodeCode Available	2
DDP: Diffusion Model for Dense Visual Prediction	Mar 30, 2023	DenoisingDepth Estimation	CodeCode Available	2
H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation	Mar 20, 2024	Image SegmentationLesion Segmentation	CodeCode Available	2
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding	Nov 4, 2020	Multi-Task LearningScene Understanding	CodeCode Available	2
IDRNet: Intervention-Driven Relation Network for Semantic Segmentation	Oct 16, 2023	RelationRelation Network	CodeCode Available	2
Audio-Visual Segmentation with Semantics	Jan 30, 2023	SegmentationSemantic Segmentation	CodeCode Available	2
IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos	Nov 18, 2024	Pose EstimationSemantic Segmentation	CodeCode Available	2
Deep Covariance Alignment for Domain Adaptive Remote Sensing Image Segmentation	Jan 9, 2024	Image SegmentationSegmentation	CodeCode Available	2
DAMamba: Vision State Space Model with Dynamic Adaptive Scan	Feb 18, 2025	image-classificationImage Classification	CodeCode Available	2
Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable Filters	Jul 4, 2022	Autonomous DrivingScene Segmentation	CodeCode Available	2
A Unified Framework for 3D Scene Understanding	Jul 3, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available	2
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models	Aug 11, 2023	Dataset GenerationDecoder	CodeCode Available	2
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation	Aug 9, 2024	Image to textObject	CodeCode Available	2
AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditions	Sep 3, 2024	Autonomous DrivingDeep Attention	CodeCode Available	2
A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection	Mar 9, 2022	Co-Salient Object Detectionobject-detection	CodeCode Available	2
Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration	Nov 23, 2022	object-detectionObject Detection	CodeCode Available	2
Customized Segment Anything Model for Medical Image Segmentation	Apr 26, 2023	DecoderImage Segmentation	CodeCode Available	2
Dataset Quantization	Aug 21, 2023	Dataset Distillationobject-detection	CodeCode Available	2
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding	Mar 15, 2022	Boundary DetectionHuman Parsing	CodeCode Available	2
KPConvX: Modernizing Kernel Point Convolution with Kernel Attention	May 21, 2024	3D Point Cloud ClassificationSemantic Segmentation	CodeCode Available	2
Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts	Jul 2, 2024	Few-Shot Semantic SegmentationSemantic Segmentation	CodeCode Available	2
LambdaNetworks: Modeling Long-Range Interactions Without Attention	Feb 17, 2021	image-classificationImage Classification	CodeCode Available	2
DeepGCNs: Making GCNs Go as Deep as CNNs	Oct 15, 2019	3D Point Cloud Classification3D Semantic Segmentation	CodeCode Available	2
Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation	Jul 15, 2025	Image SegmentationSegmentation	CodeCode Available	2
Boundary-Aware Segmentation Network for Mobile and Web Applications	Jan 12, 2021	Camouflaged Object SegmentationDecoder	CodeCode Available	2
Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers	Mar 5, 2022	Semantic SegmentationWeakly supervised Semantic Segmentation	CodeCode Available	2
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping	Apr 9, 2024	Image RetrievalObject	CodeCode Available	2
Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling	Jul 6, 2021	SegmentationSemantic Segmentation	CodeCode Available	2
Learning Vision from Models Rivals Learning Vision from Data	Dec 28, 2023	Contrastive LearningImage Captioning	CodeCode Available	2
Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels	Mar 5, 2024	Pseudo LabelSemantic Segmentation	CodeCode Available	2
Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation	Jan 1, 2024	SegmentationSemantic Segmentation	CodeCode Available	2
Cross-Image Relational Knowledge Distillation for Semantic Segmentation	Apr 14, 2022	Knowledge DistillationSegmentation	CodeCode Available	2
CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation	Oct 30, 2024	Domain AdaptationDomain Generalization	CodeCode Available	2
Locality Alignment Improves Vision-Language Models	Oct 14, 2024	Semantic SegmentationSpatial Reasoning	CodeCode Available	2
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention	Mar 13, 2023	image-classificationImage Classification	CodeCode Available	2

Show:10 25 50

← PrevPage 13 of 296Next →

All datasets ADE20K NYU-Depth V2 Cityscapes test Cityscapes val ADE20K val PASCAL Context S3DIS Area5 PASCAL VOC 2012 test S3DIS ScanNet SUN-RGBD DensePASS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	InternImage-H (M3I Pre-training)	Params (M)	1,310	—	Unverified
2	ViT-P (InternImage-H)	Validation mIoU	63.6	—	Unverified
3	ONE-PEACE	Validation mIoU	63	—	Unverified
4	InternImage-H	Validation mIoU	62.9	—	Unverified
5	M3I Pre-training (InternImage-H)	Validation mIoU	62.9	—	Unverified
6	BEiT-3	Validation mIoU	62.8	—	Unverified
7	EVA	Validation mIoU	62.3	—	Unverified
8	ViT-P (OneFormer, InternImage-H)	Validation mIoU	61.6	—	Unverified
9	ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)	Validation mIoU	61.5	—	Unverified
10	FD-SwinV2-G	Validation mIoU	61.4	—	Unverified