Semantic Segmentation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 14763 papers

Title	Date	Tasks	Status	Hype	Score
Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement	Apr 14, 2023	Image EnhancementLow-Light Image Enhancement	CodeCode Available	2	5
Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling	Jul 6, 2021	SegmentationSemantic Segmentation	CodeCode Available	2	5
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation	Apr 26, 2023	Domain AdaptationDomain Generalization	CodeCode Available	2	5
Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation	Jan 1, 2024	SegmentationSemantic Segmentation	CodeCode Available	2	5
Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D	Aug 13, 2020	Autonomous VehiclesBird's-Eye View Semantic Segmentation	CodeCode Available	2	5
Scalable Video Object Segmentation with Identification Mechanism	Mar 22, 2022	ObjectSegmentation	CodeCode Available	2	5
Distribution-Free, Risk-Controlling Prediction Sets	Jan 7, 2021	BIG-bench Machine LearningClassification	CodeCode Available	2	5
Locality Alignment Improves Vision-Language Models	Oct 14, 2024	Semantic SegmentationSpatial Reasoning	CodeCode Available	2	5
DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation	Mar 24, 2025	3D Semantic SegmentationLIDAR Semantic Segmentation	CodeCode Available	2	5
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration	Jul 9, 2024	3D ReconstructionAutonomous Navigation	CodeCode Available	2	5
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data	May 16, 2024	Data AugmentationDiversity	CodeCode Available	2	5
LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks	Jan 17, 2025	Change DetectionImage Classification	CodeCode Available	2	5
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors	Mar 24, 2022	Image GenerationSemantic Segmentation	CodeCode Available	2	5
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation	Apr 4, 2025	Domain GeneralizationMamba	CodeCode Available	2	5
DreamColour: Controllable Video Colour Editing without Training	Dec 6, 2024	Instance SegmentationSemantic Segmentation	CodeCode Available	2	5
Diffusion models as plug-and-play priors	Jun 17, 2022	Combinatorial OptimizationDenoising	CodeCode Available	2	5
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation	Apr 25, 2024	Autonomous DrivingEvolutionary Algorithms	CodeCode Available	2	5
An Empirical Study of Remote Sensing Pretraining	Apr 6, 2022	Aerial Scene ClassificationBuilding change detection for remote sensing images	CodeCode Available	2	5
Digital Twin Generation from Visual Data: A Survey	Apr 17, 2025	Semantic SegmentationSurvey	CodeCode Available	2	5
A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and Benchmark	Feb 28, 2022	Image SegmentationInductive Bias	CodeCode Available	2	5
Masked Generative Distillation	May 3, 2022	image-classificationImage Classification	CodeCode Available	2	5
Mask-Free Video Instance Segmentation	Mar 28, 2023	Instance SegmentationOptical Flow Estimation	CodeCode Available	2	5
DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image Segmentation	Jul 13, 2024	DenoisingImage Segmentation	CodeCode Available	2	5
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion	Aug 23, 2023	SegmentationSemantic Segmentation	CodeCode Available	2	5
MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation	Mar 29, 2024	Image SegmentationMedical Image Analysis	CodeCode Available	2	5
MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation	Sep 28, 2024	Image SegmentationMedical Image Analysis	CodeCode Available	2	5
Dilated Neighborhood Attention Transformer	Sep 29, 2022	Image ClassificationInstance Segmentation	CodeCode Available	2	5
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting	Jan 18, 2024	Instance SegmentationInteractive Segmentation	CodeCode Available	2	5
3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentation	Jun 23, 2023	Image SegmentationMedical Image Segmentation	CodeCode Available	2	5
Benchmarking the Robustness of LiDAR Semantic Segmentation Models	Jan 3, 2023	Autonomous DrivingBenchmarking	CodeCode Available	2	5
1st Place Solution for PSG competition with ECCV'22 SenseHuman Workshop	Feb 6, 2023	Multi-class ClassificationPanoptic Segmentation	CodeCode Available	2	5
BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation	Mar 18, 2024	Decision MakingScene Segmentation	CodeCode Available	2	5
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion	Mar 9, 2025	Image SegmentationMedical Image Segmentation	CodeCode Available	2	5
Merging Context Clustering with Visual State Space Models for Medical Image Segmentation	Jan 3, 2025	ClusteringImage Segmentation	CodeCode Available	2	5
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning	May 14, 2025	Anomaly DetectionAnomaly Segmentation	CodeCode Available	2	5
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions	Aug 16, 2023	Motion Expressions Guided Video SegmentationObject	CodeCode Available	2	5
MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation	Dec 2, 2022	Domain Adaptationimage-classification	CodeCode Available	2	5
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training	Aug 3, 2022	Instance SegmentationSegmentation	CodeCode Available	2	5
Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss	Apr 2, 2024	image-classificationImage Classification	CodeCode Available	2	5
MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features	Sep 30, 2022	Image Classification	CodeCode Available	2	5
DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation	Sep 18, 2023	3D geometryDecoder	CodeCode Available	2	5
Model-Based Imitation Learning for Urban Driving	Oct 14, 2022	3D geometryAutonomous Driving	CodeCode Available	2	5
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks	May 5, 2021	image-classificationImage Classification	CodeCode Available	2	5
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes	Feb 3, 2023	ObjectSegmentation	CodeCode Available	2	5
Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation	Aug 31, 2023	Image SegmentationMedical Image Segmentation	CodeCode Available	2	5
DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception	Mar 15, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	2	5
DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model	Oct 22, 2024	DecoderInstance Segmentation	CodeCode Available	2	5
DreamLIP: Language-Image Pre-training with Long Captions	Mar 25, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	2	5
An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models	Nov 25, 2024	DenoisingScene Understanding	CodeCode Available	2	5
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation	Jan 15, 2025	Image SegmentationReferring Expression Segmentation	CodeCode Available	2	5

Show:10 25 50

← PrevPage 9 of 296Next →

All datasets ADE20K NYU-Depth V2 Cityscapes test Cityscapes val ADE20K val PASCAL Context S3DIS Area5 PASCAL VOC 2012 test S3DIS ScanNet SUN-RGBD DensePASS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	InternImage-H (M3I Pre-training)	Params (M)	1,310	—	Unverified
2	ViT-P (InternImage-H)	Validation mIoU	63.6	—	Unverified
3	ONE-PEACE	Validation mIoU	63	—	Unverified
4	M3I Pre-training (InternImage-H)	Validation mIoU	62.9	—	Unverified
5	InternImage-H	Validation mIoU	62.9	—	Unverified
6	BEiT-3	Validation mIoU	62.8	—	Unverified
7	EVA	Validation mIoU	62.3	—	Unverified
8	ViT-P (OneFormer, InternImage-H)	Validation mIoU	61.6	—	Unverified
9	ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)	Validation mIoU	61.5	—	Unverified
10	FD-SwinV2-G	Validation mIoU	61.4	—	Unverified