Semantic Segmentation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 901–950 of 14763 papers

Title	Date	Tasks	Status	Hype
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation	Dec 17, 2024	Contrastive LearningImage Segmentation	CodeCode Available	1
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation	Dec 17, 2024	Instance SegmentationSegmentation	CodeCode Available	1
Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation	Dec 16, 2024	DiversitySemantic Segmentation	CodeCode Available	1
MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation	Dec 16, 2024	Image SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	1
MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation	Dec 15, 2024	Semantic SegmentationWeakly supervised Semantic Segmentation	CodeCode Available	1
DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting	Dec 14, 2024	3D ReconstructionSegmentation	CodeCode Available	1
RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone	Dec 14, 2024	image-classificationImage Classification	CodeCode Available	1
Towards Open-Vocabulary Video Semantic Segmentation	Dec 12, 2024	SegmentationSemantic Segmentation	CodeCode Available	1
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning	Dec 11, 2024	AttributeBenchmarking	CodeCode Available	1
EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation	Dec 11, 2024	DecoderGPU	CodeCode Available	1
Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation	Dec 9, 2024	Domain AdaptationImage Segmentation	CodeCode Available	1
MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One Day	Dec 8, 2024	GPUImage Segmentation	CodeCode Available	1
RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts	Dec 7, 2024	Change DetectionImage Comprehension	CodeCode Available	1
MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities	Dec 4, 2024	Image GenerationImage Segmentation	CodeCode Available	1
Active Negative Loss: A Robust Framework for Learning with Noisy Labels	Dec 3, 2024	Image SegmentationLearning with noisy labels	CodeCode Available	1
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training	Dec 2, 2024	Self-Supervised LearningSemantic Segmentation	CodeCode Available	1
Multi-Granularity Video Object Segmentation	Dec 2, 2024	ObjectSegmentation	CodeCode Available	1
Referring Video Object Segmentation via Language-aligned Track Selection	Dec 2, 2024	ObjectObject Tracking	CodeCode Available	1
SyncVIS: Synchronized Video Instance Segmentation	Dec 1, 2024	Instance SegmentationSegmentation	CodeCode Available	1
Token Cropr: Faster ViTs for Quite a Few Tasks	Dec 1, 2024	image-classificationImage Classification	CodeCode Available	1
TAROT: Targeted Data Selection via Optimal Transport	Nov 30, 2024	motion predictionSemantic Segmentation	CodeCode Available	1
Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding	Nov 29, 2024	3D geometry3DGS	CodeCode Available	1
Deformable Mamba for Wide Field of View Segmentation	Nov 25, 2024	DecoderMamba	CodeCode Available	1
Learn from Foundation Model: Fruit Detection Model without Manual Annotation	Nov 25, 2024	Instance SegmentationKnowledge Distillation	CodeCode Available	1
A SAM-guided and Match-based Semi-Supervised Segmentation Framework for Medical Imaging	Nov 25, 2024	Image SegmentationMedical Image Segmentation	CodeCode Available	1
MulModSeg: Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating Training	Nov 23, 2024	Computed Tomography (CT)Image Segmentation	CodeCode Available	1
Revisiting the Integration of Convolution and Attention for Vision Backbone	Nov 21, 2024	Semantic SegmentationWeakly supervised Semantic Segmentation	CodeCode Available	1
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation	Nov 21, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Nov 20, 2024	3D geometry3D Semantic Segmentation	CodeCode Available	1
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements	Nov 18, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
RETR: Multi-View Radar Detection Transformer for Indoor Perception	Nov 15, 2024	Instance Segmentationobject-detection	CodeCode Available	1
OneNet: A Channel-Wise 1D Convolutional U-Net	Nov 14, 2024	DecoderImage Segmentation	CodeCode Available	1
Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction	Nov 11, 2024	Autonomous VehiclesInstance Segmentation	CodeCode Available	1
Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification	Nov 11, 2024	BenchmarkingImage Segmentation	CodeCode Available	1
ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset	Nov 7, 2024	SegmentationSemantic Segmentation	CodeCode Available	1
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts	Nov 6, 2024	Domain GeneralizationOut of Distribution (OOD) Detection	CodeCode Available	1
LiVOS: Light Video Object Segmentation with Gated Linear Matching	Nov 5, 2024	GPUSemantic Segmentation	CodeCode Available	1
Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective	Nov 5, 2024	DecoderSegmentation	CodeCode Available	1
Automated Classification of Cell Shapes: A Comparative Evaluation of Shape Descriptors	Nov 1, 2024	Instance SegmentationSemantic Segmentation	CodeCode Available	1
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation	Oct 31, 2024	Image SegmentationMamba	CodeCode Available	1
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model	Oct 31, 2024	Semantic SegmentationSpecificity	CodeCode Available	1
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes	Oct 31, 2024	SegmentationSemantic Segmentation	CodeCode Available	1
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation	Oct 29, 2024	Domain AdaptationPseudo Label	CodeCode Available	1
Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation	Oct 29, 2024	Cross-Domain Few-ShotFew-Shot Semantic Segmentation	CodeCode Available	1
IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks	Oct 28, 2024	Domain Adaptationobject-detection	CodeCode Available	1
Unlocking Comics: The AI4VA Dataset for Visual Understanding	Oct 27, 2024	Depth EstimationSaliency Detection	CodeCode Available	1
Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation	Oct 25, 2024	3D Semantic SegmentationDomain Adaptation	CodeCode Available	1
Context-Based Visual-Language Place Recognition	Oct 25, 2024	Semantic SegmentationVisual Place Recognition	CodeCode Available	1
Gaze-Assisted Medical Image Segmentation	Oct 23, 2024	DiagnosticImage Segmentation	CodeCode Available	1
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting	Oct 23, 2024	Decision MakingMinecraft	CodeCode Available	1

Show:10 25 50

← PrevPage 19 of 296Next →

All datasets ADE20K NYU-Depth V2 Cityscapes test Cityscapes val ADE20K val PASCAL Context S3DIS Area5 PASCAL VOC 2012 test S3DIS ScanNet SUN-RGBD DensePASS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	InternImage-H (M3I Pre-training)	Params (M)	1,310	—	Unverified
2	ViT-P (InternImage-H)	Validation mIoU	63.6	—	Unverified
3	ONE-PEACE	Validation mIoU	63	—	Unverified
4	InternImage-H	Validation mIoU	62.9	—	Unverified
5	M3I Pre-training (InternImage-H)	Validation mIoU	62.9	—	Unverified
6	BEiT-3	Validation mIoU	62.8	—	Unverified
7	EVA	Validation mIoU	62.3	—	Unverified
8	ViT-P (OneFormer, InternImage-H)	Validation mIoU	61.6	—	Unverified
9	ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)	Validation mIoU	61.5	—	Unverified
10	FD-SwinV2-G	Validation mIoU	61.4	—	Unverified