Panoptic Segmentation

Panoptic Segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to "things" classes (countable objects with instances, like cars and people) are assigned unique instance IDs. ( Image credit: Detectron2 )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 462 papers

Title	Date	Tasks	Status	Hype
@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology	Sep 21, 2024	BenchmarkingDepth Estimation	—Unverified	0
Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model	Sep 20, 2024	Image CaptioningPanoptic Segmentation	CodeCode Available	1
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding	Sep 19, 2024	Contrastive LearningPanoptic Segmentation	CodeCode Available	0
Panoptic-Depth Forecasting	Sep 18, 2024	Depth EstimationNavigate	—Unverified	0
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation	Sep 15, 2024	Image SegmentationInstance Segmentation	—Unverified	0
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding	Sep 12, 2024	Panoptic SegmentationSegmentation	—Unverified	0
Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data	Sep 10, 2024	3D Plane Detection3d scene graph generation	—Unverified	0
A Simple and Generalist Approach for Panoptic Segmentation	Aug 29, 2024	Missing LabelsPanoptic Segmentation	—Unverified	0
DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries	Aug 28, 2024	DecoderInstance Segmentation	—Unverified	0
Image Segmentation in Foundation Model Era: A Survey	Aug 23, 2024	Image SegmentationInstance Segmentation	CodeCode Available	2
NuLite -- Lightweight and Fast Model for Nuclei Instance Segmentation and Classification	Aug 3, 2024	Cell DetectionCell Segmentation	CodeCode Available	1
LKCell: Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels	Jul 25, 2024	Cell SegmentationDecoder	CodeCode Available	1
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation	Jul 23, 2024	Panoptic SegmentationSegmentation	—Unverified	0
Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model	Jul 19, 2024	Image GenerationInstance Segmentation	—Unverified	0
MC-PanDA: Mask Confidence for Panoptic Domain Adaptation	Jul 19, 2024	Domain AdaptationPanoptic Segmentation	CodeCode Available	0
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models	Jul 15, 2024	Graph Generationobject-detection	CodeCode Available	1
A Fair Ranking and New Model for Panoptic Scene Graph Generation	Jul 12, 2024	Graph GenerationPanoptic Scene Graph Generation	CodeCode Available	1
From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation	Jul 12, 2024	Graph GenerationKnowledge Distillation	—Unverified	0
Panoptic Segmentation of Galactic Structures in LSB Images	Jul 10, 2024	Panoptic SegmentationSegmentation	—Unverified	0
Context-Aware Video Instance Segmentation	Jul 3, 2024	Instance SegmentationPanoptic Segmentation	CodeCode Available	2
Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation	Jul 1, 2024	Autonomous DrivingDecoder	CodeCode Available	1
Gradient-based Class Weighting for Unsupervised Domain Adaptation in Dense Prediction Visual Tasks	Jul 1, 2024	Domain Adaptationimage-classification	—Unverified	0
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction	Jul 1, 2024	3D Panoptic SegmentationInstance Segmentation	—Unverified	0
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations	Jun 14, 2024	Panoptic SegmentationPart-aware Panoptic Segmentation	CodeCode Available	1
PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving	Jun 11, 2024	3D Instance Segmentation3D Scene Reconstruction	—Unverified	0
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation	Jun 8, 2024	BenchmarkingInstance Segmentation	—Unverified	0
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation	Jun 6, 2024	Panoptic SegmentationSegmentation	—Unverified	0
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation	Jun 1, 2024	Autonomous DrivingPanoptic Segmentation	—Unverified	0
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation	May 29, 2024	Autonomous DrivingBoundary Detection	CodeCode Available	1
4D Panoptic Scene Graph Generation	May 16, 2024	4D Panoptic SegmentationGraph Generation	CodeCode Available	3
An Integrated Framework for Multi-Granular Explanation of Video Summarization	May 16, 2024	BenchmarkingPanoptic Segmentation	CodeCode Available	0
Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception	May 12, 2024	object-detectionObject Detection	CodeCode Available	1
Panoptic Segmentation and Labelling of Lumbar Spine Vertebrae using Modified Attention Unet	Apr 28, 2024	Panoptic SegmentationSegmentation	—Unverified	0
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies	Apr 15, 2024	Panoptic SegmentationRetrieval	—Unverified	0
The revenge of BiSeNet: Efficient Multi-Task Image Segmentation	Apr 15, 2024	Image SegmentationPanoptic Segmentation	—Unverified	0
COCONut: Modernizing COCO Segmentation	Apr 12, 2024	Panoptic SegmentationSegmentation	—Unverified	0
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation	Apr 4, 2024	Autonomous DrivingDomain Adaptation	—Unverified	0
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments	Apr 2, 2024	Decision MakingPanoptic Segmentation	—Unverified	0
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning	Mar 29, 2024	Continual LearningContinual Panoptic Segmentation	CodeCode Available	2
Using Images as Covariates: Measuring Curb Appeal with Deep Learning	Mar 29, 2024	Deep LearningEconometrics	—Unverified	0
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model	Mar 21, 2024	DecoderGeneralized Referring Expression Segmentation	CodeCode Available	3
Better Call SAL: Towards Learning to Segment Anything in Lidar	Mar 19, 2024	Panoptic SegmentationSegmentation	CodeCode Available	2
PosSAM: Panoptic Open-vocabulary Segment Anything	Mar 14, 2024	DecoderOpen Vocabulary Panoptic Segmentation	CodeCode Available	2
Small, Versatile and Mighty: A Range-View Perception Framework	Mar 1, 2024	Panoptic SegmentationSemantic Segmentation	—Unverified	0
PEM: Prototype-based Efficient MaskFormer for Image Segmentation	Feb 29, 2024	Image SegmentationPanoptic Segmentation	CodeCode Available	2
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving	Feb 23, 2024	BenchmarkingDecision Making	—Unverified	0
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation	Feb 21, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review	Feb 17, 2024	Panoptic SegmentationScene Segmentation	CodeCode Available	1
Generalizable Entity Grounding via Assistance of Large Language Model	Feb 4, 2024	Language ModelingLanguage Modelling	—Unverified	0
UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models	Jan 25, 2024	Image GenerationImage Segmentation	—Unverified	0

Show:10 25 50

← PrevPage 2 of 10Next →

All datasets COCO test-dev Cityscapes val COCO minival ADE20K val Mapillary val Cityscapes test LaRS S3DIS Area5 ScanNetV2 Indian Driving Dataset KITTI Panoptic Segmentation PanNuke

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Mask DINO (single scale)	PQ	59.5	—	Unverified
2	kMaX-DeepLab (single-scale)	PQ	58.5	—	Unverified
3	Mask2Former (Swin-L)	PQ	58.3	—	Unverified
4	Panoptic SegFormer (Swin-L)	PQ	56.2	—	Unverified
5	Panoptic SegFormer (PVTv2-B5)	PQ	55.8	—	Unverified
6	CMT-DeepLab (single-scale)	PQ	55.7	—	Unverified
7	K-Net (Swin-L)	PQ	55.2	—	Unverified
8	MaskConver (ResNet50, single-scale)	PQ	53.6	—	Unverified
9	MaskFormer (Swin-L)	PQ	53.3	—	Unverified
10	Panoptic FCN* (Swin-L)	PQ	52.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ViT-P (OneFormer, InternImage-H)	PQ	70.8	—	Unverified
2	Panoptic FCN* (Swin-L, Cityscapes-fine)	PQst	70.6	—	Unverified
3	OneFormer (ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained)	PQ	70.1	—	Unverified
4	Panoptic-DeepLab (SWideRNet [1, 1, 4.5], Mapillary Vistas, multi-scale)	PQ	69.6	—	Unverified
5	OneFormer (ConvNeXt-L, single-scale)	PQ	68.51	—	Unverified
6	Panoptic-DeepLab (SWideRNet [1, 1, 4.5], Mapillary Vistas, single-scale)	PQ	68.5	—	Unverified
7	Axial-DeepLab-XL (Mapillary Vistas, multi-scale)	PQ	68.5	—	Unverified
8	kMaX-DeepLab (single-scale)	PQ	68.4	—	Unverified
9	OneFormer (ConvNeXt-XL, single-scale)	PQ	68.4	—	Unverified
10	AFF-Base (single-scale, point-based Mask2Former)	PQ	67.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HyperSeg (Swin-B)	PQ	61.2	—	Unverified
2	OneFormer (InternImage-H,single-scale)	PQ	60	—	Unverified
3	OpenSeeD (SwinL, single-scale)	PQ	59.5	—	Unverified
4	UMG-CLIP-E/14	PQ	59.5	—	Unverified
5	MasK DINO (SwinL,single-scale)	PQ	59.4	—	Unverified
6	EoMT (DINOv2-g, single-scale, 1280x1280)	PQ	59.2	—	Unverified
7	UMG-CLIP-L/14	PQ	58.9	—	Unverified
8	Panoptic FCN* (Swin-L, single-scale)	PQth	58.5	—	Unverified
9	DiNAT-L (single-scale, Mask2Former)	PQ	58.5	—	Unverified
10	ViT-Adapter-L (single-scale, BEiTv2 pretrain, Mask2Former)	PQ	58.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer (InternImage-H, emb_dim=256, single-scale, 896x896)	PQ	54.5	—	Unverified
2	ViT-P (OneFormer, DiNAT-L, single-scale, 1280x1280, COCO_pretrain)	PQ	54	—	Unverified
3	OpenSeed(SwinL, single scale, 1280x1280)	PQ	53.7	—	Unverified
4	OneFormer (DiNAT-L, single-scale, 1280x1280, COCO-Pretrain)	PQ	53.4	—	Unverified
5	EoMT (DINOv2-g, single-scale, 1280x1280, COCO pre-trained)	PQ	52.8	—	Unverified
6	X-Decoder (Davit-d5, Deform, single-scale, 1280x1280)	PQ	52.4	—	Unverified
7	ViT-P (OneFormer, DiNAT-L, single-scale, 1280x1280)	PQ	51.9	—	Unverified
8	OneFormer (DiNAT-L, single-scale, 1280x1280)	PQ	51.5	—	Unverified
9	OneFormer (Swin-L, single-scale, 1280x1280)	PQ	51.4	—	Unverified
10	kMaX-DeepLab (ConvNeXt-L, single-scale, 1281x1281)	PQ	50.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer (DiNAT-L, single-scale)	PQ	46.7	—	Unverified
2	OneFormer (ConvNeXt-L, single-scale)	PQ	46.4	—	Unverified
3	Panoptic FCN* (Swin-L, single-scale)	PQ	45.7	—	Unverified
4	Panoptic-DeepLab (SWideRNet-(1, 1, 4.5), multi-scale)	PQ	44.8	—	Unverified
5	Panoptic FCN* (ResNet-50-FPN)	PQst	42.3	—	Unverified
6	Mask2Former + Intra-Batch Supervision (ResNet-50)	PQ	42.2	—	Unverified
7	Axial-DeepLab-L (multi-scale)	PQ	41.1	—	Unverified
8	EfficientPS	PQ	40.6	—	Unverified
9	Panoptic-DeepLab (X71)	PQ	40.5	—	Unverified
10	AdaptIS (ResNeXt-101)	PQ	40.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer (ConvNeXt-L, single-scale, Mapillary Vistas-Pretrained)	PQ	68	—	Unverified
2	Panoptic-DeepLab (SWideRNet [1, 1, 4.5], Mapillary, multi-scale)	PQ	67.8	—	Unverified
3	EfficientPS	PQ	67.1	—	Unverified
4	Axial-DeepLab-XL (Mapillary Vistas, multi-scale)	PQ	66.6	—	Unverified
5	kMaX-DeepLab (single-scale)	PQ	66.2	—	Unverified
6	Panoptic-Deeplab	PQ	65.5	—	Unverified
7	EfficientPS (Cityscapes-fine)	PQ	62.9	—	Unverified
8	COPS (ResNet-50)	PQ	60	—	Unverified
9	SOGNet (ResNet-50)	PQ	60	—	Unverified
10	Dynamically Instantiated Network	PQ	55.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Mask2Former (Swin-B)	PQ	41.7	—	Unverified
2	Panoptic FPN (ResNet-50)	PQ	40.1	—	Unverified
3	Mask2Former (Swin-T)	PQ	39.2	—	Unverified
4	Panoptic FPN (ResNet-101)	PQ	38.7	—	Unverified
5	Mask2Former (ResNet-50)	PQ	37.6	—	Unverified
6	Mask2Former (ResNet-101)	PQ	37.2	—	Unverified
7	Panoptic Deeplab (ResNet-50)	PQ	34.7	—	Unverified
8	MaX-DeepLab	PQ	31.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SuperCluster	PQ	50.1	—	Unverified
2	PointGroup (Xiang 2023)	PQ	42.3	—	Unverified
3	KPConv (Xiang 2023)	PQ	41.8	—	Unverified
4	MinkowskiNet (Xiang 2023)	PQ	39.2	—	Unverified
5	PointNet++ (Xiang 2023)	PQ	24.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer3D	PQ	71.2	—	Unverified
2	PanopticNDT (10cm)	PQ	59.19	—	Unverified
3	SuperCluster	PQ	58.7	—	Unverified
4	PanopticFusion (with CRF)	PQ	33.5	—	Unverified
5	SceneGraphFusion (NN mapping)	PQ	31.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	EfficientPS	PQ	51.1	—	Unverified
2	Seamless	PQ	48.5	—	Unverified
3	UPSNet	PQ	47.1	—	Unverified
4	Panoptic FPN	PQ	46.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	EfficientPS	PQ	43.7	—	Unverified
2	Seamless	PQ	42.2	—	Unverified
3	UPSNet	PQ	39.9	—	Unverified
4	Panoptic FPN	PQ	39.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LKCell	PQ	50.8	—	Unverified
2	CellViT-SAM-H	PQ	50.62	—	Unverified
3	TSFD	PQ	50.4	—	Unverified
4	NuLite-H	PQ	49.81	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer3D	PQ	71.2	—	Unverified
2	SuperCluster	PQ	58.7	—	Unverified
3	PanopticFusion	PQ	33.5	—	Unverified
4	SceneGraphFusion	PQ	31.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Exchanger+Mask2Former	PQ	52.6	—	Unverified
2	Exchanger+Unet+PaPs	PQ	47.8	—	Unverified
3	U-TAE + PaPs	PQ	40.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VAN-B6*	PQ	58.2	—	Unverified
2	PFPN (ideal number of groups)	PQ	42.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CAFuser (Swin-T)	PQ	59.7	—	Unverified
2	MUSES (Mask2Former /w 4xSwin-T)	PQ	53.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	EMSANet (2x ResNet-34 NBt1D, PanopticNDT version, finetuned)	PQ	51.15	—	Unverified
2	EMSANet	PQ	47.38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	P3Former	PQ	0.65	—	Unverified
2	DS-Net	PQ	0.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MasQCLIP	PQ	23.3	—	Unverified