Panoptic Segmentation

Panoptic Segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to "things" classes (countable objects with instances, like cars and people) are assigned unique instance IDs. ( Image credit: Detectron2 )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 462 papers

Title	Date	Tasks	Status	Hype
Temporal Context for Robust Maritime Obstacle Detection	Mar 10, 2022	ObjectPanoptic Segmentation	CodeCode Available	1
Temporal Consistent 3D LiDAR Representation Learning for Semantic Perception in Autonomous Driving	Jan 1, 2023	Autonomous DrivingPanoptic Segmentation	CodeCode Available	1
PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation	Jun 1, 2022	Depth EstimationDepth Prediction	CodeCode Available	1
EDAPS: Enhanced Domain-Adaptive Panoptic Segmentation	Apr 27, 2023	Domain AdaptationInstance Segmentation	CodeCode Available	1
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments	Jul 10, 2022	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
Efficient Multi-Task Scene Analysis with RGB-D Transformers	Jun 8, 2023	Panoptic SegmentationScene Classification	CodeCode Available	1
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation	Jan 2, 2023	Caption GenerationInstance Segmentation	CodeCode Available	1
EfficientPS: Efficient Panoptic Segmentation	Apr 5, 2020	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations	Jun 14, 2024	Panoptic SegmentationPart-aware Panoptic Segmentation	CodeCode Available	1
Panoptic Narrative Grounding	Sep 10, 2021	Natural Language Visual GroundingPanoptic Segmentation	CodeCode Available	1
Panoptic Narrative Grounding	Jan 1, 2021	Natural Language Visual GroundingPanoptic Segmentation	CodeCode Available	1
PanopticNDT: Efficient and Robust Panoptic Mapping	Sep 24, 2023	2D Panoptic Segmentation3D Panoptic Segmentation	CodeCode Available	1
ElC-OIS: Ellipsoidal Clustering for Open-World Instance Segmentation on LiDAR Data	Mar 8, 2023	Autonomous NavigationClustering	CodeCode Available	1
End-to-End Object Detection with Transformers	May 26, 2020	2D Object DetectionDecoder	CodeCode Available	1
Zero-Shot 4D Lidar Panoptic Segmentation	Apr 1, 2025	DiversityPanoptic Segmentation	—Unverified	0
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation	Jun 8, 2024	BenchmarkingInstance Segmentation	—Unverified	0
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation	Jun 1, 2024	Autonomous DrivingPanoptic Segmentation	—Unverified	0
3D detection of roof sections from a single satellite image and application to LOD2-building reconstruction	Jul 11, 2023	3D ReconstructionPanoptic Segmentation	—Unverified	0
3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation	Jan 4, 2024	3D Panoptic SegmentationAutonomous Driving	—Unverified	0
3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation	Jun 11, 2023	Instance SegmentationPanoptic Segmentation	—Unverified	0
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation	Jun 6, 2024	Panoptic SegmentationSegmentation	—Unverified	0
4D-Former: Multimodal 4D Panoptic Segmentation	Nov 2, 2023	4D Panoptic SegmentationPanoptic Segmentation	—Unverified	0
4D Panoptic Segmentation as Invariant and Equivariant Field Prediction	Mar 28, 2023	4D Panoptic SegmentationAutonomous Driving	—Unverified	0
7th AI Driving Olympics: 1st Place Report for Panoptic Tracking	Dec 9, 2021	BenchmarkingPanoptic Segmentation	—Unverified	0
A Benchmark for LiDAR-based Panoptic Segmentation based on KITTI	Mar 4, 2020	Instance SegmentationPanoptic Segmentation	—Unverified	0
ACDC: The Adverse Conditions Dataset with Correspondences for Robust Semantic Driving Scene Perception	Apr 27, 2021	Instance Segmentationobject-detection	—Unverified	0
A Compositional Approach to Occlusion in Panoptic Segmentation	Sep 29, 2021	Image Segmentationobject-detection	—Unverified	0
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects	Jun 16, 2025	BenchmarkingInstance Segmentation	—Unverified	0
Ada-Segment: Automated Multi-loss Adaptation for Panoptic Segmentation	Dec 7, 2020	Instance SegmentationPanoptic Segmentation	—Unverified	0
A Generalist Framework for Panoptic Segmentation of Images and Videos	Oct 12, 2022	Inductive BiasPanoptic Segmentation	—Unverified	0
Agricultural Landscape Understanding At Country-Scale	Nov 8, 2024	Decision MakingPanoptic Segmentation	—Unverified	0
Amodal Panoptic Segmentation	Feb 23, 2022	Amodal Panoptic SegmentationInstance Segmentation	—Unverified	0
An End-to-End Network for Panoptic Segmentation	Mar 12, 2019	Panoptic SegmentationSegmentation	—Unverified	0
An End-to-End Trainable Video Panoptic Segmentation Method usingTransformers	Oct 8, 2021	Multi-Object TrackingObject Tracking	—Unverified	0
AOP-Net: All-in-One Perception Network for Joint LiDAR-based 3D Object Detection and Panoptic Segmentation	Feb 2, 2023	3D Object DetectionAll	—Unverified	0
A SAM-based Solution for Hierarchical Panoptic Segmentation of Crops and Weeds Competition	Sep 24, 2023	Instance Segmentationobject-detection	—Unverified	0
A Simple and Generalist Approach for Panoptic Segmentation	Aug 29, 2024	Missing LabelsPanoptic Segmentation	—Unverified	0
ASSIST: Interactive Scene Nodes for Scalable and Realistic Indoor Simulation	Nov 10, 2023	Panoptic Segmentation	—Unverified	0
A Survey on Label-efficient Deep Image Segmentation: Bridging the Gap between Weak Supervision and Dense Prediction	Jul 4, 2022	Image SegmentationInstance Segmentation	—Unverified	0
Attention-guided Unified Network for Panoptic Segmentation	Dec 10, 2018	Panoptic SegmentationSegmentation	—Unverified	0
Automated processing of X-ray computed tomography images via panoptic segmentation for modeling woven composite textiles	Feb 2, 2022	Computed Tomography (CT)Panoptic Segmentation	—Unverified	0
A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation	Mar 4, 2022	3D Object Detection3D Panoptic Segmentation	—Unverified	0
Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation	Dec 10, 2024	DecoderDepth-aware Video Panoptic Segmentation	—Unverified	0
@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology	Sep 21, 2024	BenchmarkingDepth Estimation	—Unverified	0
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving	Feb 23, 2024	BenchmarkingDecision Making	—Unverified	0
Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation	Dec 13, 2023	Panoptic SegmentationPosition	—Unverified	0
Bidirectional Graph Reasoning Network for Panoptic Segmentation	Apr 14, 2020	Instance SegmentationPanoptic Segmentation	—Unverified	0
Boosting Supervised Learning Performance with Co-training	Nov 18, 2021	Domain Adaptationobject-detection	—Unverified	0
CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings	Dec 10, 2024	DiversityPanoptic Segmentation	—Unverified	0
Can we cover navigational perception needs of the visually impaired by panoptic segmentation?	Jul 20, 2020	Deep LearningInstance Segmentation	—Unverified	0

Show:10 25 50

← PrevPage 5 of 10Next →

All datasets COCO test-dev Cityscapes val COCO minival ADE20K val Mapillary val Cityscapes test LaRS S3DIS Area5 ScanNetV2 Indian Driving Dataset KITTI Panoptic Segmentation PanNuke

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Mask DINO (single scale)	PQ	59.5	—	Unverified
2	kMaX-DeepLab (single-scale)	PQ	58.5	—	Unverified
3	Mask2Former (Swin-L)	PQ	58.3	—	Unverified
4	Panoptic SegFormer (Swin-L)	PQ	56.2	—	Unverified
5	Panoptic SegFormer (PVTv2-B5)	PQ	55.8	—	Unverified
6	CMT-DeepLab (single-scale)	PQ	55.7	—	Unverified
7	K-Net (Swin-L)	PQ	55.2	—	Unverified
8	MaskConver (ResNet50, single-scale)	PQ	53.6	—	Unverified
9	MaskFormer (Swin-L)	PQ	53.3	—	Unverified
10	Panoptic FCN* (Swin-L)	PQ	52.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ViT-P (OneFormer, InternImage-H)	PQ	70.8	—	Unverified
2	Panoptic FCN* (Swin-L, Cityscapes-fine)	PQst	70.6	—	Unverified
3	OneFormer (ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained)	PQ	70.1	—	Unverified
4	Panoptic-DeepLab (SWideRNet [1, 1, 4.5], Mapillary Vistas, multi-scale)	PQ	69.6	—	Unverified
5	OneFormer (ConvNeXt-L, single-scale)	PQ	68.51	—	Unverified
6	Panoptic-DeepLab (SWideRNet [1, 1, 4.5], Mapillary Vistas, single-scale)	PQ	68.5	—	Unverified
7	Axial-DeepLab-XL (Mapillary Vistas, multi-scale)	PQ	68.5	—	Unverified
8	kMaX-DeepLab (single-scale)	PQ	68.4	—	Unverified
9	OneFormer (ConvNeXt-XL, single-scale)	PQ	68.4	—	Unverified
10	AFF-Base (single-scale, point-based Mask2Former)	PQ	67.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HyperSeg (Swin-B)	PQ	61.2	—	Unverified
2	OneFormer (InternImage-H,single-scale)	PQ	60	—	Unverified
3	UMG-CLIP-E/14	PQ	59.5	—	Unverified
4	OpenSeeD (SwinL, single-scale)	PQ	59.5	—	Unverified
5	MasK DINO (SwinL,single-scale)	PQ	59.4	—	Unverified
6	EoMT (DINOv2-g, single-scale, 1280x1280)	PQ	59.2	—	Unverified
7	UMG-CLIP-L/14	PQ	58.9	—	Unverified
8	Panoptic FCN* (Swin-L, single-scale)	PQth	58.5	—	Unverified
9	DiNAT-L (single-scale, Mask2Former)	PQ	58.5	—	Unverified
10	ViT-Adapter-L (single-scale, BEiTv2 pretrain, Mask2Former)	PQ	58.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer (InternImage-H, emb_dim=256, single-scale, 896x896)	PQ	54.5	—	Unverified
2	ViT-P (OneFormer, DiNAT-L, single-scale, 1280x1280, COCO_pretrain)	PQ	54	—	Unverified
3	OpenSeed(SwinL, single scale, 1280x1280)	PQ	53.7	—	Unverified
4	OneFormer (DiNAT-L, single-scale, 1280x1280, COCO-Pretrain)	PQ	53.4	—	Unverified
5	EoMT (DINOv2-g, single-scale, 1280x1280, COCO pre-trained)	PQ	52.8	—	Unverified
6	X-Decoder (Davit-d5, Deform, single-scale, 1280x1280)	PQ	52.4	—	Unverified
7	ViT-P (OneFormer, DiNAT-L, single-scale, 1280x1280)	PQ	51.9	—	Unverified
8	OneFormer (DiNAT-L, single-scale, 1280x1280)	PQ	51.5	—	Unverified
9	OneFormer (Swin-L, single-scale, 1280x1280)	PQ	51.4	—	Unverified
10	kMaX-DeepLab (ConvNeXt-L, single-scale, 1281x1281)	PQ	50.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer (DiNAT-L, single-scale)	PQ	46.7	—	Unverified
2	OneFormer (ConvNeXt-L, single-scale)	PQ	46.4	—	Unverified
3	Panoptic FCN* (Swin-L, single-scale)	PQ	45.7	—	Unverified
4	Panoptic-DeepLab (SWideRNet-(1, 1, 4.5), multi-scale)	PQ	44.8	—	Unverified
5	Panoptic FCN* (ResNet-50-FPN)	PQst	42.3	—	Unverified
6	Mask2Former + Intra-Batch Supervision (ResNet-50)	PQ	42.2	—	Unverified
7	Axial-DeepLab-L (multi-scale)	PQ	41.1	—	Unverified
8	EfficientPS	PQ	40.6	—	Unverified
9	Panoptic-DeepLab (X71)	PQ	40.5	—	Unverified
10	AdaptIS (ResNeXt-101)	PQ	40.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer (ConvNeXt-L, single-scale, Mapillary Vistas-Pretrained)	PQ	68	—	Unverified
2	Panoptic-DeepLab (SWideRNet [1, 1, 4.5], Mapillary, multi-scale)	PQ	67.8	—	Unverified
3	EfficientPS	PQ	67.1	—	Unverified
4	Axial-DeepLab-XL (Mapillary Vistas, multi-scale)	PQ	66.6	—	Unverified
5	kMaX-DeepLab (single-scale)	PQ	66.2	—	Unverified
6	Panoptic-Deeplab	PQ	65.5	—	Unverified
7	EfficientPS (Cityscapes-fine)	PQ	62.9	—	Unverified
8	COPS (ResNet-50)	PQ	60	—	Unverified
9	SOGNet (ResNet-50)	PQ	60	—	Unverified
10	Dynamically Instantiated Network	PQ	55.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Mask2Former (Swin-B)	PQ	41.7	—	Unverified
2	Panoptic FPN (ResNet-50)	PQ	40.1	—	Unverified
3	Mask2Former (Swin-T)	PQ	39.2	—	Unverified
4	Panoptic FPN (ResNet-101)	PQ	38.7	—	Unverified
5	Mask2Former (ResNet-50)	PQ	37.6	—	Unverified
6	Mask2Former (ResNet-101)	PQ	37.2	—	Unverified
7	Panoptic Deeplab (ResNet-50)	PQ	34.7	—	Unverified
8	MaX-DeepLab	PQ	31.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SuperCluster	PQ	50.1	—	Unverified
2	PointGroup (Xiang 2023)	PQ	42.3	—	Unverified
3	KPConv (Xiang 2023)	PQ	41.8	—	Unverified
4	MinkowskiNet (Xiang 2023)	PQ	39.2	—	Unverified
5	PointNet++ (Xiang 2023)	PQ	24.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer3D	PQ	71.2	—	Unverified
2	PanopticNDT (10cm)	PQ	59.19	—	Unverified
3	SuperCluster	PQ	58.7	—	Unverified
4	PanopticFusion (with CRF)	PQ	33.5	—	Unverified
5	SceneGraphFusion (NN mapping)	PQ	31.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	EfficientPS	PQ	51.1	—	Unverified
2	Seamless	PQ	48.5	—	Unverified
3	UPSNet	PQ	47.1	—	Unverified
4	Panoptic FPN	PQ	46.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	EfficientPS	PQ	43.7	—	Unverified
2	Seamless	PQ	42.2	—	Unverified
3	UPSNet	PQ	39.9	—	Unverified
4	Panoptic FPN	PQ	39.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LKCell	PQ	50.8	—	Unverified
2	CellViT-SAM-H	PQ	50.62	—	Unverified
3	TSFD	PQ	50.4	—	Unverified
4	NuLite-H	PQ	49.81	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OneFormer3D	PQ	71.2	—	Unverified
2	SuperCluster	PQ	58.7	—	Unverified
3	PanopticFusion	PQ	33.5	—	Unverified
4	SceneGraphFusion	PQ	31.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Exchanger+Mask2Former	PQ	52.6	—	Unverified
2	Exchanger+Unet+PaPs	PQ	47.8	—	Unverified
3	U-TAE + PaPs	PQ	40.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VAN-B6*	PQ	58.2	—	Unverified
2	PFPN (ideal number of groups)	PQ	42.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CAFuser (Swin-T)	PQ	59.7	—	Unverified
2	MUSES (Mask2Former /w 4xSwin-T)	PQ	53.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	EMSANet (2x ResNet-34 NBt1D, PanopticNDT version, finetuned)	PQ	51.15	—	Unverified
2	EMSANet	PQ	47.38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	P3Former	PQ	0.65	—	Unverified
2	DS-Net	PQ	0.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MasQCLIP	PQ	23.3	—	Unverified