SOTAVerified

Video Panoptic Segmentation

Video Panoptic Segmentation is a computer vision task that extends panoptic segmentation by incorporating temporal dimension. That is, given a video sequence, the goal is to predict the semantic class of each pixel while consistently tracking object instances. Here, the pixels belonging to the same object instance should be assigned the same instance ID throughout the video sequence.

Papers

Showing 125 of 42 papers

TitleStatusHype
UniVS: Unified and Universal Video Segmentation with Prompts as QueriesCode3
Tracking Anything with Decoupled Video SegmentationCode3
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical EnvironmentsCode2
Context-Aware Video Instance SegmentationCode2
PVO: Panoptic Visual OdometryCode2
ViP-DeepLab: Learning Visual Perception with Depth-aware Video Panoptic SegmentationCode1
Uni-DVPS: Unified Model for Depth-Aware Video Panoptic SegmentationCode1
1st Place Solution for PVUW Challenge 2023: Video Panoptic SegmentationCode1
Tube-Link: A Flexible Cross Tube Framework for Universal Video SegmentationCode1
A Simple Video Segmenter by Tracking Objects Along Axial TrajectoriesCode1
TarViS: A Unified Approach for Target-based Video SegmentationCode1
Context-Aware Relative Object Queries To Unify Video Instance and Panoptic SegmentationCode1
Large-Scale Video Panoptic Segmentation in the Wild: A BenchmarkCode1
DVIS: Decoupled Video Instance Segmentation FrameworkCode1
DVIS++: Improved Decoupled Framework for Universal Video SegmentationCode1
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic SegmentationCode1
Video Panoptic SegmentationCode1
Video K-Net: A Simple, Strong, and Unified Baseline for Video SegmentationCode1
Waymo Open Dataset: Panoramic Video Panoptic SegmentationCode0
An Integrated Framework for Multi-Granular Explanation of Video SummarizationCode0
MGNiceNet: Unified Monocular Geometric Scene UnderstandingCode0
STEP: Segmenting and Tracking Every PixelCode0
LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training0
MonoDVPS: A Self-Supervised Monocular Depth Estimation Approach to Depth-aware Video Panoptic Segmentation0
PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.