SOTAVerified

Video Segmentation

Papers

Showing 125 of 388 papers

TitleStatusHype
Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation0
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation0
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and GrounderCode1
CogGen: A Learner-Centered Generative AI Architecture for Intelligent Tutoring with Programming Video0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment0
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects0
Q-SAM2: Accurate Quantization for Segment Anything Model 20
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training CostCode1
OmniFall: A Unified Staged-to-Wild Benchmark for Human Fall DetectionCode0
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of ThoughtsCode0
Unlocking the Power of SAM 2 for Few-Shot SegmentationCode1
FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching0
VolE: A Point-cloud Framework for Food 3D Reconstruction and Volume Estimation0
TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in ActionCode1
DC-SAM: In-Context Segment Anything in Images and Videos via Dual ConsistencyCode1
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild0
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
MedSAM2: Segment Anything in 3D Medical Images and VideosCode4
Comparative Analysis of Image, Video, and Audio Classifiers for Automated News Video Segmentation0
Online Reasoning Video Segmentation with Just-in-Time Digital Twins0
CamSAM2: Segment Anything Accurately in Camouflaged VideosCode1
Reducing Annotation Burden: Exploiting Image Knowledge for Few-Shot Medical Video Object Segmentation via Spatiotemporal Consistency RelearningCode0
Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and TrackingCode0
SAM2 for Image and Video Segmentation: A Comprehensive Survey0
Show:102550
← PrevPage 1 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDHFAccuracy86.86Unverified