SOTAVerified

Video Segmentation

Papers

Showing 150 of 388 papers

TitleStatusHype
Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation0
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation0
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and GrounderCode1
CogGen: A Learner-Centered Generative AI Architecture for Intelligent Tutoring with Programming Video0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment0
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects0
Q-SAM2: Accurate Quantization for Segment Anything Model 20
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training CostCode1
OmniFall: A Unified Staged-to-Wild Benchmark for Human Fall DetectionCode0
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of ThoughtsCode0
Unlocking the Power of SAM 2 for Few-Shot SegmentationCode1
FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching0
VolE: A Point-cloud Framework for Food 3D Reconstruction and Volume Estimation0
TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in ActionCode1
DC-SAM: In-Context Segment Anything in Images and Videos via Dual ConsistencyCode1
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild0
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
MedSAM2: Segment Anything in 3D Medical Images and VideosCode4
Comparative Analysis of Image, Video, and Audio Classifiers for Automated News Video Segmentation0
Online Reasoning Video Segmentation with Just-in-Time Digital Twins0
CamSAM2: Segment Anything Accurately in Camouflaged VideosCode1
Reducing Annotation Burden: Exploiting Image Knowledge for Few-Shot Medical Video Object Segmentation via Spatiotemporal Consistency RelearningCode0
Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and TrackingCode0
SAM2 for Image and Video Segmentation: A Comprehensive Survey0
Open-World Skill Discovery from Unsegmented Demonstrations0
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation0
Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching0
Parameter-free Video Segmentation for Vision and Language Understanding0
BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket SportsCode1
An Analysis of Data Transformation Effects on Segment Anything 20
Deep learning approaches to surgical video segmentation and object detection: A Scoping Review0
Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field0
Role of the Pretraining and the Adaptation data sizes for low-resource real-time MRI video segmentation0
SASVi - Segment Any Surgical VideoCode1
Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors0
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object SegmentationCode1
Efficient Frame Extraction: A Novel Approach Through Frame Similarity and Surgical Tool Tracking for Video SegmentationCode0
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural NetworksCode1
EdgeTAM: On-Device Track Anything ModelCode4
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video CaptioningCode1
Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation0
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron MicroscopyCode0
EntitySAM: Segment Everything in Video0
Decoupled Motion Expression Video Segmentation0
VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos0
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual PerceiverCode2
Is Segment Anything Model 2 All You Need for Surgery Video Segmentation? A Systematic Evaluation0
Generative Video Propagation0
Show:102550
← PrevPage 1 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDHFAccuracy86.86Unverified