SOTAVerified

Video Segmentation

Papers

Showing 101150 of 388 papers

TitleStatusHype
Physarum Powered Differentiable Linear Programming Layers and ApplicationsCode1
TapLab: A Fast Framework for Semantic Video Segmentation Tapping into Compressed-Domain KnowledgeCode1
Efficient Semantic Video Segmentation with Per-frame InferenceCode1
Zero-Shot Video Object Segmentation via Attentive Graph Neural NetworksCode1
Separable Convolutional LSTMs for Faster Video SegmentationCode1
Semantic Segmentation of Video Sequences with Convolutional LSTMsCode1
YouTube-VOS: Sequence-to-Sequence Video Object SegmentationCode1
Actor and Action Video Segmentation from a SentenceCode1
One-Shot Video Object SegmentationCode1
Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation0
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation0
CogGen: A Learner-Centered Generative AI Architecture for Intelligent Tutoring with Programming Video0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment0
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects0
Q-SAM2: Accurate Quantization for Segment Anything Model 20
OmniFall: A Unified Staged-to-Wild Benchmark for Human Fall DetectionCode0
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of ThoughtsCode0
FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching0
VolE: A Point-cloud Framework for Food 3D Reconstruction and Volume Estimation0
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild0
Comparative Analysis of Image, Video, and Audio Classifiers for Automated News Video Segmentation0
Online Reasoning Video Segmentation with Just-in-Time Digital Twins0
Reducing Annotation Burden: Exploiting Image Knowledge for Few-Shot Medical Video Object Segmentation via Spatiotemporal Consistency RelearningCode0
Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and TrackingCode0
SAM2 for Image and Video Segmentation: A Comprehensive Survey0
Open-World Skill Discovery from Unsegmented Demonstrations0
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation0
Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching0
Parameter-free Video Segmentation for Vision and Language Understanding0
An Analysis of Data Transformation Effects on Segment Anything 20
Deep learning approaches to surgical video segmentation and object detection: A Scoping Review0
Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field0
Role of the Pretraining and the Adaptation data sizes for low-resource real-time MRI video segmentation0
Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors0
Efficient Frame Extraction: A Novel Approach Through Frame Similarity and Surgical Tool Tracking for Video SegmentationCode0
Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation0
Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron MicroscopyCode0
EntitySAM: Segment Everything in Video0
VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos0
Decoupled Motion Expression Video Segmentation0
Is Segment Anything Model 2 All You Need for Surgery Video Segmentation? A Systematic Evaluation0
Generative Video Propagation0
When SAM2 Meets Video Shadow and Mirror DetectionCode0
Collaborative Hybrid Propagator for Temporal Misalignment in Audio-Visual Segmentation0
RoMo: Robust Motion Segmentation Improves Structure from Motion0
Geometric Algebra Planes: Convex Implicit Neural Volumes0
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level0
Zero-shot capability of SAM-family models for bone segmentation in CT scans0
MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection DataCode0
GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting0
Show:102550
← PrevPage 3 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDHFAccuracy86.86Unverified