SOTAVerified

Video Segmentation

Papers

Showing 151200 of 388 papers

TitleStatusHype
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos0
Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters0
VideoSAM: A Large Vision Foundation Model for High-Speed Video SegmentationCode0
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation0
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation0
VideoSAM: Open-World Video Segmentation0
Shift and matching queries for video semantic segmentation0
Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision0
LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation0
Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?0
Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track0
Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions0
Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM20
SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation0
Is SAM 2 Better than SAM in Medical Image Segmentation?0
Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation0
Biomedical SAM 2: Segment Anything in Biomedical Images and VideosCode0
FoodMem: Near Real-time and Precise Food Video Segmentation0
DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-ResolutionCode0
Deep Unfolding-Aided Parameter Tuning for Plug-and-Play-Based Video Snapshot Compressive Imaging0
MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation0
Multimodal Segmentation for Vocal Tract Modeling0
2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation0
Visual Representation Learning with Stochastic Frame Prediction0
I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data0
Training-Free Robust Interactive Video Object Segmentation0
3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation0
Automatic Dance Video Segmentation for Understanding Choreography0
arcjetCV: an open-source software to analyze material ablationCode0
Triple Component Matrix Factorization: Untangling Global, Local, and Noisy Components0
Motion-Corrected Moving Average: Including Post-Hoc Temporal Information for Improved Video Segmentation0
PolypNextLSTM: A lightweight and fast polyp video segmentation network using ConvNext and ConvLSTMCode0
Is Two-shot All You Need? A Label-efficient Approach for Video Segmentation in Breast Ultrasound0
Infer from What You Have Seen Before: Temporally-dependent Classifier for Semi-supervised Video SegmentationCode0
Appearance-Based Refinement for Object-Centric Motion Segmentation0
Hierarchical Graph Pattern Understanding for Zero-Shot VOSCode0
GenDeF: Learning Generative Deformation Field for Video Generation0
DeepPyramid+: Medical Image Segmentation using Pyramid View Fusion and Deformable Pyramid Reception0
DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance FieldsCode0
Correlation-aware active learning for surgery video segmentation0
Understanding Video Transformers for Segmentation: A Survey of Application and Interpretability0
CoralVOS: Dataset and Benchmark for Coral Video Segmentation0
SimLVSeg: Simplifying Left Ventricular Segmentation in 2D+Time Echocardiograms with Self- and Weakly-Supervised LearningCode0
Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric RepresentationCode0
SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset0
MoDA: Leveraging Motion Priors from Videos for Advancing Unsupervised Domain Adaptation in Semantic SegmentationCode0
GL-Fusion: Global-Local Fusion Network for Multi-view Echocardiogram Video SegmentationCode0
Robotic Scene Segmentation with Memory Network for Runtime Surgical Context InferenceCode0
MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation0
Immersive Human-Machine Teleoperation Framework for Precision Agriculture: Integrating UAV-based Digital Mapping and Virtual Reality Control0
Show:102550
← PrevPage 4 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDHFAccuracy86.86Unverified