SOTAVerified

Video Segmentation

Papers

Showing 151175 of 388 papers

TitleStatusHype
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos0
Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters0
VideoSAM: A Large Vision Foundation Model for High-Speed Video SegmentationCode0
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation0
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation0
VideoSAM: Open-World Video Segmentation0
Shift and matching queries for video semantic segmentation0
Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision0
LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation0
Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?0
Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track0
SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation0
Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions0
Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM20
Is SAM 2 Better than SAM in Medical Image Segmentation?0
Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation0
Biomedical SAM 2: Segment Anything in Biomedical Images and VideosCode0
FoodMem: Near Real-time and Precise Food Video Segmentation0
DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-ResolutionCode0
Deep Unfolding-Aided Parameter Tuning for Plug-and-Play-Based Video Snapshot Compressive Imaging0
MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation0
Multimodal Segmentation for Vocal Tract Modeling0
2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation0
Visual Representation Learning with Stochastic Frame Prediction0
I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data0
Show:102550
← PrevPage 7 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDHFAccuracy86.86Unverified