SOTAVerified

Video Segmentation

Papers

Showing 51100 of 388 papers

TitleStatusHype
When SAM2 Meets Video Shadow and Mirror DetectionCode0
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language ModelsCode2
Collaborative Hybrid Propagator for Temporal Misalignment in Audio-Visual Segmentation0
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any GranularityCode2
Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different ScenesCode3
Multi-Granularity Video Object SegmentationCode1
Det-SAM2:Technical Report on the Self-Prompting Segmentation Framework Based on Segment Anything Model 2Code2
Efficient Track AnythingCode7
RoMo: Robust Motion Segmentation Improves Structure from Motion0
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video SegmentationCode3
Geometric Algebra Planes: Convex Implicit Neural Volumes0
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level0
Zero-shot capability of SAM-family models for bone segmentation in CT scans0
MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection DataCode0
GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting0
Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters0
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos0
SMITE: Segment Me In TimECode3
VideoSAM: A Large Vision Foundation Model for High-Speed Video SegmentationCode0
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory TreeCode4
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation0
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation0
VideoSAM: Open-World Video Segmentation0
Shift and matching queries for video semantic segmentation0
Underwater Camouflaged Object Tracking Meets Vision-Language SAM2Code5
Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision0
Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 ModelCode2
LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation0
Unleashing the Potential of SAM2 for Biomedical Images and Videos: A SurveyCode5
Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?0
Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track0
Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame PruningCode2
Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM20
Is SAM 2 Better than SAM in Medical Image Segmentation?0
Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions0
SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation0
Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation0
Segment Anything in Medical Images and Videos: Benchmark and DeploymentCode7
Biomedical SAM 2: Segment Anything in Biomedical Images and VideosCode0
Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2Code3
SAM 2: Segment Anything in Images and VideosCode11
ViLLa: Video Reasoning Segmentation with Large Language ModelCode1
FoodMem: Near Real-time and Precise Food Video Segmentation0
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
General and Task-Oriented Video SegmentationCode1
Uni-DVPS: Unified Model for Depth-Aware Video Panoptic SegmentationCode1
DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-ResolutionCode0
Deep Unfolding-Aided Parameter Tuning for Plug-and-Play-Based Video Snapshot Compressive Imaging0
MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation0
PVUW 2024 Challenge on Complex Video Understanding: Methods and ResultsCode4
Show:102550
← PrevPage 2 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDHFAccuracy86.86Unverified