SOTAVerified

Video Segmentation

Papers

Showing 150 of 388 papers

TitleStatusHype
SAM 2: Segment Anything in Images and VideosCode11
Efficient Track AnythingCode7
Segment Anything in Medical Images and Videos: Benchmark and DeploymentCode7
The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video SegmentationCode5
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
Underwater Camouflaged Object Tracking Meets Vision-Language SAM2Code5
Unleashing the Potential of SAM2 for Biomedical Images and Videos: A SurveyCode5
MedSAM2: Segment Anything in 3D Medical Images and VideosCode4
EdgeTAM: On-Device Track Anything ModelCode4
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory TreeCode4
PVUW 2024 Challenge on Complex Video Understanding: Methods and ResultsCode4
Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different ScenesCode3
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video SegmentationCode3
SMITE: Segment Me In TimECode3
Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2Code3
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
UniVS: Unified and Universal Video Segmentation with Prompts as QueriesCode3
RAP-SAM: Towards Real-Time All-Purpose Segment AnythingCode3
Tracking Anything with Decoupled Video SegmentationCode3
VideoCutLER: Surprisingly Simple Unsupervised Video Instance SegmentationCode3
Segment Anything Meets Point TrackingCode3
Min-Max Similarity: A Contrastive Semi-Supervised Deep Learning Network for Surgical Tools SegmentationCode3
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual PerceiverCode2
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language ModelsCode2
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any GranularityCode2
Det-SAM2:Technical Report on the Self-Prompting Segmentation Framework Based on Segment Anything Model 2Code2
Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 ModelCode2
Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame PruningCode2
Decoupling Static and Hierarchical Motion Perception for Referring Video SegmentationCode2
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor QueriesCode2
MemSAM: Taming Segment Anything Model for Echocardiography Video SegmentationCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
XMem++: Production-level Video Segmentation From Few Annotated FramesCode2
InstMove: Instance Motion for Object-centric Video SegmentationCode2
Mask2Former for Video Instance SegmentationCode2
Simplifying Object Segmentation with PixelLib LibraryCode2
TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language TranslationCode2
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and GrounderCode1
SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training CostCode1
Unlocking the Power of SAM 2 for Few-Shot SegmentationCode1
TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in ActionCode1
DC-SAM: In-Context Segment Anything in Images and Videos via Dual ConsistencyCode1
CamSAM2: Segment Anything Accurately in Camouflaged VideosCode1
BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket SportsCode1
SASVi - Segment Any Surgical VideoCode1
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object SegmentationCode1
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural NetworksCode1
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video CaptioningCode1
Multi-Granularity Video Object SegmentationCode1
Show:102550
← PrevPage 1 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDHFAccuracy86.86Unverified