SOTAVerified

Video Segmentation

Papers

Showing 2650 of 388 papers

TitleStatusHype
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual PerceiverCode2
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any GranularityCode2
TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language TranslationCode2
Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame PruningCode2
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor QueriesCode2
XMem++: Production-level Video Segmentation From Few Annotated FramesCode2
Decoupling Static and Hierarchical Motion Perception for Referring Video SegmentationCode2
Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 ModelCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
MemSAM: Taming Segment Anything Model for Echocardiography Video SegmentationCode2
Simplifying Object Segmentation with PixelLib LibraryCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
Det-SAM2:Technical Report on the Self-Prompting Segmentation Framework Based on Segment Anything Model 2Code2
A Survey on Deep Learning Technique for Video SegmentationCode1
Local-Global Context Aware Transformer for Language-Guided Video SegmentationCode1
Making a Case for 3D Convolutions for Object Segmentation in VideosCode1
In-N-Out Generative Learning for Dense Unsupervised Video SegmentationCode1
CamSAM2: Segment Anything Accurately in Camouflaged VideosCode1
BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket SportsCode1
Global Knowledge Calibration for Fast Open-Vocabulary SegmentationCode1
GraphEcho: Graph-Driven Unsupervised Domain Adaptation for Echocardiogram Video SegmentationCode1
A Simple Video Segmenter by Tracking Objects Along Axial TrajectoriesCode1
1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video SegmentationCode1
EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object RelationsCode1
CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video SegmentationCode1
Show:102550
← PrevPage 2 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDHFAccuracy86.86Unverified