| Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation | Jul 13, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation | Jul 10, 2025 | NeRFObject | —Unverified | 0 |
| Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder | Jun 28, 2025 | Image SegmentationLarge Language Model | CodeCode Available | 1 |
| CogGen: A Learner-Centered Generative AI Architecture for Intelligent Tutoring with Programming Video | Jun 25, 2025 | Knowledge TracingVideo Segmentation | —Unverified | 0 |
| Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment | Jun 17, 2025 | Autonomous DrivingInstance Segmentation | —Unverified | 0 |
| A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects | Jun 16, 2025 | BenchmarkingInstance Segmentation | —Unverified | 0 |
| Q-SAM2: Accurate Quantization for Segment Anything Model 2 | Jun 11, 2025 | QuantizationVideo Segmentation | —Unverified | 0 |
| SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost | Jun 2, 2025 | Image SegmentationSemantic Segmentation | CodeCode Available | 1 |
| OmniFall: A Unified Staged-to-Wild Benchmark for Human Fall Detection | May 26, 2025 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 0 |
| ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts | May 24, 2025 | Image SegmentationInstance Segmentation | CodeCode Available | 0 |