| Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters | Nov 7, 2024 | Image SegmentationOptical Flow Estimation | —Unverified | 0 |
| VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Nov 7, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| VideoSAM: A Large Vision Foundation Model for High-Speed Video Segmentation | Oct 22, 2024 | SegmentationVideo Segmentation | CodeCode Available | 0 |
| Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation | Oct 17, 2024 | Multi-Object TrackingMulti-Object Tracking and Segmentation | —Unverified | 0 |
| Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation | Oct 16, 2024 | BenchmarkingPanoptic Segmentation | —Unverified | 0 |
| VideoSAM: Open-World Video Segmentation | Oct 11, 2024 | Autonomous DrivingDecoder | —Unverified | 0 |
| Shift and matching queries for video semantic segmentation | Oct 10, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision | Sep 14, 2024 | Video SegmentationVideo Semantic Segmentation | —Unverified | 0 |
| LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation | Sep 9, 2024 | ObjectReferring Video Object Segmentation | —Unverified | 0 |
| Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? | Aug 20, 2024 | Image SegmentationSegmentation | —Unverified | 0 |