| Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation | Oct 17, 2024 | Multi-Object TrackingMulti-Object Tracking and Segmentation | —Unverified | 0 |
| Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation | Oct 16, 2024 | BenchmarkingPanoptic Segmentation | —Unverified | 0 |
| VideoSAM: Open-World Video Segmentation | Oct 11, 2024 | Autonomous DrivingDecoder | —Unverified | 0 |
| Shift and matching queries for video semantic segmentation | Oct 10, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Underwater Camouflaged Object Tracking Meets Vision-Language SAM2 | Sep 25, 2024 | ObjectObject Tracking | CodeCode Available | 5 |
| Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 Model | Sep 14, 2024 | Medical Image SegmentationPolyp Segmentation | CodeCode Available | 2 |
| Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision | Sep 14, 2024 | Video SegmentationVideo Semantic Segmentation | —Unverified | 0 |
| LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation | Sep 9, 2024 | ObjectReferring Video Object Segmentation | —Unverified | 0 |
| Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey | Aug 23, 2024 | Image SegmentationSegmentation | CodeCode Available | 5 |
| Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? | Aug 20, 2024 | Image SegmentationSegmentation | —Unverified | 0 |