| VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Nov 7, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters | Nov 7, 2024 | Image SegmentationOptical Flow Estimation | —Unverified | 0 |
| VideoSAM: A Large Vision Foundation Model for High-Speed Video Segmentation | Oct 22, 2024 | SegmentationVideo Segmentation | CodeCode Available | 0 |
| Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation | Oct 17, 2024 | Multi-Object TrackingMulti-Object Tracking and Segmentation | —Unverified | 0 |
| Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation | Oct 16, 2024 | BenchmarkingPanoptic Segmentation | —Unverified | 0 |
| VideoSAM: Open-World Video Segmentation | Oct 11, 2024 | Autonomous DrivingDecoder | —Unverified | 0 |
| Shift and matching queries for video semantic segmentation | Oct 10, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision | Sep 14, 2024 | Video SegmentationVideo Semantic Segmentation | —Unverified | 0 |
| LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation | Sep 9, 2024 | ObjectReferring Video Object Segmentation | —Unverified | 0 |
| Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? | Aug 20, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track | Aug 19, 2024 | ObjectSegmentation | —Unverified | 0 |
| SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation | Aug 8, 2024 | DecoderInteractive Segmentation | —Unverified | 0 |
| Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions | Aug 8, 2024 | Information RetrievalSaliency Detection | —Unverified | 0 |
| Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2 | Aug 8, 2024 | Image SegmentationMedical Image Analysis | —Unverified | 0 |
| Is SAM 2 Better than SAM in Medical Image Segmentation? | Aug 8, 2024 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Aug 7, 2024 | Adversarial RobustnessImage Segmentation | —Unverified | 0 |
| Biomedical SAM 2: Segment Anything in Biomedical Images and Videos | Aug 6, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| FoodMem: Near Real-time and Precise Food Video Segmentation | Jul 16, 2024 | SegmentationSemantic Segmentation | —Unverified | 0 |
| DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-Resolution | Jul 1, 2024 | DeblurringSuper-Resolution | CodeCode Available | 0 |
| Deep Unfolding-Aided Parameter Tuning for Plug-and-Play-Based Video Snapshot Compressive Imaging | Jun 28, 2024 | DenoisingVideo Segmentation | —Unverified | 0 |
| MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation | Jun 27, 2024 | Anomaly DetectionGraph Generation | —Unverified | 0 |
| Multimodal Segmentation for Vocal Tract Modeling | Jun 22, 2024 | SegmentationVideo Segmentation | —Unverified | 0 |
| 2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 20, 2024 | Instance SegmentationReferring Video Object Segmentation | —Unverified | 0 |
| Visual Representation Learning with Stochastic Frame Prediction | Jun 11, 2024 | DecoderPose Tracking | —Unverified | 0 |
| I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data | Jun 10, 2024 | NavigateObject | —Unverified | 0 |