| VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Nov 7, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters | Nov 7, 2024 | Image SegmentationOptical Flow Estimation | —Unverified | 0 |
| VideoSAM: A Large Vision Foundation Model for High-Speed Video Segmentation | Oct 22, 2024 | SegmentationVideo Segmentation | CodeCode Available | 0 |
| Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation | Oct 17, 2024 | Multi-Object TrackingMulti-Object Tracking and Segmentation | —Unverified | 0 |
| Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation | Oct 16, 2024 | BenchmarkingPanoptic Segmentation | —Unverified | 0 |
| VideoSAM: Open-World Video Segmentation | Oct 11, 2024 | Autonomous DrivingDecoder | —Unverified | 0 |
| Shift and matching queries for video semantic segmentation | Oct 10, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision | Sep 14, 2024 | Video SegmentationVideo Semantic Segmentation | —Unverified | 0 |
| LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation | Sep 9, 2024 | ObjectReferring Video Object Segmentation | —Unverified | 0 |
| Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? | Aug 20, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track | Aug 19, 2024 | ObjectSegmentation | —Unverified | 0 |
| Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions | Aug 8, 2024 | Information RetrievalSaliency Detection | —Unverified | 0 |
| Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2 | Aug 8, 2024 | Image SegmentationMedical Image Analysis | —Unverified | 0 |
| SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation | Aug 8, 2024 | DecoderInteractive Segmentation | —Unverified | 0 |
| Is SAM 2 Better than SAM in Medical Image Segmentation? | Aug 8, 2024 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| Performance and Non-adversarial Robustness of the Segment Anything Model 2 in Surgical Video Segmentation | Aug 7, 2024 | Adversarial RobustnessImage Segmentation | —Unverified | 0 |
| Biomedical SAM 2: Segment Anything in Biomedical Images and Videos | Aug 6, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| FoodMem: Near Real-time and Precise Food Video Segmentation | Jul 16, 2024 | SegmentationSemantic Segmentation | —Unverified | 0 |
| DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-Resolution | Jul 1, 2024 | DeblurringSuper-Resolution | CodeCode Available | 0 |
| Deep Unfolding-Aided Parameter Tuning for Plug-and-Play-Based Video Snapshot Compressive Imaging | Jun 28, 2024 | DenoisingVideo Segmentation | —Unverified | 0 |
| MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation | Jun 27, 2024 | Anomaly DetectionGraph Generation | —Unverified | 0 |
| Multimodal Segmentation for Vocal Tract Modeling | Jun 22, 2024 | SegmentationVideo Segmentation | —Unverified | 0 |
| 2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 20, 2024 | Instance SegmentationReferring Video Object Segmentation | —Unverified | 0 |
| Visual Representation Learning with Stochastic Frame Prediction | Jun 11, 2024 | DecoderPose Tracking | —Unverified | 0 |
| I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data | Jun 10, 2024 | NavigateObject | —Unverified | 0 |
| Training-Free Robust Interactive Video Object Segmentation | Jun 8, 2024 | Interactive Video Object SegmentationObject | —Unverified | 0 |
| 3rd Place Solution for MeViS Track in CVPR 2024 PVUW workshop: Motion Expression guided Video Segmentation | Jun 7, 2024 | Referring Video Object SegmentationSemantic Segmentation | —Unverified | 0 |
| Automatic Dance Video Segmentation for Understanding Choreography | May 30, 2024 | SegmentationVideo Segmentation | —Unverified | 0 |
| arcjetCV: an open-source software to analyze material ablation | Apr 17, 2024 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 0 |
| Triple Component Matrix Factorization: Untangling Global, Local, and Noisy Components | Mar 21, 2024 | Anomaly DetectionVideo Segmentation | —Unverified | 0 |
| Motion-Corrected Moving Average: Including Post-Hoc Temporal Information for Improved Video Segmentation | Mar 5, 2024 | Optical Flow EstimationSegmentation | —Unverified | 0 |
| PolypNextLSTM: A lightweight and fast polyp video segmentation network using ConvNext and ConvLSTM | Feb 18, 2024 | SegmentationVideo Segmentation | CodeCode Available | 0 |
| Is Two-shot All You Need? A Label-efficient Approach for Video Segmentation in Breast Ultrasound | Feb 7, 2024 | AllLesion Segmentation | —Unverified | 0 |
| Infer from What You Have Seen Before: Temporally-dependent Classifier for Semi-supervised Video Segmentation | Jan 1, 2024 | Representation LearningSemantic Segmentation | CodeCode Available | 0 |
| Appearance-Based Refinement for Object-Centric Motion Segmentation | Dec 18, 2023 | Motion SegmentationObject | —Unverified | 0 |
| Hierarchical Graph Pattern Understanding for Zero-Shot VOS | Dec 15, 2023 | DecoderGraph Neural Network | CodeCode Available | 0 |
| GenDeF: Learning Generative Deformation Field for Video Generation | Dec 7, 2023 | DisentanglementVideo Editing | —Unverified | 0 |
| DeepPyramid+: Medical Image Segmentation using Pyramid View Fusion and Deformable Pyramid Reception | Dec 6, 2023 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields | Nov 18, 2023 | DecoderPoint Cloud Segmentation | CodeCode Available | 0 |
| Correlation-aware active learning for surgery video segmentation | Nov 15, 2023 | Active LearningContrastive Learning | —Unverified | 0 |
| Understanding Video Transformers for Segmentation: A Survey of Application and Interpretability | Oct 18, 2023 | SegmentationVideo Segmentation | —Unverified | 0 |
| CoralVOS: Dataset and Benchmark for Coral Video Segmentation | Oct 3, 2023 | SegmentationSemantic Segmentation | —Unverified | 0 |
| SimLVSeg: Simplifying Left Ventricular Segmentation in 2D+Time Echocardiograms with Self- and Weakly-Supervised Learning | Sep 30, 2023 | Left Ventricle SegmentationLV Segmentation | CodeCode Available | 0 |
| Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation | Sep 23, 2023 | ObjectVideo Segmentation | CodeCode Available | 0 |
| SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset | Sep 21, 2023 | Autonomous VehiclesDepth Estimation | —Unverified | 0 |
| MoDA: Leveraging Motion Priors from Videos for Advancing Unsupervised Domain Adaptation in Semantic Segmentation | Sep 21, 2023 | Domain AdaptationImage Segmentation | CodeCode Available | 0 |
| GL-Fusion: Global-Local Fusion Network for Multi-view Echocardiogram Video Segmentation | Sep 20, 2023 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 0 |
| Robotic Scene Segmentation with Memory Network for Runtime Surgical Context Inference | Aug 24, 2023 | Scene SegmentationSegmentation | CodeCode Available | 0 |
| MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation | Aug 22, 2023 | Scene SegmentationSegmentation | —Unverified | 0 |
| Immersive Human-Machine Teleoperation Framework for Precision Agriculture: Integrating UAV-based Digital Mapping and Virtual Reality Control | Aug 14, 2023 | Video SegmentationVideo Semantic Segmentation | —Unverified | 0 |