| Unlocking the Power of SAM 2 for Few-Shot Segmentation | May 20, 2025 | SegmentationVideo Segmentation | CodeCode Available | 1 |
| FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching | May 19, 2025 | Instance SegmentationSegmentation | —Unverified | 0 |
| VolE: A Point-cloud Framework for Food 3D Reconstruction and Volume Estimation | May 15, 2025 | 3D ReconstructionCamera Calibration | —Unverified | 0 |
| TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action | May 2, 2025 | Dense CaptioningHighlight Detection | CodeCode Available | 1 |
| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild | Apr 15, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video Segmentation | Apr 7, 2025 | Inference OptimizationReferring Video Object Segmentation | CodeCode Available | 5 |
| MedSAM2: Segment Anything in 3D Medical Images and Videos | Apr 4, 2025 | SegmentationVideo Segmentation | CodeCode Available | 4 |
| Comparative Analysis of Image, Video, and Audio Classifiers for Automated News Video Segmentation | Mar 27, 2025 | Binary ClassificationVideo Segmentation | —Unverified | 0 |