| ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation | Jun 26, 2025 | Autonomous NavigationDepth Estimation | —Unverified | 0 |
| Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis | May 31, 2025 | Scene SegmentationSegmentation | —Unverified | 0 |
| JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation | May 15, 2025 | BenchmarkingDepth Estimation | —Unverified | 0 |
| FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization | May 14, 2025 | Scene SegmentationSegmentation | —Unverified | 0 |
| ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking | May 13, 2025 | DiversityMamba | CodeCode Available | 1 |
| Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections | Apr 23, 2025 | Action Triplet RecognitionFederated Learning | —Unverified | 0 |
| Temporal Propagation of Asymmetric Feature Pyramid for Surgical Scene Segmentation | Apr 18, 2025 | Scene SegmentationScene Understanding | —Unverified | 0 |
| Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation | Mar 28, 2025 | Dataset GenerationDomain Generalization | —Unverified | 0 |
| The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs | Mar 25, 2025 | BenchmarkingScene Segmentation | CodeCode Available | 1 |
| MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss Alps | Mar 23, 2025 | Scene SegmentationVideo Understanding | CodeCode Available | 1 |
| SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints | Mar 19, 2025 | NeRFScene Segmentation | —Unverified | 0 |
| Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting | Mar 18, 2025 | Instance SegmentationObject | CodeCode Available | 2 |
| Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning | Mar 17, 2025 | Scene SegmentationTask Planning | —Unverified | 0 |
| Point Cloud Based Scene Segmentation: A Survey | Mar 16, 2025 | 3D Object Detection3D Semantic Segmentation | —Unverified | 0 |
| Long-Video Audio Synthesis with Multi-Agent Collaboration | Mar 13, 2025 | Audio SynthesisScene Segmentation | —Unverified | 0 |
| SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection | Mar 5, 2025 | AnatomyScene Segmentation | —Unverified | 0 |
| Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field | Feb 22, 2025 | 2D Panoptic Segmentation3D Scene Reconstruction | —Unverified | 0 |
| Activation-wise Propagation: A Universal Strategy to Break Timestep Constraints in Spiking Neural Networks for 3D Data Processing | Feb 18, 2025 | Action RecognitionAutonomous Driving | —Unverified | 0 |
| NightAdapter: Learning a Frequency Adapter for Generalizable Night-time Scene Segmentation | Jan 1, 2025 | Scene Segmentation | —Unverified | 0 |
| Classification Drives Geographic Bias in Street Scene Segmentation | Dec 15, 2024 | ClassificationDiversity | —Unverified | 0 |
| Point-GR: Graph Residual Point Cloud Network for 3D Object Classification and Segmentation | Dec 4, 2024 | 3D Object ClassificationClassification | —Unverified | 0 |
| Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications | Nov 18, 2024 | Scene SegmentationScene Understanding | —Unverified | 0 |
| Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model | Nov 13, 2024 | DecoderScene Segmentation | —Unverified | 0 |
| OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing | Nov 5, 2024 | Scene ParsingScene Segmentation | —Unverified | 0 |
| ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Nov 3, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |