| ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation | Jun 26, 2025 | Autonomous NavigationDepth Estimation | —Unverified | 0 |
| Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis | May 31, 2025 | Scene SegmentationSegmentation | —Unverified | 0 |
| JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation | May 15, 2025 | BenchmarkingDepth Estimation | —Unverified | 0 |
| FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization | May 14, 2025 | Scene SegmentationSegmentation | —Unverified | 0 |
| ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking | May 13, 2025 | DiversityMamba | CodeCode Available | 1 |
| Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections | Apr 23, 2025 | Action Triplet RecognitionFederated Learning | —Unverified | 0 |
| Temporal Propagation of Asymmetric Feature Pyramid for Surgical Scene Segmentation | Apr 18, 2025 | Scene SegmentationScene Understanding | —Unverified | 0 |
| Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation | Mar 28, 2025 | Dataset GenerationDomain Generalization | —Unverified | 0 |
| The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs | Mar 25, 2025 | BenchmarkingScene Segmentation | CodeCode Available | 1 |
| MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss Alps | Mar 23, 2025 | Scene SegmentationVideo Understanding | CodeCode Available | 1 |
| SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints | Mar 19, 2025 | NeRFScene Segmentation | —Unverified | 0 |
| Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting | Mar 18, 2025 | Instance SegmentationObject | CodeCode Available | 2 |
| Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning | Mar 17, 2025 | Scene SegmentationTask Planning | —Unverified | 0 |
| Point Cloud Based Scene Segmentation: A Survey | Mar 16, 2025 | 3D Object Detection3D Semantic Segmentation | —Unverified | 0 |
| Long-Video Audio Synthesis with Multi-Agent Collaboration | Mar 13, 2025 | Audio SynthesisScene Segmentation | —Unverified | 0 |
| SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection | Mar 5, 2025 | AnatomyScene Segmentation | —Unverified | 0 |
| Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field | Feb 22, 2025 | 2D Panoptic Segmentation3D Scene Reconstruction | —Unverified | 0 |
| Activation-wise Propagation: A Universal Strategy to Break Timestep Constraints in Spiking Neural Networks for 3D Data Processing | Feb 18, 2025 | Action RecognitionAutonomous Driving | —Unverified | 0 |
| NightAdapter: Learning a Frequency Adapter for Generalizable Night-time Scene Segmentation | Jan 1, 2025 | Scene Segmentation | —Unverified | 0 |
| Classification Drives Geographic Bias in Street Scene Segmentation | Dec 15, 2024 | ClassificationDiversity | —Unverified | 0 |
| Point-GR: Graph Residual Point Cloud Network for 3D Object Classification and Segmentation | Dec 4, 2024 | 3D Object ClassificationClassification | —Unverified | 0 |
| Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications | Nov 18, 2024 | Scene SegmentationScene Understanding | —Unverified | 0 |
| Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model | Nov 13, 2024 | DecoderScene Segmentation | —Unverified | 0 |
| OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing | Nov 5, 2024 | Scene ParsingScene Segmentation | —Unverified | 0 |
| ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Nov 3, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation | Oct 31, 2024 | Image GenerationScene Segmentation | —Unverified | 0 |
| Surgical Scene Segmentation by Transformer With Asymmetric Feature Enhancement | Oct 23, 2024 | AnatomyScene Segmentation | CodeCode Available | 0 |
| Scene-Segmentation-Based Exposure Compensation for Tone Mapping of High Dynamic Range Scenes | Oct 21, 2024 | Multi-Exposure Image FusionScene Segmentation | —Unverified | 0 |
| Tackling domain generalization for out-of-distribution endoscopic imaging | Oct 18, 2024 | Domain GeneralizationScene Segmentation | —Unverified | 0 |
| Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation | Oct 11, 2024 | Autonomous VehiclesOut-of-Distribution Detection | —Unverified | 0 |
| Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion Models | Oct 10, 2024 | AnatomyData Augmentation | CodeCode Available | 0 |
| Individuation of 3D perceptual units from neurogeometry of binocular cells | Oct 3, 2024 | Scene Segmentation | —Unverified | 0 |
| BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices | Sep 25, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Deep intra-operative illumination calibration of hyperspectral cameras | Sep 11, 2024 | parameter estimationScene Segmentation | —Unverified | 0 |
| Real-Time Multi-Scene Visibility Enhancement for Promoting Navigational Safety of Vessels Under Complex Weather Conditions | Sep 2, 2024 | Computational Efficiencyobject-detection | CodeCode Available | 0 |
| Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images | Aug 27, 2024 | Organ SegmentationScene Segmentation | CodeCode Available | 0 |
| Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian | Aug 7, 2024 | Autonomous Drivingobject-detection | CodeCode Available | 1 |
| Multimodal Fusion and Coherence Modeling for Video Topic Segmentation | Aug 1, 2024 | Contrastive LearningMixture-of-Experts | —Unverified | 0 |
| SMPISD-MTPNet: Scene Semantic Prior-Assisted Infrared Ship Detection Using Multi-Task Perception Networks | Jul 26, 2024 | Data AugmentationScene Segmentation | CodeCode Available | 1 |
| CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM | Jul 9, 2024 | One-Shot SegmentationScene Segmentation | —Unverified | 0 |
| RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation | Jul 8, 2024 | Autonomous DrivingScene Segmentation | —Unverified | 0 |
| Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation | Jul 1, 2024 | 3DGSNeRF | CodeCode Available | 0 |
| Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization | Jun 17, 2024 | feature selectionNeRF | —Unverified | 0 |
| MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion | May 30, 2024 | Decision MakingScene Segmentation | CodeCode Available | 0 |
| Improved Convex Decomposition with Ensembling and Boolean Primitives | May 29, 2024 | regressionScene Segmentation | —Unverified | 0 |
| 3D Learnable Supertoken Transformer for LiDAR Point Cloud Scene Segmentation | May 23, 2024 | ClusteringScene Segmentation | —Unverified | 0 |
| TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System | May 22, 2024 | 3D Object Detection3D Semantic Segmentation | —Unverified | 0 |
| Contrastive Gaussian Clustering: Weakly Supervised 3D Scene Segmentation | Apr 19, 2024 | ClusteringContrastive Learning | —Unverified | 0 |
| Gaga: Group Any Gaussians via 3D-aware Memory Bank | Apr 11, 2024 | Contrastive LearningObject Tracking | —Unverified | 0 |
| No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation | Apr 5, 2024 | Few-Shot LearningScene Segmentation | CodeCode Available | 4 |