| No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation | Apr 5, 2024 | Few-Shot LearningScene Segmentation | CodeCode Available | 4 |
| BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation | May 26, 2022 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 4 |
| Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting | Mar 18, 2025 | Instance SegmentationObject | CodeCode Available | 2 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 |
| Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable Filters | Jul 4, 2022 | Autonomous DrivingScene Segmentation | CodeCode Available | 2 |
| Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation | Mar 29, 2022 | Instance SegmentationNeRF | CodeCode Available | 2 |
| UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery | Sep 18, 2021 | Change DetectionDecoder | CodeCode Available | 2 |
| Simplifying Object Segmentation with PixelLib Library | Jan 20, 2021 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking | May 13, 2025 | DiversityMamba | CodeCode Available | 1 |
| The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs | Mar 25, 2025 | BenchmarkingScene Segmentation | CodeCode Available | 1 |
| MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss Alps | Mar 23, 2025 | Scene SegmentationVideo Understanding | CodeCode Available | 1 |
| ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Nov 3, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices | Sep 25, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian | Aug 7, 2024 | Autonomous Drivingobject-detection | CodeCode Available | 1 |
| SMPISD-MTPNet: Scene Semantic Prior-Assisted Infrared Ship Detection Using Multi-Task Perception Networks | Jul 26, 2024 | Data AugmentationScene Segmentation | CodeCode Available | 1 |
| GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction | Mar 26, 2024 | 3D Human Pose EstimationImage Reconstruction | CodeCode Available | 1 |
| Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review | Feb 17, 2024 | Panoptic SegmentationScene Segmentation | CodeCode Available | 1 |
| Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation | Jan 30, 2024 | Autonomous VehiclesScene Segmentation | CodeCode Available | 1 |
| Neighbor Relations Matter in Video Scene Detection | Jan 1, 2024 | Scene Segmentation | CodeCode Available | 1 |
| The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark | Dec 19, 2023 | AnatomyInstance Segmentation | CodeCode Available | 1 |
| SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation | Nov 29, 2023 | Scene SegmentationScene Understanding | CodeCode Available | 1 |
| Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation | Nov 21, 2023 | Depth EstimationDomain Adaptation | CodeCode Available | 1 |
| CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer Vision | Nov 12, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| Uncertainty Estimation for Safety-critical Scene Segmentation via Fine-grained Reward Maximization | Nov 5, 2023 | Scene SegmentationSegmentation | CodeCode Available | 1 |
| GNeSF: Generalizable Neural Semantic Fields | Oct 24, 2023 | 3D Semantic SegmentationScene Segmentation | CodeCode Available | 1 |
| APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds | Sep 29, 2023 | Scene SegmentationSemantic Segmentation | CodeCode Available | 1 |
| MA-SAM: Modality-agnostic SAM Adaptation for 3D Medical Image Segmentation | Sep 16, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation Surveillance | Sep 15, 2023 | 2k4k | CodeCode Available | 1 |
| AdaptiveSAM: Towards Efficient Tuning of SAM for Surgical Scene Segmentation | Aug 7, 2023 | Scene SegmentationSegmentation | CodeCode Available | 1 |
| Unmasking Anomalies in Road-Scene Segmentation | Jul 25, 2023 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 1 |
| VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions | May 30, 2023 | Dialogue GenerationDialogue Understanding | CodeCode Available | 1 |
| SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery | Apr 19, 2023 | Question AnsweringScene Segmentation | CodeCode Available | 1 |
| Self-positioning Point-based Transformer for Point Cloud Understanding | Mar 29, 2023 | 3D Part Segmentation3D Point Cloud Classification | CodeCode Available | 1 |
| Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies | Mar 26, 2023 | Highlight DetectionLearning with noisy labels | CodeCode Available | 1 |
| Neural Implicit Vision-Language Feature Fields | Mar 20, 2023 | Image SegmentationLanguage Modeling | CodeCode Available | 1 |
| Paced-Curriculum Distillation with Prediction and Label Uncertainty for Image Segmentation | Feb 2, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction | Jan 1, 2023 | 3D Scene ReconstructionImage Segmentation | CodeCode Available | 1 |
| Efficient Movie Scene Detection using State-Space Transformers | Dec 29, 2022 | GPUScene Segmentation | CodeCode Available | 1 |
| Push-the-Boundary: Boundary-aware Feature Propagation for Semantic Segmentation of 3D Point Clouds | Dec 23, 2022 | Multi-Task LearningObject | CodeCode Available | 1 |
| Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation | Nov 26, 2022 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 1 |
| Unsupervised RGB-to-Thermal Domain Adaptation via Multi-Domain Attention Network | Oct 9, 2022 | Domain Adaptationimage-classification | CodeCode Available | 1 |
| Diffusion Unit: Interpretable Edge Enhancement and Suppression Learning for 3D Point Cloud Segmentation | Sep 20, 2022 | 3D Part SegmentationPoint Cloud Segmentation | CodeCode Available | 1 |
| DenseHybrid: Hybrid Anomaly Detection for Dense Open-set Recognition | Jul 6, 2022 | Anomaly DetectionOpen Set Learning | CodeCode Available | 1 |
| IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments | Jun 27, 2022 | Autonomous VehiclesScene Segmentation | CodeCode Available | 1 |
| Rethinking Surgical Instrument Segmentation: A Background Image Can Be All You Need | Jun 23, 2022 | AllDeep Learning | CodeCode Available | 1 |
| Scene Consistency Representation Learning for Video Scene Segmentation | May 11, 2022 | Data AugmentationInductive Bias | CodeCode Available | 1 |
| FIFO: Learning Fog-invariant Features for Foggy Scene Segmentation | Apr 4, 2022 | Domain AdaptationFoggy Scene Segmentation | CodeCode Available | 1 |
| Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation | Mar 29, 2022 | Contrastive LearningRelation | CodeCode Available | 1 |
| Test-time Adaptation with Slot-Centric Models | Mar 21, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Boundary-aware Self-supervised Learning for Video Scene Segmentation | Jan 14, 2022 | Scene SegmentationSelf-Supervised Learning | CodeCode Available | 1 |