| Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement | Apr 14, 2023 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 2 | 5 |
| Learning Semantic Segmentation of Large-Scale Point Clouds with Random Sampling | Jul 6, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation | Apr 26, 2023 | Domain AdaptationDomain Generalization | CodeCode Available | 2 | 5 |
| Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D | Aug 13, 2020 | Autonomous VehiclesBird's-Eye View Semantic Segmentation | CodeCode Available | 2 | 5 |
| Scalable Video Object Segmentation with Identification Mechanism | Mar 22, 2022 | ObjectSegmentation | CodeCode Available | 2 | 5 |
| Distribution-Free, Risk-Controlling Prediction Sets | Jan 7, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 2 | 5 |
| Locality Alignment Improves Vision-Language Models | Oct 14, 2024 | Semantic SegmentationSpatial Reasoning | CodeCode Available | 2 | 5 |
| DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation | Mar 24, 2025 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 | 5 |
| LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration | Jul 9, 2024 | 3D ReconstructionAutonomous Navigation | CodeCode Available | 2 | 5 |
| DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data | May 16, 2024 | Data AugmentationDiversity | CodeCode Available | 2 | 5 |
| LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks | Jan 17, 2025 | Change DetectionImage Classification | CodeCode Available | 2 | 5 |
| Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors | Mar 24, 2022 | Image GenerationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation | Apr 4, 2025 | Domain GeneralizationMamba | CodeCode Available | 2 | 5 |
| DreamColour: Controllable Video Colour Editing without Training | Dec 6, 2024 | Instance SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Diffusion models as plug-and-play priors | Jun 17, 2022 | Combinatorial OptimizationDenoising | CodeCode Available | 2 | 5 |
| A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Apr 25, 2024 | Autonomous DrivingEvolutionary Algorithms | CodeCode Available | 2 | 5 |
| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 | 5 |
| Digital Twin Generation from Visual Data: A Survey | Apr 17, 2025 | Semantic SegmentationSurvey | CodeCode Available | 2 | 5 |
| A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and Benchmark | Feb 28, 2022 | Image SegmentationInductive Bias | CodeCode Available | 2 | 5 |
| Masked Generative Distillation | May 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Mask-Free Video Instance Segmentation | Mar 28, 2023 | Instance SegmentationOptical Flow Estimation | CodeCode Available | 2 | 5 |
| DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image Segmentation | Jul 13, 2024 | DenoisingImage Segmentation | CodeCode Available | 2 | 5 |
| Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion | Aug 23, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation | Mar 29, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 | 5 |
| MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation | Sep 28, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 | 5 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 | 5 |
| A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Jan 18, 2024 | Instance SegmentationInteractive Segmentation | CodeCode Available | 2 | 5 |
| 3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentation | Jun 23, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| Benchmarking the Robustness of LiDAR Semantic Segmentation Models | Jan 3, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 | 5 |
| 1st Place Solution for PSG competition with ECCV'22 SenseHuman Workshop | Feb 6, 2023 | Multi-class ClassificationPanoptic Segmentation | CodeCode Available | 2 | 5 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 | 5 |
| DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion | Mar 9, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| Merging Context Clustering with Visual State Space Models for Medical Image Segmentation | Jan 3, 2025 | ClusteringImage Segmentation | CodeCode Available | 2 | 5 |
| MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning | May 14, 2025 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 2 | 5 |
| MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Aug 16, 2023 | Motion Expressions Guided Video SegmentationObject | CodeCode Available | 2 | 5 |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Dec 2, 2022 | Domain Adaptationimage-classification | CodeCode Available | 2 | 5 |
| MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training | Aug 3, 2022 | Instance SegmentationSegmentation | CodeCode Available | 2 | 5 |
| Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Apr 2, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features | Sep 30, 2022 | Image Classification | CodeCode Available | 2 | 5 |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Sep 18, 2023 | 3D geometryDecoder | CodeCode Available | 2 | 5 |
| Model-Based Imitation Learning for Urban Driving | Oct 14, 2022 | 3D geometryAutonomous Driving | CodeCode Available | 2 | 5 |
| Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks | May 5, 2021 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| MOSE: A New Dataset for Video Object Segmentation in Complex Scenes | Feb 3, 2023 | ObjectSegmentation | CodeCode Available | 2 | 5 |
| Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation | Aug 31, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception | Mar 15, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Oct 22, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 | 5 |
| DreamLIP: Language-Image Pre-training with Long Captions | Mar 25, 2024 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 | 5 |
| An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models | Nov 25, 2024 | DenoisingScene Understanding | CodeCode Available | 2 | 5 |
| Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation | Jan 15, 2025 | Image SegmentationReferring Expression Segmentation | CodeCode Available | 2 | 5 |