| Breaking reCAPTCHAv2 | Sep 13, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 3 |
| MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model | Nov 1, 2022 | Anomaly DetectionBrain Tumor Segmentation | CodeCode Available | 3 |
| 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks | Aug 15, 2024 | image-classificationImage Classification | CodeCode Available | 3 |
| Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Jun 10, 2024 | 3D Semantic SegmentationComputed Tomography (CT) | CodeCode Available | 3 |
| Anything-3D: Towards Single-view Anything Reconstruction in the Wild | Apr 19, 2023 | 3D ReconstructionDiversity | CodeCode Available | 3 |
| Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving | May 8, 2024 | Autonomous DrivingLIDAR Semantic Segmentation | CodeCode Available | 3 |
| Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 3 |
| EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation | May 11, 2024 | Computational EfficiencyDecoder | CodeCode Available | 3 |
| FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization | Mar 24, 2023 | 3D Hand Pose EstimationGPU | CodeCode Available | 3 |
| PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360deg | Jan 1, 2023 | Image GenerationImage Segmentation | CodeCode Available | 3 |
| FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes | May 7, 2024 | 3D Point Cloud Classification3D Semantic Segmentation | CodeCode Available | 3 |
| Personalize Segment Anything Model with One Shot | May 4, 2023 | Image Generationmodel | CodeCode Available | 3 |
| Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline | Nov 19, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 3 |
| Point Transformer V3: Simpler, Faster, Stronger | Dec 15, 2023 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 3 |
| Transformers in Medical Imaging: A Survey | Jan 24, 2022 | Image ClassificationImage Segmentation | CodeCode Available | 3 |
| CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation | May 17, 2024 | DecoderMamba | CodeCode Available | 3 |
| RAP-SAM: Towards Real-Time All-Purpose Segment Anything | Jan 18, 2024 | AllDecoder | CodeCode Available | 3 |
| Recovering Realistic Texture in Image Super-resolution by Deep Spatial Feature Transform | Apr 9, 2018 | Image Super-ResolutionSemantic Segmentation | CodeCode Available | 3 |
| MogaNet: Multi-order Gated Aggregation Network | Nov 7, 2022 | 3D Human Pose EstimationImage Classification | CodeCode Available | 2 |
| Ambiguous Medical Image Segmentation using Diffusion Models | Apr 10, 2023 | DiagnosticDiversity | CodeCode Available | 2 |
| UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery | Sep 18, 2021 | Change DetectionDecoder | CodeCode Available | 2 |
| Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object Segmentation | Jul 5, 2022 | Autonomous DrivingCollision Avoidance | CodeCode Available | 2 |
| ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer | Mar 8, 2022 | Image Classificationobject-detection | CodeCode Available | 2 |
| ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning | Mar 29, 2024 | Continual LearningContinual Panoptic Segmentation | CodeCode Available | 2 |
| EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications | Jun 21, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |
| AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditions | Sep 3, 2024 | Autonomous DrivingDeep Attention | CodeCode Available | 2 |
| EasyPortrait -- Face Parsing and Portrait Segmentation Dataset | Apr 26, 2023 | DiversityDomain Generalization | CodeCode Available | 2 |
| ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks | Oct 8, 2019 | Dimensionality Reductionimage-classification | CodeCode Available | 2 |
| Efficient 3D Semantic Segmentation with Superpoint Transformer | Jun 13, 2023 | 3D Semantic SegmentationGPU | CodeCode Available | 2 |
| Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation | Jul 15, 2025 | Image SegmentationSegmentation | CodeCode Available | 2 |
| DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments | Sep 17, 2022 | Motion SegmentationSemantic Segmentation | CodeCode Available | 2 |
| E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation | Mar 8, 2022 | GPUInstance Segmentation | CodeCode Available | 2 |
| Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation | Apr 21, 2024 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 2 |
| DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation | Mar 17, 2024 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 2 |
| Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation | Mar 18, 2024 | Mixture-of-Expertsparameter-efficient fine-tuning | CodeCode Available | 2 |
| EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation | Mar 3, 2024 | ObjectRepresentation Learning | CodeCode Available | 2 |
| DreamColour: Controllable Video Colour Editing without Training | Dec 6, 2024 | Instance SegmentationSemantic Segmentation | CodeCode Available | 2 |
| DreamLIP: Language-Image Pre-training with Long Captions | Mar 25, 2024 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 |
| 3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation | Sep 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation | Mar 4, 2024 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 2 |
| Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation | Apr 26, 2023 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation | Jun 6, 2024 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation | Apr 8, 2025 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| Efficient Video Object Segmentation via Modulated Cross-Attention Memory | Mar 26, 2024 | GPUObject | CodeCode Available | 2 |
| DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data | May 16, 2024 | Data AugmentationDiversity | CodeCode Available | 2 |
| Distribution-Free, Risk-Controlling Prediction Sets | Jan 7, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 2 |
| Diversified and Personalized Multi-rater Medical Image Segmentation | Mar 20, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| AiTLAS: Artificial Intelligence Toolbox for Earth Observation | Jan 21, 2022 | BenchmarkingEarth Observation | CodeCode Available | 2 |
| 1st Place Solution for PSG competition with ECCV'22 SenseHuman Workshop | Feb 6, 2023 | Multi-class ClassificationPanoptic Segmentation | CodeCode Available | 2 |
| Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Jun 10, 2024 | Instance SegmentationSalient Object Detection | CodeCode Available | 2 |