| ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities | May 18, 2023 | 1 Image, 2*2 StitchiAction Classification | CodeCode Available | 3 |
| Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks | Mar 30, 2023 | Human ParsingPedestrian Attribute Recognition | CodeCode Available | 3 |
| MA-Net: A Multi-Scale Attention Network for Liver and Tumor Segmentation | Sep 21, 2020 | Image SegmentationMedical Image Segmentation | CodeCode Available | 3 |
| Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline | Nov 19, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 3 |
| LangSplat: 3D Language Gaussian Splatting | Dec 26, 2023 | NeRFObject Localization | CodeCode Available | 3 |
| InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation | Aug 28, 2024 | Cell SegmentationGPU | CodeCode Available | 3 |
| CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation | May 17, 2024 | DecoderMamba | CodeCode Available | 3 |
| PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition | Mar 26, 2024 | Image ClassificationInstance Segmentation | CodeCode Available | 3 |
| Point-SAM: Promptable 3D Segmentation Model for Point Clouds | Jun 25, 2024 | Image SegmentationSegmentation | CodeCode Available | 3 |
| Point Transformer V3: Simpler, Faster, Stronger | Dec 15, 2023 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 3 |
| LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation | Mar 8, 2024 | Image SegmentationMamba | CodeCode Available | 3 |
| RAP-SAM: Towards Real-Time All-Purpose Segment Anything | Jan 18, 2024 | AllDecoder | CodeCode Available | 3 |
| 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks | Aug 15, 2024 | image-classificationImage Classification | CodeCode Available | 3 |
| ResNeSt: Split-Attention Networks | Apr 19, 2020 | image-classificationImage Classification | CodeCode Available | 3 |
| How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks? | Jan 20, 2025 | Computed Tomography (CT)GPU | CodeCode Available | 3 |
| ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features | Feb 6, 2025 | Image SegmentationSegmentation | CodeCode Available | 3 |
| A Survey of Camouflaged Object Detection and Beyond | Aug 26, 2024 | Instance SegmentationObject | CodeCode Available | 3 |
| Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment | Dec 1, 2023 | Contrastive LearningFew-Shot Learning | CodeCode Available | 3 |
| A Simple Framework for Open-Vocabulary Segmentation and Detection | Mar 14, 2023 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 3 |
| SAM-Med2D | Aug 30, 2023 | DecoderImage Segmentation | CodeCode Available | 3 |
| ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders | Jan 2, 2023 | Object DetectionRepresentation Learning | CodeCode Available | 3 |
| Generalized Decoding for Pixel, Image, and Language | Dec 21, 2022 | DecoderImage Segmentation | CodeCode Available | 3 |
| How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model | Apr 15, 2024 | DecoderImage Segmentation | CodeCode Available | 3 |
| FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization | Mar 24, 2023 | 3D Hand Pose EstimationGPU | CodeCode Available | 3 |
| Anything-3D: Towards Single-view Anything Reconstruction in the Wild | Apr 19, 2023 | 3D ReconstructionDiversity | CodeCode Available | 3 |