YOLO-World: Real-Time Open-Vocabulary Object Detection Jan 30, 2024 Instance Segmentation Language Modeling
Code Code Available 9MambaVision: A Hybrid Mamba-Transformer Vision Backbone Jul 10, 2024 Image Classification Instance Segmentation
Code Code Available 7MambaOut: Do We Really Need Mamba for Vision? May 13, 2024 image-classification Image Classification
Code Code Available 74M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities Jun 13, 2024 Instance Segmentation multimodal generation
Code Code Available 5YOLOR-Based Multi-Task Learning Sep 29, 2023 Image Captioning Instance Segmentation
Code Code Available 5Faster Segment Anything: Towards Lightweight SAM for Mobile Applications Jun 25, 2023 CPU Decoder
Code Code Available 5OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels Feb 27, 2025 Image Classification Instance Segmentation
Code Code Available 4EmbodiedSAM: Online Segment Any 3D Thing in Real Time Aug 21, 2024 3D Instance Segmentation GPU
Code Code Available 4InstanceDiffusion: Instance-level Control for Image Generation Feb 5, 2024 Conditional Text-to-Image Synthesis Image Generation
Code Code Available 4LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model Dec 28, 2023 Instance Segmentation Language Modeling
Code Code Available 4EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything Dec 1, 2023 Decoder image-classification
Code Code Available 4RTMDet: An Empirical Study of Designing Real-Time Object Detectors Dec 14, 2022 GPU Instance Segmentation
Code Code Available 4InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions Nov 10, 2022 2D Object Detection Classification
Code Code Available 4GLIPv2: Unifying Localization and Vision-Language Understanding Jun 12, 2022 2D Object Detection Contrastive Learning
Code Code Available 4Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation Jun 6, 2022 Image Segmentation Instance Segmentation
Code Code Available 4EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction May 29, 2022 Autonomous Driving CPU
Code Code Available 4Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN May 27, 2022 Image Classification Instance Segmentation
Code Code Available 4Visual Attention Network Feb 20, 2022 image-classification Image Classification
Code Code Available 4Detectron2 Object Detection & Manipulating Images using Cartoonization Aug 1, 2021 Autonomous Vehicles Data Visualization
Code Code Available 4Panoptic Feature Pyramid Networks Jan 8, 2019 Instance Segmentation Panoptic Segmentation
Code Code Available 4No time to train! Training-Free Reference-Based Instance Segmentation Jul 3, 2025 Cross-Domain Few-Shot Object Detection Few-Shot Object Detection
Code Code Available 3UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface Mar 3, 2025 Instance Segmentation Reasoning Segmentation
Code Code Available 3InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation Aug 28, 2024 Cell Segmentation GPU
Code Code Available 3A Survey of Camouflaged Object Detection and Beyond Aug 26, 2024 Instance Segmentation Object
Code Code Available 35%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks Aug 15, 2024 image-classification Image Classification
Code Code Available 3Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation Jun 4, 2024 2D Object Detection 3D Instance Segmentation
Code Code Available 3PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition Mar 26, 2024 Image Classification Instance Segmentation
Code Code Available 3MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining Mar 20, 2024 Aerial Scene Classification Building change detection for remote sensing images
Code Code Available 3ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions Mar 13, 2024 Instance Segmentation Object Detection
Code Code Available 3General Object Foundation Model for Images and Videos at Scale Dec 14, 2023 Instance Segmentation Long-tail Video Object Segmentation
Code Code Available 3Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment Dec 1, 2023 Contrastive Learning Few-Shot Learning
Code Code Available 3VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation Aug 28, 2023 Instance Segmentation Optical Flow Estimation
Code Code Available 3A Simple Framework for Open-Vocabulary Segmentation and Detection Mar 14, 2023 Instance Segmentation Panoptic Segmentation
Code Code Available 3Universal Instance Perception as Object Discovery and Retrieval Mar 12, 2023 Described Object Detection Generalized Referring Expression Comprehension
Code Code Available 3Cut and Learn for Unsupervised Object Detection and Instance Segmentation Jan 26, 2023 Instance Segmentation object-detection
Code Code Available 3Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling Jan 9, 2023 2D Object Detection Contrastive Learning
Code Code Available 3Generalized Decoding for Pixel, Image, and Language Dec 21, 2022 Decoder Image Segmentation
Code Code Available 3DETRs with Collaborative Hybrid Assignments Training Nov 22, 2022 Decoder Instance Segmentation
Code Code Available 3OneFormer: One Transformer to Rule Universal Image Segmentation Nov 10, 2022 Instance Segmentation Panoptic Segmentation
Code Code Available 3Vision Transformers: From Semantic Segmentation to Dense Prediction Jul 19, 2022 image-classification Image Classification
Code Code Available 3Vision Transformer Adapter for Dense Predictions May 17, 2022 Instance Segmentation Object Detection
Code Code Available 3Nuclei instance segmentation and classification in histopathology images with StarDist Mar 3, 2022 Classification Instance Segmentation
Code Code Available 3XCiT: Cross-Covariance Image Transformers Jun 17, 2021 image-classification Image Classification
Code Code Available 3ResNeSt: Split-Attention Networks Apr 19, 2020 image-classification Image Classification
Code Code Available 3The Missing Point in Vision Transformers for Universal Image Segmentation May 26, 2025 Image Segmentation Instance Segmentation
Code Code Available 2P2Object: Single Point Supervised Object Detection and Instance Segmentation Apr 10, 2025 Instance Segmentation Multiple Instance Learning
Code Code Available 2Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery Apr 3, 2025 Field Boundary Delineation Instance Segmentation
Code Code Available 2Scene-Centric Unsupervised Panoptic Segmentation Apr 2, 2025 Instance Segmentation Panoptic Segmentation
Code Code Available 2Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting Mar 18, 2025 Instance Segmentation Object
Code Code Available 2DAMamba: Vision State Space Model with Dynamic Adaptive Scan Feb 18, 2025 image-classification Image Classification
Code Code Available 2