OMG-Seg: Is One Model Good Enough For All Segmentation? Jan 18, 2024 All Decoder
Code Code Available 55 Faster Segment Anything: Towards Lightweight SAM for Mobile Applications Jun 25, 2023 CPU Decoder
Code Code Available 55 Detectron2 Object Detection & Manipulating Images using Cartoonization Aug 1, 2021 Autonomous Vehicles Data Visualization
Code Code Available 45 Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering Jan 12, 2024 3D Panoptic Segmentation 3D Semantic Segmentation
Code Code Available 45 SegGPT: Segmenting Everything In Context Apr 6, 2023 Few-Shot Semantic Segmentation In-Context Learning
Code Code Available 45 Panoptic Feature Pyramid Networks Jan 8, 2019 Instance Segmentation Panoptic Segmentation
Code Code Available 45 Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation Jun 6, 2022 Image Segmentation Instance Segmentation
Code Code Available 45 Visual Attention Network Feb 20, 2022 image-classification Image Classification
Code Code Available 45 4D Panoptic Scene Graph Generation May 16, 2024 4D Panoptic Segmentation Graph Generation
Code Code Available 35 A Simple Framework for Open-Vocabulary Segmentation and Detection Mar 14, 2023 Instance Segmentation Panoptic Segmentation
Code Code Available 35 Generalized Decoding for Pixel, Image, and Language Dec 21, 2022 Decoder Image Segmentation
Code Code Available 35 ResNeSt: Split-Attention Networks Apr 19, 2020 image-classification Image Classification
Code Code Available 35 Vision Transformer Adapter for Dense Predictions May 17, 2022 Instance Segmentation Object Detection
Code Code Available 35 Tracking Anything with Decoupled Video Segmentation Sep 7, 2023 Open-Vocabulary Video Segmentation Open-World Video Segmentation
Code Code Available 35 OneFormer: One Transformer to Rule Universal Image Segmentation Nov 10, 2022 Instance Segmentation Panoptic Segmentation
Code Code Available 35 PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model Mar 21, 2024 Decoder Generalized Referring Expression Segmentation
Code Code Available 35 RAP-SAM: Towards Real-Time All-Purpose Segment Anything Jan 18, 2024 All Decoder
Code Code Available 35 Aligning and Prompting Everything All at Once for Universal Visual Perception Dec 4, 2023 All Object
Code Code Available 25 DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution Jun 3, 2020 Instance Segmentation Object
Code Code Available 25 Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration Jan 23, 2024 3D Semantic Segmentation Autonomous Driving
Code Code Available 25 PVO: Panoptic Visual Odometry Jul 4, 2022 Camera Pose Estimation Optical Flow Estimation
Code Code Available 25 PosSAM: Panoptic Open-vocabulary Segment Anything Mar 14, 2024 Decoder Open Vocabulary Panoptic Segmentation
Code Code Available 25 PEM: Prototype-based Efficient MaskFormer for Image Segmentation Feb 29, 2024 Image Segmentation Panoptic Segmentation
Code Code Available 25 Per-Pixel Classification is Not All You Need for Semantic Segmentation Jul 13, 2021 All Classification
Code Code Available 25 Scalable SoftGroup for 3D Instance Segmentation on Point Clouds Sep 17, 2022 3D Instance Segmentation Instance Segmentation
Code Code Available 25 Open-World Entity Segmentation Jul 29, 2021 Image Manipulation Image Segmentation
Code Code Available 25 CellViT: Vision Transformers for Precise Cell Segmentation and Classification Jun 27, 2023 Cell Detection Cell Segmentation
Code Code Available 25 SAD: Segment Any RGBD May 23, 2023 3D Panoptic Segmentation Open Vocabulary Semantic Segmentation
Code Code Available 25 Context-Aware Video Instance Segmentation Jul 3, 2024 Instance Segmentation Panoptic Segmentation
Code Code Available 25 Scene-Centric Unsupervised Panoptic Segmentation Apr 2, 2025 Instance Segmentation Panoptic Segmentation
Code Code Available 25 A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting Jan 18, 2024 Instance Segmentation Interactive Segmentation
Code Code Available 25 Mask2Former for Video Instance Segmentation Dec 20, 2021 Image Segmentation Instance Segmentation
Code Code Available 25 OneFormer3D: One Transformer for Unified Point Cloud Segmentation Nov 24, 2023 3D Instance Segmentation 3D Object Detection
Code Code Available 25 Panoptic Lifting for 3D Scene Understanding with Neural Fields Dec 19, 2022 2D Panoptic Segmentation Panoptic Segmentation
Code Code Available 25 HyperSeg: Towards Universal Visual Segmentation with Large Language Model Nov 26, 2024 Language Modeling Large Language Model
Code Code Available 25 Better Call SAL: Towards Learning to Segment Anything in Lidar Mar 19, 2024 Panoptic Segmentation Segmentation
Code Code Available 25 Image Segmentation in Foundation Model Era: A Survey Aug 23, 2024 Image Segmentation Instance Segmentation
Code Code Available 25 Hierarchical Multi-Scale Attention for Semantic Segmentation May 21, 2020 Panoptic Segmentation Semantic Segmentation
Code Code Available 25 Hierarchical Open-vocabulary Universal Image Segmentation Jul 3, 2023 Image Comprehension Image Segmentation
Code Code Available 25 Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model May 18, 2023 Image Generation Language Modeling
Code Code Available 25 BatchFormerV2: Exploring Sample Relationships for Dense Representation Learning Apr 4, 2022 image-classification Image Classification
Code Code Available 25 Masked-attention Mask Transformer for Universal Image Segmentation Dec 2, 2021 2D Semantic Segmentation Image Segmentation
Code Code Available 25 Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation Mar 17, 2020 image-classification Image Classification
Code Code Available 25 A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future Jul 18, 2023 Knowledge Distillation object-detection
Code Code Available 25 Dilated Neighborhood Attention Transformer Sep 29, 2022 Image Classification Instance Segmentation
Code Code Available 25 Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Mar 8, 2023 Open Vocabulary Panoptic Segmentation Open Vocabulary Semantic Segmentation
Code Code Available 25 1st Place Solution for PSG competition with ECCV'22 SenseHuman Workshop Feb 6, 2023 Multi-class Classification Panoptic Segmentation
Code Code Available 25 ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning Mar 29, 2024 Continual Learning Continual Panoptic Segmentation
Code Code Available 25 CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction Oct 2, 2023 image-classification Image Classification
Code Code Available 25 Focal Modulation Networks Mar 22, 2022 image-classification Image Classification
Code Code Available 25