| OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Nov 24, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 | 5 |
| Customized Segment Anything Model for Medical Image Segmentation | Apr 26, 2023 | DecoderImage Segmentation | CodeCode Available | 2 | 5 |
| DaViT: Dual Attention Vision Transformers | Apr 7, 2022 | Computational EfficiencyImage Classification | CodeCode Available | 2 | 5 |
| OpenScene: 3D Scene Understanding with Open Vocabularies | Nov 28, 2022 | 3D Open-Vocabulary Instance Segmentation3D Semantic Segmentation | CodeCode Available | 2 | 5 |
| Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models | Mar 21, 2024 | Image GenerationSemantic Segmentation | CodeCode Available | 2 | 5 |
| A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence | May 24, 2023 | Dense Pixel Correspondence EstimationRepresentation Learning | CodeCode Available | 2 | 5 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 | 5 |
| Open-World Entity Segmentation | Jul 29, 2021 | Image ManipulationImage Segmentation | CodeCode Available | 2 | 5 |
| CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation | Nov 15, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 | 5 |
| Is Your LiDAR Placement Optimized for 3D Scene Understanding? | Mar 25, 2024 | 3D Object DetectionLIDAR Semantic Segmentation | CodeCode Available | 2 | 5 |
| CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation | Oct 30, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 2 | 5 |
| 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification | Dec 1, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 | 5 |
| Coordinate Attention for Efficient Mobile Network Design | Mar 4, 2021 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking | Sep 8, 2021 | BenchmarkingDiversity | CodeCode Available | 2 | 5 |
| PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation | Jan 23, 2024 | DecoderImage Segmentation | CodeCode Available | 2 | 5 |
| CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention | Mar 13, 2023 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Scalable Video Object Segmentation with Identification Mechanism | Mar 22, 2022 | ObjectSegmentation | CodeCode Available | 2 | 5 |
| PEM: Prototype-based Efficient MaskFormer for Image Segmentation | Feb 29, 2024 | Image SegmentationPanoptic Segmentation | CodeCode Available | 2 | 5 |
| CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers | Jul 5, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP | Aug 4, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 | 5 |
| Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Apr 2, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease Segmentation | Sep 6, 2024 | Benchmarkingimage-classification | CodeCode Available | 2 | 5 |
| Agent Attention: On the Integration of Softmax and Linear Attention | Dec 14, 2023 | Computational Efficiencyimage-classification | CodeCode Available | 2 | 5 |
| Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks | May 5, 2021 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Co-Occurrent Features in Semantic Segmentation | Jun 1, 2019 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |