| Spherical Transformer for LiDAR-based 3D Recognition | Mar 22, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation | Mar 21, 2023 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Generative Semantic Segmentation | Mar 20, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| M^2SNet: Multi-scale in Multi-scale Subtraction Network for Medical Image Segmentation | Mar 20, 2023 | Computed Tomography (CT)Decoder | CodeCode Available | 2 |
| Towards Diverse Binary Segmentation via A Simple yet General Gated Network | Mar 18, 2023 | DecoderSegmentation | CodeCode Available | 2 |
| MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation | Mar 17, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| Large Selective Kernel Network for Remote Sensing Object Detection | Mar 16, 2023 | Objectobject-detection | CodeCode Available | 2 |
| BiFormer: Vision Transformer with Bi-Level Routing Attention | Mar 15, 2023 | Computational EfficiencyGPU | CodeCode Available | 2 |
| DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception | Mar 15, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation | Mar 15, 2023 | DecoderInstance Segmentation | CodeCode Available | 2 |
| CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention | Mar 13, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Mar 8, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning | Mar 8, 2023 | Semantic Segmentation | CodeCode Available | 2 |
| Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern Analysis | Mar 4, 2023 | BenchmarkingContrastive Learning | CodeCode Available | 2 |
| Unleashing Text-to-Image Diffusion Models for Visual Perception | Mar 3, 2023 | DenoisingDepth Estimation | CodeCode Available | 2 |
| Delivering Arbitrary-Modal Semantic Segmentation | Mar 2, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Side Adapter Network for Open-Vocabulary Semantic Segmentation | Feb 23, 2023 | Language ModellingOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| 1st Place Solution for PSG competition with ECCV'22 SenseHuman Workshop | Feb 6, 2023 | Multi-class ClassificationPanoptic Segmentation | CodeCode Available | 2 |
| MOSE: A New Dataset for Video Object Segmentation in Complex Scenes | Feb 3, 2023 | ObjectSegmentation | CodeCode Available | 2 |
| Audio-Visual Segmentation with Semantics | Jan 30, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition | Jan 30, 2023 | Feature Upsamplingimage-classification | CodeCode Available | 2 |
| ViTs for SITS: Vision Transformers for Satellite Image Time Series | Jan 12, 2023 | Semantic SegmentationTime Series | CodeCode Available | 2 |
| Benchmarking the Robustness of LiDAR Semantic Segmentation Models | Jan 3, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| XNet: Wavelet-Based Low and High Frequency Fusion Networks for Fully- and Semi-Supervised Semantic Segmentation of Biomedical Images | Jan 1, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Reversible Column Networks | Dec 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 |