| LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs | Jun 21, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications | Jun 21, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |
| Global Context Vision Transformers | Jun 20, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| ORFD: A Dataset and Benchmark for Off-Road Freespace Detection | Jun 20, 2022 | Autonomous DrivingSemantic Segmentation | CodeCode Available | 2 |
| Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders | Jun 20, 2022 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| Diffusion models as plug-and-play priors | Jun 17, 2022 | Combinatorial OptimizationDenoising | CodeCode Available | 2 |
| Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D Convolutions | Jun 8, 2022 | Autonomous VehiclesNavigate | CodeCode Available | 2 |
| MobileOne: An Improved One millisecond Mobile Backbone | Jun 8, 2022 | Efficient Neural NetworkGaze Estimation | CodeCode Available | 2 |
| PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers | Jun 4, 2022 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| What Are Expected Queries in End-to-End Object Detection? | Jun 2, 2022 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction | May 30, 2022 | Exposure CorrectionImage Enhancement | CodeCode Available | 2 |
| Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation | May 27, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Fast Vision Transformers with HiLo Attention | May 26, 2022 | BenchmarkingEfficient ViTs | CodeCode Available | 2 |
| Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization | May 16, 2022 | graph partitioningSegmentation | CodeCode Available | 2 |
| Surface Representation for Point Clouds | May 11, 2022 | 3D Object Detection3D Point Cloud Classification | CodeCode Available | 2 |
| ConvMAE: Masked Convolution Meets Masked Autoencoders | May 8, 2022 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Neural 3D Scene Reconstruction with the Manhattan-world Assumption | May 5, 2022 | 2D Semantic Segmentation3D Reconstruction | CodeCode Available | 2 |
| Masked Generative Distillation | May 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Computer Vision for Road Imaging and Pothole Detection: A State-of-the-Art Review of Systems and Algorithms | Apr 28, 2022 | ArticlesSegmentation | CodeCode Available | 2 |
| HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation | Apr 27, 2022 | Domain AdaptationGPU | CodeCode Available | 2 |
| Understanding The Robustness in Vision Transformers | Apr 26, 2022 | Domain GeneralizationImage Classification | CodeCode Available | 2 |
| Toward Fast, Flexible, and Robust Low-Light Image Enhancement | Apr 21, 2022 | Computational EfficiencyFace Detection | CodeCode Available | 2 |
| RangeUDF: Semantic Surface Reconstruction from 3D Point Clouds | Apr 19, 2022 | Semantic SegmentationSurface Reconstruction | CodeCode Available | 2 |
| Temporally Efficient Vision Transformer for Video Instance Segmentation | Apr 18, 2022 | Instance SegmentationSemantic Segmentation | CodeCode Available | 2 |
| VSA: Learning Varied-Size Window Attention in Vision Transformers | Apr 18, 2022 | Instance SegmentationObject Detection | CodeCode Available | 2 |
| Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation | Apr 15, 2022 | 3D Semantic SegmentationColorization | CodeCode Available | 2 |
| ResT V2: Simpler, Faster and Stronger | Apr 15, 2022 | Semantic Segmentation | CodeCode Available | 2 |
| Neighborhood Attention Transformer | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Cross-Image Relational Knowledge Distillation for Semantic Segmentation | Apr 14, 2022 | Knowledge DistillationSegmentation | CodeCode Available | 2 |
| TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation | Apr 12, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Sat2lod2: A Software For Automated Lod-2 Modeling From Satellite-Derived Orthophoto And Digital Surface Model | Apr 8, 2022 | Semantic Segmentation | CodeCode Available | 2 |
| DaViT: Dual Attention Vision Transformers | Apr 7, 2022 | Computational EfficiencyImage Classification | CodeCode Available | 2 |
| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 |
| FocalClick: Towards Practical Interactive Image Segmentation | Apr 6, 2022 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| Region Rebalance for Long-Tailed Semantic Segmentation | Apr 5, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| MultiMAE: Multi-modal Multi-task Masked Autoencoders | Apr 4, 2022 | Depth Estimationimage-classification | CodeCode Available | 2 |
| Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data | Mar 30, 2022 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection | Mar 30, 2022 | 2D Object DetectionBilevel Optimization | CodeCode Available | 2 |
| Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation | Mar 29, 2022 | Instance SegmentationNeRF | CodeCode Available | 2 |
| Rethinking Semantic Segmentation: A Prototype View | Mar 28, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Stratified Transformer for 3D Point Cloud Segmentation | Mar 28, 2022 | Point Cloud SegmentationPosition | CodeCode Available | 2 |
| Deep Hierarchical Semantic Segmentation | Mar 27, 2022 | Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION | CodeCode Available | 2 |
| Video Polyp Segmentation: A Deep Learning Perspective | Mar 27, 2022 | AttributeDeep Learning | CodeCode Available | 2 |
| Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation | Mar 25, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors | Mar 24, 2022 | Image GenerationSemantic Segmentation | CodeCode Available | 2 |
| Sparse Instance Activation for Real-Time Instance Segmentation | Mar 24, 2022 | Instance SegmentationObject | CodeCode Available | 2 |
| Scalable Video Object Segmentation with Identification Mechanism | Mar 22, 2022 | ObjectSegmentation | CodeCode Available | 2 |
| Focal Modulation Networks | Mar 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Scribble-Supervised LiDAR Semantic Segmentation | Mar 16, 2022 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| Unsupervised Semantic Segmentation by Distilling Feature Correspondences | Mar 16, 2022 | FormSemantic Segmentation | CodeCode Available | 2 |