| LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs | Jun 21, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications | Jun 21, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |
| Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders | Jun 20, 2022 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| Global Context Vision Transformers | Jun 20, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| ORFD: A Dataset and Benchmark for Off-Road Freespace Detection | Jun 20, 2022 | Autonomous DrivingSemantic Segmentation | CodeCode Available | 2 |
| Diffusion models as plug-and-play priors | Jun 17, 2022 | Combinatorial OptimizationDenoising | CodeCode Available | 2 |
| Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D Convolutions | Jun 8, 2022 | Autonomous VehiclesNavigate | CodeCode Available | 2 |
| MobileOne: An Improved One millisecond Mobile Backbone | Jun 8, 2022 | Efficient Neural NetworkGaze Estimation | CodeCode Available | 2 |
| PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers | Jun 4, 2022 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| What Are Expected Queries in End-to-End Object Detection? | Jun 2, 2022 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction | May 30, 2022 | Exposure CorrectionImage Enhancement | CodeCode Available | 2 |
| Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation | May 27, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Fast Vision Transformers with HiLo Attention | May 26, 2022 | BenchmarkingEfficient ViTs | CodeCode Available | 2 |
| Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization | May 16, 2022 | graph partitioningSegmentation | CodeCode Available | 2 |
| Surface Representation for Point Clouds | May 11, 2022 | 3D Object Detection3D Point Cloud Classification | CodeCode Available | 2 |
| ConvMAE: Masked Convolution Meets Masked Autoencoders | May 8, 2022 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Neural 3D Scene Reconstruction with the Manhattan-world Assumption | May 5, 2022 | 2D Semantic Segmentation3D Reconstruction | CodeCode Available | 2 |
| Masked Generative Distillation | May 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Computer Vision for Road Imaging and Pothole Detection: A State-of-the-Art Review of Systems and Algorithms | Apr 28, 2022 | ArticlesSegmentation | CodeCode Available | 2 |
| HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation | Apr 27, 2022 | Domain AdaptationGPU | CodeCode Available | 2 |
| Understanding The Robustness in Vision Transformers | Apr 26, 2022 | Domain GeneralizationImage Classification | CodeCode Available | 2 |
| Toward Fast, Flexible, and Robust Low-Light Image Enhancement | Apr 21, 2022 | Computational EfficiencyFace Detection | CodeCode Available | 2 |
| RangeUDF: Semantic Surface Reconstruction from 3D Point Clouds | Apr 19, 2022 | Semantic SegmentationSurface Reconstruction | CodeCode Available | 2 |
| Temporally Efficient Vision Transformer for Video Instance Segmentation | Apr 18, 2022 | Instance SegmentationSemantic Segmentation | CodeCode Available | 2 |
| VSA: Learning Varied-Size Window Attention in Vision Transformers | Apr 18, 2022 | Instance SegmentationObject Detection | CodeCode Available | 2 |