| HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions | Jul 28, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |
| ALBench: A Framework for Evaluating Active Learning in Object Detection | Jul 27, 2022 | Active Learningimage-classification | CodeCode Available | 2 |
| ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization | Jul 27, 2022 | 3D Shape Reconstruction3D Shape Reconstruction From A Single 2D Image | CodeCode Available | 2 |
| Monocular 3D Object Detection with Depth from Motion | Jul 26, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| MV-FCOS3D++: Multi-View Camera-Only 4D Object Detection with Pretrained Monocular Backbones | Jul 26, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection | Jul 21, 2022 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild | Jul 21, 2022 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| Fully Sparse 3D Object Detection | Jul 20, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Point-to-Box Network for Accurate Object Detection via Single Point Supervision | Jul 14, 2022 | AttributeMultiple Instance Learning | CodeCode Available | 2 |
| Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning | Jul 11, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity | Jul 7, 2022 | Object DetectionSemantic Segmentation | CodeCode Available | 2 |
| CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers | Jul 5, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Parametric and Multivariate Uncertainty Calibration for Regression and Object Detection | Jul 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications | Jun 21, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 |
| BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection | Jun 21, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs | Jun 21, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders | Jun 20, 2022 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| Global Context Vision Transformers | Jun 20, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| 3D Object Detection for Autonomous Driving: A Comprehensive Survey | Jun 19, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather Conditions | Jun 16, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| LinK3D: Linear Keypoints Representation for 3D LiDAR Point Cloud | Jun 13, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| MobileOne: An Improved One millisecond Mobile Backbone | Jun 8, 2022 | Efficient Neural NetworkGaze Estimation | CodeCode Available | 2 |
| Tutel: Adaptive Mixture-of-Experts at Scale | Jun 7, 2022 | Mixture-of-ExpertsObject Detection | CodeCode Available | 2 |
| Slim-neck by GSConv: A lightweight-design for real-time detector architectures | Jun 6, 2022 | Autonomous VehiclesEdge-computing | CodeCode Available | 2 |
| What Are Expected Queries in End-to-End Object Detection? | Jun 2, 2022 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| Unifying Voxel-based Representation with Transformer for 3D Object Detection | Jun 1, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 |
| You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction | May 30, 2022 | Exposure CorrectionImage Enhancement | CodeCode Available | 2 |
| Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training | May 28, 2022 | 3D Object Detection3D Point Cloud Classification | CodeCode Available | 2 |
| BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework | May 27, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation | May 27, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Fast Vision Transformers with HiLo Attention | May 26, 2022 | BenchmarkingEfficient ViTs | CodeCode Available | 2 |
| BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving | May 19, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| PillarNet: Real-Time and High-Performance Pillar-based 3D Object Detection | May 16, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| ConvMAE: Masked Convolution Meets Masked Autoencoders | May 8, 2022 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Masked Generative Distillation | May 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Deep PCB To COCO Convertor | May 1, 2022 | ClassificationData Augmentation | CodeCode Available | 2 |
| TJ4DRadSet: A 4D Radar Dataset for Autonomous Driving | Apr 28, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Understanding The Robustness in Vision Transformers | Apr 26, 2022 | Domain GeneralizationImage Classification | CodeCode Available | 2 |
| Focal Sparse Convolutional Networks for 3D Object Detection | Apr 26, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| K-LITE: Learning Transferable Visual Models with External Knowledge | Apr 20, 2022 | BenchmarkingDescriptive | CodeCode Available | 2 |
| VSA: Learning Varied-Size Window Attention in Vision Transformers | Apr 18, 2022 | Instance SegmentationObject Detection | CodeCode Available | 2 |
| CenterNet++ for Object Detection | Apr 18, 2022 | Objectobject-detection | CodeCode Available | 2 |
| YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss | Apr 14, 2022 | Multi-Person Pose Estimationobject-detection | CodeCode Available | 2 |
| Neighborhood Attention Transformer | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection | Apr 12, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Localization Distillation for Object Detection | Apr 12, 2022 | Knowledge DistillationObject | CodeCode Available | 2 |
| HIT-UAV: A high-altitude infrared thermal dataset for Unmanned Aerial Vehicle-based object detection | Apr 7, 2022 | Objectobject-detection | CodeCode Available | 2 |
| DaViT: Dual Attention Vision Transformers | Apr 7, 2022 | Computational EfficiencyImage Classification | CodeCode Available | 2 |
| Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection | Apr 6, 2022 | Instance SegmentationObject | CodeCode Available | 2 |
| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 |