| Unifying Voxel-based Representation with Transformer for 3D Object Detection | Jun 1, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 |
| You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction | May 30, 2022 | Exposure CorrectionImage Enhancement | CodeCode Available | 2 |
| Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training | May 28, 2022 | 3D Object Detection3D Point Cloud Classification | CodeCode Available | 2 |
| BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework | May 27, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation | May 27, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Fast Vision Transformers with HiLo Attention | May 26, 2022 | BenchmarkingEfficient ViTs | CodeCode Available | 2 |
| BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving | May 19, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| PillarNet: Real-Time and High-Performance Pillar-based 3D Object Detection | May 16, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| ConvMAE: Masked Convolution Meets Masked Autoencoders | May 8, 2022 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Masked Generative Distillation | May 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Deep PCB To COCO Convertor | May 1, 2022 | ClassificationData Augmentation | CodeCode Available | 2 |
| TJ4DRadSet: A 4D Radar Dataset for Autonomous Driving | Apr 28, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Understanding The Robustness in Vision Transformers | Apr 26, 2022 | Domain GeneralizationImage Classification | CodeCode Available | 2 |
| Focal Sparse Convolutional Networks for 3D Object Detection | Apr 26, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| K-LITE: Learning Transferable Visual Models with External Knowledge | Apr 20, 2022 | BenchmarkingDescriptive | CodeCode Available | 2 |
| VSA: Learning Varied-Size Window Attention in Vision Transformers | Apr 18, 2022 | Instance SegmentationObject Detection | CodeCode Available | 2 |
| CenterNet++ for Object Detection | Apr 18, 2022 | Objectobject-detection | CodeCode Available | 2 |
| YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss | Apr 14, 2022 | Multi-Person Pose Estimationobject-detection | CodeCode Available | 2 |
| Neighborhood Attention Transformer | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection | Apr 12, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Localization Distillation for Object Detection | Apr 12, 2022 | Knowledge DistillationObject | CodeCode Available | 2 |
| HIT-UAV: A high-altitude infrared thermal dataset for Unmanned Aerial Vehicle-based object detection | Apr 7, 2022 | Objectobject-detection | CodeCode Available | 2 |
| DaViT: Dual Attention Vision Transformers | Apr 7, 2022 | Computational EfficiencyImage Classification | CodeCode Available | 2 |
| Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection | Apr 6, 2022 | Instance SegmentationObject | CodeCode Available | 2 |
| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 |