| H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-Domain Weakly Supervised Object Detection | Jan 1, 2022 | Domain Adaptationobject-detection | CodeCode Available | 1 |
| Accurate and Real-time 3D Pedestrian Detection Using an Efficient Attentive Pillar Network | Dec 31, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks | Dec 30, 2021 | CPUimage-classification | CodeCode Available | 1 |
| MetaGraspNet_v0: A Large-Scale Benchmark Dataset for Vision-driven Robotic Grasping via Physics-based Metaverse Synthesis | Dec 29, 2021 | Objectobject-detection | CodeCode Available | 1 |
| A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model | Dec 29, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Augmenting Convolutional networks with attention-based aggregation | Dec 27, 2021 | ClassificationImage Classification | CodeCode Available | 1 |
| ELSA: Enhanced Local Self-Attention for Vision Transformer | Dec 23, 2021 | Image ClassificationInstance Segmentation | CodeCode Available | 1 |
| Class-aware Sounding Objects Localization via Audiovisual Correspondence | Dec 22, 2021 | Objectobject-detection | CodeCode Available | 1 |
| YOLO-Z: Improving small object detection in YOLOv5 for autonomous vehicles | Dec 22, 2021 | Autonomous RacingAutonomous Vehicles | CodeCode Available | 1 |
| Leveraging Synthetic Data in Object Detection on Unmanned Aerial Vehicles | Dec 22, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Learned Queries for Efficient Local Attention | Dec 21, 2021 | Image ClassificationObject Detection | CodeCode Available | 1 |
| EPNet++: Cascade Bi-directional Fusion for Multi-Modal 3D Object Detection | Dec 21, 2021 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| MPViT: Multi-Path Vision Transformer for Dense Prediction | Dec 21, 2021 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Pixel Distillation: A New Knowledge Distillation Scheme for Low-Resolution Image Recognition | Dec 17, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks | Dec 17, 2021 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 1 |
| Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds | Dec 16, 2021 | Objectobject-detection | CodeCode Available | 1 |
| RegionCLIP: Region-based Language-Image Pretraining | Dec 16, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| IS-COUNT: Large-scale Object Counting from Satellite Images with Covariate-based Importance Sampling | Dec 16, 2021 | ObjectObject Counting | CodeCode Available | 1 |
| QAHOI: Query-Based Anchors for Human-Object Interaction Detection | Dec 16, 2021 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| CPPE-5: Medical Personal Protective Equipment Dataset | Dec 15, 2021 | Object Detection | CodeCode Available | 1 |
| Towards General and Efficient Active Learning | Dec 15, 2021 | Active LearningDepth Estimation | CodeCode Available | 1 |
| Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions | Dec 15, 2021 | Image Enhancementobject-detection | CodeCode Available | 1 |
| TRACER: Extreme Attention Guided Salient Object Tracing Network | Dec 14, 2021 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| WOOD: Wasserstein-based Out-of-Distribution Detection | Dec 13, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images | Dec 13, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Visual Transformers with Primal Object Queries for Multi-Label Image Classification | Dec 10, 2021 | Decoderimage-classification | CodeCode Available | 1 |
| Monitoring and Adapting the Physical State of a Camera for Autonomous Vehicles | Dec 10, 2021 | Autonomous Vehiclesobject-detection | CodeCode Available | 1 |
| Label, Verify, Correct: A Simple Few Shot Object Detection Method | Dec 10, 2021 | BenchmarkingFew-Shot Object Detection | CodeCode Available | 1 |
| Searching Parameterized AP Loss for Object Detection | Dec 9, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Recurrent Glimpse-based Decoder for Detection with Transformer | Dec 9, 2021 | DecoderObject Detection | CodeCode Available | 1 |
| Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection | Dec 9, 2021 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 1 |
| SoK: Vehicle Orientation Representations for Deep Rotation Estimation | Dec 8, 2021 | 3D Object DetectionDeep Learning | CodeCode Available | 1 |
| Segment and Complete: Defending Object Detectors against Adversarial Patch Attacks with Robust Patch Detection | Dec 8, 2021 | Adversarial Attack DetectionAdversarial Defense | CodeCode Available | 1 |
| Object Shape Error Response Using Bayesian 3-D Convolutional Neural Networks for Assembly Systems With Compliant Parts | Dec 8, 2021 | 3D Shape ModelingBenchmarking | CodeCode Available | 1 |
| Dilated convolution with learnable spacings | Dec 7, 2021 | Image ClassificationObject Detection | CodeCode Available | 1 |
| Activation to Saliency: Forming High-Quality Labels for Completely Unsupervised Salient Object Detection | Dec 7, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| A Contrastive Distillation Approach for Incremental Semantic Segmentation in Aerial Images | Dec 7, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| GaTector: A Unified Framework for Gaze Object Prediction | Dec 7, 2021 | Gaze EstimationGaze Prediction | CodeCode Available | 1 |
| Context-Aware Transfer Attacks for Object Detection | Dec 6, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Joint Learning of Localized Representations from Medical Images and Reports | Dec 6, 2021 | Contrastive Learningimage-classification | CodeCode Available | 1 |
| Dynamic Token Normalization Improves Vision Transformers | Dec 5, 2021 | Inductive BiasListOps | CodeCode Available | 1 |
| Behind the Curtain: Learning Occluded Shapes for 3D Object Detection | Dec 4, 2021 | 3D Object DetectionObject | CodeCode Available | 1 |
| CAVER: Cross-Modal View-Mixed Transformer for Bi-Modal Salient Object Detection | Dec 4, 2021 | Decoderobject-detection | CodeCode Available | 1 |
| The Box Size Confidence Bias Harms Your Object Detector | Dec 3, 2021 | Objectobject-detection | CodeCode Available | 1 |
| SGM3D: Stereo Guided Monocular 3D Object Detection | Dec 3, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection | Dec 3, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| AirDet: Few-Shot Detection without Fine-tuning for Autonomous Exploration | Dec 3, 2021 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 1 |
| MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion Attention | Dec 2, 2021 | Object DetectionRepresentation Learning | CodeCode Available | 1 |
| MViTv2: Improved Multiscale Vision Transformers for Classification and Detection | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting | Dec 2, 2021 | Image-text matchingInstance Segmentation | CodeCode Available | 1 |