| A General Divergence Modeling Strategy for Salient Object Detection | Nov 23, 2021 | Objectobject-detection | —Unverified | 0 |
| Focal and Global Knowledge Distillation for Detectors | Nov 23, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Metamorphic Adversarial Detection Pipeline for Face Recognition Systems | Nov 22, 2021 | Adversarial AttackFace Recognition | —Unverified | 0 |
| Lightweight Transformer Backbone for Medical Object Detection | Nov 22, 2021 | Lesion DetectionMedical Object Detection | —Unverified | 0 |
| FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks | Nov 22, 2021 | BenchmarkingFederated Learning | CodeCode Available | 1 |
| Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images | Nov 22, 2021 | Instance SegmentationObject Detection | CodeCode Available | 1 |
| MetaFormer Is Actually What You Need for Vision | Nov 22, 2021 | Image ClassificationObject Detection | CodeCode Available | 2 |
| Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask Architecture | Nov 22, 2021 | Handwritten Text Recognitionobject-detection | CodeCode Available | 1 |
| L-Verse: Bidirectional Generation Between Image and Text | Nov 22, 2021 | Image CaptioningImage Generation | CodeCode Available | 1 |
| Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras | Nov 22, 2021 | Multi-Object Trackingobject-detection | CodeCode Available | 1 |
| Benchmarking Detection Transfer Learning with Vision Transformers | Nov 22, 2021 | Benchmarkingobject-detection | CodeCode Available | 1 |
| Dense Uncertainty Estimation via an Ensemble-based Conditional Latent Variable Model | Nov 22, 2021 | Attributeobject-detection | —Unverified | 0 |
| MUM : Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object Detection | Nov 22, 2021 | Data Augmentationobject-detection | CodeCode Available | 1 |
| Class-agnostic Object Detection with Multi-modal Transformer | Nov 22, 2021 | Class-agnostic Object DetectionObject | CodeCode Available | 1 |
| Conifer Seedling Detection in UAV-Imagery with RGB-Depth Information | Nov 22, 2021 | object-detectionObject Detection | —Unverified | 0 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| FCOSR: A Simple Anchor-free Rotated Detector for Aerial Object Detection | Nov 21, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Efficient Softmax Approximation for Deep Neural Networks with Attention Mechanism | Nov 21, 2021 | Machine Translationobject-detection | —Unverified | 0 |
| HoughCL: Finding Better Positive Pairs in Dense Self-supervised Learning | Nov 21, 2021 | Contrastive LearningInstance Segmentation | —Unverified | 0 |
| FBNetV5: Neural Architecture Search for Multiple Tasks in One Run | Nov 19, 2021 | Classificationimage-classification | —Unverified | 0 |
| Grounded Situation Recognition with Transformers | Nov 19, 2021 | DecoderGrounded Situation Recognition | CodeCode Available | 1 |
| Swin Transformer V2: Scaling Up Capacity and Resolution | Nov 18, 2021 | Action Classificationimage-classification | CodeCode Available | 1 |
| Boosting Supervised Learning Performance with Co-training | Nov 18, 2021 | Domain Adaptationobject-detection | —Unverified | 0 |
| Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Point Density Level Estimation | Nov 18, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| LiDAR Cluster First and Camera Inference Later: A New Perspective Towards Autonomous Driving | Nov 18, 2021 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| TransMix: Attend to Mix for Vision Transformers | Nov 18, 2021 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Open Vocabulary Object Detection with Pseudo Bounding-Box Labels | Nov 18, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Tracklet-Switch Adversarial Attack against Pedestrian Multi-Object Tracking Trackers | Nov 17, 2021 | Adversarial AttackMulti-Object Tracking | CodeCode Available | 1 |
| ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data | Nov 17, 2021 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Deraining | Nov 17, 2021 | Contrastive LearningImage Restoration | CodeCode Available | 1 |
| TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video | Nov 17, 2021 | Data AugmentationImage Augmentation | CodeCode Available | 1 |
| Single-stage uav detection and classification with yolov5: Mosaic data augmentation and panet | Nov 16, 2021 | 2D Object DetectionData Augmentation | CodeCode Available | 0 |
| On Vision Features in Multimodal Machine Translation | Nov 16, 2021 | Image CaptioningMachine Translation | —Unverified | 0 |
| TextMosaic: A New Data Augmentation Method for Named Entity Recognition Using Document-Level Contexts | Nov 16, 2021 | Data AugmentationGPU | —Unverified | 0 |
| Postdisaster image-based damage detection and repair cost estimation of reinforced concrete buildings using dual convolutional neural networks | Nov 16, 2021 | Managementobject-detection | —Unverified | 0 |
| Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts | Nov 16, 2021 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 1 |
| Single Image Object Counting and Localizing using Active-Learning | Nov 16, 2021 | Active LearningObject | —Unverified | 0 |
| Semantically Grounded Object Matching for Robust Robotic Scene Rearrangement | Nov 15, 2021 | Language ModellingObject | CodeCode Available | 0 |
| iBOT: Image BERT Pre-Training with Online Tokenizer | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Attention Mechanisms in Computer Vision: A Survey | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| Robust and Accurate Object Detection via Self-Knowledge Distillation | Nov 14, 2021 | Adversarial RobustnessKnowledge Distillation | CodeCode Available | 0 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Fracture Detection in Wrist X-ray Images Using Deep Learning-Based Object Detection Models | Nov 14, 2021 | Ensemble LearningFracture detection | CodeCode Available | 0 |
| Factorial Convolution Neural Networks | Nov 13, 2021 | Objectobject-detection | —Unverified | 0 |
| Visual Understanding of Complex Table Structures from Document Images | Nov 13, 2021 | Novel Object Detectionobject-detection | —Unverified | 0 |
| Can neural networks predict dynamics they have never seen? | Nov 12, 2021 | Machine Translationobject-detection | —Unverified | 0 |
| Attention Guided Cosine Margin For Overcoming Class-Imbalance in Few-Shot Road Object Detection | Nov 12, 2021 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 1 |
| Masked Autoencoders Are Scalable Vision Learners | Nov 11, 2021 | DecoderDomain Generalization | CodeCode Available | 1 |
| Indian Licence Plate Dataset in the wild | Nov 11, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Towards Live Video Analytics with On-Drone Deeper-yet-Compatible Compression | Nov 10, 2021 | object-detectionObject Detection | —Unverified | 0 |