| TransMix: Attend to Mix for Vision Transformers | Nov 18, 2021 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Open Vocabulary Object Detection with Pseudo Bounding-Box Labels | Nov 18, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Tracklet-Switch Adversarial Attack against Pedestrian Multi-Object Tracking Trackers | Nov 17, 2021 | Adversarial AttackMulti-Object Tracking | CodeCode Available | 1 |
| ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data | Nov 17, 2021 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Deraining | Nov 17, 2021 | Contrastive LearningImage Restoration | CodeCode Available | 1 |
| TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video | Nov 17, 2021 | Data AugmentationImage Augmentation | CodeCode Available | 1 |
| Single-stage uav detection and classification with yolov5: Mosaic data augmentation and panet | Nov 16, 2021 | 2D Object DetectionData Augmentation | CodeCode Available | 0 |
| On Vision Features in Multimodal Machine Translation | Nov 16, 2021 | Image CaptioningMachine Translation | —Unverified | 0 |
| TextMosaic: A New Data Augmentation Method for Named Entity Recognition Using Document-Level Contexts | Nov 16, 2021 | Data AugmentationGPU | —Unverified | 0 |
| Postdisaster image-based damage detection and repair cost estimation of reinforced concrete buildings using dual convolutional neural networks | Nov 16, 2021 | Managementobject-detection | —Unverified | 0 |
| Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts | Nov 16, 2021 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 1 |
| Single Image Object Counting and Localizing using Active-Learning | Nov 16, 2021 | Active LearningObject | —Unverified | 0 |
| Semantically Grounded Object Matching for Robust Robotic Scene Rearrangement | Nov 15, 2021 | Language ModellingObject | CodeCode Available | 0 |
| iBOT: Image BERT Pre-Training with Online Tokenizer | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Attention Mechanisms in Computer Vision: A Survey | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| Robust and Accurate Object Detection via Self-Knowledge Distillation | Nov 14, 2021 | Adversarial RobustnessKnowledge Distillation | CodeCode Available | 0 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Fracture Detection in Wrist X-ray Images Using Deep Learning-Based Object Detection Models | Nov 14, 2021 | Ensemble LearningFracture detection | CodeCode Available | 0 |
| Factorial Convolution Neural Networks | Nov 13, 2021 | Objectobject-detection | —Unverified | 0 |
| Visual Understanding of Complex Table Structures from Document Images | Nov 13, 2021 | Novel Object Detectionobject-detection | —Unverified | 0 |
| Can neural networks predict dynamics they have never seen? | Nov 12, 2021 | Machine Translationobject-detection | —Unverified | 0 |
| Attention Guided Cosine Margin For Overcoming Class-Imbalance in Few-Shot Road Object Detection | Nov 12, 2021 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 1 |
| Masked Autoencoders Are Scalable Vision Learners | Nov 11, 2021 | DecoderDomain Generalization | CodeCode Available | 1 |
| Indian Licence Plate Dataset in the wild | Nov 11, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Towards Live Video Analytics with On-Drone Deeper-yet-Compatible Compression | Nov 10, 2021 | object-detectionObject Detection | —Unverified | 0 |