| Lost in Translation: When GPT-4V(ision) Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond | Oct 19, 2023 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection | Oct 18, 2023 | Long-tailed Object Detectionobject-detection | CodeCode Available | 0 |
| VST++: Efficient and Stronger Visual Saliency Transformer | Oct 18, 2023 | object-detectionObject Detection | —Unverified | 0 |
| GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment | Oct 17, 2023 | AttributeObject | CodeCode Available | 2 |
| Multi Self-supervised Pre-fine-tuned Transformer Fusion for Better Intelligent Transportation Detection | Oct 17, 2023 | object-detectionObject Detection | —Unverified | 0 |
| An empirical study of automatic wildlife detection using drone thermal imaging and object detection | Oct 17, 2023 | Managementobject-detection | —Unverified | 0 |
| MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient | Oct 17, 2023 | 3D Object DetectionGPU | CodeCode Available | 1 |
| Language Models as Zero-Shot Trajectory Generators | Oct 17, 2023 | object-detectionObject Detection | —Unverified | 0 |
| Towards Generalizable Multi-Camera 3D Object Detection via Perspective Debiasing | Oct 17, 2023 | 3D Object DetectionDomain Adaptation | CodeCode Available | 1 |
| A Machine Learning-based Algorithm for Automated Detection of Frequency-based Events in Recorded Time Series of Sensor Data | Oct 16, 2023 | Event Detectionobject-detection | —Unverified | 0 |
| Towards Open-World Co-Salient Object Detection with Generative Uncertainty-aware Group Selective Exchange-Masking | Oct 16, 2023 | Co-Salient Object Detectionobject-detection | CodeCode Available | 0 |
| A Non-monotonic Smooth Activation Function | Oct 16, 2023 | Adversarial AttackAdversarial Robustness | —Unverified | 0 |
| Open-CRB: Towards Open World Active Learning for 3D Object Detection | Oct 16, 2023 | 3D Object DetectionActive Learning | CodeCode Available | 1 |
| RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets | Oct 16, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| RoboLLM: Robotic Vision Tasks Grounded on Multimodal Large Language Models | Oct 16, 2023 | Instance SegmentationModel Selection | CodeCode Available | 1 |
| Multimodal Object Query Initialization for 3D Object Detection | Oct 16, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Mask wearing object detection algorithm based on improved YOLOv5 | Oct 16, 2023 | Face Detectionobject-detection | —Unverified | 0 |
| Object Detection in Aerial Images in Scarce Data Regimes | Oct 16, 2023 | Few-Shot Object DetectionMetric Learning | —Unverified | 0 |
| MAC: ModAlity Calibration for Object Detection | Oct 14, 2023 | Objectobject-detection | —Unverified | 0 |
| Detecting Moving Objects Using a Novel Optical-Flow-Based Range-Independent Invariant | Oct 14, 2023 | Moving Object Detectionobject-detection | —Unverified | 0 |
| Towards End-to-End Unsupervised Saliency Detection with Self-Supervised Top-Down Context | Oct 14, 2023 | Contrastive Learningobject-detection | —Unverified | 0 |
| MEMTRACK: A Deep Learning-Based Approach to Microrobot Tracking in Dense and Low-Contrast Environments | Oct 13, 2023 | object-detectionObject Detection | CodeCode Available | 0 |
| Efficient Apple Maturity and Damage Assessment: A Lightweight Detection Model with GAN and Attention Mechanism | Oct 13, 2023 | Decision MakingModel Compression | —Unverified | 0 |
| VCL Challenges 2023 at ICCV 2023 Technical Report: Bi-level Adaptation Method for Test-time Adaptive Object Detection | Oct 13, 2023 | object-detectionObject Detection | —Unverified | 0 |
| Rank-DETR for High Quality Object Detection | Oct 13, 2023 | Objectobject-detection | CodeCode Available | 1 |