| ELA: Efficient Local Attention for Deep Convolutional Neural Networks | Mar 2, 2024 | Dimensionality Reductionimage-classification | —Unverified | 0 |
| TUMTraf V2X Cooperative Perception Dataset | Mar 2, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 4 |
| DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion | Mar 1, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Abductive Ego-View Accident Video Understanding for Safe Driving Perception | Mar 1, 2024 | Objectobject-detection | —Unverified | 0 |
| Learning Causal Features for Incremental Object Detection | Mar 1, 2024 | Incremental LearningObject | —Unverified | 0 |
| VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks | Mar 1, 2024 | Image ClassificationImage Generation | CodeCode Available | 3 |
| YOLO-MED : Multi-Task Interaction Network for Biomedical Images | Mar 1, 2024 | object-detectionObject Detection | —Unverified | 0 |
| FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything | Feb 29, 2024 | 3D Object ReconstructionInstance Segmentation | CodeCode Available | 2 |
| Privacy-Preserving Autoencoder for Collaborative Object Detection | Feb 29, 2024 | License Plate RecognitionObject | CodeCode Available | 0 |
| LLMs in Political Science: Heralding a New Era of Visual Analysis | Feb 29, 2024 | Caption GenerationFace Identification | —Unverified | 0 |
| Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering | Feb 29, 2024 | Edge-computingobject-detection | —Unverified | 0 |
| SeMoLi: What Moves Together Belongs Together | Feb 29, 2024 | ClusteringObject | —Unverified | 0 |
| HyenaPixel: Global Image Context with Convolutions | Feb 29, 2024 | Image ClassificationObject Detection | CodeCode Available | 0 |
| Boosting Semi-Supervised Object Detection in Remote Sensing Images With Active Teaching | Feb 29, 2024 | Active LearningObject | —Unverified | 0 |
| Theoretically Achieving Continuous Representation of Oriented Bounding Boxes | Feb 29, 2024 | Fairnessobject-detection | CodeCode Available | 3 |
| ProtoP-OD: Explainable Object Detection with Prototypical Parts | Feb 29, 2024 | Objectobject-detection | —Unverified | 0 |
| Debiased Novel Category Discovering and Localization | Feb 29, 2024 | Contrastive LearningNovel Class Discovery | —Unverified | 0 |
| A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection | Feb 29, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Zero-Shot Aerial Object Detection with Visual Description Regularization | Feb 28, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization | Feb 28, 2024 | Domain Generalizationimage-classification | —Unverified | 0 |
| Spatial Coherence Loss: All Objects Matter in Salient and Camouflaged Object Detection | Feb 28, 2024 | AllObject | —Unverified | 0 |
| Detection of Micromobility Vehicles in Urban Traffic Videos | Feb 28, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection | Feb 28, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| Towards Unified 3D Object Detection via Algorithm and Data Unification | Feb 28, 2024 | 3D Object DetectionMonocular 3D Object Detection | —Unverified | 0 |
| Deployment Prior Injection for Run-time Calibratable Object Detection | Feb 27, 2024 | Objectobject-detection | —Unverified | 0 |
| A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track | Feb 27, 2024 | 3D Object DetectionContinual Learning | —Unverified | 0 |
| SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection | Feb 27, 2024 | class-incremental learningClass Incremental Learning | —Unverified | 0 |
| Probing Multimodal Large Language Models for Global and Local Semantic Representations | Feb 27, 2024 | Image to textobject-detection | CodeCode Available | 0 |
| AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Scene Understanding | Feb 27, 2024 | 3D Object Detection3D Part Segmentation | CodeCode Available | 0 |
| Searching a Lightweight Network Architecture for Thermal Infrared Pedestrian Tracking | Feb 26, 2024 | channel selectionimage-classification | —Unverified | 0 |
| DEYO: DETR with YOLO for End-to-End Object Detection | Feb 26, 2024 | DecoderGPU | CodeCode Available | 2 |
| SaRPFF: A Self-Attention with Register-based Pyramid Feature Fusion module for enhanced RLD detection | Feb 26, 2024 | Objectobject-detection | —Unverified | 0 |
| Real-Time Vehicle Detection and Urban Traffic Behavior Analysis Based on UAV Traffic Videos on Mobile Devices | Feb 26, 2024 | Objectobject-detection | —Unverified | 0 |
| Semi-supervised Open-World Object Detection | Feb 25, 2024 | Incremental LearningObject | CodeCode Available | 1 |
| MMW-Carry: Enhancing Carry Object Detection through Millimeter-Wave Radar-Camera Fusion | Feb 24, 2024 | Human Detectionobject-detection | —Unverified | 0 |
| State Space Models for Event Cameras | Feb 23, 2024 | Event-based visionObject Detection | CodeCode Available | 3 |
| EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Feb 23, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends | Feb 23, 2024 | 6D Visionimage-classification | —Unverified | 0 |
| WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Feb 22, 2024 | Image-level Supervised Instance Segmentationobject-detection | CodeCode Available | 2 |
| S^2Former-OR: Single-Stage Bi-Modal Transformer for Scene Graph Generation in OR | Feb 22, 2024 | Graph Generationobject-detection | CodeCode Available | 0 |
| High-Speed Detector For Low-Powered Devices In Aerial Grasping | Feb 22, 2024 | object-detectionObject Detection | —Unverified | 0 |
| YOLO-TLA: An Efficient and Lightweight Small Object Detection Model based on YOLOv5 | Feb 22, 2024 | Objectobject-detection | —Unverified | 0 |
| Unsupervised learning based object detection using Contrastive Learning | Feb 21, 2024 | Contrastive LearningObject | —Unverified | 0 |
| YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information | Feb 21, 2024 | object-detectionObject Detection | CodeCode Available | 16 |
| TransGOP: Transformer-Based Gaze Object Prediction | Feb 21, 2024 | Gaze EstimationObject | CodeCode Available | 1 |
| Combining unsupervised and supervised learning in microscopy enables defect analysis of a full 4H-SiC wafer | Feb 20, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Efficient Parameter Mining and Freezing for Continual Object Detection | Feb 20, 2024 | Continual LearningIncremental Learning | —Unverified | 0 |
| CST: Calibration Side-Tuning for Parameter and Memory Efficient Transfer Learning | Feb 20, 2024 | GPUObject | —Unverified | 0 |
| YOLO-Ant: A Lightweight Detector via Depthwise Separable Convolutional and Large Kernel Design for Antenna Interference Source Detection | Feb 20, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| GOOD: Towards Domain Generalized Orientated Object Detection | Feb 20, 2024 | HallucinationObject | —Unverified | 0 |