| A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data | Nov 20, 2024 | Autonomous DrivingDecoder | CodeCode Available | 0 |
| YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Nov 20, 2024 | 2D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension | Nov 20, 2024 | GPUMME | CodeCode Available | 3 |
| Physics-Guided Detector for SAR Airplanes | Nov 19, 2024 | Object DetectionSelf-Supervised Learning | CodeCode Available | 1 |
| GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Nov 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Scaling Deep Learning Research with Kubernetes on the NRP Nautilus HyperCluster | Nov 18, 2024 | Deep Learningobject-detection | —Unverified | 0 |
| WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images | Nov 18, 2024 | Novel Object Detectionobject-detection | —Unverified | 0 |
| SL-YOLO: A Stronger and Lighter Drone Target Detection Model | Nov 18, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Nov 18, 2024 | image-classificationImage Classification | —Unverified | 0 |
| EVT: Efficient View Transformation for Multi-Modal 3D Object Detection | Nov 16, 2024 | 3D Object DetectionDecoder | —Unverified | 0 |
| Structure Tensor Representation for Robust Oriented Object Detection | Nov 15, 2024 | Objectobject-detection | —Unverified | 0 |
| Vision Eagle Attention: a new lens for advancing image classification | Nov 15, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions | Nov 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras | Nov 15, 2024 | energy managementManagement | —Unverified | 0 |
| Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning | Nov 15, 2024 | DescriptiveObject | —Unverified | 0 |
| RETR: Multi-View Radar Detection Transformer for Indoor Perception | Nov 15, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Diachronic Document Dataset for Semantic Layout Analysis | Nov 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| RenderBender: A Survey on Adversarial Attacks Using Differentiable Rendering | Nov 14, 2024 | Depth EstimationImage Classification | —Unverified | 0 |
| Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction | Nov 14, 2024 | Contrastive LearningLong-tailed Object Detection | —Unverified | 0 |
| Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration | Nov 14, 2024 | Computational EfficiencyObject | CodeCode Available | 1 |
| DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines | Nov 14, 2024 | Multi-class Classificationobject-detection | —Unverified | 0 |
| Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks | Nov 14, 2024 | Infrared And Visible Image Fusionobject-detection | —Unverified | 0 |
| Cross-Modal Consistency in Multimodal Large Language Models | Nov 14, 2024 | Image Captioningobject-detection | —Unverified | 0 |
| LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection | Nov 14, 2024 | Objectobject-detection | —Unverified | 0 |
| UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation | Nov 13, 2024 | DecoderFew-Shot Object Detection | —Unverified | 0 |