| Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension | Nov 20, 2024 | GPUMME | CodeCode Available | 3 |
| YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Nov 20, 2024 | 2D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation | Nov 20, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Nov 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Physics-Guided Detector for SAR Airplanes | Nov 19, 2024 | Object DetectionSelf-Supervised Learning | CodeCode Available | 1 |
| Scaling Deep Learning Research with Kubernetes on the NRP Nautilus HyperCluster | Nov 18, 2024 | Deep Learningobject-detection | —Unverified | 0 |
| SL-YOLO: A Stronger and Lighter Drone Target Detection Model | Nov 18, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Nov 18, 2024 | image-classificationImage Classification | —Unverified | 0 |
| WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images | Nov 18, 2024 | Novel Object Detectionobject-detection | —Unverified | 0 |
| EVT: Efficient View Transformation for Multi-Modal 3D Object Detection | Nov 16, 2024 | 3D Object DetectionDecoder | —Unverified | 0 |
| Vision Eagle Attention: a new lens for advancing image classification | Nov 15, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Structure Tensor Representation for Robust Oriented Object Detection | Nov 15, 2024 | Objectobject-detection | —Unverified | 0 |
| Diachronic Document Dataset for Semantic Layout Analysis | Nov 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions | Nov 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning | Nov 15, 2024 | DescriptiveObject | —Unverified | 0 |
| RETR: Multi-View Radar Detection Transformer for Indoor Perception | Nov 15, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras | Nov 15, 2024 | energy managementManagement | —Unverified | 0 |
| RenderBender: A Survey on Adversarial Attacks Using Differentiable Rendering | Nov 14, 2024 | Depth EstimationImage Classification | —Unverified | 0 |
| Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks | Nov 14, 2024 | Infrared And Visible Image Fusionobject-detection | —Unverified | 0 |
| Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction | Nov 14, 2024 | Contrastive LearningLong-tailed Object Detection | —Unverified | 0 |
| Cross-Modal Consistency in Multimodal Large Language Models | Nov 14, 2024 | Image Captioningobject-detection | —Unverified | 0 |
| DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines | Nov 14, 2024 | Multi-class Classificationobject-detection | —Unverified | 0 |
| LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection | Nov 14, 2024 | Objectobject-detection | —Unverified | 0 |
| Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration | Nov 14, 2024 | Computational EfficiencyObject | CodeCode Available | 1 |
| V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion | Nov 13, 2024 | 3D Object DetectionDenoising | CodeCode Available | 2 |
| Multimodal Object Detection using Depth and Image Data for Manufacturing Parts | Nov 13, 2024 | Objectobject-detection | —Unverified | 0 |
| UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation | Nov 13, 2024 | DecoderFew-Shot Object Detection | —Unverified | 0 |
| Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance | Nov 13, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning | Nov 12, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Nov 12, 2024 | Ensemble LearningObject | CodeCode Available | 1 |
| Depthwise Separable Convolutions with Deep Residual Convolutions | Nov 12, 2024 | Edge-computingobject-detection | —Unverified | 0 |
| Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs | Nov 11, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing Images | Nov 11, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Multi-scale Frequency Enhancement Network for Blind Image Deblurring | Nov 11, 2024 | Blind Image DeblurringDeblurring | —Unverified | 0 |
| LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection | Nov 11, 2024 | Mambaobject-detection | CodeCode Available | 0 |
| Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction | Nov 11, 2024 | Autonomous VehiclesInstance Segmentation | CodeCode Available | 1 |
| FuzzRisk: Online Collision Risk Estimation for Autonomous Vehicles based on Depth-Aware Object Detection via Fuzzy Inference | Nov 9, 2024 | Autonomous VehiclesObject | —Unverified | 0 |
| AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems | Nov 9, 2024 | Adversarial Robustnessimage-classification | —Unverified | 0 |
| LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation | Nov 9, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing | Nov 9, 2024 | Change DetectionLand Cover Classification | —Unverified | 0 |
| An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models | Nov 9, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving | Nov 8, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent | Nov 8, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| Open-set object detection: towards unified problem formulation and benchmarking | Nov 8, 2024 | Autonomous DrivingBenchmarking | —Unverified | 0 |
| SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection | Nov 8, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Exploring the Feasibility of Affordable Sonar Technology: Object Detection in Underwater Environments Using the Ping 360 | Nov 7, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| l0-Regularized Sparse Coding-based Interpretable Network for Multi-Modal Image Fusion | Nov 7, 2024 | object-detectionObject Detection | —Unverified | 0 |
| On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data | Nov 7, 2024 | Dimensionality ReductionObject | CodeCode Available | 0 |
| Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory | Nov 7, 2024 | object-detectionObject Detection | —Unverified | 0 |
| UEVAVD: A Dataset for Developing UAV's Eye View Active Object Detection | Nov 7, 2024 | Active Object DetectionDeep Reinforcement Learning | CodeCode Available | 0 |