| 3D Object Detection and High-Resolution Traffic Parameters Extraction Using Low-Resolution LiDAR Data | Jan 13, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| UniVision: A Unified Framework for Vision-Centric 3D Perception | Jan 13, 2024 | Autonomous DrivingData Augmentation | CodeCode Available | 0 |
| Dense Optical Flow Estimation Using Sparse Regularizers from Reduced Measurements | Jan 12, 2024 | Activity RecognitionMotion Estimation | —Unverified | 0 |
| Improving the Detection of Small Oriented Objects in Aerial Images | Jan 12, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Embedded Planogram Compliance Control System | Jan 12, 2024 | ManagementNVIDIA Jetson Orin Nano | —Unverified | 0 |
| Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook | Jan 12, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Object-Centric Diffusion for Efficient Video Editing | Jan 11, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| YOLO-Former: YOLO Shakes Hand With ViT | Jan 11, 2024 | Objectobject-detection | —Unverified | 0 |
| Wasserstein Distance-based Expansion of Low-Density Latent Regions for Unknown Class Detection | Jan 10, 2024 | Metric LearningNovelty Detection | CodeCode Available | 0 |
| Consensus Focus for Object Detection and minority classes | Jan 10, 2024 | Domain AdaptationLong-tailed Object Detection | CodeCode Available | 0 |
| CLIP-Guided Source-Free Object Detection in Aerial Images | Jan 10, 2024 | Domain AdaptationObject | CodeCode Available | 1 |
| Optimising Graph Representation for Hardware Implementation of Graph Convolutional Networks for Event-based Vision | Jan 10, 2024 | Event-based visionGraph Generation | —Unverified | 0 |
| Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection | Jan 10, 2024 | 3D Object DetectionData Augmentation | CodeCode Available | 0 |
| Generic Knowledge Boosted Pre-training For Remote Sensing Images | Jan 9, 2024 | Change DetectionDeep Learning | CodeCode Available | 1 |
| Integrity Assessment of Maritime Object Detection Impacted by Partial Camera Obstruction | Jan 8, 2024 | Decision MakingObject | —Unverified | 0 |
| SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling | Jan 8, 2024 | 3D Object DetectionDomain Adaptation | —Unverified | 0 |
| MS-DETR: Efficient DETR Training with Mixed Supervision | Jan 8, 2024 | DecoderObject | CodeCode Available | 2 |
| A Flying Bird Object Detection Method for Surveillance Video | Jan 8, 2024 | Objectobject-detection | CodeCode Available | 1 |
| UFO: Unidentified Foreground Object Detection in 3D Point Cloud | Jan 8, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Dr^2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning | Jan 8, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| A Multi-objective Newton Optimization Algorithm for Hyper-Parameter Search | Jan 7, 2024 | Bayesian Optimizationobject-detection | —Unverified | 0 |
| SeTformer is What You Need for Vision and Language | Jan 7, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Real Time Human Detection by Unmanned Aerial Vehicles | Jan 6, 2024 | Human DetectionObject | —Unverified | 0 |
| Multimodal Data Curation via Object Detection and Filter Ensembles | Jan 5, 2024 | Objectobject-detection | —Unverified | 0 |
| VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection | Jan 5, 2024 | 3D Object DetectionFeature Importance | —Unverified | 0 |
| HyperSense: Hyperdimensional Intelligent Sensing for Energy-Efficient Sparse Data Processing | Jan 4, 2024 | object-detectionObject Detection | —Unverified | 0 |
| ShapeAug: Occlusion Augmentation for Event Camera Data | Jan 4, 2024 | Data AugmentationObject | —Unverified | 0 |
| FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding | Jan 3, 2024 | object-detectionObject Detection | —Unverified | 0 |
| DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models | Jan 3, 2024 | Denoisingobject-detection | —Unverified | 0 |
| Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection | Jan 3, 2024 | 3D Object DetectionKnowledge Distillation | —Unverified | 0 |
| Real-Time Object Detection in Occluded Environment with Background Cluttering Effects Using Deep Learning | Jan 2, 2024 | Data AugmentationObject Counting | —Unverified | 0 |
| Depth-discriminative Metric Learning for Monocular 3D Object Detection | Jan 2, 2024 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Deep Learning-Based Computational Model for Disease Identification in Cocoa Pods (Theobroma cacao L.) | Jan 2, 2024 | ArticlesDeep Learning | —Unverified | 0 |
| Hybrid Pooling and Convolutional Network for Improving Accuracy and Training Convergence Speed in Object Detection | Jan 2, 2024 | Objectobject-detection | —Unverified | 0 |
| Exploring Orthogonality in Open World Object Detection | Jan 1, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images | Jan 1, 2024 | 3D Object Detection3D Reconstruction | CodeCode Available | 1 |
| CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object Detection | Jan 1, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| Towards Automatic Power Battery Detection: New Challenge Benchmark Dataset and Baseline | Jan 1, 2024 | Crowd Countingobject-detection | CodeCode Available | 3 |
| Few-Shot Object Detection with Foundation Models | Jan 1, 2024 | Few-Shot LearningFew-Shot Object Detection | —Unverified | 0 |
| Exploring Region-Word Alignment in Built-in Detector for Open-Vocabulary Object Detection | Jan 1, 2024 | Decoderobject-detection | —Unverified | 0 |
| Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action | Jan 1, 2024 | Image GenerationInstruction Following | —Unverified | 0 |
| Bilateral Adaptation for Human-Object Interaction Detection with Occlusion-Robustness | Jan 1, 2024 | Human-Object Interaction Detectionobject-detection | —Unverified | 0 |
| Dispel Darkness for Better Fusion: A Controllable Visual Enhancer based on Cross-modal Conditional Adversarial Learning | Jan 1, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity Training | Jan 1, 2024 | Adversarial Attackimage-classification | CodeCode Available | 1 |
| AM-RADIO: Agglomerative Vision Foundation Model Reduce All Domains Into One | Jan 1, 2024 | AllBenchmarking | —Unverified | 0 |
| Multi-agent Collaborative Perception via Motion-aware Robust Communication Network | Jan 1, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| On Scaling Up a Multilingual Vision and Language Model | Jan 1, 2024 | document understandingIn-Context Learning | —Unverified | 0 |
| M3-UDA: A New Benchmark for Unsupervised Domain Adaptive Fetal Cardiac Structure Detection | Jan 1, 2024 | object-detectionObject Detection | CodeCode Available | 0 |