| Multimodal Data Curation via Object Detection and Filter Ensembles | Jan 5, 2024 | Objectobject-detection | —Unverified | 0 |
| VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection | Jan 5, 2024 | 3D Object DetectionFeature Importance | —Unverified | 0 |
| HyperSense: Hyperdimensional Intelligent Sensing for Energy-Efficient Sparse Data Processing | Jan 4, 2024 | object-detectionObject Detection | —Unverified | 0 |
| ShapeAug: Occlusion Augmentation for Event Camera Data | Jan 4, 2024 | Data AugmentationObject | —Unverified | 0 |
| FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding | Jan 3, 2024 | object-detectionObject Detection | —Unverified | 0 |
| DiffYOLO: Object Detection for Anti-Noise via YOLO and Diffusion Models | Jan 3, 2024 | Denoisingobject-detection | —Unverified | 0 |
| Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection | Jan 3, 2024 | 3D Object DetectionKnowledge Distillation | —Unverified | 0 |
| Real-Time Object Detection in Occluded Environment with Background Cluttering Effects Using Deep Learning | Jan 2, 2024 | Data AugmentationObject Counting | —Unverified | 0 |
| Depth-discriminative Metric Learning for Monocular 3D Object Detection | Jan 2, 2024 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Deep Learning-Based Computational Model for Disease Identification in Cocoa Pods (Theobroma cacao L.) | Jan 2, 2024 | ArticlesDeep Learning | —Unverified | 0 |
| Hybrid Pooling and Convolutional Network for Improving Accuracy and Training Convergence Speed in Object Detection | Jan 2, 2024 | Objectobject-detection | —Unverified | 0 |
| Exploring Orthogonality in Open World Object Detection | Jan 1, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images | Jan 1, 2024 | 3D Object Detection3D Reconstruction | CodeCode Available | 1 |
| CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object Detection | Jan 1, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| Towards Automatic Power Battery Detection: New Challenge Benchmark Dataset and Baseline | Jan 1, 2024 | Crowd Countingobject-detection | CodeCode Available | 3 |
| Few-Shot Object Detection with Foundation Models | Jan 1, 2024 | Few-Shot LearningFew-Shot Object Detection | —Unverified | 0 |
| Exploring Region-Word Alignment in Built-in Detector for Open-Vocabulary Object Detection | Jan 1, 2024 | Decoderobject-detection | —Unverified | 0 |
| Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action | Jan 1, 2024 | Image GenerationInstruction Following | —Unverified | 0 |
| Bilateral Adaptation for Human-Object Interaction Detection with Occlusion-Robustness | Jan 1, 2024 | Human-Object Interaction Detectionobject-detection | —Unverified | 0 |
| Dispel Darkness for Better Fusion: A Controllable Visual Enhancer based on Cross-modal Conditional Adversarial Learning | Jan 1, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity Training | Jan 1, 2024 | Adversarial Attackimage-classification | CodeCode Available | 1 |
| AM-RADIO: Agglomerative Vision Foundation Model Reduce All Domains Into One | Jan 1, 2024 | AllBenchmarking | —Unverified | 0 |
| Multi-agent Collaborative Perception via Motion-aware Robust Communication Network | Jan 1, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| On Scaling Up a Multilingual Vision and Language Model | Jan 1, 2024 | document understandingIn-Context Learning | —Unverified | 0 |
| M3-UDA: A New Benchmark for Unsupervised Domain Adaptive Fetal Cardiac Structure Detection | Jan 1, 2024 | object-detectionObject Detection | CodeCode Available | 0 |