| From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents | Jun 25, 2025 | Document Layout Analysisobject-detection | —Unverified | 0 |
| Feature Hallucination for Self-supervised Action Recognition | Jun 25, 2025 | Action RecognitionHallucination | —Unverified | 0 |
| A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects | Jun 24, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages | Jun 22, 2025 | image-classificationImage Classification | —Unverified | 0 |
| YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception | Jun 21, 2025 | Computational Efficiencyobject-detection | CodeCode Available | 5 |
| Class Agnostic Instance-level Descriptor for Visual Instance Search | Jun 20, 2025 | Content-Based Image RetrievalImage Retrieval | —Unverified | 0 |
| Can AI Dream of Unseen Galaxies? Conditional Diffusion Model for Galaxy Morphology Augmentation | Jun 19, 2025 | AstronomyMorphology classification | CodeCode Available | 0 |
| Retrospective Memory for Camouflaged Object Detection | Jun 18, 2025 | Objectobject-detection | —Unverified | 0 |
| YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework | Jun 17, 2025 | Multispectral Object Detectionobject-detection | CodeCode Available | 4 |
| VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and Reasoning | Jun 17, 2025 | object-detectionObject Detection | CodeCode Available | 0 |