| QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation | Oct 9, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Mero Nagarikta: Advanced Nepali Citizenship Data Extractor with Deep Learning-Powered Text Detection and OCR | Oct 8, 2024 | object-detectionObject Detection | —Unverified | 0 |
| PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAM | Oct 8, 2024 | Disentanglementobject-detection | CodeCode Available | 0 |
| Believing is Seeing: Unobserved Object Detection using Generative Models | Oct 8, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga | Oct 8, 2024 | ColorizationData Augmentation | —Unverified | 0 |
| Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions | Oct 8, 2024 | Autonomous VehiclesObject | —Unverified | 0 |
| SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection | Oct 8, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Underwater Object Detection in the Era of Artificial Intelligence: Current, Challenge, and Future | Oct 8, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts | Oct 8, 2024 | Instance SegmentationObject | —Unverified | 0 |
| Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Oct 8, 2024 | Data CompressionFacial Landmark Detection | —Unverified | 0 |
| CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection | Oct 8, 2024 | Attributeobject-detection | —Unverified | 0 |
| Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading | Oct 8, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Rethinking Weak-to-Strong Augmentation in Source-Free Domain Adaptive Object Detection | Oct 7, 2024 | Contrastive Learningobject-detection | —Unverified | 0 |
| Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava | Oct 7, 2024 | Autonomous Vehiclesobject-detection | —Unverified | 0 |
| Improving Object Detection via Local-global Contrastive Learning | Oct 7, 2024 | Contrastive LearningImage-to-Image Translation | —Unverified | 0 |
| Improved detection of discarded fish species through BoxAL active learning | Oct 7, 2024 | Active Learningobject-detection | CodeCode Available | 0 |
| Learning De-Biased Representations for Remote-Sensing Imagery | Oct 6, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Cross Resolution Encoding-Decoding For Detection Transformers | Oct 5, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Fast Object Detection with a Machine Learning Edge Device | Oct 5, 2024 | Autonomous NavigationCPU | —Unverified | 0 |
| Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection | Oct 5, 2024 | Mambaobject-detection | CodeCode Available | 0 |
| STONE: A Submodular Optimization Framework for Active 3D Object Detection | Oct 4, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 0 |
| Learning 3D Perception from Others' Predictions | Oct 3, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations | Oct 3, 2024 | Contrastive LearningImage Classification | CodeCode Available | 0 |
| BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization | Oct 3, 2024 | Bilevel Optimizationimage-classification | —Unverified | 0 |
| Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |