| On Moving Object Segmentation from Monocular Video with Transformers | Nov 28, 2024 | 3D geometryMotion Segmentation | —Unverified | 0 |
| HDI-Former: Hybrid Dynamic Interaction ANN-SNN Transformer for Object Detection Using Frames and Events | Nov 27, 2024 | object-detectionObject Detection | —Unverified | 0 |
| ROICtrl: Boosting Instance Control for Visual Generation | Nov 27, 2024 | Attributeobject-detection | —Unverified | 0 |
| Deep Fourier-embedded Network for Bi-modal Salient Object Detection | Nov 27, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| RPEE-HEADS: A Novel Benchmark for Pedestrian Head Detection in Crowd Videos | Nov 27, 2024 | Head Detectionobject-detection | —Unverified | 0 |
| Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks | Nov 27, 2024 | Multispectral Object DetectionObject | —Unverified | 0 |
| From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Nov 27, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection | Nov 26, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models | Nov 26, 2024 | Objectobject-detection | —Unverified | 0 |
| TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba | Nov 26, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and Implementation | Nov 26, 2024 | Articlesobject-detection | CodeCode Available | 1 |
| Interpretable Dynamic Graph Neural Networks for Small Occluded Object Detection and Tracking | Nov 26, 2024 | Decision Makingobject-detection | —Unverified | 0 |
| Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning | Nov 26, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Open Vocabulary Monocular 3D Object Detection | Nov 25, 2024 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 2 |
| Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training | Nov 25, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation | Nov 25, 2024 | Instance Segmentationobject-detection | —Unverified | 0 |
| Interpreting Object-level Foundation Models via Visual Precision Search | Nov 25, 2024 | Explainable Artificial Intelligence (XAI)Object | CodeCode Available | 2 |
| Leverage Task Context for Object Affordance Ranking | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks | Nov 25, 2024 | Benchmarkingobject-detection | CodeCode Available | 1 |
| CIA: Controllable Image Augmentation Framework Based on Stable Diffusion | Nov 25, 2024 | Image AugmentationObject | CodeCode Available | 0 |
| Imperceptible Adversarial Examples in the Physical World | Nov 25, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Diagnosis of diabetic retinopathy using machine learning & deep learning technique | Nov 25, 2024 | Deep Learningobject-detection | —Unverified | 0 |
| Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Nov 25, 2024 | Instance SegmentationKnowledge Distillation | CodeCode Available | 1 |
| AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks | Nov 24, 2024 | Few-Shot Object DetectionImage Generation | —Unverified | 0 |
| LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Nov 24, 2024 | Ensemble LearningObject | CodeCode Available | 1 |
| Towards RAW Object Detection in Diverse Conditions | Nov 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Highly Efficient and Unsupervised Framework for Moving Object Detection in Satellite Videos | Nov 24, 2024 | Moving Object Detectionobject-detection | CodeCode Available | 1 |
| AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation | Nov 23, 2024 | Data AugmentationDiversity | CodeCode Available | 2 |
| Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation | Nov 23, 2024 | Objectobject-detection | —Unverified | 0 |
| Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data | Nov 23, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Twin Trigger Generative Networks for Backdoor Attacks against Object Detection | Nov 23, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Enhancing Object Detection Accuracy in Autonomous Vehicles Using Synthetic Data | Nov 23, 2024 | Autonomous Vehiclesobject-detection | —Unverified | 0 |
| OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs | Nov 23, 2024 | Keypoint DetectionObject | CodeCode Available | 1 |
| A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles | Nov 22, 2024 | Autonomous VehiclesObject | —Unverified | 0 |
| MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving | Nov 22, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving | Nov 22, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Beneath the Surface: The Role of Underwater Image Enhancement in Object Detection | Nov 21, 2024 | Image Enhancementobject-detection | CodeCode Available | 0 |
| Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation | Nov 21, 2024 | Denoisingobject-detection | —Unverified | 0 |
| DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding | Nov 21, 2024 | Long-tailed Object DetectionObject | CodeCode Available | 5 |
| AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection | Nov 21, 2024 | Backdoor AttackMulti-Task Learning | CodeCode Available | 0 |
| Transforming Static Images Using Generative Models for Video Salient Object Detection | Nov 21, 2024 | object-detectionObject Detection | —Unverified | 0 |
| WARLearn: Weather-Adaptive Representation Learning | Nov 21, 2024 | 2D Object DetectionAdversarial Robustness | CodeCode Available | 0 |
| Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection | Nov 20, 2024 | Contrastive Learningobject-detection | —Unverified | 0 |
| MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection | Nov 20, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors | Nov 20, 2024 | Model extractionobject-detection | —Unverified | 0 |
| RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation | Nov 20, 2024 | Image Generationobject-detection | CodeCode Available | 2 |
| VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation | Nov 20, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension | Nov 20, 2024 | GPUMME | CodeCode Available | 3 |