| On Moving Object Segmentation from Monocular Video with Transformers | Nov 28, 2024 | 3D geometryMotion Segmentation | —Unverified | 0 |
| HDI-Former: Hybrid Dynamic Interaction ANN-SNN Transformer for Object Detection Using Frames and Events | Nov 27, 2024 | object-detectionObject Detection | —Unverified | 0 |
| ROICtrl: Boosting Instance Control for Visual Generation | Nov 27, 2024 | Attributeobject-detection | —Unverified | 0 |
| Deep Fourier-embedded Network for Bi-modal Salient Object Detection | Nov 27, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| RPEE-HEADS: A Novel Benchmark for Pedestrian Head Detection in Crowd Videos | Nov 27, 2024 | Head Detectionobject-detection | —Unverified | 0 |
| Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks | Nov 27, 2024 | Multispectral Object DetectionObject | —Unverified | 0 |
| From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Nov 27, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection | Nov 26, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models | Nov 26, 2024 | Objectobject-detection | —Unverified | 0 |
| Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning | Nov 26, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Interpretable Dynamic Graph Neural Networks for Small Occluded Object Detection and Tracking | Nov 26, 2024 | Decision Makingobject-detection | —Unverified | 0 |
| Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and Implementation | Nov 26, 2024 | Articlesobject-detection | CodeCode Available | 1 |
| TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba | Nov 26, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Open Vocabulary Monocular 3D Object Detection | Nov 25, 2024 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 2 |
| Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training | Nov 25, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| Leverage Task Context for Object Affordance Ranking | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation | Nov 25, 2024 | Instance Segmentationobject-detection | —Unverified | 0 |
| Interpreting Object-level Foundation Models via Visual Precision Search | Nov 25, 2024 | Explainable Artificial Intelligence (XAI)Object | CodeCode Available | 2 |
| Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks | Nov 25, 2024 | Benchmarkingobject-detection | CodeCode Available | 1 |
| Imperceptible Adversarial Examples in the Physical World | Nov 25, 2024 | object-detectionObject Detection | —Unverified | 0 |
| CIA: Controllable Image Augmentation Framework Based on Stable Diffusion | Nov 25, 2024 | Image AugmentationObject | CodeCode Available | 0 |
| Diagnosis of diabetic retinopathy using machine learning & deep learning technique | Nov 25, 2024 | Deep Learningobject-detection | —Unverified | 0 |
| Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Nov 25, 2024 | Instance SegmentationKnowledge Distillation | CodeCode Available | 1 |