| Vision Transformer with Sparse Scan Prior | May 22, 2024 | Instance Segmentationobject-detection | CodeCode Available | 0 |
| Class-Conditional self-reward mechanism for improved Text-to-Image models | May 22, 2024 | Image Captioningobject-detection | CodeCode Available | 0 |
| Collaboration of Teachers for Semi-supervised Object Detection | May 22, 2024 | Objectobject-detection | —Unverified | 0 |
| Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation | May 22, 2024 | CPUobject-detection | —Unverified | 0 |
| Transfer Learning Approach for Railway Technical Map (RTM) Component Identification | May 21, 2024 | Managementobject-detection | —Unverified | 0 |
| Empowering Urban Traffic Management: Elevated 3D LiDAR for Data Collection and Advanced Object Detection Analysis | May 21, 2024 | 3D Object DetectionManagement | —Unverified | 0 |
| FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors | May 21, 2024 | 3D Object DetectionObject | CodeCode Available | 0 |
| Mutual Information Analysis in Multimodal Learning Systems | May 21, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once | May 21, 2024 | AllImage Segmentation | —Unverified | 0 |
| Active Object Detection with Knowledge Aggregation and Distillation from Large Models | May 21, 2024 | Active Object DetectionDecision Making | CodeCode Available | 0 |
| Multi-View Attentive Contextualization for Multi-View 3D Object Detection | May 20, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain Adaptation | May 20, 2024 | Domain Adaptationobject-detection | CodeCode Available | 1 |
| Bangladeshi Native Vehicle Detection in Wild | May 20, 2024 | Autonomous Navigationobject-detection | CodeCode Available | 0 |
| DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment | May 20, 2024 | Contrastive LearningDomain Adaptation | CodeCode Available | 2 |
| FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention | May 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization | May 19, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images | May 18, 2024 | Few-Shot Object DetectionIncremental Learning | CodeCode Available | 1 |
| Visible and Clear: Finding Tiny Objects in Difference Map | May 18, 2024 | Objectobject-detection | CodeCode Available | 1 |
| DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection | May 17, 2024 | 3D Object DetectionDecoder | —Unverified | 0 |
| A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability | May 17, 2024 | AttributeDomain Adaptation | CodeCode Available | 1 |
| A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model | May 17, 2024 | AstronomyFew-Shot Learning | —Unverified | 0 |
| Drone-type-Set: Drone types detection benchmark for drone detection and tracking | May 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Grounded 3D-LLM with Referent Tokens | May 16, 2024 | Dense CaptioningDiversity | CodeCode Available | 2 |
| Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection | May 16, 2024 | Edge-computingFew-Shot Object Detection | CodeCode Available | 7 |
| DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data | May 16, 2024 | Data AugmentationDiversity | CodeCode Available | 2 |