| Hespi: A pipeline for automatically detecting information from hebarium specimen sheets | Oct 11, 2024 | Handwritten Text RecognitionHTR | CodeCode Available | 1 |
| QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model | Oct 9, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection | Oct 8, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Underwater Object Detection in the Era of Artificial Intelligence: Current, Challenge, and Future | Oct 8, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particles | Oct 2, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| OSSA: Unsupervised One-Shot Style Adaptation | Oct 1, 2024 | Domain Adaptationobject-detection | CodeCode Available | 1 |
| OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images | Sep 29, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| A Confidence-Aware Matching Strategy For Generalized Multi-Object Tracking | Sep 27, 2024 | Multi-Object Trackingobject-detection | CodeCode Available | 1 |
| BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices | Sep 25, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation | Sep 25, 2024 | DecoderImage Generation | CodeCode Available | 1 |
| Neuromorphic Drone Detection: an Event-RGB Multimodal Approach | Sep 24, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| PDT: Uav Target Detection Dataset for Pests and Diseases Tree | Sep 24, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary Nodule | Sep 21, 2024 | Lung Cancer Diagnosisobject-detection | CodeCode Available | 1 |
| STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking | Sep 17, 2024 | Multiple Object TrackingObject | CodeCode Available | 1 |
| Towards Physically Realizable Adversarial Attacks in Embodied Vision Navigation | Sep 16, 2024 | Adversarial Robustnessobject-detection | CodeCode Available | 1 |
| GLCONet: Learning Multi-source Perception Representation for Camouflaged Object Detection | Sep 15, 2024 | Decoderobject-detection | CodeCode Available | 1 |
| SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks | Sep 15, 2024 | Image ClassificationObject Detection | CodeCode Available | 1 |
| When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking | Sep 10, 2024 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 1 |
| LEROjD: Lidar Extended Radar-Only Object Detection | Sep 9, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| Visual Grounding with Multi-modal Conditional Adaptation | Sep 8, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Can OOD Object Detectors Learn from Foundation Models? | Sep 8, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection | Sep 7, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| SSFam: Scribble Supervised Salient Object Detection Family | Sep 7, 2024 | DecoderObject | CodeCode Available | 1 |
| LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones | Sep 5, 2024 | CPUGPU | CodeCode Available | 1 |
| Latent Distillation for Continual Object Detection at the Edge | Sep 3, 2024 | Class-Incremental Object DetectionContinual Learning | CodeCode Available | 1 |
| Frequency-Spatial Entanglement Learning for Camouflaged Object Detection | Sep 3, 2024 | Objectobject-detection | CodeCode Available | 1 |
| GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection | Sep 3, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Fisher Information guided Purification against Backdoor Attacks | Sep 1, 2024 | Action Recognitionbackdoor defense | CodeCode Available | 1 |
| Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training | Aug 30, 2024 | Image ClassificationMamba | CodeCode Available | 1 |
| PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View | Aug 29, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Enhancing Sound Source Localization via False Negative Elimination | Aug 29, 2024 | audio-visual learningContrastive Learning | CodeCode Available | 1 |
| NAS-BNN: Neural Architecture Search for Binary Neural Networks | Aug 28, 2024 | Neural Architecture Searchobject-detection | CodeCode Available | 1 |
| A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future Directions | Aug 28, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion Guidance | Aug 27, 2024 | Decoderobject-detection | CodeCode Available | 1 |
| Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object Detection | Aug 27, 2024 | Decoderobject-detection | CodeCode Available | 1 |
| A Lightweight Insulator Defect Detection Model Based on Drone Images | Aug 26, 2024 | Defect DetectionInsulator Defect Detection | CodeCode Available | 1 |
| UMAD: University of Macau Anomaly Detection Benchmark Dataset | Aug 22, 2024 | Anomaly DetectionChange Detection | CodeCode Available | 1 |
| OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion | Aug 22, 2024 | Decoderobject-detection | CodeCode Available | 1 |
| SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition | Aug 19, 2024 | 3D Hand Pose EstimationAction Recognition | CodeCode Available | 1 |
| PADetBench: Towards Benchmarking Physical Attacks against Object Detection | Aug 17, 2024 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification | Aug 16, 2024 | Fine-Grained Image Classificationobject-detection | CodeCode Available | 1 |
| Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement | Aug 15, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Unified-IoU: For High-Quality Object Detection | Aug 13, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection | Aug 13, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object Detection | Aug 11, 2024 | Few-Shot Object Detectionobject-detection | CodeCode Available | 1 |
| FADE: A Dataset for Detecting Falling Objects around Buildings in Video | Aug 11, 2024 | Moving Object DetectionObject | CodeCode Available | 1 |
| UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster Scenarios | Aug 9, 2024 | BenchmarkingHuman Detection | CodeCode Available | 1 |
| SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes | Aug 8, 2024 | Autonomous VehiclesObject | CodeCode Available | 1 |
| GUI Element Detection Using SOTA YOLO Deep Learning Models | Aug 7, 2024 | 2D Object DetectionCode Generation | CodeCode Available | 1 |
| Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian | Aug 7, 2024 | Autonomous Drivingobject-detection | CodeCode Available | 1 |