| UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection | Dec 13, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Cloud Object Detector Adaptation by Integrating Different Source Knowledge | Dec 10, 2024 | Domain AdaptationKnowledge Distillation | CodeCode Available | 1 |
| Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Dec 6, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision | Dec 3, 2024 | Event-based visionEvent Detection | CodeCode Available | 1 |
| Token Cropr: Faster ViTs for Quite a Few Tasks | Dec 1, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding | Nov 29, 2024 | 3D geometry3DGS | CodeCode Available | 1 |
| COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detection | Nov 28, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Deep Fourier-embedded Network for Bi-modal Salient Object Detection | Nov 27, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Nov 27, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and Implementation | Nov 26, 2024 | Articlesobject-detection | CodeCode Available | 1 |
| Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Nov 25, 2024 | Instance SegmentationKnowledge Distillation | CodeCode Available | 1 |
| Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks | Nov 25, 2024 | Benchmarkingobject-detection | CodeCode Available | 1 |
| Towards RAW Object Detection in Diverse Conditions | Nov 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Nov 24, 2024 | Ensemble LearningObject | CodeCode Available | 1 |
| Highly Efficient and Unsupervised Framework for Moving Object Detection in Satellite Videos | Nov 24, 2024 | Moving Object Detectionobject-detection | CodeCode Available | 1 |
| OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs | Nov 23, 2024 | Keypoint DetectionObject | CodeCode Available | 1 |
| Physics-Guided Detector for SAR Airplanes | Nov 19, 2024 | Object DetectionSelf-Supervised Learning | CodeCode Available | 1 |
| Vision Eagle Attention: a new lens for advancing image classification | Nov 15, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| RETR: Multi-View Radar Detection Transformer for Indoor Perception | Nov 15, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration | Nov 14, 2024 | Computational EfficiencyObject | CodeCode Available | 1 |
| Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Nov 12, 2024 | Ensemble LearningObject | CodeCode Available | 1 |
| Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction | Nov 11, 2024 | Autonomous VehiclesInstance Segmentation | CodeCode Available | 1 |
| LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation | Nov 9, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models | Nov 9, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object Detection | Nov 6, 2024 | Contrastive Learningobject-detection | CodeCode Available | 1 |
| CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection | Nov 5, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 1 |
| Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection | Nov 5, 2024 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 1 |
| Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery | Nov 4, 2024 | 4kgeo-localization | CodeCode Available | 1 |
| ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Nov 3, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels | Oct 29, 2024 | Feature Upsamplingobject-detection | CodeCode Available | 1 |
| PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices | Oct 29, 2024 | Objectobject-detection | CodeCode Available | 1 |
| IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks | Oct 28, 2024 | Domain Adaptationobject-detection | CodeCode Available | 1 |
| MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps | Oct 28, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 1 |
| Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network | Oct 27, 2024 | Image Super-Resolutionobject-detection | CodeCode Available | 1 |
| Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images | Oct 24, 2024 | Depth EstimationImage Rescaling | CodeCode Available | 1 |
| Optimizing Edge Offloading Decisions for Object Detection | Oct 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| You Only Look Around: Learning Illumination Invariant Feature for Low-light Object Detection | Oct 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection | Oct 23, 2024 | Image RestorationObject | CodeCode Available | 1 |
| PlantCamo: Plant Camouflage Detection | Oct 23, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking | Oct 23, 2024 | Multi-Object TrackingObject | CodeCode Available | 1 |
| Fire and Smoke Detection with Burning Intensity Representation | Oct 22, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool | Oct 20, 2024 | Multiple Object TrackingObject | CodeCode Available | 1 |
| MambaSOD: Dual Mamba-Driven Cross-Modal Fusion Network for RGB-D Salient Object Detection | Oct 19, 2024 | Mambaobject-detection | CodeCode Available | 1 |
| Real-time Stereo-based 3D Object Detection for Streaming Perception | Oct 16, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Oct 15, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| V2M: Visual 2-Dimensional Mamba for Image Representation Learning | Oct 14, 2024 | Instance SegmentationMamba | CodeCode Available | 1 |
| GlobalMamba: Global Image Serialization for Vision Mamba | Oct 14, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond | Oct 13, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention | Oct 11, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection | Oct 11, 2024 | General Knowledgeobject-detection | CodeCode Available | 1 |