| On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes | Aug 20, 2024 | Objectobject-detection | —Unverified | 0 |
| A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection Training | Aug 20, 2024 | Autonomous VehiclesComputational Efficiency | CodeCode Available | 0 |
| Vision Calorimeter: Migrating Visual Object Detector to High-energy Particle Images | Aug 20, 2024 | Deep Learningobject-detection | CodeCode Available | 0 |
| Detection of Intracranial Hemorrhage for Trauma Patients | Aug 20, 2024 | 3D Object DetectionAnatomy | CodeCode Available | 0 |
| Just a Hint: Point-Supervised Camouflaged Object Detection | Aug 20, 2024 | Contrastive LearningObject | —Unverified | 0 |
| A Closer Look at Data Augmentation Strategies for Finetuning-Based Low/Few-Shot Object Detection | Aug 20, 2024 | Data AugmentationFew-Shot Object Detection | —Unverified | 0 |
| SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection | Aug 20, 2024 | Knowledge Distillationobject-detection | —Unverified | 0 |
| IDD-YOLOv5: A Lightweight Insulator Defect Real-time Detection Algorithm | Aug 19, 2024 | Defect DetectionInsulator Defect Detection | CodeCode Available | 0 |
| Leveraging Superfluous Information in Contrastive Representation Learning | Aug 19, 2024 | Contrastive Learningimage-classification | —Unverified | 0 |
| Latent Diffusion for Guided Document Table Generation | Aug 19, 2024 | object-detectionObject Detection | —Unverified | 0 |
| SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition | Aug 19, 2024 | 3D Hand Pose EstimationAction Recognition | CodeCode Available | 1 |
| Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving | Aug 19, 2024 | Adversarial RobustnessAutonomous Driving | CodeCode Available | 0 |
| Boundary-Recovering Network for Temporal Action Detection | Aug 18, 2024 | Action Detectionobject-detection | —Unverified | 0 |
| Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection | Aug 18, 2024 | object-detectionObject Detection | —Unverified | 0 |
| YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems | Aug 18, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community | Aug 17, 2024 | Novel ConceptsObject | CodeCode Available | 3 |
| GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System | Aug 17, 2024 | Multiple Object TrackingObject | —Unverified | 0 |
| PADetBench: Towards Benchmarking Physical Attacks against Object Detection | Aug 17, 2024 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation | Aug 17, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Depth-guided Texture Diffusion for Image Semantic Segmentation | Aug 17, 2024 | Objectobject-detection | —Unverified | 0 |
| Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification | Aug 16, 2024 | Fine-Grained Image Classificationobject-detection | CodeCode Available | 1 |
| Enhancing Object Detection with Hybrid dataset in Manufacturing Environments: Comparing Federated Learning to Conventional Techniques | Aug 16, 2024 | Federated LearningObject | —Unverified | 0 |
| Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs | Aug 16, 2024 | Common Sense Reasoningimage-classification | —Unverified | 0 |
| Multimodal Relational Triple Extraction with Query-based Entity Object Transformer | Aug 16, 2024 | Knowledge GraphsObject | —Unverified | 0 |
| 5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks | Aug 15, 2024 | image-classificationImage Classification | CodeCode Available | 3 |
| Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement | Aug 15, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Aug 15, 2024 | Objectobject-detection | CodeCode Available | 2 |
| SC3D: Label-Efficient Outdoor 3D Object Detection via Single Click Annotation | Aug 15, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection | Aug 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Learned Multimodal Compression for Autonomous Driving | Aug 15, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Sign language recognition based on deep learning and low-cost handcrafted descriptors | Aug 14, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Aug 14, 2024 | 3D Object Detection3D Object Tracking | CodeCode Available | 3 |
| Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection | Aug 14, 2024 | Efficient Neural NetworkModel Compression | —Unverified | 0 |
| See It All: Contextualized Late Aggregation for 3D Dense Captioning | Aug 14, 2024 | 3D dense captioningAll | —Unverified | 0 |
| Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces | Aug 13, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection | Aug 13, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries | Aug 13, 2024 | 3D Object DetectionBEV Segmentation | —Unverified | 0 |
| Unified-IoU: For High-Quality Object Detection | Aug 13, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions | Aug 13, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers | Aug 13, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Latent Disentanglement for Low Light Image Enhancement | Aug 12, 2024 | DisentanglementImage Enhancement | —Unverified | 0 |
| Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts | Aug 12, 2024 | Anomaly DetectionEvent Detection | —Unverified | 0 |
| MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception | Aug 12, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 0 |
| DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection | Aug 12, 2024 | DecoderObject | CodeCode Available | 0 |
| Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes | Aug 12, 2024 | Contrastive Learningobject-detection | —Unverified | 0 |
| MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection | Aug 12, 2024 | 3D Object DetectionAutonomous Vehicles | —Unverified | 0 |
| Optimizing Vision Transformers with Data-Free Knowledge Transfer | Aug 12, 2024 | Knowledge Distillationobject-detection | —Unverified | 0 |
| PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object Detection | Aug 11, 2024 | Few-Shot Object Detectionobject-detection | CodeCode Available | 1 |
| FADE: A Dataset for Detecting Falling Objects around Buildings in Video | Aug 11, 2024 | Moving Object DetectionObject | CodeCode Available | 1 |
| U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training | Aug 11, 2024 | DenoisingObject | CodeCode Available | 0 |