| Semantic Is Enough: Only Semantic Information For NeRF Reconstruction | Mar 24, 2024 | NeRFobject-detection | —Unverified | 0 |
| Cross-domain Multi-modal Few-shot Object Detection via Rich Text | Mar 24, 2024 | Cross-Domain Few-ShotDomain Adaptation | CodeCode Available | 0 |
| Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement | Mar 24, 2024 | 2D Object DetectionComputational Efficiency | CodeCode Available | 3 |
| Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions | Mar 23, 2024 | Adversarial Defenseobject-detection | —Unverified | 0 |
| Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection | Mar 22, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| VRSO: Visual-Centric Reconstruction for Static Object Annotation | Mar 22, 2024 | Objectobject-detection | CodeCode Available | 1 |
| IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection | Mar 22, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 3 |
| ParFormer: A Vision Transformer with Parallel Mixer and Sparse Channel Attention Patch Embedding | Mar 22, 2024 | GPUImage Classification | —Unverified | 0 |
| An In-Depth Analysis of Data Reduction Methods for Sustainable Deep Learning | Mar 22, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| CR3DT: Camera-RADAR Fusion for 3D Detection and Tracking | Mar 22, 2024 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 1 |
| Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection | Mar 22, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| SFOD: Spiking Fusion Object Detector | Mar 22, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Deep Active Learning: A Reality Check | Mar 21, 2024 | Active Learningobject-detection | —Unverified | 0 |
| Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering | Mar 21, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection | Mar 21, 2024 | DecoderObject | —Unverified | 0 |
| T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy | Mar 21, 2024 | Contrastive LearningDescriptive | CodeCode Available | 7 |
| 3D Object Detection from Point Cloud via Voting Step Diffusion | Mar 21, 2024 | 3D Object DetectionObject | CodeCode Available | 0 |
| Mask-based Invisible Backdoor Attacks on Object Detection | Mar 20, 2024 | Autonomous DrivingBackdoor Attack | CodeCode Available | 1 |
| EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union | Mar 20, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining | Mar 20, 2024 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 3 |
| DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception | Mar 20, 2024 | AttributeData Augmentation | —Unverified | 0 |
| Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments | Mar 20, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Few-shot Oriented Object Detection with Memorable Contrastive Learning in Remote Sensing Images | Mar 20, 2024 | Contrastive LearningFew-Shot Object Detection | —Unverified | 0 |
| EcoSense: Energy-Efficient Intelligent Sensing for In-Shore Ship Detection through Edge-Cloud Collaboration | Mar 20, 2024 | ClassificationObject | —Unverified | 0 |
| Fostc3net:A Lightweight YOLOv5 Based On the Network Structure Optimization | Mar 20, 2024 | Line Detectionobject-detection | —Unverified | 0 |
| Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments | Mar 20, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition | Mar 20, 2024 | Contrastive LearningFine-Grained Visual Recognition | CodeCode Available | 2 |
| SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model | Mar 19, 2024 | 3D Object DetectionDecoder | —Unverified | 0 |
| TAPTR: Tracking Any Point with Transformers as Detection | Mar 19, 2024 | object-detectionObject Detection | —Unverified | 0 |
| As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? | Mar 19, 2024 | Adversarial AttackImage Captioning | —Unverified | 0 |
| Wildfire danger prediction optimization with transfer learning | Mar 19, 2024 | Anomaly Detectionobject-detection | CodeCode Available | 0 |
| Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition | Mar 19, 2024 | Dense CaptioningImage Captioning | —Unverified | 0 |
| DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM | Mar 19, 2024 | Objectobject-detection | CodeCode Available | 1 |
| TransformMix: Learning Transformation and Mixing Strategies from Data | Mar 19, 2024 | Data AugmentationKnowledge Distillation | —Unverified | 0 |
| EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks | Mar 19, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation | Mar 19, 2024 | Anomaly Detectionobject-detection | CodeCode Available | 1 |
| EffiPerception: an Efficient Framework for Various Perception Tasks | Mar 18, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| Prototipo de un Contador Bidireccional Automático de Personas basado en sensores de visión 3D | Mar 18, 2024 | Objectobject-detection | —Unverified | 0 |
| GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection | Mar 18, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| LSKNet: A Foundation Lightweight Backbone for Remote Sensing | Mar 18, 2024 | Change Detectionobject-detection | CodeCode Available | 4 |
| FlexCap: Describe Anything in Images in Controllable Detail | Mar 18, 2024 | AttributeDense Captioning | —Unverified | 0 |
| Align and Distill: Unifying and Improving Domain Adaptive Object Detection | Mar 18, 2024 | Benchmarkingobject-detection | CodeCode Available | 1 |
| Towards Real-Time Fast Unmanned Aerial Vehicle Detection Using Dynamic Vision Sensors | Mar 18, 2024 | CPUEvent-based vision | —Unverified | 0 |
| Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem | Mar 18, 2024 | 3D Object DetectionDiversity | —Unverified | 0 |
| TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction | Mar 18, 2024 | Autonomous DrivingNeural Architecture Search | —Unverified | 0 |
| Continual Forgetting for Pre-trained Vision Models | Mar 18, 2024 | Continual ForgettingFace Recognition | CodeCode Available | 2 |
| GRA: Detecting Oriented Objects through Group-wise Rotating and Attention | Mar 17, 2024 | Objectobject-detection | —Unverified | 0 |
| Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning | Mar 17, 2024 | Edge DetectionLine Detection | —Unverified | 0 |
| YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images | Mar 17, 2024 | Data AugmentationFracture detection | CodeCode Available | 1 |
| Self-supervised co-salient object detection via feature correspondence at multiple scales | Mar 17, 2024 | Co-Salient Object Detectionobject-detection | CodeCode Available | 0 |