| Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments | Mar 20, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition | Mar 20, 2024 | Contrastive LearningFine-Grained Visual Recognition | CodeCode Available | 2 |
| SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model | Mar 19, 2024 | 3D Object DetectionDecoder | —Unverified | 0 |
| TAPTR: Tracking Any Point with Transformers as Detection | Mar 19, 2024 | object-detectionObject Detection | —Unverified | 0 |
| As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? | Mar 19, 2024 | Adversarial AttackImage Captioning | —Unverified | 0 |
| Wildfire danger prediction optimization with transfer learning | Mar 19, 2024 | Anomaly Detectionobject-detection | CodeCode Available | 0 |
| Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition | Mar 19, 2024 | Dense CaptioningImage Captioning | —Unverified | 0 |
| DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM | Mar 19, 2024 | Objectobject-detection | CodeCode Available | 1 |
| TransformMix: Learning Transformation and Mixing Strategies from Data | Mar 19, 2024 | Data AugmentationKnowledge Distillation | —Unverified | 0 |
| EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks | Mar 19, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation | Mar 19, 2024 | Anomaly Detectionobject-detection | CodeCode Available | 1 |
| EffiPerception: an Efficient Framework for Various Perception Tasks | Mar 18, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| Prototipo de un Contador Bidireccional Automático de Personas basado en sensores de visión 3D | Mar 18, 2024 | Objectobject-detection | —Unverified | 0 |
| GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection | Mar 18, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| LSKNet: A Foundation Lightweight Backbone for Remote Sensing | Mar 18, 2024 | Change Detectionobject-detection | CodeCode Available | 4 |
| FlexCap: Describe Anything in Images in Controllable Detail | Mar 18, 2024 | AttributeDense Captioning | —Unverified | 0 |
| Align and Distill: Unifying and Improving Domain Adaptive Object Detection | Mar 18, 2024 | Benchmarkingobject-detection | CodeCode Available | 1 |
| Towards Real-Time Fast Unmanned Aerial Vehicle Detection Using Dynamic Vision Sensors | Mar 18, 2024 | CPUEvent-based vision | —Unverified | 0 |
| Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem | Mar 18, 2024 | 3D Object DetectionDiversity | —Unverified | 0 |
| TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction | Mar 18, 2024 | Autonomous DrivingNeural Architecture Search | —Unverified | 0 |
| Continual Forgetting for Pre-trained Vision Models | Mar 18, 2024 | Continual ForgettingFace Recognition | CodeCode Available | 2 |
| GRA: Detecting Oriented Objects through Group-wise Rotating and Attention | Mar 17, 2024 | Objectobject-detection | —Unverified | 0 |
| Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning | Mar 17, 2024 | Edge DetectionLine Detection | —Unverified | 0 |
| YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images | Mar 17, 2024 | Data AugmentationFracture detection | CodeCode Available | 1 |
| Self-supervised co-salient object detection via feature correspondence at multiple scales | Mar 17, 2024 | Co-Salient Object Detectionobject-detection | CodeCode Available | 0 |