| ParFormer: A Vision Transformer with Parallel Mixer and Sparse Channel Attention Patch Embedding | Mar 22, 2024 | GPUImage Classification | —Unverified | 0 |
| Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection | Mar 22, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| An In-Depth Analysis of Data Reduction Methods for Sustainable Deep Learning | Mar 22, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection | Mar 22, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection | Mar 21, 2024 | DecoderObject | —Unverified | 0 |
| Deep Active Learning: A Reality Check | Mar 21, 2024 | Active Learningobject-detection | —Unverified | 0 |
| 3D Object Detection from Point Cloud via Voting Step Diffusion | Mar 21, 2024 | 3D Object DetectionObject | CodeCode Available | 0 |
| Few-shot Oriented Object Detection with Memorable Contrastive Learning in Remote Sensing Images | Mar 20, 2024 | Contrastive LearningFew-Shot Object Detection | —Unverified | 0 |
| EcoSense: Energy-Efficient Intelligent Sensing for In-Shore Ship Detection through Edge-Cloud Collaboration | Mar 20, 2024 | ClassificationObject | —Unverified | 0 |
| EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union | Mar 20, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception | Mar 20, 2024 | AttributeData Augmentation | —Unverified | 0 |
| Fostc3net:A Lightweight YOLOv5 Based On the Network Structure Optimization | Mar 20, 2024 | Line Detectionobject-detection | —Unverified | 0 |
| As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? | Mar 19, 2024 | Adversarial AttackImage Captioning | —Unverified | 0 |
| Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition | Mar 19, 2024 | Dense CaptioningImage Captioning | —Unverified | 0 |
| TransformMix: Learning Transformation and Mixing Strategies from Data | Mar 19, 2024 | Data AugmentationKnowledge Distillation | —Unverified | 0 |
| TAPTR: Tracking Any Point with Transformers as Detection | Mar 19, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Wildfire danger prediction optimization with transfer learning | Mar 19, 2024 | Anomaly Detectionobject-detection | CodeCode Available | 0 |
| SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model | Mar 19, 2024 | 3D Object DetectionDecoder | —Unverified | 0 |
| EffiPerception: an Efficient Framework for Various Perception Tasks | Mar 18, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| Prototipo de un Contador Bidireccional Automático de Personas basado en sensores de visión 3D | Mar 18, 2024 | Objectobject-detection | —Unverified | 0 |
| TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction | Mar 18, 2024 | Autonomous DrivingNeural Architecture Search | —Unverified | 0 |
| GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection | Mar 18, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Towards Real-Time Fast Unmanned Aerial Vehicle Detection Using Dynamic Vision Sensors | Mar 18, 2024 | CPUEvent-based vision | —Unverified | 0 |
| Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem | Mar 18, 2024 | 3D Object DetectionDiversity | —Unverified | 0 |
| FlexCap: Describe Anything in Images in Controllable Detail | Mar 18, 2024 | AttributeDense Captioning | —Unverified | 0 |