| End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting | Sep 19, 2024 | DecoderObject | —Unverified | 0 |
| Enhancing 3D Robotic Vision Robustness by Minimizing Adversarial Mutual Information through a Curriculum Training Approach | Sep 19, 2024 | Adversarial Robustnessobject-detection | CodeCode Available | 0 |
| PoTATO: A Dataset for Analyzing Polarimetric Traces of Afloat Trash Objects | Sep 19, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Applications of Knowledge Distillation in Remote Sensing: A Survey | Sep 18, 2024 | Computational EfficiencyInstance Segmentation | —Unverified | 0 |
| RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework | Sep 18, 2024 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 |
| Agglomerative Token Clustering | Sep 18, 2024 | Clusteringimage-classification | —Unverified | 0 |
| Scale-Invariant Object Detection by Adaptive Convolution with Unified Global-Local Context | Sep 17, 2024 | object-detectionObject Detection | —Unverified | 0 |
| VALO: A Versatile Anytime Framework for LiDAR-based Object Detection Deep Neural Networks | Sep 17, 2024 | Objectobject-detection | CodeCode Available | 0 |
| TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection | Sep 17, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation | Sep 17, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking | Sep 17, 2024 | Multiple Object TrackingObject | CodeCode Available | 1 |
| UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height | Sep 17, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 4 |
| Online Learning via Memory: Retrieval-Augmented Detector Adaptation | Sep 16, 2024 | Memorizationobject-detection | —Unverified | 0 |
| CoMamba: Real-time Cooperative Perception Unlocked with State Space Models | Sep 16, 2024 | Autonomous Drivingobject-detection | CodeCode Available | 0 |
| Towards Physically Realizable Adversarial Attacks in Embodied Vision Navigation | Sep 16, 2024 | Adversarial Robustnessobject-detection | CodeCode Available | 1 |
| Self-Updating Vehicle Monitoring Framework Employing Distributed Acoustic Sensing towards Real-World Settings | Sep 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Performance of Human Annotators in Object Detection and Segmentation of Remotely Sensed Data | Sep 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion | Sep 16, 2024 | Autonomous DrivingAutonomous Navigation | —Unverified | 0 |
| Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation | Sep 16, 2024 | 3D Open-Vocabulary Object DetectionGraph Generation | —Unverified | 0 |
| LithoHoD: A Litho Simulator-Powered Framework for IC Layout Hotspot Detection | Sep 16, 2024 | Objectobject-detection | —Unverified | 0 |
| GLCONet: Learning Multi-source Perception Representation for Camouflaged Object Detection | Sep 15, 2024 | Decoderobject-detection | CodeCode Available | 1 |
| SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks | Sep 15, 2024 | Image ClassificationObject Detection | CodeCode Available | 1 |
| Tracking Virtual Meetings in the Wild: Re-identification in Multi-Participant Virtual Meetings | Sep 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Stutter-Solver: End-to-end Multi-lingual Dysfluency Detection | Sep 15, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Enhancing Weakly-Supervised Object Detection on Static Images through (Hallucinated) Motion | Sep 15, 2024 | Objectobject-detection | —Unverified | 0 |