| BankTweak: Adversarial Attack against Multi-Object Trackers by Manipulating Feature Banks | Aug 22, 2024 | Adversarial AttackMulti-Object Tracking | —Unverified | 0 |
| Class-balanced Open-set Semi-supervised Object Detection for Medical Images | Aug 22, 2024 | Objectobject-detection | —Unverified | 0 |
| Real-Time Incremental Explanations for Object Detectors | Aug 21, 2024 | Object | —Unverified | 0 |
| A Survey of Embodied Learning for Object-Centric Robotic Manipulation | Aug 21, 2024 | Imitation LearningObject | CodeCode Available | 3 |
| Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models | Aug 21, 2024 | DenoisingImage Generation | —Unverified | 0 |
| Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection | Aug 21, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Low-Light Object Tracking: A Benchmark | Aug 21, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance | Aug 21, 2024 | 2D Object Detectionimage-classification | —Unverified | 0 |
| On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes | Aug 20, 2024 | Objectobject-detection | —Unverified | 0 |
| Target-Oriented Object Grasping via Multimodal Human Guidance | Aug 20, 2024 | Motion PlanningObject | —Unverified | 0 |
| OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding | Aug 20, 2024 | ObjectScene Understanding | CodeCode Available | 1 |
| Aligning Object Detector Bounding Boxes with Human Preference | Aug 20, 2024 | Object | CodeCode Available | 0 |
| Just a Hint: Point-Supervised Camouflaged Object Detection | Aug 20, 2024 | Contrastive LearningObject | —Unverified | 0 |
| LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS | Aug 20, 2024 | Instance SegmentationObject | —Unverified | 0 |
| A Review of Human-Object Interaction Detection | Aug 20, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields | Aug 19, 2024 | Contrastive LearningObject | —Unverified | 0 |
| RUMI: Rummaging Using Mutual Information | Aug 19, 2024 | Model Predictive ControlObject | CodeCode Available | 4 |
| Physics-Aware Combinatorial Assembly Sequence Planning using Data-free Action Masking | Aug 19, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 0 |
| 3D-Aware Instance Segmentation and Tracking in Egocentric Videos | Aug 19, 2024 | 3D Object ReconstructionInstance Segmentation | —Unverified | 0 |
| Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track | Aug 19, 2024 | ObjectSegmentation | —Unverified | 0 |
| Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering | Aug 19, 2024 | Inverse RenderingObject | —Unverified | 0 |
| Retina-Inspired Object Motion Segmentation for Event-Cameras | Aug 18, 2024 | Decision MakingMotion Compensation | —Unverified | 0 |
| Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community | Aug 17, 2024 | Novel ConceptsObject | CodeCode Available | 3 |
| MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation | Aug 17, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System | Aug 17, 2024 | Multiple Object TrackingObject | —Unverified | 0 |
| Zero-Shot Object-Centric Representation Learning | Aug 17, 2024 | ObjectObject Discovery | —Unverified | 0 |
| PADetBench: Towards Benchmarking Physical Attacks against Object Detection | Aug 17, 2024 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| Depth-guided Texture Diffusion for Image Semantic Segmentation | Aug 17, 2024 | Objectobject-detection | —Unverified | 0 |
| Enhancing Object Detection with Hybrid dataset in Manufacturing Environments: Comparing Federated Learning to Conventional Techniques | Aug 16, 2024 | Federated LearningObject | —Unverified | 0 |
| TEXTOC: Text-driven Object-Centric Style Transfer | Aug 16, 2024 | ObjectStyle Transfer | —Unverified | 0 |
| Multimodal Relational Triple Extraction with Query-based Entity Object Transformer | Aug 16, 2024 | Knowledge GraphsObject | —Unverified | 0 |
| FunEditor: Achieving Complex Image Edits via Function Aggregation with Diffusion Models | Aug 16, 2024 | Image Quality AssessmentObject | —Unverified | 0 |
| Comparative Evaluation of 3D Reconstruction Methods for Object Pose Estimation | Aug 15, 2024 | 3D ReconstructionObject | CodeCode Available | 1 |
| GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Aug 15, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving | Aug 14, 2024 | 3D Object Detection3D Object Tracking | CodeCode Available | 3 |
| See It All: Contextualized Late Aggregation for 3D Dense Captioning | Aug 14, 2024 | 3D dense captioningAll | —Unverified | 0 |
| Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection | Aug 14, 2024 | Efficient Neural NetworkModel Compression | —Unverified | 0 |
| Bi-directional Contextual Attention for 3D Dense Captioning | Aug 13, 2024 | 3D dense captioningAttribute | —Unverified | 0 |
| Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions | Aug 13, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries | Aug 13, 2024 | 3D Object DetectionBEV Segmentation | —Unverified | 0 |
| Unified-IoU: For High-Quality Object Detection | Aug 13, 2024 | Objectobject-detection | CodeCode Available | 1 |
| SceneGPT: A Language Model for 3D Scene Understanding | Aug 13, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection | Aug 13, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields | Aug 13, 2024 | Novel View SynthesisObject | —Unverified | 0 |
| DC3DO: Diffusion Classifier for 3D Objects | Aug 13, 2024 | 3D Object ClassificationClassification | CodeCode Available | 1 |
| MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection | Aug 12, 2024 | 3D Object DetectionAutonomous Vehicles | —Unverified | 0 |
| DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection | Aug 12, 2024 | DecoderObject | CodeCode Available | 0 |
| Robust Domain Generalization for Multi-modal Object Recognition | Aug 11, 2024 | Domain GeneralizationMulti-Label Classification | —Unverified | 0 |
| U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training | Aug 11, 2024 | DenoisingObject | CodeCode Available | 0 |
| MacFormer: Semantic Segmentation with Fine Object Boundaries | Aug 11, 2024 | DecoderObject | —Unverified | 0 |