| Visual Grounding with Attention-Driven Constraint Balancing | Jul 3, 2024 | Objectobject-detection | —Unverified | 0 |
| EgoFlowNet: Non-Rigid Scene Flow from Point Clouds with Ego-Motion Support | Jul 3, 2024 | ClusteringObject | —Unverified | 0 |
| Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-View 3D Detection and Tracking | Jul 3, 2024 | ObjectRepresentation Learning | —Unverified | 0 |
| Magic Insert: Style-Aware Drag-and-Drop | Jul 2, 2024 | Domain AdaptationObject | —Unverified | 0 |
| HOIMotion: Forecasting Human Motion During Human-Object Interactions Using Egocentric 3D Object Bounding Boxes | Jul 2, 2024 | DecoderHuman-Object Interaction Detection | —Unverified | 0 |
| Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision | Jul 1, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Formal Verification of Deep Neural Networks for Object Detection | Jul 1, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Grouped Discrete Representation Guides Object-Centric Learning | Jul 1, 2024 | AttributeObject | —Unverified | 0 |
| DroBoost: An Intelligent Score and Model Boosting Method for Drone Detection | Jun 30, 2024 | Objectobject-detection | —Unverified | 0 |
| Object Space is Embodied | Jun 28, 2024 | Object | —Unverified | 0 |
| EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting | Jun 28, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators | Jun 28, 2024 | DecoderObject | —Unverified | 0 |
| Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking | Jun 28, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| Weighted Circle Fusion: Ensembling Circle Representation from Different Object Detection Results | Jun 27, 2024 | Objectobject-detection | CodeCode Available | 0 |
| ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data | Jun 27, 2024 | Contact-rich ManipulationObject | —Unverified | 0 |
| HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection | Jun 27, 2024 | Objectobject-detection | CodeCode Available | 0 |
| CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection | Jun 26, 2024 | 3D Object DetectionDomain Adaptation | —Unverified | 0 |
| 3D Feature Distillation with Object-Centric Priors | Jun 26, 2024 | 3D Instance SegmentationInstance Segmentation | —Unverified | 0 |
| Geometric Features Enhanced Human-Object Interaction Detection | Jun 26, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 0 |
| SpY: A Context-Based Approach to Spacecraft Component Detection | Jun 26, 2024 | Objectobject-detection | —Unverified | 0 |
| Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models | Jun 26, 2024 | Collision AvoidanceHuman-Object Interaction Detection | —Unverified | 0 |
| Pixel-weighted Multi-pose Fusion for Metal Artifact Reduction in X-ray Computed Tomography | Jun 25, 2024 | Computed Tomography (CT)Metal Artifact Reduction | —Unverified | 0 |
| Human-Object Interaction from Human-Level Instructions | Jun 25, 2024 | Common Sense ReasoningHuman-Object Interaction Detection | —Unverified | 0 |
| ET tu, CLIP? Addressing Common Object Errors for Unseen Environments | Jun 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Towards Open-set Camera 3D Object Detection | Jun 25, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Vision Controlled Sensorized Prosthetic Hand | Jun 25, 2024 | Object | CodeCode Available | 0 |
| OCALM: Object-Centric Assessment with Language Models | Jun 24, 2024 | ObjectReinforcement Learning (RL) | —Unverified | 0 |
| Exploring Test-Time Adaptation for Object Detection in Continually Changing Environments | Jun 24, 2024 | Contrastive LearningObject | —Unverified | 0 |
| High-resolution open-vocabulary object 6D pose estimation | Jun 24, 2024 | 6D Pose EstimationObject | —Unverified | 0 |
| LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control | Jun 23, 2024 | Novel View SynthesisObject | —Unverified | 0 |
| Contextual Interaction via Primitive-based Adversarial Training For Compositional Zero-shot Learning | Jun 21, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 0 |
| GIC: Gaussian-Informed Continuum for Physical Property Identification and Simulation | Jun 21, 2024 | Object | —Unverified | 0 |
| Unseen Object Reasoning with Shared Appearance Cues | Jun 21, 2024 | DiversityObject | CodeCode Available | 0 |
| Image Conductor: Precision Control for Interactive Video Synthesis | Jun 21, 2024 | Object | —Unverified | 0 |
| CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics | Jun 20, 2024 | Human-Object Interaction DetectionHumanoid Control | —Unverified | 0 |
| Two-Stage Depth Enhanced Learning with Obstacle Map For Object Navigation | Jun 20, 2024 | NavigateObject | —Unverified | 0 |
| Semantic Enhanced Few-shot Object Detection | Jun 19, 2024 | Few-Shot Object DetectionObject | —Unverified | 0 |
| 3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data | Jun 19, 2024 | 3D Instance Segmentation3D Object Recognition | —Unverified | 0 |
| SMORE: Simultaneous Map and Object REconstruction | Jun 19, 2024 | Depth CompletionDynamic Reconstruction | —Unverified | 0 |
| On rough mereology and VC-dimension in treatment of decision prediction for open world decision systems | Jun 19, 2024 | Object | —Unverified | 0 |
| Certified ML Object Detection for Surveillance Missions | Jun 18, 2024 | Objectobject-detection | —Unverified | 0 |
| GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation | Jun 18, 2024 | Contrastive LearningObject | —Unverified | 0 |
| Beyond Visual Appearances: Privacy-sensitive Objects Identification via Hybrid Graph Reasoning | Jun 18, 2024 | Data AugmentationGraph Generation | —Unverified | 0 |
| Online Multi-camera People Tracking with Spatial-temporal Mechanism and Anchor-feature Hierarchical Clustering | Jun 17, 2024 | Multi-Object TrackingObject | CodeCode Available | 0 |
| Overlap Suppression Clustering for Offline Multi-Camera People Tracking | Jun 17, 2024 | ClusteringMulti-Object Tracking | —Unverified | 0 |
| Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection | Jun 17, 2024 | 3D Object DetectionDomain Adaptation | CodeCode Available | 0 |
| YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection | Jun 17, 2024 | Objectobject-detection | —Unverified | 0 |
| V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results | Jun 17, 2024 | Objectobject-detection | —Unverified | 0 |
| Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags | Jun 16, 2024 | Image to textInstruction Following | —Unverified | 0 |
| SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection | Jun 16, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |