| TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding | Jan 16, 2024 | Action RecognitionBenchmarking | —Unverified | 0 |
| Small Object Detection by DETR via Information Augmentation and Adaptive Feature Fusion | Jan 16, 2024 | Objectobject-detection | —Unverified | 0 |
| Multi-task real-robot data with gaze attention for dual-arm fine manipulation | Jan 15, 2024 | Imitation LearningObject | —Unverified | 0 |
| Machine Learning Based Object Tracking | Jan 15, 2024 | Objectobject-detection | —Unverified | 0 |
| CascadeV-Det: Cascade Point Voting for 3D Object Detection | Jan 15, 2024 | 3D Object DetectionObject | CodeCode Available | 0 |
| Discriminative Consensus Mining with A Thousand Groups for More Accurate Co-Salient Object Detection | Jan 15, 2024 | Co-Salient Object DetectionObject | CodeCode Available | 0 |
| Seeing the Unseen: Visual Common Sense for Semantic Placement | Jan 15, 2024 | Common Sense ReasoningImage Description | —Unverified | 0 |
| Domain Adaptation for Large-Vocabulary Object Detectors | Jan 13, 2024 | Domain AdaptationKnowledge Graphs | —Unverified | 0 |
| AffordanceLLM: Grounding Affordance from Vision Language Models | Jan 12, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Embedded Planogram Compliance Control System | Jan 12, 2024 | ManagementNVIDIA Jetson Orin Nano | —Unverified | 0 |
| Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook | Jan 12, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| GO-NeRF: Generating Objects in Neural Radiance Fields for Virtual Reality Content Creation | Jan 11, 2024 | 3D GenerationMixed Reality | —Unverified | 0 |
| Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection | Jan 11, 2024 | Human-Object Interaction DetectionKnowledge Distillation | —Unverified | 0 |
| Object-Centric Diffusion for Efficient Video Editing | Jan 11, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| YOLO-Former: YOLO Shakes Hand With ViT | Jan 11, 2024 | Objectobject-detection | —Unverified | 0 |
| Content-Aware Depth-Adaptive Image Restoration | Jan 10, 2024 | Image RestorationObject | —Unverified | 0 |
| Consensus Focus for Object Detection and minority classes | Jan 10, 2024 | Domain AdaptationLong-tailed Object Detection | CodeCode Available | 0 |
| InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes | Jan 10, 2024 | 3D scene EditingDepth Estimation | —Unverified | 0 |
| Meta-forests: Domain generalization on random forests with meta-learning | Jan 9, 2024 | Domain GeneralizationMeta-Learning | —Unverified | 0 |
| UFO: Unidentified Foreground Object Detection in 3D Point Cloud | Jan 8, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| UAV-enabled Integrated Sensing and Communication: Tracking Design and Optimization | Jan 8, 2024 | Integrated sensing and communicationISAC | —Unverified | 0 |
| Integrity Assessment of Maritime Object Detection Impacted by Partial Camera Obstruction | Jan 8, 2024 | Decision MakingObject | —Unverified | 0 |
| A New Dataset and a Distractor-Aware Architecture for Transparent Object Tracking | Jan 8, 2024 | 2kObject | —Unverified | 0 |
| SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling | Jan 8, 2024 | 3D Object DetectionDomain Adaptation | —Unverified | 0 |
| RHOBIN Challenge: Reconstruction of Human Object Interaction | Jan 7, 2024 | 3D ReconstructionHuman-Object Interaction Detection | —Unverified | 0 |
| LLMs for Robotic Object Disambiguation | Jan 7, 2024 | Decision MakingNavigate | —Unverified | 0 |
| Real Time Human Detection by Unmanned Aerial Vehicles | Jan 6, 2024 | Human DetectionObject | —Unverified | 0 |
| DistFormer: Enhancing Local and Global Features for Monocular Per-Object Distance Estimation | Jan 6, 2024 | Autonomous DrivingDecoder | CodeCode Available | 0 |
| Object-Centric Instruction Augmentation for Robotic Manipulation | Jan 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Une ontologie pour les systèmes multi-agents ambiants dans les villes intelligentes | Jan 5, 2024 | Object | —Unverified | 0 |
| PAHD: Perception-Action based Human Decision Making using Explainable Graph Neural Networks on SAR Images | Jan 5, 2024 | Decision MakingObject | —Unverified | 0 |
| Clustering-Guided Class Activation for Weakly Supervised Semantic Segmentation | Jan 5, 2024 | ClusteringObject | CodeCode Available | 0 |
| Multimodal Data Curation via Object Detection and Filter Ensembles | Jan 5, 2024 | Objectobject-detection | —Unverified | 0 |
| Object-oriented backdoor attack against image captioning | Jan 5, 2024 | Backdoor AttackImage Captioning | —Unverified | 0 |
| VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection | Jan 5, 2024 | 3D Object DetectionFeature Importance | —Unverified | 0 |
| Fit-NGP: Fitting Object Models to Neural Graphics Primitives | Jan 4, 2024 | ObjectPose Estimation | —Unverified | 0 |
| Towards Efficient Object Re-Identification with A Novel Cloud-Edge Collaborative Framework | Jan 4, 2024 | Collaborative InferenceObject | —Unverified | 0 |
| ShapeAug: Occlusion Augmentation for Event Camera Data | Jan 4, 2024 | Data AugmentationObject | —Unverified | 0 |
| Slot-guided Volumetric Object Radiance Fields | Jan 4, 2024 | ObjectRepresentation Learning | —Unverified | 0 |
| Unsupervised Object-Centric Learning from Multiple Unspecified Viewpoints | Jan 3, 2024 | Object | —Unverified | 0 |
| Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition | Jan 3, 2024 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Hybrid Pooling and Convolutional Network for Improving Accuracy and Training Convergence Speed in Object Detection | Jan 2, 2024 | Objectobject-detection | —Unverified | 0 |
| Depth-discriminative Metric Learning for Monocular 3D Object Detection | Jan 2, 2024 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Image Sculpting: Precise Object Editing with 3D Geometry Control | Jan 2, 2024 | 3D geometryObject | —Unverified | 0 |
| Cyclic Learning for Binaural Audio Generation and Localization | Jan 1, 2024 | Audio GenerationObject | —Unverified | 0 |
| CORE-MPI: Consistency Object Removal with Embedding MultiPlane Image | Jan 1, 2024 | Novel View SynthesisObject | —Unverified | 0 |
| Projecting Trackable Thermal Patterns for Dynamic Computer Vision | Jan 1, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Learning to Segment Referred Objects from Narrated Egocentric Videos | Jan 1, 2024 | ObjectSegmentation | —Unverified | 0 |
| Contextual Associated Triplet Queries for Panoptic Scene Graph Generation | Jan 1, 2024 | Graph GenerationObject | —Unverified | 0 |
| Few-Shot Object Detection with Foundation Models | Jan 1, 2024 | Few-Shot LearningFew-Shot Object Detection | —Unverified | 0 |