| Robust Domain Generalization for Multi-modal Object Recognition | Aug 11, 2024 | Domain GeneralizationMulti-Label Classification | —Unverified | 0 |
| SABER-6D: Shape Representation Based Implicit Object Pose Estimation | Aug 11, 2024 | DecoderObject | —Unverified | 0 |
| In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Aug 9, 2024 | Image to textObject | CodeCode Available | 2 |
| Embodied Uncertainty-Aware Object Segmentation | Aug 8, 2024 | Instance SegmentationInteractive Segmentation | —Unverified | 0 |
| SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes | Aug 8, 2024 | Autonomous VehiclesObject | CodeCode Available | 1 |
| Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models | Aug 8, 2024 | Contrastive LearningFine-Grained Image Recognition | —Unverified | 0 |
| Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Aug 7, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling | Aug 7, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| GUI Element Detection Using SOTA YOLO Deep Learning Models | Aug 7, 2024 | 2D Object DetectionCode Generation | CodeCode Available | 1 |
| Mitigating Hallucinations in Large Vision-Language Models (LVLMs) via Language-Contrastive Decoding (LCD) | Aug 6, 2024 | Object | —Unverified | 0 |
| An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion | Aug 6, 2024 | 3D Shape GenerationImage Generation | —Unverified | 0 |
| Understanding How Blind Users Handle Object Recognition Errors: Strategies and Challenges | Aug 6, 2024 | ObjectObject Recognition | —Unverified | 0 |
| LAC-Net: Linear-Fusion Attention-Guided Convolutional Network for Accurate Robotic Grasping Under the Occlusion | Aug 6, 2024 | ObjectRobotic Grasping | —Unverified | 0 |
| Line-based 6-DoF Object Pose Estimation and Tracking With an Event Camera | Aug 6, 2024 | ObjectPose Estimation | CodeCode Available | 1 |
| HQOD: Harmonious Quantization for Object Detection | Aug 5, 2024 | Objectobject-detection | CodeCode Available | 0 |
| View-consistent Object Removal in Radiance Fields | Aug 4, 2024 | Image InpaintingObject | —Unverified | 0 |
| KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving | Aug 4, 2024 | 3D Object DetectionAttribute | CodeCode Available | 0 |
| Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation | Aug 4, 2024 | Domain AdaptationObject | CodeCode Available | 0 |
| Visual Grounding for Object-Level Generalization in Reinforcement Learning | Aug 4, 2024 | Language ModellingObject | CodeCode Available | 1 |
| A Survey and Evaluation of Adversarial Attacks for Object Detection | Aug 4, 2024 | Adversarial RobustnessAutonomous Vehicles | —Unverified | 0 |
| Do You Remember . . . the Future? Weak-to-Strong generalization in 3D Object Detection | Aug 3, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 0 |
| Supervised Image Translation from Visible to Infrared Domain for Object Detection | Aug 3, 2024 | Generative Adversarial NetworkObject | —Unverified | 0 |
| Domain penalisation for improved Out-of-Distribution Generalisation | Aug 3, 2024 | Objectobject-detection | —Unverified | 0 |
| LAM3D: Leveraging Attention for Monocular 3D Object Detection | Aug 3, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| SiamMo: Siamese Motion-Centric 3D Object Tracking | Aug 3, 2024 | 3D Object Tracking3D Single Object Tracking | CodeCode Available | 0 |
| Stimulating Imagination: Towards General-purpose Object Rearrangement | Aug 3, 2024 | ObjectObject Localization | —Unverified | 0 |
| THOR2: Topological Analysis for 3D Shape and Color-Based Human-Inspired Object Recognition in Unseen Environments | Aug 2, 2024 | ObjectObject Recognition | CodeCode Available | 0 |
| An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design | Aug 2, 2024 | Model CompressionNeural Network Compression | —Unverified | 0 |
| A Robotics-Inspired Scanpath Model Reveals the Importance of Uncertainty and Semantic Object Cues for Gaze Guidance in Dynamic Scenes | Aug 2, 2024 | FoveationObject | CodeCode Available | 0 |
| Underwater Object Detection Enhancement via Channel Stabilization | Aug 2, 2024 | Image EnhancementObject | CodeCode Available | 0 |
| PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network | Aug 2, 2024 | 4k8k | —Unverified | 0 |
| Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model | Aug 2, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Aug 2, 2024 | cross-modal alignmentMultiple Object Tracking | CodeCode Available | 2 |
| Extracting Object Heights From LiDAR & Aerial Imagery | Aug 2, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| SOCIAL MEDIA MANAGEMENT SYSTEM PROJECT REPORT. | Aug 1, 2024 | ManagementObject | —Unverified | 0 |
| MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection | Aug 1, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection | Aug 1, 2024 | 3D Object DetectionDecoder | —Unverified | 0 |
| A Simple Background Augmentation Method for Object Detection with Diffusion Model | Aug 1, 2024 | Data AugmentationDiversity | —Unverified | 0 |
| RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment | Aug 1, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| PEAR: Phrase-Based Hand-Object Interaction Anticipation | Jul 31, 2024 | Object | —Unverified | 0 |
| MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection | Jul 31, 2024 | Language ModellingObject | CodeCode Available | 1 |
| A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap | Jul 31, 2024 | Human-Object Interaction DetectionImage Reconstruction | CodeCode Available | 0 |
| Strike the Balance: On-the-Fly Uncertainty based User Interactions for Long-Term Video Object Segmentation | Jul 31, 2024 | ObjectSegmentation | CodeCode Available | 0 |
| Dynamic Object Queries for Transformer-based Incremental Object Detection | Jul 31, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| EZSR: Event-based Zero-Shot Recognition | Jul 31, 2024 | ObjectObject Recognition | —Unverified | 0 |
| Spatial Transformer Network YOLO Model for Agricultural Object Detection | Jul 31, 2024 | Objectobject-detection | CodeCode Available | 1 |
| What is YOLOv5: A deep look into the internal features of the popular object detector | Jul 30, 2024 | Objectobject-detection | —Unverified | 0 |
| Monocular Human-Object Reconstruction in the Wild | Jul 30, 2024 | DiversityHuman-Object Interaction Detection | CodeCode Available | 1 |
| 3D-GRES: Generalized 3D Referring Expression Segmentation | Jul 30, 2024 | ObjectReferring Expression | CodeCode Available | 1 |
| StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset | Jul 30, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |