| A Modern Take on Visual Relationship Reasoning for Grasp Planning | Sep 3, 2024 | Graph GenerationObject | —Unverified | 0 |
| Improving Apple Object Detection with Occlusion-Enhanced Distillation | Sep 3, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Segmenting Object Affordances: Reproducibility and Sensitivity to Scale | Sep 3, 2024 | ObjectSegmentation | CodeCode Available | 0 |
| Understanding Multimodal Hallucination with Parameter-Free Representation Alignment | Sep 2, 2024 | HallucinationObject | CodeCode Available | 0 |
| DS MYOLO: A Reliable Object Detector Based on SSMs for Driving Scenarios | Sep 2, 2024 | MambaObject | —Unverified | 0 |
| From Pixels to Objects: A Hierarchical Approach for Part and Object Segmentation Using Local and Global Aggregation | Sep 2, 2024 | Computational EfficiencyImage Segmentation | —Unverified | 0 |
| ReMOVE: A Reference-free Metric for Object Erasure | Sep 1, 2024 | Image GenerationObject | —Unverified | 0 |
| Detection, Recognition and Pose Estimation of Tabletop Objects | Sep 1, 2024 | Objectobject-detection | —Unverified | 0 |
| COMOGen: A Controllable Text-to-3D Multi-object Generation Framework | Sep 1, 2024 | ObjectText to 3D | —Unverified | 0 |
| TrackSSM: A General Motion Predictor by State-Space Model | Aug 31, 2024 | DecoderMamba | CodeCode Available | 1 |
| Toward a More Complete OMR Solution | Aug 31, 2024 | Objectobject-detection | CodeCode Available | 0 |
| EraseDraw: Learning to Draw Step-by-Step via Erasing Objects from Images | Aug 31, 2024 | Object | —Unverified | 0 |
| UTrack: Multi-Object Tracking with Uncertain Detections | Aug 30, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 2 |
| BOP-Distrib: Revisiting 6D Pose Estimation Benchmarks for Better Evaluation under Visual Ambiguities | Aug 30, 2024 | 6D Pose EstimationObject | —Unverified | 0 |
| Analyzing Errors in Controlled Turret System Given Target Location Input from Artificial Intelligence Methods in Automatic Target Recognition | Aug 29, 2024 | Objectobject-detection | —Unverified | 0 |
| PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View | Aug 29, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation | Aug 29, 2024 | ObjectPose Estimation | CodeCode Available | 1 |
| PartFormer: Awakening Latent Diverse Representation from Vision Transformer for Object Re-Identification | Aug 29, 2024 | DiversityObject | —Unverified | 0 |
| Anno-incomplete Multi-dataset Detection | Aug 29, 2024 | Multi-Task LearningObject | —Unverified | 0 |
| Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS | Aug 29, 2024 | ObjectObject Recognition | CodeCode Available | 0 |
| Small Object Detection for Indoor Assistance to the Blind using YOLO NAS Small and Super Gradients | Aug 28, 2024 | Objectobject-detection | —Unverified | 0 |
| A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future Directions | Aug 28, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| microYOLO: Towards Single-Shot Object Detection on Microcontrollers | Aug 28, 2024 | GPUObject | —Unverified | 0 |
| Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation | Aug 28, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Network transferability of adversarial patches in real-time object detection | Aug 28, 2024 | Adversarial AttackObject | CodeCode Available | 0 |
| What is YOLOv8: An In-Depth Exploration of the Internal Features of the Next-Generation Object Detector | Aug 28, 2024 | Objectobject-detection | —Unverified | 0 |
| TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning | Aug 28, 2024 | ObjectOut-of-Distribution Detection | CodeCode Available | 0 |
| Object Detection for Vehicle Dashcams using Transformers | Aug 28, 2024 | ManagementObject | —Unverified | 0 |
| Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation | Aug 27, 2024 | Image GenerationObject | —Unverified | 0 |
| An Investigation on The Position Encoding in Vision-Based Dynamics Prediction | Aug 27, 2024 | ObjectPosition | —Unverified | 0 |
| 3D-Aware Manipulation with Object-Centric Gaussian Splatting | Aug 26, 2024 | ObjectSimulated Gaussian Manipulation | —Unverified | 0 |
| A Survey of Camouflaged Object Detection and Beyond | Aug 26, 2024 | Instance SegmentationObject | CodeCode Available | 3 |
| PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection | Aug 26, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Dense Center-Direction Regression for Object Counting and Localization with Point Supervision | Aug 26, 2024 | ObjectObject Counting | CodeCode Available | 0 |
| Learning Local Pattern Modularization for Point Cloud Reconstruction from Unseen Classes | Aug 26, 2024 | ObjectPoint cloud reconstruction | CodeCode Available | 0 |
| Beyond Few-shot Object Detection: A Detailed Survey | Aug 26, 2024 | Few-Shot LearningFew-Shot Object Detection | —Unverified | 0 |
| CV-MOS: A Cross-View Model for Motion Segmentation | Aug 25, 2024 | Autonomous DrivingMotion Segmentation | CodeCode Available | 0 |
| Camouflaged Object Tracking: A Benchmark | Aug 25, 2024 | ObjectObject Tracking | CodeCode Available | 0 |
| OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation | Aug 25, 2024 | 3D Object DetectionNavigate | CodeCode Available | 0 |
| InterTrack: Tracking Human Object Interaction without Object Templates | Aug 25, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Decentralised Variational Inference Frameworks for Multi-object Tracking on Sensor Networks: Additional Notes | Aug 24, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| CSS-Segment: 2nd Place Report of LSVOS Challenge VOS Track | Aug 24, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| Towards learning digital twin: case study on an anisotropic non-ideal rotor system | Aug 23, 2024 | Lifelong learningObject | —Unverified | 0 |
| SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting | Aug 23, 2024 | 3D Object EditingNeRF | CodeCode Available | 0 |
| Identifying Crucial Objects in Blind and Low-Vision Individuals' Navigation | Aug 23, 2024 | Object | —Unverified | 0 |
| Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding | Aug 23, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Context-Aware Temporal Embedding of Objects in Video Data | Aug 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ShapeICP: Iterative Category-level Object Pose and Shape Estimation from Depth | Aug 23, 2024 | Object | —Unverified | 0 |
| MCTR: Multi Camera Tracking Transformer | Aug 23, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| CatFree3D: Category-agnostic 3D Object Detection with Diffusion | Aug 22, 2024 | 3D Object DetectionAutonomous Vehicles | —Unverified | 0 |