| Open-Vocabulary Octree-Graph for 3D Scene Understanding | Nov 25, 2024 | ObjectScene Understanding | —Unverified | 0 |
| Diffusion Features for Zero-Shot 6DoF Object Pose Estimation | Nov 25, 2024 | ObjectPose Estimation | CodeCode Available | 0 |
| Leverage Task Context for Object Affordance Ranking | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| VideoOrion: Tokenizing Object Dynamics in Videos | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Boosting 3D Object Generation through PBR Materials | Nov 25, 2024 | Object | —Unverified | 0 |
| CIA: Controllable Image Augmentation Framework Based on Stable Diffusion | Nov 25, 2024 | Image AugmentationObject | CodeCode Available | 0 |
| LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Nov 24, 2024 | Ensemble LearningObject | CodeCode Available | 1 |
| Towards RAW Object Detection in Diverse Conditions | Nov 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching | Nov 24, 2024 | ObjectPose Estimation | CodeCode Available | 1 |
| FastTrackTr:Towards Fast Multi-Object Tracking with Transformers | Nov 24, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation | Nov 23, 2024 | Objectobject-detection | —Unverified | 0 |
| Twin Trigger Generative Networks for Backdoor Attacks against Object Detection | Nov 23, 2024 | image-classificationImage Classification | —Unverified | 0 |
| OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs | Nov 23, 2024 | Keypoint DetectionObject | CodeCode Available | 1 |
| ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models | Nov 22, 2024 | HallucinationObject | —Unverified | 0 |
| Instance-Aware Generalized Referring Expression Segmentation | Nov 22, 2024 | Generalized Referring Expression SegmentationObject | —Unverified | 0 |
| A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles | Nov 22, 2024 | Autonomous VehiclesObject | —Unverified | 0 |
| SEMPose: A Single End-to-end Network for Multi-object Pose Estimation | Nov 21, 2024 | ObjectPose Estimation | —Unverified | 0 |
| DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding | Nov 21, 2024 | Long-tailed Object DetectionObject | CodeCode Available | 5 |
| EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild | Nov 21, 2024 | 3D ReconstructionObject | CodeCode Available | 2 |
| Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction | Nov 21, 2024 | 3D GenerationGPU | —Unverified | 0 |
| Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity | Nov 20, 2024 | Multiple Object TrackingObject | CodeCode Available | 0 |
| YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Nov 20, 2024 | 2D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| Teaching VLMs to Localize Specific Objects from In-context Examples | Nov 20, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| ClickTrack: Towards Real-time Interactive Single Object Tracking | Nov 20, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Find Any Part in 3D | Nov 20, 2024 | 3D Part SegmentationDiversity | CodeCode Available | 2 |
| Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning | Nov 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| Text-guided Zero-Shot Object Localization | Nov 18, 2024 | ObjectObject Localization | —Unverified | 0 |
| PickScan: Object discovery and reconstruction from handheld interactions | Nov 17, 2024 | ObjectObject Discovery | CodeCode Available | 1 |
| Radio Frequency Ray Tracing with Neural Object Representation | Nov 16, 2024 | Object | —Unverified | 0 |
| Generating Compositional Scenes via Text-to-image RGBA Instance Generation | Nov 16, 2024 | ObjectPrompt Engineering | —Unverified | 0 |
| Structure Tensor Representation for Robust Oriented Object Detection | Nov 15, 2024 | Objectobject-detection | —Unverified | 0 |
| ColorEdit: Training-free Image-Guided Color editing with diffusion model | Nov 15, 2024 | AttributeDenoising | —Unverified | 0 |
| Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning | Nov 15, 2024 | DescriptiveObject | —Unverified | 0 |
| Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras | Nov 15, 2024 | energy managementManagement | —Unverified | 0 |
| LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection | Nov 14, 2024 | Objectobject-detection | —Unverified | 0 |
| Image Processing for Motion Magnification | Nov 14, 2024 | Motion MagnificationObject | —Unverified | 0 |
| Dynamic Reconstruction of Hand-Object Interaction with Distributed Force-aware Contact Representation | Nov 14, 2024 | Dynamic ReconstructionObject | —Unverified | 0 |
| Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction | Nov 14, 2024 | Contrastive LearningLong-tailed Object Detection | —Unverified | 0 |
| Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration | Nov 14, 2024 | Computational EfficiencyObject | CodeCode Available | 1 |
| Multimodal Object Detection using Depth and Image Data for Manufacturing Parts | Nov 13, 2024 | Objectobject-detection | —Unverified | 0 |
| Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance | Nov 13, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| 3D Multi-Object Tracking with Semi-Supervised GRU-Kalman Filter | Nov 13, 2024 | 3D Multi-Object TrackingAutonomous Driving | —Unverified | 0 |
| SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes | Nov 12, 2024 | DenoisingObject | CodeCode Available | 0 |
| 3D Focusing-and-Matching Network for Multi-Instance Point Cloud Registration | Nov 12, 2024 | ObjectPoint Cloud Registration | CodeCode Available | 1 |
| Zero-shot Object-Centric Instruction Following: Integrating Foundation Models with Traditional Navigation | Nov 12, 2024 | Instruction FollowingObject | —Unverified | 0 |
| MureObjectStitch: Multi-reference Image Composition | Nov 12, 2024 | Object | CodeCode Available | 3 |
| Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Nov 12, 2024 | Ensemble LearningObject | CodeCode Available | 1 |
| SynCL: A Synergistic Training Strategy with Instance-Aware Contrastive Learning for End-to-End Multi-Camera 3D Tracking | Nov 11, 2024 | 3D Multi-Object TrackingContrastive Learning | —Unverified | 0 |
| Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models | Nov 11, 2024 | Object | CodeCode Available | 1 |
| Edify 3D: Scalable High-Quality 3D Asset Generation | Nov 11, 2024 | Object | —Unverified | 0 |