| Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection | Apr 26, 2024 | Autonomous DrivingObject | CodeCode Available | 0 |
| Cross-Domain Spatial Matching for Camera and Radar Sensor Data Fusion in Autonomous Vehicle Perception System | Apr 25, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| COBRA -- COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images | Apr 25, 2024 | Gaussian ProcessesObject | —Unverified | 0 |
| Generating Minimalist Adversarial Perturbations to Test Object-Detection Models: An Adaptive Multi-Metric Evolutionary Search Approach | Apr 25, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Neural Assembler: Learning to Generate Fine-Grained Robotic Assembly Instructions from Multi-View Images | Apr 25, 2024 | Object | —Unverified | 0 |
| Single-View Scene Point Cloud Human Grasp Generation | Apr 24, 2024 | Grasp GenerationObject | CodeCode Available | 0 |
| OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation | Apr 24, 2024 | 3D ReconstructionObject | CodeCode Available | 0 |
| Learning to Detect Attended Objects in Cultural Sites with Gaze Signals and Weak Object Supervision | Apr 23, 2024 | Objectobject-detection | CodeCode Available | 0 |
| GLoD: Composing Global Contexts and Local Details in Image Generation | Apr 23, 2024 | DenoisingImage Generation | —Unverified | 0 |
| Other Tokens Matter: Exploring Global and Local Features of Vision Transformers for Object Re-Identification | Apr 23, 2024 | Object | —Unverified | 0 |
| Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion | Apr 23, 2024 | AttributeObject | —Unverified | 0 |
| Source-free Domain Adaptation for Video Object Detection Under Adverse Image Conditions | Apr 23, 2024 | Domain AdaptationObject | —Unverified | 0 |
| Deep Models for Multi-View 3D Object Recognition: A Review | Apr 23, 2024 | 3D Classification3D Object Recognition | —Unverified | 0 |
| GeoDiffuser: Geometry-Based Image Editing with Diffusion Models | Apr 22, 2024 | Object | —Unverified | 0 |
| 360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos | Apr 22, 2024 | ObjectObject Tracking | —Unverified | 0 |
| LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions | Apr 21, 2024 | Image GenerationLayout-to-Image Generation | —Unverified | 0 |
| The Framework of a Design Process Language | Apr 21, 2024 | Object | —Unverified | 0 |
| FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving | Apr 20, 2024 | ARCAutonomous Driving | —Unverified | 0 |
| Composing Pre-Trained Object-Centric Representations for Robotics From "What" and "Where" Foundation Models | Apr 20, 2024 | ObjectSystematic Generalization | —Unverified | 0 |
| Efficient and Concise Explanations for Object Detection with Gaussian-Class Activation Mapping Explainer | Apr 20, 2024 | Objectobject-detection | CodeCode Available | 0 |
| On-board classification of underwater images using hybrid classical-quantum CNN based method | Apr 19, 2024 | Autonomous VehiclesGPU | —Unverified | 0 |
| PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation | Apr 19, 2024 | motion predictionObject | —Unverified | 0 |
| Learning Object Semantic Similarity with Self-Supervision | Apr 19, 2024 | ObjectSemantic Similarity | —Unverified | 0 |
| Exploring Interactive Semantic Alignment for Efficient HOI Detection with Vision-language Model | Apr 19, 2024 | Human-Object Interaction DetectionLanguage Modeling | —Unverified | 0 |
| ECOR: Explainable CLIP for Object Recognition | Apr 19, 2024 | Objectobject-detection | —Unverified | 0 |
| Language-Driven Active Learning for Diverse Open-Set 3D Object Detection | Apr 19, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 0 |
| G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis | Apr 18, 2024 | DenoisingObject | —Unverified | 0 |
| Customizing Text-to-Image Diffusion with Object Viewpoint Control | Apr 18, 2024 | ObjectPrompt Engineering | —Unverified | 0 |
| Food Portion Estimation via 3D Object Scaling | Apr 18, 2024 | Object | CodeCode Available | 0 |
| Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition | Apr 18, 2024 | Action RecognitionFew-Shot action recognition | —Unverified | 0 |
| Inverse Neural Rendering for Explainable Multi-Object Tracking | Apr 18, 2024 | 3D Multi-Object TrackingInverse Rendering | —Unverified | 0 |
| Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection | Apr 17, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination | Apr 17, 2024 | Inverse RenderingObject | —Unverified | 0 |
| Object Remover Performance Evaluation Methods using Class-wise Object Removal Images | Apr 17, 2024 | Image InpaintingObject | —Unverified | 0 |
| How to deal with glare for improved perception of Autonomous Vehicles | Apr 17, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Multimodal 3D Object Detection on Unseen Domains | Apr 17, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Detector Collapse: Physical-World Backdooring Object Detection to Catastrophic Overload or Blindness in Autonomous Driving | Apr 17, 2024 | Autonomous DrivingBackdoor Attack | —Unverified | 0 |
| GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement | Apr 17, 2024 | ObjectPose Estimation | —Unverified | 0 |
| OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery | Apr 16, 2024 | Objectobject-detection | —Unverified | 0 |
| Generating Human Interaction Motions in Scenes with Text Control | Apr 16, 2024 | DenoisingHuman-Object Interaction Detection | —Unverified | 0 |
| A Realistic Protocol for Evaluation of Weakly Supervised Object Localization | Apr 15, 2024 | Model SelectionObject | CodeCode Available | 0 |
| Improved Object-Based Style Transfer with Single Deep Network | Apr 15, 2024 | ObjectSegmentation | —Unverified | 0 |
| VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection | Apr 15, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision | Apr 15, 2024 | ObjectQuestion Answering | —Unverified | 0 |
| Improving Weakly-Supervised Object Localization Using Adversarial Erasing and Pseudo Label | Apr 15, 2024 | ObjectObject Localization | —Unverified | 0 |
| Coreset Selection for Object Detection | Apr 14, 2024 | Diversityimage-classification | —Unverified | 0 |
| Fusion-Mamba for Cross-modality Object Detection | Apr 14, 2024 | MambaObject | —Unverified | 0 |
| LoopAnimate: Loopable Salient Object Animation | Apr 14, 2024 | GPUObject | —Unverified | 0 |
| DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection | Apr 14, 2024 | Dense CaptioningLanguage Modelling | —Unverified | 0 |
| BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection | Apr 13, 2024 | Image EnhancementObject | —Unverified | 0 |