| LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions | Apr 21, 2024 | Image GenerationLayout-to-Image Generation | —Unverified | 0 |
| FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving | Apr 20, 2024 | ARCAutonomous Driving | —Unverified | 0 |
| Augmented Object Intelligence with XR-Objects | Apr 20, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Efficient and Concise Explanations for Object Detection with Gaussian-Class Activation Mapping Explainer | Apr 20, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Composing Pre-Trained Object-Centric Representations for Robotics From "What" and "Where" Foundation Models | Apr 20, 2024 | ObjectSystematic Generalization | —Unverified | 0 |
| Learning Object Semantic Similarity with Self-Supervision | Apr 19, 2024 | ObjectSemantic Similarity | —Unverified | 0 |
| On-board classification of underwater images using hybrid classical-quantum CNN based method | Apr 19, 2024 | Autonomous VehiclesGPU | —Unverified | 0 |
| MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model | Apr 19, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| ECOR: Explainable CLIP for Object Recognition | Apr 19, 2024 | Objectobject-detection | —Unverified | 0 |
| Language-Driven Active Learning for Diverse Open-Set 3D Object Detection | Apr 19, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 0 |
| Exploring Interactive Semantic Alignment for Efficient HOI Detection with Vision-language Model | Apr 19, 2024 | Human-Object Interaction DetectionLanguage Modeling | —Unverified | 0 |
| PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation | Apr 19, 2024 | motion predictionObject | —Unverified | 0 |
| Food Portion Estimation via 3D Object Scaling | Apr 18, 2024 | Object | CodeCode Available | 0 |
| Moving Object Segmentation: All You Need Is SAM (and Flow) | Apr 18, 2024 | AllMotion Segmentation | CodeCode Available | 3 |
| The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models | Apr 18, 2024 | Instance SegmentationObject | CodeCode Available | 1 |
| Customizing Text-to-Image Diffusion with Object Viewpoint Control | Apr 18, 2024 | ObjectPrompt Engineering | —Unverified | 0 |
| G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis | Apr 18, 2024 | DenoisingObject | —Unverified | 0 |
| Simultaneous Detection and Interaction Reasoning for Object-Centric Action Recognition | Apr 18, 2024 | Action RecognitionFew-Shot action recognition | —Unverified | 0 |
| Inverse Neural Rendering for Explainable Multi-Object Tracking | Apr 18, 2024 | 3D Multi-Object TrackingInverse Rendering | —Unverified | 0 |
| Multimodal 3D Object Detection on Unseen Domains | Apr 17, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection | Apr 17, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems | Apr 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Object Remover Performance Evaluation Methods using Class-wise Object Removal Images | Apr 17, 2024 | Image InpaintingObject | —Unverified | 0 |
| GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement | Apr 17, 2024 | ObjectPose Estimation | —Unverified | 0 |
| IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination | Apr 17, 2024 | Inverse RenderingObject | —Unverified | 0 |
| Detector Collapse: Physical-World Backdooring Object Detection to Catastrophic Overload or Blindness in Autonomous Driving | Apr 17, 2024 | Autonomous DrivingBackdoor Attack | —Unverified | 0 |
| How to deal with glare for improved perception of Autonomous Vehicles | Apr 17, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Generating Human Interaction Motions in Scenes with Text Control | Apr 16, 2024 | DenoisingHuman-Object Interaction Detection | —Unverified | 0 |
| OSR-ViT: A Simple and Modular Framework for Open-Set Object Detection and Discovery | Apr 16, 2024 | Objectobject-detection | —Unverified | 0 |
| Salient Object-Aware Background Generation using Text-Guided Diffusion Models | Apr 15, 2024 | Object | CodeCode Available | 2 |
| A Realistic Protocol for Evaluation of Weakly Supervised Object Localization | Apr 15, 2024 | Model SelectionObject | CodeCode Available | 0 |
| HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision | Apr 15, 2024 | ObjectQuestion Answering | —Unverified | 0 |
| Improved Object-Based Style Transfer with Single Deep Network | Apr 15, 2024 | ObjectSegmentation | —Unverified | 0 |
| Improving Weakly-Supervised Object Localization Using Adversarial Erasing and Pseudo Label | Apr 15, 2024 | ObjectObject Localization | —Unverified | 0 |
| VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection | Apr 15, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| LoopAnimate: Loopable Salient Object Animation | Apr 14, 2024 | GPUObject | —Unverified | 0 |
| Fusion-Mamba for Cross-modality Object Detection | Apr 14, 2024 | MambaObject | —Unverified | 0 |
| Coreset Selection for Object Detection | Apr 14, 2024 | Diversityimage-classification | —Unverified | 0 |
| DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection | Apr 14, 2024 | Dense CaptioningLanguage Modelling | —Unverified | 0 |
| BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection | Apr 13, 2024 | Image EnhancementObject | —Unverified | 0 |
| Into the Fog: Evaluating Robustness of Multiple Object Tracking | Apr 12, 2024 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 0 |
| Overcoming Scene Context Constraints for Object Detection in wild using Defilters | Apr 12, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding | Apr 12, 2024 | DecoderImage Segmentation | CodeCode Available | 0 |
| Adapting the Segment Anything Model During Usage in Novel Situations | Apr 12, 2024 | Interactive SegmentationObject | —Unverified | 0 |
| Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation | Apr 12, 2024 | Objectobject-detection | CodeCode Available | 1 |
| TDANet: Target-Directed Attention Network For Object-Goal Visual Navigation With Zero-Shot Ability | Apr 12, 2024 | Deep Reinforcement LearningObject | —Unverified | 0 |
| IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic | Apr 12, 2024 | ObjectObject Localization | CodeCode Available | 0 |
| Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models | Apr 11, 2024 | AttributeObject | CodeCode Available | 1 |
| SFSORT: Scene Features-based Simple Online Real-Time Tracker | Apr 11, 2024 | CPUMulti-Object Tracking | CodeCode Available | 2 |
| Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing | Apr 11, 2024 | Model CompressionObject | —Unverified | 0 |