| AnyPlace: Learning Generalized Object Placement for Robot Manipulation | Feb 6, 2025 | ObjectPose Prediction | —Unverified | 0 |
| UAV Cognitive Semantic Communications Enabled by Knowledge Graph for Robust Object Detection | Feb 6, 2025 | Objectobject-detection | —Unverified | 0 |
| PartEdit: Fine-Grained Image Editing using Pre-Trained Diffusion Models | Feb 6, 2025 | ObjectText-based Image Editing | —Unverified | 0 |
| HD-EPIC: A Highly-Detailed Egocentric Video Dataset | Feb 6, 2025 | Action RecognitionNutrition | —Unverified | 0 |
| Enhancing people localisation in drone imagery for better crowd management by utilising every pixel in high-resolution images | Feb 6, 2025 | Crowd CountingManagement | —Unverified | 0 |
| Disentangling CLIP for Multi-Object Perception | Feb 5, 2025 | DisentanglementImage Classification | —Unverified | 0 |
| ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models | Feb 5, 2025 | Instance SegmentationObject | CodeCode Available | 0 |
| Mitigating Object Hallucinations in Large Vision-Language Models via Attention Calibration | Feb 4, 2025 | AttributeHallucination | —Unverified | 0 |
| Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling | Feb 4, 2025 | ObjectVisual Prompting | —Unverified | 0 |
| Rethinking Vision Transformer for Object Centric Foundation Models | Feb 4, 2025 | ObjectObject Tracking | CodeCode Available | 0 |
| Can You Move These Over There? An LLM-based VR Mover for Supporting Object Manipulation | Feb 4, 2025 | Object | —Unverified | 0 |
| Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks | Feb 4, 2025 | Adversarial RobustnessAutonomous Driving | —Unverified | 0 |
| Dynamic object goal pushing with mobile manipulators through model-free constrained reinforcement learning | Feb 3, 2025 | FrictionObject | —Unverified | 0 |
| Mitigating Hallucinations in Large Vision-Language Models with Internal Fact-based Contrastive Decoding | Feb 3, 2025 | AttributeMME | —Unverified | 0 |
| Neural Cellular Automata for Decentralized Sensing using a Soft Inductive Sensor Array for Distributed Manipulator Systems | Feb 3, 2025 | Object | —Unverified | 0 |
| RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning | Feb 2, 2025 | Contrastive LearningImage Generation | —Unverified | 0 |
| Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches | Jan 31, 2025 | Image SegmentationInteractive Segmentation | —Unverified | 0 |
| SpikingRTNH: Spiking Neural Network for 4D Radar Object Detection | Jan 31, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Adaptive Object Detection for Indoor Navigation Assistance: A Performance Evaluation of Real-Time Algorithms | Jan 30, 2025 | Objectobject-detection | —Unverified | 0 |
| RUN: Reversible Unfolding Network for Concealed Object Segmentation | Jan 30, 2025 | ObjectSegmentation | —Unverified | 0 |
| Efficient Interactive 3D Multi-Object Removal | Jan 29, 2025 | ObjectScene Understanding | —Unverified | 0 |
| Efficient Feature Fusion for UAV Object Detection | Jan 29, 2025 | Objectobject-detection | CodeCode Available | 0 |
| DINOSTAR: Deep Iterative Neural Object Detector Self-Supervised Training for Roadside LiDAR Applications | Jan 28, 2025 | Objectobject-detection | —Unverified | 0 |
| Objects matter: object-centric world models improve reinforcement learning in visually complex environments | Jan 27, 2025 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| 3D Reconstruction of non-visible surfaces of objects from a Single Depth View -- Comparative Study | Jan 27, 2025 | 3D ReconstructionObject | —Unverified | 0 |
| Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification | Jan 26, 2025 | Domain AdaptationObject | CodeCode Available | 0 |
| Evaluating Hallucination in Large Vision-Language Models based on Context-Aware Object Similarities | Jan 25, 2025 | HallucinationObject | —Unverified | 0 |
| ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations | Jan 24, 2025 | DecoderObject | —Unverified | 0 |
| Estimation-theoretic analysis of lensless imaging | Jan 24, 2025 | Object | —Unverified | 0 |
| CuriousBot: Interactive Mobile Exploration via Actionable 3D Relational Object Graph | Jan 23, 2025 | Object | —Unverified | 0 |
| CSAOT: Cooperative Multi-Agent System for Active Object Tracking | Jan 23, 2025 | Autonomous NavigationDeep Reinforcement Learning | —Unverified | 0 |
| MONA: Moving Object Detection from Videos Shot by Dynamic Camera | Jan 22, 2025 | Moving Object DetectionObject | —Unverified | 0 |
| TOFFE -- Temporally-binned Object Flow from Events for High-speed and Energy-Efficient Object Detection and Tracking | Jan 21, 2025 | Autonomous NavigationGPU | —Unverified | 0 |
| Slot-BERT: Self-supervised Object Discovery in Surgical Video | Jan 21, 2025 | DisentanglementDomain Adaptation | —Unverified | 0 |
| Green Video Camouflaged Object Detection | Jan 19, 2025 | Objectobject-detection | —Unverified | 0 |
| FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis | Jan 17, 2025 | Bayesian InferenceLanguage Modeling | —Unverified | 0 |
| Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation | Jan 17, 2025 | NeRFObject | CodeCode Available | 0 |
| RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection | Jan 16, 2025 | Autonomous DrivingObject | —Unverified | 0 |
| MonoSOWA: Scalable monocular 3D Object detector Without human Annotations | Jan 16, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Everybody Likes to Sleep: A Computer-Assisted Comparison of Object Naming Data from 30 Languages | Jan 14, 2025 | Object | CodeCode Available | 0 |
| Object-Centric 2D Gaussian Splatting: Background Removal and Occlusion-Aware Pruning for Compact Object Models | Jan 14, 2025 | Object | —Unverified | 0 |
| SmartEraser: Remove Anything from Images using Masked-Region Guidance | Jan 14, 2025 | Instance SegmentationObject | —Unverified | 0 |
| Predicting Performance of Object Detection Models in Electron Microscopy Using Random Forests | Jan 14, 2025 | Defect DetectionObject | CodeCode Available | 0 |
| Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying | Jan 14, 2025 | Objectobject-detection | —Unverified | 0 |
| Detecting Contextual Anomalies by Discovering Consistent Spatial Regions | Jan 14, 2025 | Anomaly DetectionClustering | —Unverified | 0 |
| DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models | Jan 14, 2025 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Guided SAM: Label-Efficient Part Segmentation | Jan 13, 2025 | ObjectObject Recognition | —Unverified | 0 |
| VDOR: A Video-based Dataset for Object Removal via Sequence Consistency | Jan 13, 2025 | Image InpaintingObject | —Unverified | 0 |
| SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing | Jan 13, 2025 | Objectobject-detection | CodeCode Available | 0 |
| Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics | Jan 13, 2025 | Action Recognitionhand-object pose | —Unverified | 0 |