| Semantic Compression of 3D Objects for Open and Collaborative Virtual Worlds | May 22, 2025 | ObjectSemantic Compression | —Unverified | 0 |
| TextureSAM: Towards a Texture Aware Foundation Model for Segmentation | May 22, 2025 | Material ClassificationObject | —Unverified | 0 |
| MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection | May 22, 2025 | Objectobject-detection | —Unverified | 0 |
| gen2seg: Generative Models Enable Generalizable Instance Segmentation | May 21, 2025 | DecoderInstance Segmentation | —Unverified | 0 |
| RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation | May 21, 2025 | GPUNatural Language Queries | —Unverified | 0 |
| Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation | May 21, 2025 | ObjectPose Estimation | —Unverified | 0 |
| Multispectral Detection Transformer with Infrared-Centric Sensor Fusion | May 21, 2025 | Multispectral Object DetectionObject | CodeCode Available | 0 |
| Expanding Zero-Shot Object Counting with Rich Prompts | May 21, 2025 | ObjectObject Counting | —Unverified | 0 |
| OPA-Pack: Object-Property-Aware Robotic Bin Packing | May 19, 2025 | ObjectQ-Learning | —Unverified | 0 |
| LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking | May 19, 2025 | Multi-Object TrackingMultiple Object Tracking | —Unverified | 0 |
| Optimizing Retrieval Augmented Generation for Object Constraint Language | May 19, 2025 | Large Language ModelObject | —Unverified | 0 |
| Emergent Active Perception and Dexterity of Simulated Humanoids from Visual Reinforcement Learning | May 18, 2025 | Object | —Unverified | 0 |
| GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity | May 17, 2025 | 3D ReconstructionObject | —Unverified | 0 |
| PARSEC: Preference Adaptation for Robotic Object Rearrangement from Scene Context | May 16, 2025 | ObjectObject Rearrangement | CodeCode Available | 0 |
| Feasibility with Language Models for Open-World Compositional Zero-Shot Learning | May 16, 2025 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| AW-GATCN: Adaptive Weighted Graph Attention Convolutional Network for Event Camera Data Joint Denoising and Object Recognition | May 16, 2025 | DenoisingEvent Segmentation | —Unverified | 0 |
| RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects | May 16, 2025 | 6D Pose EstimationObject | —Unverified | 0 |
| A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation | May 16, 2025 | Objectobject-detection | —Unverified | 0 |
| MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence | May 15, 2025 | AttributeObject | —Unverified | 0 |
| MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection | May 14, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object Shapes | May 14, 2025 | 3D Semantic Scene CompletionAutonomous Driving | CodeCode Available | 0 |
| ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation | May 14, 2025 | BenchmarkingDeformable Object Manipulation | —Unverified | 0 |
| Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models | May 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix | May 13, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| HMPNet: A Feature Aggregation Architecture for Maritime Object Detection from a Shipborne Perspective | May 13, 2025 | Computational EfficiencyObject | CodeCode Available | 0 |
| Improving Unsupervised Task-driven Models of Ventral Visual Stream via Relative Position Predictivity | May 13, 2025 | Contrastive LearningObject | CodeCode Available | 0 |
| Robustness Analysis against Adversarial Patch Attacks in Fully Unmanned Stores | May 13, 2025 | Objectobject-detection | —Unverified | 0 |
| Leveraging Multi-Modal Information to Enhance Dataset Distillation | May 13, 2025 | Dataset DistillationObject | —Unverified | 0 |
| Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology | May 13, 2025 | DenoisingObject | —Unverified | 0 |
| Hybrid Spiking Vision Transformer for Object Detection with Event Cameras | May 12, 2025 | Event DetectionObject | —Unverified | 0 |
| Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking | May 12, 2025 | 3D Multi-Object TrackingMulti-Object Tracking | —Unverified | 0 |
| Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search | May 10, 2025 | Neural Architecture SearchObject | —Unverified | 0 |
| METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection | May 10, 2025 | Objectobject-detection | CodeCode Available | 0 |
| PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting | May 8, 2025 | Autonomous VehiclesFlare Removal | —Unverified | 0 |
| Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial Pooling | May 8, 2025 | feature selectionObject | CodeCode Available | 0 |
| MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models | May 8, 2025 | AttributeImage Manipulation | —Unverified | 0 |
| Visual Affordances: Enabling Robots to Understand Object Functionality | May 8, 2025 | ObjectPrediction | —Unverified | 0 |
| An Edge AI Solution for Space Object Detection | May 8, 2025 | Deep LearningObject | —Unverified | 0 |
| CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion | May 7, 2025 | DenoisingImage Generation | —Unverified | 0 |
| AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual Grounding | May 7, 2025 | 3D visual groundingGraph Attention | CodeCode Available | 0 |
| Low Resolution Next Best View for Robot Packing | May 7, 2025 | 3D ReconstructionObject | —Unverified | 0 |
| One2Any: One-Reference 6D Pose Estimation for Any Object | May 7, 2025 | 6D Pose Estimation6D Pose Estimation using RGB | —Unverified | 0 |
| Web2Grasp: Learning Functional Grasps from Web Images of Hand-Object Interactions | May 7, 2025 | Object | —Unverified | 0 |
| Corner Cases: How Size and Position of Objects Challenge ImageNet-Trained Models | May 6, 2025 | ObjectPosition | —Unverified | 0 |
| Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning | May 6, 2025 | counterfactualObject | —Unverified | 0 |
| EOPose : Exemplar-based object reposing using Generalized Pose Correspondences | May 6, 2025 | ObjectSSIM | —Unverified | 0 |
| Sim2Real Transfer for Vision-Based Grasp Verification | May 5, 2025 | Objectobject-detection | CodeCode Available | 0 |
| Hierarchical Compact Clustering Attention (COCA) for Unsupervised Object-Centric Learning | May 4, 2025 | ClusteringDecoder | —Unverified | 0 |
| Probabilistic Interactive 3D Segmentation with Hierarchical Neural Processes | May 3, 2025 | ObjectSegmentation | —Unverified | 0 |
| RESAnything: Attribute Prompting for Arbitrary Referring Segmentation | May 3, 2025 | AttributeImage Segmentation | —Unverified | 0 |