| RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion | Jun 5, 2025 | Novel View SynthesisObject | —Unverified | 0 |
| Gen-n-Val: Agentic Image Data Generation and Validation | Jun 5, 2025 | Image HarmonizationInstance Segmentation | —Unverified | 0 |
| Light and 3D: a methodological exploration of digitisation techniques adapted to a selection of objects from the Musée d'Archéologie Nationale | Jun 5, 2025 | DiversityObject | —Unverified | 0 |
| Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations | Jun 5, 2025 | 3D Object ReconstructionNovel View Synthesis | —Unverified | 0 |
| CIVET: Systematic Evaluation of Understanding in VLMs | Jun 5, 2025 | Object | —Unverified | 0 |
| Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning | Jun 5, 2025 | In-Context LearningIndoor Scene Synthesis | —Unverified | 0 |
| Feature-Based Lie Group Transformer for Real-World Applications | Jun 5, 2025 | ObjectObject Recognition | —Unverified | 0 |
| From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes | Jun 5, 2025 | 3D visual groundingObject | —Unverified | 0 |
| EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World? | Jun 5, 2025 | Object | —Unverified | 0 |
| Rex-Thinker: Grounded Object Referring via Chain-of-Thought Reasoning | Jun 4, 2025 | ObjectReferring Expression | —Unverified | 0 |
| MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection | Jun 4, 2025 | MambaNovel Object Detection | —Unverified | 0 |
| SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models | Jun 4, 2025 | Object | —Unverified | 0 |
| Sounding that Object: Interactive Object-Aware Image to Audio Generation | Jun 4, 2025 | Audio GenerationImage Segmentation | —Unverified | 0 |
| Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs | Jun 3, 2025 | ObjectObject Rearrangement | —Unverified | 0 |
| InterRVOS: Interaction-aware Referring Video Object Segmentation | Jun 3, 2025 | 8kObject | —Unverified | 0 |
| ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment | Jun 3, 2025 | Indoor Scene SynthesisObject | —Unverified | 0 |
| WoMAP: World Models For Embodied Open-Vocabulary Object Localization | Jun 2, 2025 | Active Object LocalizationEfficient Exploration | —Unverified | 0 |
| unMORE: Unsupervised Multi-Object Segmentation via Center-Boundary Reasoning | Jun 2, 2025 | Image ReconstructionObject | CodeCode Available | 0 |
| SORCE: Small Object Retrieval in Complex Environments | May 30, 2025 | BenchmarkingImage Retrieval | CodeCode Available | 0 |
| ComposeAnything: Composite Object Priors for Text-to-Image Generation | May 30, 2025 | DenoisingImage Generation | —Unverified | 0 |
| InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing | May 30, 2025 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Object Centric Concept Bottlenecks | May 30, 2025 | Decision MakingObject | —Unverified | 0 |
| Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames | May 30, 2025 | ObjectSpatial Reasoning | —Unverified | 0 |
| DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation | May 30, 2025 | Object | —Unverified | 0 |
| Conformal Object Detection by Sequential Risk Control | May 29, 2025 | Conformal PredictionObject | —Unverified | 0 |
| Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images | May 29, 2025 | Novel Object DetectionObject | —Unverified | 0 |
| Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping | May 29, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| MOVi: Training-free Text-conditioned Multi-Object Video Generation | May 29, 2025 | ObjectVideo Generation | —Unverified | 0 |
| FMG-Det: Foundation Model Guided Robust Object Detection | May 29, 2025 | Multiple Instance LearningObject | —Unverified | 0 |
| The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector | May 28, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks | May 27, 2025 | 3D Scene ReconstructionDiagnostic | —Unverified | 0 |
| CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects | May 27, 2025 | Object | —Unverified | 0 |
| PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation | May 27, 2025 | Instruction FollowingObject | —Unverified | 0 |
| Progressive Scaling Visual Object Tracking | May 26, 2025 | ObjectObject Tracking | —Unverified | 0 |
| Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language Models | May 26, 2025 | DisentanglementHallucination | CodeCode Available | 0 |
| Category-Agnostic Neural Object Rigging | May 26, 2025 | Object | —Unverified | 0 |
| NEXT: Multi-Grained Mixture of Experts via Text-Modulation for Multi-Modal Object Re-ID | May 26, 2025 | AttributeCaption Generation | —Unverified | 0 |
| MaskedManipulator: Versatile Whole-Body Control for Loco-Manipulation | May 25, 2025 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| FusionTrack: End-to-End Multi-Object Tracking in Arbitrary Multi-View Environment | May 24, 2025 | ManagementMulti-Object Tracking | —Unverified | 0 |
| ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts | May 24, 2025 | Image SegmentationInstance Segmentation | CodeCode Available | 0 |
| EOTNet: Deep Memory Aided Bayesian Filter for Extended Object Tracking | May 24, 2025 | ObjectObject Tracking | CodeCode Available | 0 |
| SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes | May 24, 2025 | Object | —Unverified | 0 |
| Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms | May 23, 2025 | Deep LearningObject | —Unverified | 0 |
| Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | May 23, 2025 | ObjectObject Tracking | —Unverified | 0 |
| RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection | May 23, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Semantic Compression of 3D Objects for Open and Collaborative Virtual Worlds | May 22, 2025 | ObjectSemantic Compression | —Unverified | 0 |
| TextureSAM: Towards a Texture Aware Foundation Model for Segmentation | May 22, 2025 | Material ClassificationObject | —Unverified | 0 |
| MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection | May 22, 2025 | Objectobject-detection | —Unverified | 0 |
| Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance | May 22, 2025 | ObjectObject Rearrangement | —Unverified | 0 |