LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction Jun 16, 2025 Instruction Following Vision-Language-Action
— Unverified 0Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation Jun 4, 2025 Collision Avoidance Visual Navigation
— Unverified 0Visual Planning: Let's Think Only with Images May 16, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 3Learning to Drive Anywhere with Model-Based Reannotation May 8, 2025 Navigate Visual Navigation
— Unverified 0Task-Oriented Communications for Visual Navigation with Edge-Aerial Collaboration in Low Altitude Economy Apr 25, 2025 Visual Navigation
Code Code Available 1Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering Apr 19, 2025 Benchmarking Dataset Generation
— Unverified 0Decision-based AI Visual Navigation for Cardiac Ultrasounds Apr 16, 2025 Binary Classification Visual Navigation
— Unverified 0Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models Apr 14, 2025 Action Generation Denoising
Code Code Available 2NaviDiffusor: Cost-Guided Diffusion Model for Visual Navigation Apr 14, 2025 Visual Navigation
Code Code Available 2The Composite Visual-Laser Navigation Method Applied in Indoor Poultry Farming Environments Apr 11, 2025 Visual Navigation
— Unverified 0UAS Visual Navigation in Large and Unseen Environments via a Meta Agent Mar 20, 2025 Incremental Learning Meta Reinforcement Learning
— Unverified 0Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better Mar 19, 2025 Attribute Reinforcement Learning (RL)
— Unverified 0ViVa-SAFELAND: a New Freeware for Safe Validation of Vision-based Navigation in Aerial Vehicles Mar 18, 2025 Navigate Visual Navigation
— Unverified 0Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach Mar 11, 2025 Navigate Sequential Decision Making
— Unverified 0A Map-free Deep Learning-based Framework for Gate-to-Gate Monocular Visual Navigation aboard Miniaturized Aerial Vehicles Mar 7, 2025 Navigate Visual Navigation
— Unverified 0EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training Feb 26, 2025 Mamba Representation Learning
Code Code Available 1High-precision visual navigation device calibration method based on collimator Feb 25, 2025 Camera Calibration Visual Navigation
— Unverified 0Improving Collision-Free Success Rate For Object Goal Visual Navigation Via Two-Stage Training With Collision Prediction Feb 19, 2025 Collision Avoidance Deep Reinforcement Learning
— Unverified 0RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation Feb 4, 2025 Drone navigation Reinforcement Learning (RL)
— Unverified 0Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter Feb 3, 2025 Pose Estimation Simultaneous Localization and Mapping
— Unverified 0VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion Feb 3, 2025 3DGS reinforcement-learning
— Unverified 0Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation Jan 12, 2025 3D Scene Reconstruction Visual Navigation
— Unverified 0UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI Dec 30, 2024 Benchmarking Reinforcement Learning (RL)
— Unverified 0FloNa: Floor Plan Guided Embodied Visual Navigation Dec 24, 2024 Navigate Visual Navigation
— Unverified 0Multi-View Pedestrian Occupancy Prediction with a Novel Synthetic Dataset Dec 18, 2024 Pedestrian Detection Scene Understanding
— Unverified 0Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation Dec 9, 2024 Object Localization Vision and Language Navigation
Code Code Available 1SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Dec 7, 2024 General Knowledge Mixture-of-Experts
Code Code Available 1Navigation World Models Dec 4, 2024 Robot Navigation Video Generation
Code Code Available 4LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences Dec 2, 2024 Embodied Question Answering Question Answering
Code Code Available 2CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos Nov 26, 2024 Common Sense Reasoning Imitation Learning
Code Code Available 3MetaCropFollow: Few-Shot Adaptation with Meta-Learning for Under-Canopy Navigation Nov 21, 2024 Meta-Learning Visual Navigation
— Unverified 0Memory Proxy Maps for Visual Navigation Nov 15, 2024 Navigate Visual Navigation
— Unverified 0Grounding Video Models to Actions through Goal Conditioned Exploration Nov 11, 2024 Action Generation Visual Navigation
— Unverified 0Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments Oct 23, 2024 Object Visual Navigation
Code Code Available 1Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book Collection Oct 19, 2024 Classification image-classification
Code Code Available 0Flex: End-to-End Text-Instructed Visual Navigation from Foundation Model Features Oct 16, 2024 Visual Navigation
— Unverified 0RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps Oct 8, 2024 Visual Localization Visual Navigation
— Unverified 0Fast Object Detection with a Machine Learning Edge Device Oct 5, 2024 Autonomous Navigation CPU
— Unverified 0Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion Sep 24, 2024 Motion Estimation Simultaneous Localization and Mapping
— Unverified 0HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal Navigation Sep 22, 2024 Navigate Visual Navigation
— Unverified 0Causality-Aware Transformer Networks for Robotic Navigation Sep 4, 2024 Visual Navigation
— Unverified 0Addressing the challenges of loop detection in agricultural environments Aug 28, 2024 Pose Estimation Visual Navigation
Code Code Available 0NOLO: Navigate Only Look Once Aug 2, 2024 In-Context Learning Navigate
— Unverified 0IN-Sight: Interactive Navigation through Sight Aug 1, 2024 Benchmarking Navigate
— Unverified 0Visuospatial navigation without distance, prediction, integration, or maps Jul 18, 2024 Decision Making Navigate
— Unverified 0CAMON: Cooperative Agents for Multi-Object Navigation with LLM-based Conversations Jun 30, 2024 Visual Navigation
— Unverified 0Solving Vision Tasks with Simple Photoreceptors Instead of Cameras Jun 17, 2024 continuous-control Continuous Control
— Unverified 0SPIN: Spacecraft Imagery for Navigation Jun 11, 2024 Data Augmentation Image Generation
Code Code Available 1RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation May 9, 2024 Natural Language Queries Robot Navigation
— Unverified 0Sim2Real Transfer for Audio-Visual Navigation with Frequency-Adaptive Acoustic Field Prediction May 5, 2024 Data Augmentation Navigate
— Unverified 0