SOTAVerified

Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Papers

Showing 101150 of 316 papers

TitleStatusHype
LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction0
Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation0
Learning to Drive Anywhere with Model-Based Reannotation0
Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering0
Decision-based AI Visual Navigation for Cardiac Ultrasounds0
The Composite Visual-Laser Navigation Method Applied in Indoor Poultry Farming Environments0
UAS Visual Navigation in Large and Unseen Environments via a Meta Agent0
Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better0
ViVa-SAFELAND: a New Freeware for Safe Validation of Vision-based Navigation in Aerial Vehicles0
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach0
A Map-free Deep Learning-based Framework for Gate-to-Gate Monocular Visual Navigation aboard Miniaturized Aerial Vehicles0
High-precision visual navigation device calibration method based on collimator0
Improving Collision-Free Success Rate For Object Goal Visual Navigation Via Two-Stage Training With Collision Prediction0
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation0
VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion0
Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter0
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation0
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI0
FloNa: Floor Plan Guided Embodied Visual Navigation0
Multi-View Pedestrian Occupancy Prediction with a Novel Synthetic Dataset0
MetaCropFollow: Few-Shot Adaptation with Meta-Learning for Under-Canopy Navigation0
Memory Proxy Maps for Visual Navigation0
Grounding Video Models to Actions through Goal Conditioned Exploration0
Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book CollectionCode0
Flex: End-to-End Text-Instructed Visual Navigation from Foundation Model Features0
RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps0
Fast Object Detection with a Machine Learning Edge Device0
Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion0
HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal Navigation0
Causality-Aware Transformer Networks for Robotic Navigation0
Addressing the challenges of loop detection in agricultural environmentsCode0
NOLO: Navigate Only Look Once0
IN-Sight: Interactive Navigation through Sight0
Visuospatial navigation without distance, prediction, integration, or maps0
CAMON: Cooperative Agents for Multi-Object Navigation with LLM-based Conversations0
Solving Vision Tasks with Simple Photoreceptors Instead of Cameras0
RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation0
Sim2Real Transfer for Audio-Visual Navigation with Frequency-Adaptive Acoustic Field Prediction0
Motor Focus: Fast Ego-Motion Prediction for Assistive Visual NavigationCode0
TDANet: Target-Directed Attention Network For Object-Goal Visual Navigation With Zero-Shot Ability0
Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method0
Wild Visual Navigation: Fast Traversability Learning via Pre-Trained Models and Online Self-Supervision0
3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matchingCode0
OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy RepresentationCode0
TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual CorruptionsCode0
A Landmark-Aware Visual Navigation Dataset0
Interpretable Brain-Inspired Representations Improve RL Performance on Visual Navigation Tasks0
Feudal Networks for Visual Navigation0
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation0
World-Map Misalignment Detection for Visual Navigation SystemsCode0
Show:102550
← PrevPage 3 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NaviLLMdist_to_end_reduction7.9Unverified
2VLN-PETLdist_to_end_reduction6.13Unverified
3early to beddist_to_end_reduction6.03Unverified
4HAMTdist_to_end_reduction5.58Unverified
5s-agent (NDH-Full)dist_to_end_reduction5.27Unverified
6BabyWalk (r2r-pretrain)dist_to_end_reduction4.46Unverified
7Environment-agnostic Multitask Learningdist_to_end_reduction3.91Unverified
8BabyWalkdist_to_end_reduction3.65Unverified
9Test2-NDHdist_to_end_reduction3.44Unverified
10SCoAdist_to_end_reduction3.37Unverified
#ModelMetricClaimedVerifiedStatus
1SUSAspl0.64Unverified
2Meta-Explorespl0.61Unverified
3NaviLLMspl0.6Unverified
4BEV-BERTspl0.6Unverified
5HOPspl0.59Unverified
6DUETspl0.58Unverified
7VLN-PETLspl0.58Unverified
8VLN-BERTspl0.57Unverified
9Prevalentspl0.51Unverified
10RCM+SIL(no early exploration)spl0.38Unverified
#ModelMetricClaimedVerifiedStatus
1AutoVLNNav-SPL27.83Unverified
2NaviLLMNav-SPL26.26Unverified
3Meta-ExploreNav-SPL25.8Unverified
4SUSANav-SPL25.47Unverified
5DUETNav-SPL21.42Unverified
6GBENav-SPL13.3Unverified
#ModelMetricClaimedVerifiedStatus
1MVV-INSPL (All)17.27Unverified
2SAVNSPL (All)16.15Unverified
#ModelMetricClaimedVerifiedStatus
1PopArt-IMPALAMedium Human-Normalized Score72.8Unverified
#ModelMetricClaimedVerifiedStatus
1Prevalentspl28.72Unverified