SOTAVerified

Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Papers

Showing 101150 of 316 papers

TitleStatusHype
Drone Path-Following in GPS-Denied Environments using Convolutional NetworksCode0
Addressing the challenges of loop detection in agricultural environmentsCode0
Zero-Shot Object Goal Visual Navigation With Class-Independent Relationship NetworkCode0
World-Map Misalignment Detection for Visual Navigation SystemsCode0
What you see is what you get: Experience ranking with deep neural dataset-to-dataset similarity for topological localisationCode0
3D Visual Perception for Self-Driving Cars using a Multi-Camera System: Calibration, Mapping, Localization, and Obstacle DetectionCode0
Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book CollectionCode0
Visual Pre-training for Navigation: What Can We Learn from Noise?Code0
NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected ObservationsCode0
Visual Representations for Semantic Target Driven NavigationCode0
Contrastive Learning for Image Registration in Visual Teach and Repeat NavigationCode0
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language NavigationCode0
Help, Anna! Visual Navigation with Natural Multimodal Assistance via Retrospective Curiosity-Encouraging Imitation LearningCode0
Vision-based Navigation Using Deep Reinforcement LearningCode0
Good Time to Ask: A Learning Framework for Asking for Help in Embodied Visual NavigationCode0
Imitation Learning with Human Eye Gaze via Multi-Objective PredictionCode0
A Hybrid Compact Neural Architecture for Visual Place RecognitionCode0
Towards Disturbance-Free Visual Mobile ManipulationCode0
The Regretful Agent: Heuristic-Aided Navigation through Progress EstimationCode0
The Regretful Navigation Agent for Vision-and-Language NavigationCode0
Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement LearningCode0
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual NavigationCode0
Bayesian Relational Memory for Semantic Visual NavigationCode0
SeanNet: Semantic Understanding Network for Localization Under Object DynamicsCode0
Scaling and Benchmarking Self-Supervised Visual Representation LearningCode0
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual NavigationCode0
On the Performance of ConvNet Features for Place RecognitionCode0
See What the Robot Can't See: Learning Cooperative Perception for Visual NavigationCode0
OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy RepresentationCode0
On Embodied Visual Navigation in Real Environments Through HabitatCode0
3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matchingCode0
Motor Focus: Fast Ego-Motion Prediction for Assistive Visual NavigationCode0
RARA: Zero-shot Sim2Real Visual Navigation with Following Foreground CuesCode0
TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual CorruptionsCode0
Learning Efficient Multi-Agent Cooperative Visual Exploration0
ELBA: Learning by Asking for Embodied Visual Navigation and Task Completion0
Learning a State Representation and Navigation in Cluttered and Dynamic Environments0
Learning and Planning with a Semantic Model0
DRISHTI: Visual Navigation Assistant for Visually Impaired0
Learned Visual Navigation for Under-Canopy Agricultural Robots0
Learned Camera Gain and Exposure Control for Improved Visual Feature Detection and Matching0
Differentiable SLAM-net: Learning Particle SLAM for Visual Navigation0
Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation0
Invariance is Key to Generalization: Examining the Role of Representation in Sim-to-Real Transfer for Visual Navigation0
Interpretable Brain-Inspired Representations Improve RL Performance on Visual Navigation Tasks0
Integrating Symmetry into Differentiable Planning with Steerable Convolutions0
DeepRelativeFusion: Dense Monocular SLAM using Single-Image Relative Depth Prediction0
Instance-Specific Image Goal Navigation: Training Embodied Agents to Find Object Instances0
Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion0
Deep Learning for Visual Navigation of Underwater Robots0
Show:102550
← PrevPage 3 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NaviLLMdist_to_end_reduction7.9Unverified
2VLN-PETLdist_to_end_reduction6.13Unverified
3early to beddist_to_end_reduction6.03Unverified
4HAMTdist_to_end_reduction5.58Unverified
5s-agent (NDH-Full)dist_to_end_reduction5.27Unverified
6BabyWalk (r2r-pretrain)dist_to_end_reduction4.46Unverified
7Environment-agnostic Multitask Learningdist_to_end_reduction3.91Unverified
8BabyWalkdist_to_end_reduction3.65Unverified
9Test2-NDHdist_to_end_reduction3.44Unverified
10SCoAdist_to_end_reduction3.37Unverified
#ModelMetricClaimedVerifiedStatus
1SUSAspl0.64Unverified
2Meta-Explorespl0.61Unverified
3NaviLLMspl0.6Unverified
4BEV-BERTspl0.6Unverified
5HOPspl0.59Unverified
6DUETspl0.58Unverified
7VLN-PETLspl0.58Unverified
8VLN-BERTspl0.57Unverified
9Prevalentspl0.51Unverified
10RCM+SIL(no early exploration)spl0.38Unverified
#ModelMetricClaimedVerifiedStatus
1AutoVLNNav-SPL27.83Unverified
2NaviLLMNav-SPL26.26Unverified
3Meta-ExploreNav-SPL25.8Unverified
4SUSANav-SPL25.47Unverified
5DUETNav-SPL21.42Unverified
6GBENav-SPL13.3Unverified
#ModelMetricClaimedVerifiedStatus
1MVV-INSPL (All)17.27Unverified
2SAVNSPL (All)16.15Unverified
#ModelMetricClaimedVerifiedStatus
1PopArt-IMPALAMedium Human-Normalized Score72.8Unverified
#ModelMetricClaimedVerifiedStatus
1Prevalentspl28.72Unverified