SOTAVerified

Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Papers

Showing 251300 of 316 papers

TitleStatusHype
Multimodal Aggregation Approach for Memory Vision-Voice Indoor Navigation with Meta-Learning0
Exploiting Scene-specific Features for Object Goal Navigation0
Exploring the Impacts from Datasets to Monocular Depth Estimation (MDE) Models with MineNavi0
Point Cloud Based Reinforcement Learning for Sim-to-Real and Partial Observability in Visual Navigation0
Virtual Testbed for Monocular Visual Navigation of Small Unmanned Aircraft Systems0
Explore then Execute: Adapting without Rewards via Factorized Meta-Reinforcement Learning0
DeepRelativeFusion: Dense Monocular SLAM using Single-Image Relative Depth Prediction0
Neural Topological SLAM for Visual Navigation0
Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning0
VisualEchoes: Spatial Image Representation Learning through Echolocation0
Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships0
Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning0
Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation0
From Seeing to Moving: A Survey on Learning for Visual Indoor Navigation (VIN)0
Perception and Navigation in Autonomous Systems in the Era of Learning: A Survey0
Improving the Generalization of Visual Navigation Policies using Invariance Regularization0
Generating Robust Supervision for Learning-Based Visual Navigation Using Hamilton-Jacobi Reachability0
Meta Adaptation using Importance Weighted Demonstrations0
Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation0
A Hybrid Compact Neural Architecture for Visual Place RecognitionCode0
Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention and Spatial Memory0
Zero-shot Imitation Learning from Demonstrations for Legged Robot Visual Navigation0
Learning Your Way Without Map or Compass: Panoramic Target Driven Visual Navigation0
Bayesian Relational Memory for Semantic Visual NavigationCode0
Help, Anna! Visual Navigation with Natural Multimodal Assistance via Retrospective Curiosity-Encouraging Imitation LearningCode0
Improving Visual Feature Extraction in Glacial Environments0
Situational Fusion of Visual Representation for Visual Navigation0
Vision-based Navigation Using Deep Reinforcement LearningCode0
NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected ObservationsCode0
Universal Successor Features Based Deep Reinforcement Learning for Navigation0
Adaptive Navigation Scheme for Optimal Deep-Sea Localization Using Multimodal Perception Cues0
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual NavigationCode0
Shifting the Baseline: Single Modality Performance on Visual Navigation \& QA0
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual NavigationCode0
Graph Attention Memory for Visual Navigation0
Drone Path-Following in GPS-Denied Environments using Convolutional NetworksCode0
Scaling and Benchmarking Self-Supervised Visual Representation LearningCode0
Cross-Task Knowledge Transfer for Visually-Grounded Navigation0
Perceptual Attention-based Predictive Control0
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks0
Combining Optimal Control and Learning for Visual Navigation in Novel Environments0
The Regretful Navigation Agent for Vision-and-Language NavigationCode0
The Regretful Agent: Heuristic-Aided Navigation through Progress EstimationCode0
A Behavioral Approach to Visual Navigation with Graph Localization Networks0
MRS-VPR: a multi-resolution sampling based global visual place recognition method0
Embodied Multimodal Multitask Learning0
Learning On-Road Visual Control for Self-Driving Vehicles with Auxiliary Tasks0
Target Driven Visual Navigation with Hybrid Asynchronous Universal Successor Representations0
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation0
Object-oriented Targets for Visual Navigation using Rich Semantic Representations0
Show:102550
← PrevPage 6 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NaviLLMdist_to_end_reduction7.9Unverified
2VLN-PETLdist_to_end_reduction6.13Unverified
3early to beddist_to_end_reduction6.03Unverified
4HAMTdist_to_end_reduction5.58Unverified
5s-agent (NDH-Full)dist_to_end_reduction5.27Unverified
6BabyWalk (r2r-pretrain)dist_to_end_reduction4.46Unverified
7Environment-agnostic Multitask Learningdist_to_end_reduction3.91Unverified
8BabyWalkdist_to_end_reduction3.65Unverified
9Test2-NDHdist_to_end_reduction3.44Unverified
10SCoAdist_to_end_reduction3.37Unverified
#ModelMetricClaimedVerifiedStatus
1SUSAspl0.64Unverified
2Meta-Explorespl0.61Unverified
3NaviLLMspl0.6Unverified
4BEV-BERTspl0.6Unverified
5HOPspl0.59Unverified
6DUETspl0.58Unverified
7VLN-PETLspl0.58Unverified
8VLN-BERTspl0.57Unverified
9Prevalentspl0.51Unverified
10RCM+SIL(no early exploration)spl0.38Unverified
#ModelMetricClaimedVerifiedStatus
1AutoVLNNav-SPL27.83Unverified
2NaviLLMNav-SPL26.26Unverified
3Meta-ExploreNav-SPL25.8Unverified
4SUSANav-SPL25.47Unverified
5DUETNav-SPL21.42Unverified
6GBENav-SPL13.3Unverified
#ModelMetricClaimedVerifiedStatus
1MVV-INSPL (All)17.27Unverified
2SAVNSPL (All)16.15Unverified
#ModelMetricClaimedVerifiedStatus
1PopArt-IMPALAMedium Human-Normalized Score72.8Unverified
#ModelMetricClaimedVerifiedStatus
1Prevalentspl28.72Unverified