SOTAVerified

Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Papers

Showing 201250 of 316 papers

TitleStatusHype
Simultaneous Navigation and Construction Benchmarking EnvironmentsCode1
MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation0
A Survey of Embodied AI: From Simulators to Research Tasks0
Learning a State Representation and Navigation in Cluttered and Dynamic Environments0
A Pose-only Solution to Visual Reconstruction and NavigationCode1
Sequential Place Learning: Heuristic-Free High-Performance Long-Term Place RecognitionCode1
Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational GraphCode1
Learning for Visual Navigation by Imagining the Success0
Scene Retrieval for Contextual Visual Mapping0
Imitation Learning with Human Eye Gaze via Multi-Objective PredictionCode0
End-to-End Egospheric Spatial MemoryCode1
Learned Camera Gain and Exposure Control for Improved Visual Feature Detection and Matching0
A Pipeline for Vision-Based On-Orbit Proximity Operations Using Deep Learning and Synthetic Imagery0
Visual Graph Memory With Unsupervised Representation for Visual NavigationCode1
Pose Invariant Topological Memory for Visual Navigation0
Semantic Audio-Visual Navigation0
A Recurrent Vision-and-Language BERT for NavigationCode1
DeepSeqSLAM: A Trainable CNN+RNN for Joint Global Description and Sequence-based Place RecognitionCode1
A Few Shot Adaptation of Visual Navigation Skills to New Observations using Meta-Learning0
Unsupervised Domain Adaptation for Visual Navigation0
On Embodied Visual Navigation in Real Environments Through HabitatCode0
Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement LearningCode1
SHREC 2020 track: 6D Object Pose Estimation0
Embodied Visual Navigation with Automatic Curriculum Learning in Real Environments0
Multimodal Aggregation Approach for Memory Vision-Voice Indoor Navigation with Meta-Learning0
Learning to Set Waypoints for Audio-Visual NavigationCode1
Exploiting Scene-specific Features for Object Goal Navigation0
Exploring the Impacts from Datasets to Monocular Depth Estimation (MDE) Models with MineNavi0
Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without SacrificesCode1
Point Cloud Based Reinforcement Learning for Sim-to-Real and Partial Observability in Visual Navigation0
Learning Object Relation Graph and Tentative Policy for Visual NavigationCode1
Virtual Testbed for Monocular Visual Navigation of Small Unmanned Aircraft Systems0
Semantic Visual Navigation by Watching YouTube VideosCode1
Explore then Execute: Adapting without Rewards via Factorized Meta-Reinforcement Learning0
DeepRelativeFusion: Dense Monocular SLAM using Single-Image Relative Depth Prediction0
Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning0
Neural Topological SLAM for Visual Navigation0
VisualEchoes: Spatial Image Representation Learning through Echolocation0
Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships0
Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning0
Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation0
One-Shot Informed Robotic Visual Search in the WildCode1
Visual Navigation Among Humans with Optimal Control as a SupervisorCode1
Learning hierarchical relationships for object-goal navigationCode1
Extending Maps with Semantic and Contextual Object Information for Robot Navigation: a Learning-Based Framework using Visual and Depth CuesCode1
Sparse Graphical Memory for Robust PlanningCode1
MVP: Unified Motion and Visual Self-Supervised Learning for Large-Scale Robotic NavigationCode1
From Seeing to Moving: A Survey on Learning for Visual Indoor Navigation (VIN)0
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-trainingCode1
Discriminative Particle Filter Reinforcement Learning for Complex Partial ObservationsCode1
Show:102550
← PrevPage 5 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NaviLLMdist_to_end_reduction7.9Unverified
2VLN-PETLdist_to_end_reduction6.13Unverified
3early to beddist_to_end_reduction6.03Unverified
4HAMTdist_to_end_reduction5.58Unverified
5s-agent (NDH-Full)dist_to_end_reduction5.27Unverified
6BabyWalk (r2r-pretrain)dist_to_end_reduction4.46Unverified
7Environment-agnostic Multitask Learningdist_to_end_reduction3.91Unverified
8BabyWalkdist_to_end_reduction3.65Unverified
9Test2-NDHdist_to_end_reduction3.44Unverified
10SCoAdist_to_end_reduction3.37Unverified
#ModelMetricClaimedVerifiedStatus
1SUSAspl0.64Unverified
2Meta-Explorespl0.61Unverified
3NaviLLMspl0.6Unverified
4BEV-BERTspl0.6Unverified
5HOPspl0.59Unverified
6DUETspl0.58Unverified
7VLN-PETLspl0.58Unverified
8VLN-BERTspl0.57Unverified
9Prevalentspl0.51Unverified
10RCM+SIL(no early exploration)spl0.38Unverified
#ModelMetricClaimedVerifiedStatus
1AutoVLNNav-SPL27.83Unverified
2NaviLLMNav-SPL26.26Unverified
3Meta-ExploreNav-SPL25.8Unverified
4SUSANav-SPL25.47Unverified
5DUETNav-SPL21.42Unverified
6GBENav-SPL13.3Unverified
#ModelMetricClaimedVerifiedStatus
1MVV-INSPL (All)17.27Unverified
2SAVNSPL (All)16.15Unverified
#ModelMetricClaimedVerifiedStatus
1PopArt-IMPALAMedium Human-Normalized Score72.8Unverified
#ModelMetricClaimedVerifiedStatus
1Prevalentspl28.72Unverified