SOTAVerified

Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Papers

Showing 51100 of 316 papers

TitleStatusHype
Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied AgentsCode1
SOON: Scenario Oriented Object Navigation with Graph-based ExplorationCode1
VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-TrainingCode1
Sound Adversarial Audio-Visual NavigationCode1
Learning hierarchical relationships for object-goal navigationCode1
Task-Oriented Communications for Visual Navigation with Edge-Aerial Collaboration in Low Altitude EconomyCode1
A Recurrent Vision-and-Language BERT for NavigationCode1
DeepSeqSLAM: A Trainable CNN+RNN for Joint Global Description and Sequence-based Place RecognitionCode1
Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?Code1
Discriminative Particle Filter Reinforcement Learning for Complex Partial ObservationsCode1
Benchmarking Visual Localization for Autonomous NavigationCode1
Teaching Agents how to Map: Spatial Reasoning for Multi-Object NavigationCode1
Towards Autonomous Crop-Agnostic Visual Navigation in Arable FieldsCode1
Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language NavigationCode1
MemoNav: Working Memory Model for Visual NavigationCode1
A Visual Navigation Perspective for Category-Level Object Pose EstimationCode1
A 64mW DNN-based Visual Navigation Engine for Autonomous Nano-DronesCode1
Learning Object Relation Graph and Tentative Policy for Visual NavigationCode1
MVP: Unified Motion and Visual Self-Supervised Learning for Large-Scale Robotic NavigationCode1
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-trainingCode1
End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent PhenomenonCode1
Learning Navigational Visual Representations with Semantic Map SupervisionCode1
End-to-End Egospheric Spatial MemoryCode1
Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-LearningCode1
Learning Exploration Policies for NavigationCode1
Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without SacrificesCode1
Multi3DRefer: Grounding Text Description to Multiple 3D ObjectsCode1
Extending Maps with Semantic and Contextual Object Information for Robot Navigation: a Learning-Based Framework using Visual and Depth CuesCode1
EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-trainingCode1
Learning to Set Waypoints for Audio-Visual NavigationCode1
Last-Mile Embodied Visual NavigationCode1
Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational GraphCode1
SoundSpaces: Audio-Visual Navigation in 3D EnvironmentsCode1
HOP: History-and-Order Aware Pre-training for Vision-and-Language NavigationCode1
HabiCrowd: A High Performance Simulator for Crowd-Aware Visual NavigationCode1
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic EnvironmentsCode1
Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural languageCode1
CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room ScenesCode1
CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room ScenesCode1
Learning from Unlabeled 3D Environments for Vision-and-Language NavigationCode1
Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving SoundsCode1
RobustNav: Towards Benchmarking Robustness in Embodied NavigationCode1
Goal-Aware Cross-Entropy for Multi-Target Reinforcement LearningCode1
Think Locally, Act Globally: Federated Learning with Local and Global RepresentationsCode1
Towards real-world navigation with deep differentiable plannersCode1
VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual NavigationCode1
Cognitive Mapping and Planning for Visual NavigationCode1
Self-Monitoring Navigation Agent via Auxiliary Progress EstimationCode1
Collaborative Visual NavigationCode1
Zero-shot object goal visual navigationCode1
Show:102550
← PrevPage 2 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NaviLLMdist_to_end_reduction7.9Unverified
2VLN-PETLdist_to_end_reduction6.13Unverified
3early to beddist_to_end_reduction6.03Unverified
4HAMTdist_to_end_reduction5.58Unverified
5s-agent (NDH-Full)dist_to_end_reduction5.27Unverified
6BabyWalk (r2r-pretrain)dist_to_end_reduction4.46Unverified
7Environment-agnostic Multitask Learningdist_to_end_reduction3.91Unverified
8BabyWalkdist_to_end_reduction3.65Unverified
9Test2-NDHdist_to_end_reduction3.44Unverified
10SCoAdist_to_end_reduction3.37Unverified
#ModelMetricClaimedVerifiedStatus
1SUSAspl0.64Unverified
2Meta-Explorespl0.61Unverified
3NaviLLMspl0.6Unverified
4BEV-BERTspl0.6Unverified
5HOPspl0.59Unverified
6DUETspl0.58Unverified
7VLN-PETLspl0.58Unverified
8VLN-BERTspl0.57Unverified
9Prevalentspl0.51Unverified
10RCM+SIL(no early exploration)spl0.38Unverified
#ModelMetricClaimedVerifiedStatus
1AutoVLNNav-SPL27.83Unverified
2NaviLLMNav-SPL26.26Unverified
3Meta-ExploreNav-SPL25.8Unverified
4SUSANav-SPL25.47Unverified
5DUETNav-SPL21.42Unverified
6GBENav-SPL13.3Unverified
#ModelMetricClaimedVerifiedStatus
1MVV-INSPL (All)17.27Unverified
2SAVNSPL (All)16.15Unverified
#ModelMetricClaimedVerifiedStatus
1PopArt-IMPALAMedium Human-Normalized Score72.8Unverified
#ModelMetricClaimedVerifiedStatus
1Prevalentspl28.72Unverified