SOTAVerified

Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Papers

Showing 76100 of 316 papers

TitleStatusHype
Towards Learning a Generalist Model for Embodied NavigationCode2
Deep Learning for Visual Navigation of Underwater Robots0
Bird's Eye View Based Pretrained World model for Visual Navigation0
Invariance is Key to Generalization: Examining the Role of Representation in Sim-to-Real Transfer for Visual Navigation0
What you see is what you get: Experience ranking with deep neural dataset-to-dataset similarity for topological localisationCode0
Zero-Shot Object Goal Visual Navigation With Class-Independent Relationship NetworkCode0
Multimodal Large Language Model for Visual Navigation0
A Decentralized Cooperative Navigation Approach for Visual Homing Networks0
End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent PhenomenonCode1
STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience0
CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room ScenesCode1
CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room ScenesCode1
Wait, That Feels Familiar: Learning to Extrapolate Human Preferences for Preference Aligned Path Planning0
Multi3DRefer: Grounding Text Description to Multiple 3D ObjectsCode1
Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation0
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language NavigationCode0
Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural languageCode1
Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied AgentsCode1
Multi-goal Audio-visual Navigation using Sound Direction Map0
Scaling Data Generation in Vision-and-Language NavigationCode2
Learning Navigational Visual Representations with Semantic Map SupervisionCode1
Online Self-Supervised Thermal Water Segmentation for Aerial VehiclesCode1
The Drunkard's Odometry: Estimating Camera Motion in Deforming ScenesCode1
ViNT: A Foundation Model for Visual NavigationCode3
HabiCrowd: A High Performance Simulator for Crowd-Aware Visual NavigationCode1
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NaviLLMdist_to_end_reduction7.9Unverified
2VLN-PETLdist_to_end_reduction6.13Unverified
3early to beddist_to_end_reduction6.03Unverified
4HAMTdist_to_end_reduction5.58Unverified
5s-agent (NDH-Full)dist_to_end_reduction5.27Unverified
6BabyWalk (r2r-pretrain)dist_to_end_reduction4.46Unverified
7Environment-agnostic Multitask Learningdist_to_end_reduction3.91Unverified
8BabyWalkdist_to_end_reduction3.65Unverified
9Test2-NDHdist_to_end_reduction3.44Unverified
10SCoAdist_to_end_reduction3.37Unverified
#ModelMetricClaimedVerifiedStatus
1SUSAspl0.64Unverified
2Meta-Explorespl0.61Unverified
3NaviLLMspl0.6Unverified
4BEV-BERTspl0.6Unverified
5HOPspl0.59Unverified
6DUETspl0.58Unverified
7VLN-PETLspl0.58Unverified
8VLN-BERTspl0.57Unverified
9Prevalentspl0.51Unverified
10RCM+SIL(no early exploration)spl0.38Unverified
#ModelMetricClaimedVerifiedStatus
1AutoVLNNav-SPL27.83Unverified
2NaviLLMNav-SPL26.26Unverified
3Meta-ExploreNav-SPL25.8Unverified
4SUSANav-SPL25.47Unverified
5DUETNav-SPL21.42Unverified
6GBENav-SPL13.3Unverified
#ModelMetricClaimedVerifiedStatus
1MVV-INSPL (All)17.27Unverified
2SAVNSPL (All)16.15Unverified
#ModelMetricClaimedVerifiedStatus
1PopArt-IMPALAMedium Human-Normalized Score72.8Unverified
#ModelMetricClaimedVerifiedStatus
1Prevalentspl28.72Unverified