SOTAVerified

Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Papers

Showing 151200 of 316 papers

TitleStatusHype
MetaCropFollow: Few-Shot Adaptation with Meta-Learning for Under-Canopy Navigation0
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding0
Exploring the Impacts from Datasets to Monocular Depth Estimation (MDE) Models with MineNavi0
MoDA: Map style transfer for self-supervised Domain Adaptation of embodied agents0
Adaptive Navigation Scheme for Optimal Deep-Sea Localization Using Multimodal Perception Cues0
Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics0
MRS-VPR: a multi-resolution sampling based global visual place recognition method0
Gram-SLD: Automatic Self-labeling and Detection for Instance Objects0
Multi-goal Audio-visual Navigation using Sound Direction Map0
Multimodal Aggregation Approach for Memory Vision-Voice Indoor Navigation with Meta-Learning0
Multimodal Large Language Model for Visual Navigation0
Multi-View Pedestrian Occupancy Prediction with a Novel Synthetic Dataset0
Google Map Aided Visual Navigation for UAVs in GPS-denied Environment0
NaRPA: Navigation and Rendering Pipeline for Astronautics0
Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better0
Generating Robust Supervision for Learning-Based Visual Navigation Using Hamilton-Jacobi Reachability0
Navigating to Objects in the Real World0
GAPLE: Generalizable Approaching Policy LEarning for Robotic Object Searching in Indoor Environment0
Neural Topological SLAM for Visual Navigation0
Newton-PnP: Real-time Visual Navigation for Autonomous Toy-Drones0
Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation States0
Object-oriented Targets for Visual Navigation using Rich Semantic Representations0
Visuospatial navigation without distance, prediction, integration, or maps0
Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation0
Bird's Eye View Based Pretrained World model for Visual Navigation0
From Seeing to Moving: A Survey on Learning for Visual Indoor Navigation (VIN)0
FloNa: Floor Plan Guided Embodied Visual Navigation0
On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning0
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments0
NOLO: Navigate Only Look Once0
Optimizing Gaze Direction in a Visual Navigation Task0
Flex: End-to-End Text-Instructed Visual Navigation from Foundation Model Features0
OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav0
Pay Self-Attention to Audio-Visual Navigation0
Perception and Navigation in Autonomous Systems in the Era of Learning: A Survey0
Perceptual Attention-based Predictive Control0
Filter-Aware Model-Predictive Control0
Point Cloud Based Reinforcement Learning for Sim-to-Real and Partial Observability in Visual Navigation0
Polyline Generative Navigable Space Segmentation for Autonomous Visual Navigation0
Few-Shot Goal Inference for Visuomotor Learning and Planning0
Pose Invariant Topological Memory for Visual Navigation0
Predicting Topological Maps for Visual Navigation in Unexplored Environments0
Predictive Control Using Learned State Space Models via Rolling Horizon Evolution0
Feudal Networks for Visual Navigation0
Fast Traversability Estimation for Wild Visual Navigation0
RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation0
Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation0
RCA: Ride Comfort-Aware Visual Navigation via Self-Supervised Learning0
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach0
ReCoRe: Regularized Contrastive Representation Learning of World Model0
Show:102550
← PrevPage 4 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NaviLLMdist_to_end_reduction7.9Unverified
2VLN-PETLdist_to_end_reduction6.13Unverified
3early to beddist_to_end_reduction6.03Unverified
4HAMTdist_to_end_reduction5.58Unverified
5s-agent (NDH-Full)dist_to_end_reduction5.27Unverified
6BabyWalk (r2r-pretrain)dist_to_end_reduction4.46Unverified
7Environment-agnostic Multitask Learningdist_to_end_reduction3.91Unverified
8BabyWalkdist_to_end_reduction3.65Unverified
9Test2-NDHdist_to_end_reduction3.44Unverified
10SCoAdist_to_end_reduction3.37Unverified
#ModelMetricClaimedVerifiedStatus
1SUSAspl0.64Unverified
2Meta-Explorespl0.61Unverified
3NaviLLMspl0.6Unverified
4BEV-BERTspl0.6Unverified
5HOPspl0.59Unverified
6DUETspl0.58Unverified
7VLN-PETLspl0.58Unverified
8VLN-BERTspl0.57Unverified
9Prevalentspl0.51Unverified
10RCM+SIL(no early exploration)spl0.38Unverified
#ModelMetricClaimedVerifiedStatus
1AutoVLNNav-SPL27.83Unverified
2NaviLLMNav-SPL26.26Unverified
3Meta-ExploreNav-SPL25.8Unverified
4SUSANav-SPL25.47Unverified
5DUETNav-SPL21.42Unverified
6GBENav-SPL13.3Unverified
#ModelMetricClaimedVerifiedStatus
1MVV-INSPL (All)17.27Unverified
2SAVNSPL (All)16.15Unverified
#ModelMetricClaimedVerifiedStatus
1PopArt-IMPALAMedium Human-Normalized Score72.8Unverified
#ModelMetricClaimedVerifiedStatus
1Prevalentspl28.72Unverified