SOTAVerified

Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Papers

Showing 201250 of 316 papers

TitleStatusHype
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation0
Fast Object Detection with a Machine Learning Edge Device0
Retrospectives on the Embodied AI Workshop0
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation0
RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps0
RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation0
Robot in a China Shop: Using Reinforcement Learning for Location-Specific Navigation Behaviour0
Explore then Execute: Adapting without Rewards via Factorized Meta-Reinforcement Learning0
Robustness of Utilizing Feedback in Embodied Visual Navigation0
SACSoN: Scalable Autonomous Control for Social Navigation0
Exploiting Scene-specific Features for Object Goal Navigation0
SAMPLE-HD: Simultaneous Action and Motion Planning Learning Environment0
ViVa-SAFELAND: a New Freeware for Safe Validation of Vision-based Navigation in Aerial Vehicles0
Environment Predictive Coding for Visual Navigation0
Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation0
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks0
Scene Retrieval for Contextual Visual Mapping0
Zero-shot Imitation Learning from Demonstrations for Legged Robot Visual Navigation0
Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation0
Self-Supervised Domain Adaptation for Visual Navigation with Global Map Consistency0
STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience0
Semantic Audio-Visual Navigation0
Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter0
Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method0
Deep Visual Navigation under Partial Observability0
Embodied Visual Navigation with Automatic Curriculum Learning in Real Environments0
Shifting the Baseline: Single Modality Performance on Visual Navigation & QA0
Shifting the Baseline: Single Modality Performance on Visual Navigation \& QA0
SHREC 2020 track: 6D Object Pose Estimation0
Embodied Multimodal Multitask Learning0
Sim2Real Transfer for Audio-Visual Navigation with Frequency-Adaptive Acoustic Field Prediction0
Embodied Agents for Efficient Exploration and Smart Scene Description0
Single Object Tracking Research: A Survey0
Situational Fusion of Visual Representation for Visual Navigation0
SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation0
Solving Vision Tasks with Simple Photoreceptors Instead of Cameras0
Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning0
Dynamical Audio-Visual Navigation: Catching Unheard Moving Sound Sources in Unmapped 3D Environments0
DRISHTI: Visual Navigation Assistant for Visually Impaired0
Differentiable SLAM-net: Learning Particle SLAM for Visual Navigation0
DeepRelativeFusion: Dense Monocular SLAM using Single-Image Relative Depth Prediction0
Deep Learning for Visual Navigation of Underwater Robots0
VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion0
Symmetry-aware Neural Architecture for Embodied Visual Navigation0
Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention and Spatial Memory0
Deep Learning for Embodied Vision Navigation: A Survey0
Active Object Perceiver: Recognition-guided Policy Learning for Object Searching on Mobile Robots0
Target Driven Visual Navigation with Hybrid Asynchronous Universal Successor Representations0
Decision-based AI Visual Navigation for Cardiac Ultrasounds0
TDANet: Target-Directed Attention Network For Object-Goal Visual Navigation With Zero-Shot Ability0
Show:102550
← PrevPage 5 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NaviLLMdist_to_end_reduction7.9Unverified
2VLN-PETLdist_to_end_reduction6.13Unverified
3early to beddist_to_end_reduction6.03Unverified
4HAMTdist_to_end_reduction5.58Unverified
5s-agent (NDH-Full)dist_to_end_reduction5.27Unverified
6BabyWalk (r2r-pretrain)dist_to_end_reduction4.46Unverified
7Environment-agnostic Multitask Learningdist_to_end_reduction3.91Unverified
8BabyWalkdist_to_end_reduction3.65Unverified
9Test2-NDHdist_to_end_reduction3.44Unverified
10SCoAdist_to_end_reduction3.37Unverified
#ModelMetricClaimedVerifiedStatus
1SUSAspl0.64Unverified
2Meta-Explorespl0.61Unverified
3NaviLLMspl0.6Unverified
4BEV-BERTspl0.6Unverified
5HOPspl0.59Unverified
6DUETspl0.58Unverified
7VLN-PETLspl0.58Unverified
8VLN-BERTspl0.57Unverified
9Prevalentspl0.51Unverified
10RCM+SIL(no early exploration)spl0.38Unverified
#ModelMetricClaimedVerifiedStatus
1AutoVLNNav-SPL27.83Unverified
2NaviLLMNav-SPL26.26Unverified
3Meta-ExploreNav-SPL25.8Unverified
4SUSANav-SPL25.47Unverified
5DUETNav-SPL21.42Unverified
6GBENav-SPL13.3Unverified
#ModelMetricClaimedVerifiedStatus
1MVV-INSPL (All)17.27Unverified
2SAVNSPL (All)16.15Unverified
#ModelMetricClaimedVerifiedStatus
1PopArt-IMPALAMedium Human-Normalized Score72.8Unverified
#ModelMetricClaimedVerifiedStatus
1Prevalentspl28.72Unverified