Visual Navigation

Visual Navigation is the problem of navigating an agent, e.g. a mobile robot, in an environment using camera input only. The agent is given a target image (an image it will see from the target position), and its goal is to move from its current position to the target by applying a sequence of actions, based on the camera observations only.

Source: Vision-based Navigation Using Deep Reinforcement Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 316 papers

Title	Date	Tasks	Status	Hype
Towards Learning a Generalist Model for Embodied Navigation	Dec 4, 2023	3D Question Answering (3D-QA)Embodied Question Answering	CodeCode Available	2
Deep Learning for Visual Navigation of Underwater Robots	Oct 30, 2023	Deep LearningImitation Learning	—Unverified	0
Bird's Eye View Based Pretrained World model for Visual Navigation	Oct 28, 2023	NavigateVisual Navigation	—Unverified	0
Invariance is Key to Generalization: Examining the Role of Representation in Sim-to-Real Transfer for Visual Navigation	Oct 23, 2023	Visual Navigation	—Unverified	0
What you see is what you get: Experience ranking with deep neural dataset-to-dataset similarity for topological localisation	Oct 20, 2023	Visual Navigation	CodeCode Available	0
Zero-Shot Object Goal Visual Navigation With Class-Independent Relationship Network	Oct 15, 2023	ObjectSemantic Similarity	CodeCode Available	0
Multimodal Large Language Model for Visual Navigation	Oct 12, 2023	Language ModelingLanguage Modelling	—Unverified	0
A Decentralized Cooperative Navigation Approach for Visual Homing Networks	Oct 2, 2023	Visual Navigation	—Unverified	0
End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon	Sep 28, 2023	Pose EstimationVisual Navigation	CodeCode Available	1
STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience	Sep 26, 2023	Representation LearningVisual Navigation	—Unverified	0
CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes	Sep 21, 2023	counterfactualVisual Navigation	CodeCode Available	1
CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes	Sep 21, 2023	counterfactualVisual Navigation	CodeCode Available	1
Wait, That Feels Familiar: Learning to Extrapolate Human Preferences for Preference Aligned Path Planning	Sep 18, 2023	NavigateRobot Navigation	—Unverified	0
Multi3DRefer: Grounding Text Description to Multiple 3D Objects	Sep 11, 2023	3D visual groundingContrastive Learning	CodeCode Available	1
Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation	Aug 20, 2023	Decision MakingTransfer Learning	—Unverified	0
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation	Aug 20, 2023	Transfer LearningVision and Language Navigation	CodeCode Available	0
Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language	Aug 17, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents	Aug 14, 2023	Instruction FollowingVisual Navigation	CodeCode Available	1
Multi-goal Audio-visual Navigation using Sound Direction Map	Aug 1, 2023	Deep Reinforcement LearningNavigate	—Unverified	0
Scaling Data Generation in Vision-and-Language Navigation	Jul 28, 2023	Imitation LearningVision and Language Navigation	CodeCode Available	2
Learning Navigational Visual Representations with Semantic Map Supervision	Jul 23, 2023	Representation LearningSelf-Supervised Learning	CodeCode Available	1
Online Self-Supervised Thermal Water Segmentation for Aerial Vehicles	Jul 18, 2023	SegmentationVisual Navigation	CodeCode Available	1
The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes	Jun 29, 2023	6D Pose Estimation using RGBDOptical Flow Estimation	CodeCode Available	1
ViNT: A Foundation Model for Visual Navigation	Jun 26, 2023	modelVisual Navigation	CodeCode Available	3
HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation	Jun 20, 2023	Collision AvoidanceComputational Efficiency	CodeCode Available	1

Show:10 25 50

← PrevPage 4 of 13Next →

All datasets Cooperative Vision-and-Dialogue Navigation R2R SOON Test AI2-THOR Dmlab-30 Help, Anna! (HANNA)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NaviLLM	dist_to_end_reduction	7.9	—	Unverified
2	VLN-PETL	dist_to_end_reduction	6.13	—	Unverified
3	early to bed	dist_to_end_reduction	6.03	—	Unverified
4	HAMT	dist_to_end_reduction	5.58	—	Unverified
5	s-agent (NDH-Full)	dist_to_end_reduction	5.27	—	Unverified
6	BabyWalk (r2r-pretrain)	dist_to_end_reduction	4.46	—	Unverified
7	Environment-agnostic Multitask Learning	dist_to_end_reduction	3.91	—	Unverified
8	BabyWalk	dist_to_end_reduction	3.65	—	Unverified
9	Test2-NDH	dist_to_end_reduction	3.44	—	Unverified
10	SCoA	dist_to_end_reduction	3.37	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SUSA	spl	0.64	—	Unverified
2	Meta-Explore	spl	0.61	—	Unverified
3	NaviLLM	spl	0.6	—	Unverified
4	BEV-BERT	spl	0.6	—	Unverified
5	HOP	spl	0.59	—	Unverified
6	DUET	spl	0.58	—	Unverified
7	VLN-PETL	spl	0.58	—	Unverified
8	VLN-BERT	spl	0.57	—	Unverified
9	Prevalent	spl	0.51	—	Unverified
10	RCM+SIL(no early exploration)	spl	0.38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AutoVLN	Nav-SPL	27.83	—	Unverified
2	NaviLLM	Nav-SPL	26.26	—	Unverified
3	Meta-Explore	Nav-SPL	25.8	—	Unverified
4	SUSA	Nav-SPL	25.47	—	Unverified
5	DUET	Nav-SPL	21.42	—	Unverified
6	GBE	Nav-SPL	13.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVV-IN	SPL (All)	17.27	—	Unverified
2	SAVN	SPL (All)	16.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PopArt-IMPALA	Medium Human-Normalized Score	72.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Prevalent	spl	28.72	—	Unverified