SOTAVerified|Agents Browse Leaderboard About Blog

Vision and Language Navigation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 223 papers

Title	Date	Tasks	Status	Hype	Score
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	Feb 8, 2024	Conversational Web NavigationText Generation	CodeCode Available	5	5
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models	Jul 9, 2024	Vision and Language Navigation	CodeCode Available	3	5
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models	Jul 17, 2024	Instruction FollowingVision and Language Navigation	CodeCode Available	3	5
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions	Jun 27, 2024	NavigateVision and Language Navigation	CodeCode Available	2	5
FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models	May 19, 2025	Disaster ResponseVision and Language Navigation	CodeCode Available	2	5
General Scene Adaptation for Vision-and-Language Navigation	Jan 29, 2025	DiversityVision and Language Navigation	CodeCode Available	2	5
AerialVLN: Vision-and-Language Navigation for UAVs	Aug 13, 2023	cross-modal alignmentNavigate	CodeCode Available	2	5
BEVBert: Multimodal Map Pre-training for Language-guided Navigation	Dec 8, 2022	Vision and Language NavigationVisual Navigation	CodeCode Available	2	5
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)	Jun 23, 2022	Data AugmentationVision and Language Navigation	CodeCode Available	2	5
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation	May 16, 2025	3D geometryNavigate	CodeCode Available	2	5

Show:10 25 50

← PrevPage 1 of 23Next →

All datasets VLN Challenge Touchdown Dataset RxR map2seq Room2Room robo-vln

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	human	success	0.86	—	Unverified
2	Lily	success	0.79	—	Unverified
3	Airbert	success	0.78	—	Unverified
4	Global Normalization	success	0.74	—	Unverified
5	explore@40 beam-search	success	0.74	—	Unverified
6	BEVBert	success	0.73	—	Unverified
7	GMap	success	0.73	—	Unverified
8	VLN-Bert	success	0.73	—	Unverified
9	Gloabl Normalization pre-explore	success	0.73	—	Unverified
10	FOAM-Beam Search	success	0.72	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FLAME	Task Completion (TC)	40.2	—	Unverified
2	ORAR + junction type + heading delta	Task Completion (TC)	29.1	—	Unverified
3	ORAR	Task Completion (TC)	24.2	—	Unverified
4	ARC + L2STOP	Task Completion (TC)	16.68	—	Unverified
5	VLN Transformer +M-50 +style	Task Completion (TC)	16.2	—	Unverified
6	VLN Transformer	Task Completion (TC)	14.9	—	Unverified
7	ARC	Task Completion (TC)	14.13	—	Unverified
8	Retouch-RConcat	Task Completion (TC)	12.8	—	Unverified
9	Gated Attention (GA)	Task Completion (TC)	11.9	—	Unverified
10	RConcat	Task Completion (TC)	11.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MARVAL	ndtw	66.76	—	Unverified
2	EnvEdit-PT	ndtw	64.61	—	Unverified
3	HAMT	ndtw	59.94	—	Unverified
4	CLEAR-CLIP	ndtw	53.69	—	Unverified
5	Monolingual Baseline	ndtw	41.05	—	Unverified
6	Multilingual Baseline	ndtw	36.81	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FLAME	Task Completion (TC)	52.44	—	Unverified
2	ORAR + junction type + heading delta	Task Completion (TC)	46.7	—	Unverified
3	ORAR	Task Completion (TC)	45.1	—	Unverified
4	Gated Attention	Task Completion (TC)	17	—	Unverified
5	Rconcat	Task Completion (TC)	14.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	R2R+EnvDrop	spl	0.61	—	Unverified
2	RCM + SIL	spl	0.59	—	Unverified
3	Tactical Rewind - short	spl	0.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hierarchical Cross-Modal Agent	SPL (Sucess Weighted by Path Length)	0.4	—	Unverified