XFeat: Accelerated Features for Lightweight Image Matching Apr 30, 2024 CPU Keypoint detection and image matching
Code Code Available 55 Navigation World Models Dec 4, 2024 Robot Navigation Video Generation
Code Code Available 45 Visual Planning: Let's Think Only with Images May 16, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 35 ViNT: A Foundation Model for Visual Navigation Jun 26, 2023 model Visual Navigation
Code Code Available 35 CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos Nov 26, 2024 Common Sense Reasoning Imitation Learning
Code Code Available 35 LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences Dec 2, 2024 Embodied Question Answering Question Answering
Code Code Available 25 NaviDiffusor: Cost-Guided Diffusion Model for Visual Navigation Apr 14, 2025 Visual Navigation
Code Code Available 25 GaussNav: Gaussian Splatting for Visual Navigation Mar 18, 2024 3DGS Visual Navigation
Code Code Available 25 Scaling Data Generation in Vision-and-Language Navigation Jul 28, 2023 Imitation Learning Vision and Language Navigation
Code Code Available 25 SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning Jun 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 BEVBert: Multimodal Map Pre-training for Language-guided Navigation Dec 8, 2022 Vision and Language Navigation Visual Navigation
Code Code Available 25 Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models Apr 14, 2025 Action Generation Denoising
Code Code Available 25 Towards Learning a Generalist Model for Embodied Navigation Dec 4, 2023 3D Question Answering (3D-QA) Embodied Question Answering
Code Code Available 25 POPGym: Benchmarking Partially Observable Reinforcement Learning Mar 3, 2023 Benchmarking GPU
Code Code Available 25 NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models May 26, 2023 Instruction Following Vision and Language Navigation
Code Code Available 25 Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation Feb 23, 2022 Efficient Exploration Navigate
Code Code Available 25 Vision-and-Language Navigation via Causal Learning Apr 16, 2024 Causal Inference Contrastive Learning
Code Code Available 25 GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation Apr 9, 2024 Go to AnyThing Navigate
Code Code Available 25 Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance? Dec 13, 2019 PointGoal Navigation Visual Navigation
Code Code Available 15 Last-Mile Embodied Visual Navigation Nov 21, 2022 Visual Navigation
Code Code Available 15 Learning Exploration Policies for Navigation Mar 5, 2019 Efficient Exploration General Reinforcement Learning
Code Code Available 15 HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation Mar 22, 2022 Decision Making Language Modeling
Code Code Available 15 HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation Jun 20, 2023 Collision Avoidance Computational Efficiency
Code Code Available 15 An Open Source and Open Hardware Deep Learning-powered Visual Navigation Engine for Autonomous Nano-UAVs May 10, 2019 Autonomous Navigation Visual Navigation
Code Code Available 15 Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph Mar 1, 2021 Hierarchical Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 15 A Pose-only Solution to Visual Reconstruction and Navigation Mar 2, 2021 3D Scene Reconstruction Computational Efficiency
Code Code Available 15 Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language Aug 17, 2023 Language Modeling Language Modelling
Code Code Available 15 A Recurrent Vision-and-Language BERT for Navigation Nov 26, 2020 Decision Making Decoder
Code Code Available 15 Learning from Unlabeled 3D Environments for Vision-and-Language Navigation Aug 24, 2022 Language Modeling Language Modelling
Code Code Available 15 Benchmarking Visual Localization for Autonomous Navigation Mar 24, 2022 Autonomous Navigation Benchmarking
Code Code Available 15 Offline Reinforcement Learning for Visual Navigation Dec 16, 2022 Navigate Offline RL
Code Code Available 15 Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving Sounds Nov 29, 2021 Navigate Visual Navigation
Code Code Available 15 Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning Oct 25, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 15 Multi3DRefer: Grounding Text Description to Multiple 3D Objects Sep 11, 2023 3D visual grounding Contrastive Learning
Code Code Available 15 CityLearn: Diverse Real-World Environments for Sample-Efficient Navigation Policy Learning Oct 10, 2019 Autonomous Driving Decision Making
Code Code Available 15 End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon Sep 28, 2023 Pose Estimation Visual Navigation
Code Code Available 15 MVP: Unified Motion and Visual Self-Supervised Learning for Large-Scale Robotic Navigation Mar 2, 2020 Autonomous Driving Autonomous Navigation
Code Code Available 15 EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training Feb 26, 2025 Mamba Representation Learning
Code Code Available 15 Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation Dec 9, 2024 Object Localization Vision and Language Navigation
Code Code Available 15 Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning Dec 3, 2018 Meta-Learning Meta Reinforcement Learning
Code Code Available 15 Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices Aug 6, 2020 Meta Reinforcement Learning reinforcement-learning
Code Code Available 15 Extending Maps with Semantic and Contextual Object Information for Robot Navigation: a Learning-Based Framework using Visual and Depth Cues Mar 13, 2020 Object Robot Navigation
Code Code Available 15 CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes Sep 21, 2023 counterfactual Visual Navigation
Code Code Available 15 CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes Sep 21, 2023 counterfactual Visual Navigation
Code Code Available 15 A Visual Navigation Perspective for Category-Level Object Pose Estimation Mar 25, 2022 Imitation Learning Pose Estimation
Code Code Available 15 An Interactive Navigation Method with Effect-oriented Affordance Jan 1, 2024 Navigate Visual Navigation
Code Code Available 15 A 64mW DNN-based Visual Navigation Engine for Autonomous Nano-Drones May 4, 2018 Autonomous Navigation Visual Navigation
Code Code Available 15 Cognitive Mapping and Planning for Visual Navigation Feb 13, 2017 Visual Navigation
Code Code Available 15 Collaborative Visual Navigation Jul 2, 2021 Multi-agent Reinforcement Learning Navigate
Code Code Available 15 Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations Feb 23, 2020 Atari Games Decision Making
Code Code Available 15