XFeat: Accelerated Features for Lightweight Image Matching Apr 30, 2024 CPU Keypoint detection and image matching
Code Code Available 5Navigation World Models Dec 4, 2024 Robot Navigation Video Generation
Code Code Available 4Visual Planning: Let's Think Only with Images May 16, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 3CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos Nov 26, 2024 Common Sense Reasoning Imitation Learning
Code Code Available 3ViNT: A Foundation Model for Visual Navigation Jun 26, 2023 model Visual Navigation
Code Code Available 3SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning Jun 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models May 26, 2023 Instruction Following Vision and Language Navigation
Code Code Available 2LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences Dec 2, 2024 Embodied Question Answering Question Answering
Code Code Available 2Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation Feb 23, 2022 Efficient Exploration Navigate
Code Code Available 2Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models Apr 14, 2025 Action Generation Denoising
Code Code Available 2GaussNav: Gaussian Splatting for Visual Navigation Mar 18, 2024 3DGS Visual Navigation
Code Code Available 2Scaling Data Generation in Vision-and-Language Navigation Jul 28, 2023 Imitation Learning Vision and Language Navigation
Code Code Available 2GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation Apr 9, 2024 Go to AnyThing Navigate
Code Code Available 2NaviDiffusor: Cost-Guided Diffusion Model for Visual Navigation Apr 14, 2025 Visual Navigation
Code Code Available 2POPGym: Benchmarking Partially Observable Reinforcement Learning Mar 3, 2023 Benchmarking GPU
Code Code Available 2Towards Learning a Generalist Model for Embodied Navigation Dec 4, 2023 3D Question Answering (3D-QA) Embodied Question Answering
Code Code Available 2Vision-and-Language Navigation via Causal Learning Apr 16, 2024 Causal Inference Contrastive Learning
Code Code Available 2BEVBert: Multimodal Map Pre-training for Language-guided Navigation Dec 8, 2022 Vision and Language Navigation Visual Navigation
Code Code Available 2Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance? Dec 13, 2019 PointGoal Navigation Visual Navigation
Code Code Available 1A Recurrent Vision-and-Language BERT for Navigation Nov 26, 2020 Decision Making Decoder
Code Code Available 1MVP: Unified Motion and Visual Self-Supervised Learning for Large-Scale Robotic Navigation Mar 2, 2020 Autonomous Driving Autonomous Navigation
Code Code Available 1MemoNav: Working Memory Model for Visual Navigation Feb 29, 2024 Decision Making Graph Attention
Code Code Available 1Learning Object Relation Graph and Tentative Policy for Visual Navigation Jul 21, 2020 Imitation Learning Relation
Code Code Available 1An Open Source and Open Hardware Deep Learning-powered Visual Navigation Engine for Autonomous Nano-UAVs May 10, 2019 Autonomous Navigation Visual Navigation
Code Code Available 1Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning Dec 3, 2018 Meta-Learning Meta Reinforcement Learning
Code Code Available 1A Pose-only Solution to Visual Reconstruction and Navigation Mar 2, 2021 3D Scene Reconstruction Computational Efficiency
Code Code Available 1Multi3DRefer: Grounding Text Description to Multiple 3D Objects Sep 11, 2023 3D visual grounding Contrastive Learning
Code Code Available 1Offline Reinforcement Learning for Visual Navigation Dec 16, 2022 Navigate Offline RL
Code Code Available 1Last-Mile Embodied Visual Navigation Nov 21, 2022 Visual Navigation
Code Code Available 1Benchmarking Visual Localization for Autonomous Navigation Mar 24, 2022 Autonomous Navigation Benchmarking
Code Code Available 1Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language Aug 17, 2023 Language Modeling Language Modelling
Code Code Available 1Learning Exploration Policies for Navigation Mar 5, 2019 Efficient Exploration General Reinforcement Learning
Code Code Available 1Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph Mar 1, 2021 Hierarchical Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents Aug 14, 2023 Instruction Following Visual Navigation
Code Code Available 1HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation Mar 22, 2022 Decision Making Language Modeling
Code Code Available 1Learning from Unlabeled 3D Environments for Vision-and-Language Navigation Aug 24, 2022 Language Modeling Language Modelling
Code Code Available 1Extending Maps with Semantic and Contextual Object Information for Robot Navigation: a Learning-Based Framework using Visual and Depth Cues Mar 13, 2020 Object Robot Navigation
Code Code Available 1Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation Dec 9, 2024 Object Localization Vision and Language Navigation
Code Code Available 1CityLearn: Diverse Real-World Environments for Sample-Efficient Navigation Policy Learning Oct 10, 2019 Autonomous Driving Decision Making
Code Code Available 1A Visual Navigation Perspective for Category-Level Object Pose Estimation Mar 25, 2022 Imitation Learning Pose Estimation
Code Code Available 1A 64mW DNN-based Visual Navigation Engine for Autonomous Nano-Drones May 4, 2018 Autonomous Navigation Visual Navigation
Code Code Available 1HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation Jun 20, 2023 Collision Avoidance Computational Efficiency
Code Code Available 1CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes Sep 21, 2023 counterfactual Visual Navigation
Code Code Available 1CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes Sep 21, 2023 counterfactual Visual Navigation
Code Code Available 1Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving Sounds Nov 29, 2021 Navigate Visual Navigation
Code Code Available 1An Interactive Navigation Method with Effect-oriented Affordance Jan 1, 2024 Navigate Visual Navigation
Code Code Available 1Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices Aug 6, 2020 Meta Reinforcement Learning reinforcement-learning
Code Code Available 1Cognitive Mapping and Planning for Visual Navigation Feb 13, 2017 Visual Navigation
Code Code Available 1Collaborative Visual Navigation Jul 2, 2021 Multi-agent Reinforcement Learning Navigate
Code Code Available 1EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training Feb 26, 2025 Mamba Representation Learning
Code Code Available 1