XFeat: Accelerated Features for Lightweight Image Matching Apr 30, 2024 CPU Keypoint detection and image matching
Code Code Available 5Navigation World Models Dec 4, 2024 Robot Navigation Video Generation
Code Code Available 4Visual Planning: Let's Think Only with Images May 16, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 3CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos Nov 26, 2024 Common Sense Reasoning Imitation Learning
Code Code Available 3ViNT: A Foundation Model for Visual Navigation Jun 26, 2023 model Visual Navigation
Code Code Available 3Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models Apr 14, 2025 Action Generation Denoising
Code Code Available 2NaviDiffusor: Cost-Guided Diffusion Model for Visual Navigation Apr 14, 2025 Visual Navigation
Code Code Available 2LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences Dec 2, 2024 Embodied Question Answering Question Answering
Code Code Available 2Vision-and-Language Navigation via Causal Learning Apr 16, 2024 Causal Inference Contrastive Learning
Code Code Available 2GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation Apr 9, 2024 Go to AnyThing Navigate
Code Code Available 2GaussNav: Gaussian Splatting for Visual Navigation Mar 18, 2024 3DGS Visual Navigation
Code Code Available 2Towards Learning a Generalist Model for Embodied Navigation Dec 4, 2023 3D Question Answering (3D-QA) Embodied Question Answering
Code Code Available 2Scaling Data Generation in Vision-and-Language Navigation Jul 28, 2023 Imitation Learning Vision and Language Navigation
Code Code Available 2NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models May 26, 2023 Instruction Following Vision and Language Navigation
Code Code Available 2POPGym: Benchmarking Partially Observable Reinforcement Learning Mar 3, 2023 Benchmarking GPU
Code Code Available 2BEVBert: Multimodal Map Pre-training for Language-guided Navigation Dec 8, 2022 Vision and Language Navigation Visual Navigation
Code Code Available 2SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning Jun 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation Feb 23, 2022 Efficient Exploration Navigate
Code Code Available 2Task-Oriented Communications for Visual Navigation with Edge-Aerial Collaboration in Low Altitude Economy Apr 25, 2025 Visual Navigation
Code Code Available 1EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training Feb 26, 2025 Mamba Representation Learning
Code Code Available 1Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation Dec 9, 2024 Object Localization Vision and Language Navigation
Code Code Available 1SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Dec 7, 2024 General Knowledge Mixture-of-Experts
Code Code Available 1Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments Oct 23, 2024 Object Visual Navigation
Code Code Available 1SPIN: Spacecraft Imagery for Navigation Jun 11, 2024 Data Augmentation Image Generation
Code Code Available 1Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models Apr 4, 2024 Spatial Reasoning Visual Navigation
Code Code Available 1VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation Mar 19, 2024 Anomaly Detection object-detection
Code Code Available 1VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-Training Mar 12, 2024 Self-Supervised Learning Visual Navigation
Code Code Available 1MemoNav: Working Memory Model for Visual Navigation Feb 29, 2024 Decision Making Graph Attention
Code Code Available 1An Interactive Navigation Method with Effect-oriented Affordance Jan 1, 2024 Navigate Visual Navigation
Code Code Available 1End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon Sep 28, 2023 Pose Estimation Visual Navigation
Code Code Available 1CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes Sep 21, 2023 counterfactual Visual Navigation
Code Code Available 1CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes Sep 21, 2023 counterfactual Visual Navigation
Code Code Available 1Multi3DRefer: Grounding Text Description to Multiple 3D Objects Sep 11, 2023 3D visual grounding Contrastive Learning
Code Code Available 1Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language Aug 17, 2023 Language Modeling Language Modelling
Code Code Available 1Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents Aug 14, 2023 Instruction Following Visual Navigation
Code Code Available 1Learning Navigational Visual Representations with Semantic Map Supervision Jul 23, 2023 Representation Learning Self-Supervised Learning
Code Code Available 1Online Self-Supervised Thermal Water Segmentation for Aerial Vehicles Jul 18, 2023 Segmentation Visual Navigation
Code Code Available 1The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes Jun 29, 2023 6D Pose Estimation using RGBD Optical Flow Estimation
Code Code Available 1HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation Jun 20, 2023 Collision Avoidance Computational Efficiency
Code Code Available 1Sonicverse: A Multisensory Simulation Platform for Embodied Household Agents that See and Hear Jun 1, 2023 Multi-Task Learning Visual Navigation
Code Code Available 1Renderable Neural Radiance Map for Visual Navigation Mar 1, 2023 Descriptive Visual Localization
Code Code Available 1Offline Reinforcement Learning for Visual Navigation Dec 16, 2022 Navigate Offline RL
Code Code Available 1Last-Mile Embodied Visual Navigation Nov 21, 2022 Visual Navigation
Code Code Available 1Towards Versatile Embodied Navigation Oct 30, 2022 Decision Making Vision-Language Navigation
Code Code Available 1ViNL: Visual Navigation and Locomotion Over Obstacles Oct 26, 2022 Navigate Visual Navigation
Code Code Available 1Learning from Unlabeled 3D Environments for Vision-and-Language Navigation Aug 24, 2022 Language Modeling Language Modelling
Code Code Available 1What do navigation agents learn about their environment? Jun 17, 2022 Visual Navigation
Code Code Available 1Zero-shot object goal visual navigation Jun 15, 2022 Knowledge Graphs Object
Code Code Available 1A Visual Navigation Perspective for Category-Level Object Pose Estimation Mar 25, 2022 Imitation Learning Pose Estimation
Code Code Available 1Benchmarking Visual Localization for Autonomous Navigation Mar 24, 2022 Autonomous Navigation Benchmarking
Code Code Available 1