| MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments | Feb 1, 2024 | Embodied Question AnsweringLanguage Modeling | CodeCode Available | 5 |
| Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding | Jan 14, 2025 | Embodied Question AnsweringHallucination | CodeCode Available | 4 |
| LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences | Dec 2, 2024 | Embodied Question AnsweringQuestion Answering | CodeCode Available | 2 |
| Towards Learning a Generalist Model for Embodied Navigation | Dec 4, 2023 | 3D Question Answering (3D-QA)Embodied Question Answering | CodeCode Available | 2 |
| CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space | Feb 18, 2025 | Embodied Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Map-based Modular Approach for Zero-shot Embodied Question Answering | May 26, 2024 | Embodied Question AnsweringNavigate | CodeCode Available | 1 |
| Synthesizing Event-centric Knowledge Graphs of Daily Activities Using Virtual Space | Jul 30, 2023 | Decision MakingEmbodied Question Answering | CodeCode Available | 1 |
| AllenAct: A Framework for Embodied AI Research | Aug 28, 2020 | Deep Reinforcement LearningEmbodied Question Answering | CodeCode Available | 1 |
| VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering | Aug 14, 2019 | Embodied Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering | Jul 17, 2025 | Embodied Question AnsweringQuestion Answering | —Unverified | 0 |