SOTAVerified

Embodied Question Answering

Papers

Showing 140 of 40 papers

TitleStatusHype
MEIA: Multimodal Embodied Perception and Interaction in Unknown EnvironmentsCode5
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video UnderstandingCode4
Towards Learning a Generalist Model for Embodied NavigationCode2
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual PreferencesCode2
VideoNavQA: Bridging the Gap between Visual and Embodied Question AnsweringCode1
CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City SpaceCode1
Map-based Modular Approach for Zero-shot Embodied Question AnsweringCode1
Synthesizing Event-centric Knowledge Graphs of Daily Activities Using Virtual SpaceCode1
AllenAct: A Framework for Embodied AI ResearchCode1
MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task PlanningCode0
EQA-RM: A Generative Embodied Reward Model with Test-time ScalingCode0
Blindfold Baselines for Embodied QACode0
ToSA: Token Merging with Spatial AwarenessCode0
Multi-Target Embodied Question AnsweringCode0
Neural Modular Control for Embodied Question AnsweringCode0
Embodied Question AnsweringCode0
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering0
"Is This It?": Towards Ecologically Valid Benchmarks for Situated Collaboration0
LLM as A Robotic Brain: Unifying Egocentric Memory and Control0
Counterfactual Vision-and-Language Navigation: Unravelling the Unseen0
A Survey of Embodied AI: From Simulators to Research Tasks0
Memory-Centric Embodied Question Answer0
Multi-Agent Embodied Question Answering in Interactive Environments0
NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries0
OpenEQA: Embodied Question Answering in the Era of Foundation Models0
Revisiting EmbodiedQA: A Simple Baseline and Beyond0
SegEQA: Video Segmentation Based Visual Attention for Embodied Question Answering0
Is the House Ready For Sleeptime? Generating and Evaluating Situational Queries for Embodied Question Answering0
Vector Quantized Feature Fields for Fast 3D Semantic Lifting0
TANGO: Training-free Embodied AI Agents for Open-world Tasks0
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering0
Cross-Task Knowledge Transfer for Visually-Grounded Navigation0
EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering0
Embodied Multimodal Multitask Learning0
Embodied Question Answering in Photorealistic Environments with Point Cloud Perception0
Multi-LLM QA with Embodied Exploration0
Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering0
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation0
Explore until Confident: Efficient Exploration for Embodied Question Answering0
General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting0
Show:102550

No leaderboard results yet.