| Social Conjuring: Multi-User Runtime Collaboration with AI in Building Virtual 3D Worlds | Sep 30, 2024 | Spatial Reasoning | —Unverified | 0 |
| Spatial Reasoning and Planning for Deep Embodied Agents | Sep 28, 2024 | Autonomous DrivingMinecraft | —Unverified | 0 |
| DARE: Diverse Visual Question Answering with Robustness Evaluation | Sep 26, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning? | Sep 25, 2024 | In-Context LearningNovel Concepts | CodeCode Available | 0 |
| Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language Models | Sep 23, 2024 | Common Sense ReasoningSpatial Reasoning | —Unverified | 0 |
| Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data | Sep 19, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 |
| Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Sep 15, 2024 | Spatial Reasoning | —Unverified | 0 |
| ActionFlow: Equivariant, Accurate, and Efficient Policies with Spatially Symmetric Flow Matching | Sep 6, 2024 | Action GenerationSpatial Reasoning | —Unverified | 0 |
| Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments | Sep 4, 2024 | Continual LearningNavigate | —Unverified | 0 |
| Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games | Aug 28, 2024 | Atari GamesBenchmarking | —Unverified | 0 |