| SpaceNLI: Evaluating the Consistency of Predicting Inferences in Space | Jul 5, 2023 | Natural Language InferenceNegation | CodeCode Available | 0 |
| SpaRC and SpaRP: Spatial Reasoning Characterization and Path Generation for Understanding Spatial Reasoning Capability of Large Language Models | Jun 7, 2024 | Spatial Reasoning | CodeCode Available | 0 |
| SPaRC: A Spatial Pathfinding Reasoning Challenge | May 22, 2025 | Spatial Reasoning | CodeCode Available | 0 |
| Narrowing the Gap between Vision and Action in Navigation | Aug 19, 2024 | DecoderSpatial Reasoning | CodeCode Available | 0 |
| Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents | Nov 27, 2024 | Autonomous NavigationObject Recognition | CodeCode Available | 0 |
| MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation | Jan 7, 2025 | Spatial Reasoning | CodeCode Available | 0 |
| MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models | Dec 31, 2024 | Multiple-choiceQuestion Answering | CodeCode Available | 0 |
| EgoHumans: An Egocentric 3D Multi-Human Benchmark | May 25, 2023 | 3D Pose EstimationHuman Detection | CodeCode Available | 0 |
| Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data | Jan 31, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 |
| LOViS: Learning Orientation and Visual Signals for Vision and Language Navigation | Sep 26, 2022 | Spatial ReasoningVision and Language Navigation | CodeCode Available | 0 |