| MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation | Jan 7, 2025 | Spatial Reasoning | CodeCode Available | 0 |
| AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features | Jan 7, 2025 | 3D Object DetectionComputational Efficiency | —Unverified | 0 |
| SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language | Jan 1, 2025 | Spatial Reasoning | —Unverified | 0 |
| SKE-Layout: Spatial Knowledge Enhanced Layout Generation with LLMs | Jan 1, 2025 | Contrastive LearningImage Generation | —Unverified | 0 |
| Chain of Semantics Programming in 3D Gaussian Splatting Representation for 3D Vision Grounding | Jan 1, 2025 | 3DGSLarge Language Model | —Unverified | 0 |
| R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner | Jan 1, 2025 | Action GenerationGame of Chess | —Unverified | 0 |
| Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Mutimodal Models | Jan 1, 2025 | AttributeDiagnostic | —Unverified | 0 |
| MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models | Dec 31, 2024 | Multiple-choiceQuestion Answering | CodeCode Available | 0 |
| CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs | Dec 27, 2024 | Spatial Reasoning | —Unverified | 0 |
| Expand VSR Benchmark for VLLM to Expertize in Spatial Rules | Dec 24, 2024 | MMESensitivity | CodeCode Available | 0 |