| Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models | Dec 23, 2024 | Relational ReasoningSpatial Reasoning | —Unverified | 0 |
| Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning | Dec 21, 2024 | Spatial Reasoning | —Unverified | 0 |
| Investigating Relational State Abstraction in Collaborative MARL | Dec 19, 2024 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Mathematical Definition and Systematization of Puzzle Rules | Dec 18, 2024 | Game DesignSpatial Reasoning | —Unverified | 0 |
| SPHERE: A Hierarchical Evaluation on Spatial Perception and Reasoning for Vision-Language Models | Dec 17, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 |
| A dual contrastive framework | Dec 13, 2024 | Contrastive LearningDecoder | —Unverified | 0 |
| Geo-LLaVA: A Large Multi-Modal Model for Solving Geometry Math Problems with Meta In-Context Learning | Dec 12, 2024 | Geometry Problem SolvingIn-Context Learning | —Unverified | 0 |
| VisionArena: 230K Real World User-VLM Conversations with Preference Labels | Dec 11, 2024 | ChatbotSpatial Reasoning | —Unverified | 0 |
| 3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark | Dec 10, 2024 | Autonomous NavigationSpatial Reasoning | —Unverified | 0 |
| VideoSAVi: Self-Aligned Video Language Models without Human Supervision | Dec 1, 2024 | EgoSchemaMVBench | —Unverified | 0 |