| PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning | Jun 17, 2025 | General Reinforcement LearningMultimodal Reasoning | —Unverified | 0 | 0 |
| Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model | Aug 1, 2024 | EgoSchemaLanguage Modeling | —Unverified | 0 | 0 |
| PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly | Jun 10, 2025 | Question AnsweringScene Understanding | —Unverified | 0 | 0 |
| PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs | Feb 12, 2024 | Instruction FollowingLogical Reasoning | —Unverified | 0 | 0 |
| Pix2Scene: Learning Implicit 3D Representations from Images | May 1, 2019 | Spatial Reasoning | —Unverified | 0 | 0 |
| Poly2Vec: Polymorphic Fourier-Based Encoding of Geospatial Objects for GeoAI Applications | Aug 27, 2024 | Spatial Reasoning | —Unverified | 0 | 0 |
| Preliminary Explorations with GPT-4o(mni) Native Image Generation | May 6, 2025 | Image Generationmultimodal generation | —Unverified | 0 | 0 |
| Proceedings of the 2nd Symposium on Problem-solving, Creativity and Spatial Reasoning in Cognitive Systems, ProSocrates 2017 | Jan 14, 2019 | Spatial Reasoning | —Unverified | 0 | 0 |
| PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging | May 17, 2025 | Image SegmentationLanguage Modeling | —Unverified | 0 | 0 |
| Quantifying Geospatial in the Common Crawl Corpus | Jun 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |