| Beyond Segmentation: Road Network Generation with Multi-Modal LLMs | Oct 15, 2023 | Autonomous NavigationLanguage Modeling | —Unverified | 0 | 0 |
| Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts | Mar 10, 2025 | Reasoning SegmentationSegmentation | —Unverified | 0 | 0 |
| Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations | Jun 9, 2025 | Large Language ModelMultimodal Reasoning | —Unverified | 0 | 0 |
| Unveiling the Invisible: Reasoning Complex Occlusions Amodally with AURA | Mar 13, 2025 | Dataset GenerationReasoning Segmentation | —Unverified | 0 | 0 |
| MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models | Jun 12, 2025 | Image SegmentationMedical Diagnosis | —Unverified | 0 | 0 |
| MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation | Mar 18, 2025 | Reasoning SegmentationVideo Editing | —Unverified | 0 | 0 |
| Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level | Nov 15, 2024 | Benchmarkingcounterfactual | —Unverified | 0 | 0 |
| MediSee: Reasoning-based Pixel-level Perception in Medical Images | Apr 15, 2025 | Logical ReasoningReasoning Segmentation | —Unverified | 0 | 0 |
| Multimodal 3D Reasoning Segmentation with Complex Scenes | Nov 21, 2024 | Reasoning SegmentationScene Understanding | —Unverified | 0 | 0 |