| 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation | Dec 9, 2024 | 3D dense captioning3D visual grounding | —Unverified | 0 |
| SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding | Dec 5, 2024 | 3D visual groundingObject Localization | —Unverified | 0 |
| 3D Scene Graph Guided Vision-Language Pre-training | Nov 27, 2024 | 3D dense captioning3D visual grounding | —Unverified | 0 |
| LidaRefer: Outdoor 3D Visual Grounding for Autonomous Driving with Transformers | Nov 7, 2024 | 3D visual groundingAutonomous Driving | —Unverified | 0 |
| Fine-Grained Spatial and Verbal Losses for 3D Visual Grounding | Nov 5, 2024 | 3D visual groundingVisual Grounding | —Unverified | 0 |
| Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding | Oct 21, 2024 | 3D visual groundingObject | —Unverified | 0 |
| Bayesian Self-Training for Semi-Supervised 3D Segmentation | Sep 12, 2024 | 3D Instance Segmentation3D Semantic Segmentation | —Unverified | 0 |
| Task-oriented Sequential Grounding in 3D Scenes | Aug 7, 2024 | 3D visual groundingVisual Grounding | —Unverified | 0 |
| PD-APE: A Parallel Decoding Framework with Adaptive Position Encoding for 3D Visual Grounding | Jul 19, 2024 | 3D visual groundingAttribute | —Unverified | 0 |
| ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities | Jul 1, 2024 | 3D visual groundingLanguage Modeling | —Unverified | 0 |