| UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding | Dec 1, 2022 | 3D dense captioning3D visual grounding | —Unverified | 0 | 0 |
| Viewpoint-Aware Visual Grounding in 3D Scenes | Jan 1, 2024 | 3D visual groundingReferring Expression | —Unverified | 0 | 0 |
| ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding | Jan 1, 2023 | 3D visual groundingVisual Grounding | —Unverified | 0 | 0 |
| ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition | Jul 15, 2025 | 3D visual groundingVisual Grounding | —Unverified | 0 | 0 |
| ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding | Jan 2, 2025 | 3D visual groundingDiagnostic | —Unverified | 0 | 0 |
| Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment | Dec 15, 2023 | 3D visual groundingNatural Language Queries | —Unverified | 0 | 0 |
| 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding | Jul 25, 2023 | 3D visual groundingObject | —Unverified | 0 | 0 |