| LanguageRefer: Spatial-Language Model for 3D Visual Grounding | Jul 7, 2021 | 3D visual groundingLanguage Modeling | —Unverified | 0 |
| LidaRefer: Outdoor 3D Visual Grounding for Autonomous Driving with Transformers | Nov 7, 2024 | 3D visual groundingAutonomous Driving | —Unverified | 0 |
| Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners | Apr 30, 2024 | 3D visual groundingVisual Grounding | —Unverified | 0 |
| NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving | Mar 28, 2025 | 3D visual groundingAutonomous Driving | —Unverified | 0 |
| PD-APE: A Parallel Decoding Framework with Adaptive Position Encoding for 3D Visual Grounding | Jul 19, 2024 | 3D visual groundingAttribute | —Unverified | 0 |
| ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding | Feb 26, 2025 | 3D visual groundingVisual Grounding | —Unverified | 0 |
| ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning | Mar 30, 2025 | 3D visual groundingFeature Splatting | —Unverified | 0 |
| SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding | Jan 17, 2024 | 3D visual groundingScene Understanding | —Unverified | 0 |
| Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding | Sep 8, 2023 | 3D Instance Segmentation3D visual grounding | —Unverified | 0 |
| TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding | Aug 5, 2021 | 3D visual groundingRelation | —Unverified | 0 |
| Unified Representation Space for 3D Visual Grounding | Jun 17, 2025 | 3D visual groundingContrastive Learning | —Unverified | 0 |
| UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding | Dec 1, 2022 | 3D dense captioning3D visual grounding | —Unverified | 0 |
| Viewpoint-Aware Visual Grounding in 3D Scenes | Jan 1, 2024 | 3D visual groundingReferring Expression | —Unverified | 0 |
| ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding | Jan 1, 2023 | 3D visual groundingVisual Grounding | —Unverified | 0 |
| ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition | Jul 15, 2025 | 3D visual groundingVisual Grounding | —Unverified | 0 |
| ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding | Jan 2, 2025 | 3D visual groundingDiagnostic | —Unverified | 0 |
| Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment | Dec 15, 2023 | 3D visual groundingNatural Language Queries | —Unverified | 0 |
| 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding | Jul 25, 2023 | 3D visual groundingObject | —Unverified | 0 |
| SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding | Dec 5, 2024 | 3D visual groundingObject Localization | —Unverified | 0 |
| SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding | Jun 27, 2025 | 3D visual groundingNatural Language Queries | —Unverified | 0 |
| Talk to Parallel LiDARs: A Human-LiDAR Interaction Method Based on 3D Visual Grounding | May 24, 2024 | 3D visual groundingAutonomous Driving | —Unverified | 0 |
| Task-oriented Sequential Grounding in 3D Scenes | Aug 7, 2024 | 3D visual groundingVisual Grounding | —Unverified | 0 |
| SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention | Mar 13, 2024 | 3D visual groundingcross-modal alignment | CodeCode Available | 0 |
| Ges3ViG: Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding | Apr 13, 2025 | 3D visual groundingData Augmentation | CodeCode Available | 0 |
| ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding | Mar 23, 2023 | 3D visual groundingVisual Grounding | CodeCode Available | 0 |