| Multi-branch Collaborative Learning Network for 3D Visual Grounding | Jul 7, 2024 | 3D visual groundingReferring Expression | CodeCode Available | 1 | 5 |
| Ges3ViG : Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding | Jan 1, 2025 | 3D visual groundingData Augmentation | CodeCode Available | 0 | 5 |
| Ges3ViG: Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding | Apr 13, 2025 | 3D visual groundingData Augmentation | CodeCode Available | 0 | 5 |
| AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual Grounding | May 7, 2025 | 3D visual groundingGraph Attention | CodeCode Available | 0 | 5 |
| Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and Consistency | Jan 1, 2024 | 3D visual groundingRelation | CodeCode Available | 0 | 5 |
| SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention | Mar 13, 2024 | 3D visual groundingcross-modal alignment | CodeCode Available | 0 | 5 |
| Beyond Human Perception: Understanding Multi-Object World from Monocular View | Jan 1, 2025 | 3D visual groundingDenoising | CodeCode Available | 0 | 5 |
| ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding | Mar 23, 2023 | 3D visual groundingVisual Grounding | CodeCode Available | 0 | 5 |
| Multi-Attribute Interactions Matter for 3D Visual Grounding | Jan 1, 2024 | 3D visual groundingAttribute | CodeCode Available | 0 | 5 |
| Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization | Apr 17, 2024 | 3D dense captioning3D visual grounding | CodeCode Available | 0 | 5 |