| MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations | Jun 13, 2024 | 3D visual groundingAttribute | CodeCode Available | 4 | 5 |
| BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence | Nov 22, 2024 | 3D visual groundingVisual Grounding | CodeCode Available | 3 | 5 |
| Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding | Feb 14, 2025 | 3D Object Detection3D visual grounding | CodeCode Available | 3 | 5 |
| A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions | Jun 9, 2024 | 3D visual groundingSurvey | CodeCode Available | 3 | 5 |
| RefMask3D: Language-Guided Transformer for 3D Referring Segmentation | Jul 25, 2024 | 3D visual groundingImage Segmentation | CodeCode Available | 2 | 5 |
| LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent | Sep 21, 2023 | 3D visual groundingLanguage Modeling | CodeCode Available | 2 | 5 |
| VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Oct 17, 2024 | 3D geometry3D visual grounding | CodeCode Available | 2 | 5 |
| Cross3DVG: Cross-Dataset 3D Visual Grounding on Different RGB-D Scans | May 23, 2023 | 3D Reconstruction3D visual grounding | CodeCode Available | 1 | 5 |
| EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding | Sep 29, 2022 | 3D visual groundingObject | CodeCode Available | 1 | 5 |
| Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training | Jan 1, 2023 | 3D dense captioning3D visual grounding | CodeCode Available | 1 | 5 |