| Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding | Jul 18, 2023 | 3D visual groundingObject | CodeCode Available | 1 | 5 |
| Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis | Mar 28, 2025 | 3D Question Answering (3D-QA)3D visual grounding | CodeCode Available | 1 | 5 |
| ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance | Mar 29, 2023 | 3D visual groundingVisual Grounding | CodeCode Available | 1 | 5 |
| Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding | Nov 26, 2023 | 3D visual groundingObject | CodeCode Available | 1 | 5 |
| Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual Grounding | Nov 25, 2022 | 3D visual groundingKnowledge Distillation | CodeCode Available | 1 | 5 |
| MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding | Mar 5, 2024 | 3D visual groundingDecision Making | CodeCode Available | 1 | 5 |
| AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual Grounding | May 7, 2025 | 3D visual groundingGraph Attention | CodeCode Available | 0 | 5 |
| Ges3ViG : Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding | Jan 1, 2025 | 3D visual groundingData Augmentation | CodeCode Available | 0 | 5 |
| Ges3ViG: Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding | Apr 13, 2025 | 3D visual groundingData Augmentation | CodeCode Available | 0 | 5 |
| SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention | Mar 13, 2024 | 3D visual groundingcross-modal alignment | CodeCode Available | 0 | 5 |
| Multi-Attribute Interactions Matter for 3D Visual Grounding | Jan 1, 2024 | 3D visual groundingAttribute | CodeCode Available | 0 | 5 |
| Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and Consistency | Jan 1, 2024 | 3D visual groundingRelation | CodeCode Available | 0 | 5 |
| Beyond Human Perception: Understanding Multi-Object World from Monocular View | Jan 1, 2025 | 3D visual groundingDenoising | CodeCode Available | 0 | 5 |
| ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding | Mar 23, 2023 | 3D visual groundingVisual Grounding | CodeCode Available | 0 | 5 |
| Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization | Apr 17, 2024 | 3D dense captioning3D visual grounding | CodeCode Available | 0 | 5 |
| WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language | Apr 12, 2023 | 3D visual groundingAutonomous Driving | CodeCode Available | 0 | 5 |
| Zero-Shot 3D Visual Grounding from Vision-Language Models | May 28, 2025 | 3D visual groundingVisual Grounding | —Unverified | 0 | 0 |
| 3D Scene Graph Guided Vision-Language Pre-training | Nov 27, 2024 | 3D dense captioning3D visual grounding | —Unverified | 0 | 0 |
| 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation | Dec 9, 2024 | 3D dense captioning3D visual grounding | —Unverified | 0 | 0 |
| A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding | Jul 9, 2025 | 3D visual groundingAutonomous Navigation | —Unverified | 0 | 0 |
| AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring | Jan 16, 2025 | 3D visual groundingDecoder | —Unverified | 0 | 0 |
| Bayesian Self-Training for Semi-Supervised 3D Segmentation | Sep 12, 2024 | 3D Instance Segmentation3D Semantic Segmentation | —Unverified | 0 | 0 |
| D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding | Dec 2, 2021 | 3D dense captioning3D visual grounding | —Unverified | 0 | 0 |
| DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding | May 8, 2025 | 3D visual groundingcross-modal alignment | —Unverified | 0 | 0 |
| Data-Efficient 3D Visual Grounding via Order-Aware Referring | Mar 25, 2024 | 3D visual groundingObject | —Unverified | 0 | 0 |