| WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language | Apr 12, 2023 | 3D visual groundingAutonomous Driving | CodeCode Available | 0 |
| ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding | Mar 23, 2023 | 3D visual groundingVisual Grounding | CodeCode Available | 0 |
| ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding | Jan 1, 2023 | 3D visual groundingVisual Grounding | —Unverified | 0 |
| UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding | Dec 1, 2022 | 3D dense captioning3D visual grounding | —Unverified | 0 |
| D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding | Dec 2, 2021 | 3D dense captioning3D visual grounding | —Unverified | 0 |
| TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding | Aug 5, 2021 | 3D visual groundingRelation | —Unverified | 0 |
| LanguageRefer: Spatial-Language Model for 3D Visual Grounding | Jul 7, 2021 | 3D visual groundingLanguage Modeling | —Unverified | 0 |