| ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding | Jan 2, 2025 | 3D visual groundingDiagnostic | —Unverified | 0 |
| Ges3ViG : Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding | Jan 1, 2025 | 3D visual groundingData Augmentation | CodeCode Available | 0 |
| Beyond Human Perception: Understanding Multi-Object World from Monocular View | Jan 1, 2025 | 3D visual groundingDenoising | CodeCode Available | 0 |
| 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation | Dec 9, 2024 | 3D dense captioning3D visual grounding | —Unverified | 0 |
| SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding | Dec 5, 2024 | 3D visual groundingObject Localization | —Unverified | 0 |
| 3D Scene Graph Guided Vision-Language Pre-training | Nov 27, 2024 | 3D dense captioning3D visual grounding | —Unverified | 0 |
| BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence | Nov 22, 2024 | 3D visual groundingVisual Grounding | CodeCode Available | 3 |
| Solving Zero-Shot 3D Visual Grounding as Constraint Satisfaction Problems | Nov 21, 2024 | 3D visual groundingNegation | CodeCode Available | 1 |
| LidaRefer: Outdoor 3D Visual Grounding for Autonomous Driving with Transformers | Nov 7, 2024 | 3D visual groundingAutonomous Driving | —Unverified | 0 |
| Fine-Grained Spatial and Verbal Losses for 3D Visual Grounding | Nov 5, 2024 | 3D visual groundingVisual Grounding | —Unverified | 0 |