SOTAVerified

3D visual grounding

Papers

Showing 125 of 82 papers

TitleStatusHype
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language AnnotationsCode4
Text-guided Sparse Voxel Pruning for Efficient 3D Visual GroundingCode3
BIP3D: Bridging 2D Images and 3D Perception for Embodied IntelligenceCode3
A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future DirectionsCode3
VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual GroundingCode2
RefMask3D: Language-Guided Transformer for 3D Referring SegmentationCode2
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an AgentCode2
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous DrivingCode1
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-AnalysisCode1
Evolving Symbolic 3D Visual Grounder with Weakly Supervised ReflectionCode1
Solving Zero-Shot 3D Visual Grounding as Constraint Satisfaction ProblemsCode1
Multi-branch Collaborative Learning Network for 3D Visual GroundingCode1
Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression ComprehensionCode1
MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual GroundingCode1
Mono3DVG: 3D Visual Grounding in Monocular ImagesCode1
Visual Programming for Zero-shot Open-Vocabulary 3D Visual GroundingCode1
CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud DataCode1
CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual GroundingCode1
Multi3DRefer: Grounding Text Description to Multiple 3D ObjectsCode1
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual GroundingCode1
Cross3DVG: Cross-Dataset 3D Visual Grounding on Different RGB-D ScansCode1
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype GuidanceCode1
Context-Aware Alignment and Mutual Masking for 3D-Language Pre-TrainingCode1
Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual GroundingCode1
Learning Point-Language Hierarchical Alignment for 3D Visual GroundingCode1
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.