SOTAVerified

3D visual grounding

Papers

Showing 2650 of 82 papers

TitleStatusHype
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual GroundingCode1
3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive SelectionCode1
Multi-View Transformer for 3D Visual GroundingCode1
SAT: 2D Semantics Assisted Training for 3D Visual GroundingCode1
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD ImagesCode1
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual ReferringCode1
ViewSRD: 3D Visual Grounding via Structured Multi-View Decomposition0
A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding0
SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding0
GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding0
Unified Representation Space for 3D Visual Grounding0
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs0
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes0
Zero-Shot 3D Visual Grounding from Vision-Language Models0
DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding0
AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual GroundingCode0
Ges3ViG: Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference UnderstandingCode0
DSM: Building A Diverse Semantic Map for 3D Visual Grounding0
ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning0
NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving0
ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding0
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring0
ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding0
Ges3ViG : Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference UnderstandingCode0
Beyond Human Perception: Understanding Multi-Object World from Monocular ViewCode0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.