SOTAVerified

3D Object Captioning

3D object captioning involves generating a natural language description of an object, given its point cloud representation.

Papers

Showing 17 of 7 papers

TitleStatusHype
ShapeLLM: Universal 3D Object Understanding for Embodied InteractionCode3
View Selection for 3D Captioning via Diffusion RankingCode3
3D-LLM: Injecting the 3D World into Large Language ModelsCode3
MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D PriorsCode2
PointLLM: Empowering Large Language Models to Understand Point CloudsCode2
PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models0
MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content CreationCode0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MiniGPT-3DGPT-457.06Unverified
2ShapeLLM-13BGPT-448.94Unverified
3PointLLM-13B V1.2GPT-448.15Unverified
4ShapeLLM-7BGPT-446.92Unverified
5PointLLM-7B V1.2GPT-444.85Unverified
63D-LLMGPT-433.42Unverified