| ShapeLLM: Universal 3D Object Understanding for Embodied Interaction | Feb 27, 2024 | 3D geometry3D Object Captioning | CodeCode Available | 3 |
| Uni3D: Exploring Unified 3D Representation at Scale | Oct 10, 2023 | 3D Object ClassificationRetrieval | CodeCode Available | 2 |
| OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding | May 18, 2023 | 3D Classification3D Shape Representation | CodeCode Available | 2 |
| ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding | May 14, 2023 | 3D Classification3D Point Cloud Classification | CodeCode Available | 2 |
| ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding | Dec 10, 2022 | 3D Architecture3D Classification | CodeCode Available | 2 |
| PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning | Nov 21, 2022 | 3D Classification3D Object Detection | CodeCode Available | 2 |
| OpenDlign: Open-World Point Cloud Understanding with Depth-Aligned Images | Apr 25, 2024 | Representation LearningTransfer Learning | CodeCode Available | 1 |
| Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training | Nov 3, 2023 | Contrastive LearningRetrieval | CodeCode Available | 1 |
| ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights | Aug 20, 2023 | 3D ClassificationQuestion Answering | CodeCode Available | 1 |
| Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation | Aug 6, 2023 | 3D Classification3D Part Segmentation | CodeCode Available | 1 |
| CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training | Oct 3, 2022 | 3D Point Cloud ClassificationContrastive Learning | CodeCode Available | 1 |
| PointCLIP: Point Cloud Understanding by CLIP | Dec 4, 2021 | 3D Open-Vocabulary Instance SegmentationFew-Shot Learning | CodeCode Available | 1 |
| Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding | Jan 16, 2025 | 3D ClassificationZero-shot 3D classification | —Unverified | 0 |
| MM-Mixing: Multi-Modal Mixing Alignment for 3D Understanding | May 28, 2024 | 3D Classification3D Object Recognition | —Unverified | 0 |
| TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding | Feb 28, 2024 | 3D Shape RepresentationRepresentation Learning | —Unverified | 0 |
| MV-CLIP: Multi-View CLIP for Zero-shot 3D Shape Recognition | Nov 30, 2023 | 3D Classification3D Shape Recognition | —Unverified | 0 |