| PointCLIP: Point Cloud Understanding by CLIP | Dec 4, 2021 | 3D Open-Vocabulary Instance SegmentationFew-Shot Learning | CodeCode Available | 1 | 5 |
| Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation | Aug 6, 2023 | 3D Classification3D Part Segmentation | CodeCode Available | 1 | 5 |
| Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding | Jan 16, 2025 | 3D ClassificationZero-shot 3D classification | —Unverified | 0 | 0 |
| TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding | Feb 28, 2024 | 3D Shape RepresentationRepresentation Learning | —Unverified | 0 | 0 |
| MV-CLIP: Multi-View CLIP for Zero-shot 3D Shape Recognition | Nov 30, 2023 | 3D Classification3D Shape Recognition | —Unverified | 0 | 0 |
| MM-Mixing: Multi-Modal Mixing Alignment for 3D Understanding | May 28, 2024 | 3D Classification3D Object Recognition | —Unverified | 0 | 0 |