| Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention | May 23, 2025 | 3D Generation3D geometry | CodeCode Available | 5 |
| Wonder3D: Single Image to 3D using Cross-Domain Diffusion | Oct 23, 2023 | 3D geometryImage to 3D | CodeCode Available | 5 |
| You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale | Dec 9, 2024 | 3D Generation3D geometry | CodeCode Available | 4 |
| GeoCalib: Learning Single-image Calibration with Geometric Optimization | Sep 10, 2024 | 3D geometryVisual Localization | CodeCode Available | 4 |
| CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets | May 30, 2024 | 2k3D geometry | CodeCode Available | 4 |
| CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner | May 23, 2024 | 3D Generation3D geometry | CodeCode Available | 4 |
| GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image | Mar 18, 2024 | 3D geometry3D Reconstruction | CodeCode Available | 4 |
| NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors | Dec 6, 2022 | 3D Generation3D geometry | CodeCode Available | 4 |
| LiftFeat: 3D Geometry-Aware Local Feature Matching | May 6, 2025 | 3D geometryDepth Estimation | CodeCode Available | 3 |
| TAPIP3D: Tracking Any Point in Persistent 3D Geometry | Apr 20, 2025 | 3D geometryDepth And Camera Motion | CodeCode Available | 3 |
| DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation | Apr 7, 2025 | 3D geometryRGBD Semantic Segmentation | CodeCode Available | 3 |
| TripNet: Learning Large-scale High-fidelity 3D Car Aerodynamics with Triplane Networks | Mar 19, 2025 | 3D geometry | CodeCode Available | 3 |
| 3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation | Oct 24, 2024 | 3D Generation3D geometry | CodeCode Available | 3 |
| Towards Realistic Scene Generation with LiDAR Diffusion Models | Mar 31, 2024 | 3D geometryImage Generation | CodeCode Available | 3 |
| ShapeLLM: Universal 3D Object Understanding for Embodied Interaction | Feb 27, 2024 | 3D geometry3D Object Captioning | CodeCode Available | 3 |
| MagicDrive: Street View Generation with Diverse 3D Geometry Control | Oct 4, 2023 | 3D geometry3D Object Detection | CodeCode Available | 3 |
| Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior | Mar 24, 2023 | 3D geometryText to 3D | CodeCode Available | 3 |
| Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models | Mar 21, 2023 | 3D geometryText to 3D | CodeCode Available | 3 |
| VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion | Feb 23, 2023 | 3D geometry3D Semantic Scene Completion | CodeCode Available | 3 |
| AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars | May 17, 2022 | 3D geometryLanguage Modelling | CodeCode Available | 3 |
| Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion | Jul 8, 2025 | 3D geometryDomain Generalization | CodeCode Available | 2 |
| GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control | May 28, 2025 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos | May 19, 2025 | 3D geometryCamera Pose Estimation | CodeCode Available | 2 |
| Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation | May 16, 2025 | 3D geometryNavigate | CodeCode Available | 2 |
| GaussRender: Learning 3D Occupancy with Gaussian Rendering | Feb 7, 2025 | 3D geometryAutonomous Vehicles | CodeCode Available | 2 |
| VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Oct 17, 2024 | 3D geometry3D visual grounding | CodeCode Available | 2 |
| Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models | Oct 14, 2024 | 3D geometryDenoising | CodeCode Available | 2 |
| GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Sep 24, 2024 | 3D geometry3DGS | CodeCode Available | 2 |
| Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction | Sep 12, 2024 | 3D geometry | CodeCode Available | 2 |
| iHuman: Instant Animatable Digital Humans From Monocular Videos | Jul 15, 2024 | 3D geometry3D Reconstruction | CodeCode Available | 2 |
| L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration | Jun 5, 2024 | 3D geometryPoint Cloud Registration | CodeCode Available | 2 |
| Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding | Apr 11, 2024 | 3D geometryparameter-efficient fine-tuning | CodeCode Available | 2 |
| Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction | Mar 28, 2024 | 3D geometry3D Reconstruction | CodeCode Available | 2 |
| Volumetric Environment Representation for Vision-Language Navigation | Mar 21, 2024 | 3D geometryMulti-Task Learning | CodeCode Available | 2 |
| ThermoNeRF: Joint RGB and Thermal Novel View Synthesis for Building Facades using Multimodal Neural Radiance Fields | Mar 18, 2024 | 3D geometryImage Generation | CodeCode Available | 2 |
| MonoOcc: Digging into Monocular Semantic Occupancy Prediction | Mar 13, 2024 | 3D geometryAutonomous Vehicles | CodeCode Available | 2 |
| Retrieval-Augmented Score Distillation for Text-to-3D Generation | Feb 5, 2024 | 3D Generation3D geometry | CodeCode Available | 2 |
| Visual Point Cloud Forecasting enables Scalable Autonomous Driving | Dec 29, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Sep 18, 2023 | 3D geometryDecoder | CodeCode Available | 2 |
| Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation | Jul 27, 2023 | 3D geometryFew-Shot Learning | CodeCode Available | 2 |
| NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection | Jul 27, 2023 | 3D geometry3D Object Detection | CodeCode Available | 2 |
| End-to-End Vectorized HD-map Construction with Piecewise Bezier Curve | Jun 16, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance | May 30, 2023 | 3D Generation3D geometry | CodeCode Available | 2 |
| gRNAde: Geometric Deep Learning for 3D RNA inverse design | May 24, 2023 | 3D geometryDeep Learning | CodeCode Available | 2 |
| DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation | May 10, 2023 | 3D geometryGenerative Adversarial Network | CodeCode Available | 2 |
| Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving | Apr 27, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra | Apr 19, 2023 | 3D geometry3D Reconstruction | CodeCode Available | 2 |
| OccDepth: A Depth-Aware Method for 3D Semantic Scene Completion | Feb 27, 2023 | 3D geometry3D Semantic Scene Completion | CodeCode Available | 2 |
| ConceptFusion: Open-set Multimodal 3D Mapping | Feb 14, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Learning Physically Realizable Skills for Online Packing of General 3D Shapes | Dec 5, 2022 | 3D geometryAction Generation | CodeCode Available | 2 |