| VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Oct 17, 2024 | 3D geometry3D visual grounding | CodeCode Available | 2 |
| Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models | Oct 14, 2024 | 3D geometryDenoising | CodeCode Available | 2 |
| GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Sep 24, 2024 | 3D geometry3DGS | CodeCode Available | 2 |
| Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction | Sep 12, 2024 | 3D geometry | CodeCode Available | 2 |
| iHuman: Instant Animatable Digital Humans From Monocular Videos | Jul 15, 2024 | 3D geometry3D Reconstruction | CodeCode Available | 2 |
| L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration | Jun 5, 2024 | 3D geometryPoint Cloud Registration | CodeCode Available | 2 |
| Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding | Apr 11, 2024 | 3D geometryparameter-efficient fine-tuning | CodeCode Available | 2 |
| Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction | Mar 28, 2024 | 3D geometry3D Reconstruction | CodeCode Available | 2 |
| Volumetric Environment Representation for Vision-Language Navigation | Mar 21, 2024 | 3D geometryMulti-Task Learning | CodeCode Available | 2 |
| ThermoNeRF: Joint RGB and Thermal Novel View Synthesis for Building Facades using Multimodal Neural Radiance Fields | Mar 18, 2024 | 3D geometryImage Generation | CodeCode Available | 2 |
| MonoOcc: Digging into Monocular Semantic Occupancy Prediction | Mar 13, 2024 | 3D geometryAutonomous Vehicles | CodeCode Available | 2 |
| Retrieval-Augmented Score Distillation for Text-to-3D Generation | Feb 5, 2024 | 3D Generation3D geometry | CodeCode Available | 2 |
| Visual Point Cloud Forecasting enables Scalable Autonomous Driving | Dec 29, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Sep 18, 2023 | 3D geometryDecoder | CodeCode Available | 2 |
| Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation | Jul 27, 2023 | 3D geometryFew-Shot Learning | CodeCode Available | 2 |
| NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection | Jul 27, 2023 | 3D geometry3D Object Detection | CodeCode Available | 2 |
| End-to-End Vectorized HD-map Construction with Piecewise Bezier Curve | Jun 16, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance | May 30, 2023 | 3D Generation3D geometry | CodeCode Available | 2 |
| gRNAde: Geometric Deep Learning for 3D RNA inverse design | May 24, 2023 | 3D geometryDeep Learning | CodeCode Available | 2 |
| DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation | May 10, 2023 | 3D geometryGenerative Adversarial Network | CodeCode Available | 2 |
| Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving | Apr 27, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra | Apr 19, 2023 | 3D geometry3D Reconstruction | CodeCode Available | 2 |
| OccDepth: A Depth-Aware Method for 3D Semantic Scene Completion | Feb 27, 2023 | 3D geometry3D Semantic Scene Completion | CodeCode Available | 2 |
| ConceptFusion: Open-set Multimodal 3D Mapping | Feb 14, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Learning Physically Realizable Skills for Online Packing of General 3D Shapes | Dec 5, 2022 | 3D geometryAction Generation | CodeCode Available | 2 |