| Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion | Jul 8, 2025 | 3D geometryDomain Generalization | CodeCode Available | 2 |
| GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control | May 28, 2025 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos | May 19, 2025 | 3D geometryCamera Pose Estimation | CodeCode Available | 2 |
| Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation | May 16, 2025 | 3D geometryNavigate | CodeCode Available | 2 |
| GaussRender: Learning 3D Occupancy with Gaussian Rendering | Feb 7, 2025 | 3D geometryAutonomous Vehicles | CodeCode Available | 2 |
| VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Oct 17, 2024 | 3D geometry3D visual grounding | CodeCode Available | 2 |
| Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models | Oct 14, 2024 | 3D geometryDenoising | CodeCode Available | 2 |
| GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Sep 24, 2024 | 3D geometry3DGS | CodeCode Available | 2 |
| Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction | Sep 12, 2024 | 3D geometry | CodeCode Available | 2 |
| iHuman: Instant Animatable Digital Humans From Monocular Videos | Jul 15, 2024 | 3D geometry3D Reconstruction | CodeCode Available | 2 |