| MagicDrive: Street View Generation with Diverse 3D Geometry Control | Oct 4, 2023 | 3D geometry3D Object Detection | CodeCode Available | 3 |
| PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images | Jun 2, 2022 | 3D Lane Detection3D Object Detection | CodeCode Available | 3 |
| Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks | Jul 18, 2024 | Autonomous DrivingBEV Segmentation | CodeCode Available | 2 |
| SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects | Mar 29, 2024 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Mar 13, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers | Jul 5, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection | Nov 5, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 1 |
| Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance | Aug 8, 2024 | BEV SegmentationData Augmentation | CodeCode Available | 1 |
| Learning Ego 3D Representation as Ray Tracing | Jun 8, 2022 | 3D Object DetectionBEV Segmentation | CodeCode Available | 1 |
| DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion | May 3, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |