| BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers | Mar 31, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 4 |
| PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images | Jun 2, 2022 | 3D Lane Detection3D Object Detection | CodeCode Available | 3 |
| MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception | Nov 19, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 2 |
| Model-Based Imitation Learning for Urban Driving | Oct 14, 2022 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning | Jul 15, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 2 |
| CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers | Jul 5, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D | Aug 13, 2020 | Autonomous VehiclesBird's-Eye View Semantic Segmentation | CodeCode Available | 2 |
| SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset | Feb 4, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Semi-Supervised Learning for Visual Bird's Eye View Semantic Segmentation | Aug 28, 2023 | Autonomous VehiclesBird's-Eye View Semantic Segmentation | CodeCode Available | 1 |
| LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation | Jun 27, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 1 |