| BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers | Mar 31, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 4 | 5 |
| PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images | Jun 2, 2022 | 3D Lane Detection3D Object Detection | CodeCode Available | 3 | 5 |
| Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D | Aug 13, 2020 | Autonomous VehiclesBird's-Eye View Semantic Segmentation | CodeCode Available | 2 | 5 |
| ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning | Jul 15, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 2 | 5 |
| Model-Based Imitation Learning for Urban Driving | Oct 14, 2022 | 3D geometryAutonomous Driving | CodeCode Available | 2 | 5 |
| CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers | Jul 5, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception | Nov 19, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 2 | 5 |
| LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation | Jun 27, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 1 | 5 |
| FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras | Apr 21, 2021 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 1 | 5 |
| Cross-view Transformers for real-time Map-view Semantic Segmentation | May 5, 2022 | Bird's-Eye View Semantic SegmentationSegmentation | CodeCode Available | 1 | 5 |