SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 301350 of 6092 papers

TitleStatusHype
A Survey on Multimodal Large Language Models for Autonomous DrivingCode2
GPD-1: Generative Pre-training for DrivingCode2
FRNet: Frustum-Range Networks for Scalable LiDAR SegmentationCode2
GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous DrivingCode2
A real-time dynamic obstacle tracking and mapping system for UAV navigation and collision avoidance with an RGB-D cameraCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
Fully Sparse 3D Object DetectionCode2
HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single DecoderCode2
GAIA-1: A Generative World Model for Autonomous DrivingCode2
FocalFormer3D: Focusing on Hard Instance for 3D Object DetectionCode2
DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop TechnologiesCode2
CW-ERM: Improving Autonomous Driving Planning with Closed-loop Weighted Empirical Risk MinimizationCode2
HybridNets: End-to-End Perception NetworkCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
Controllable 3D Outdoor Scene Generation via Scene GraphsCode2
Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable FiltersCode2
DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy PredictionCode2
A Review of Safe Reinforcement Learning: Methods, Theory and ApplicationsCode2
Autonomous Driving with Spiking Neural NetworksCode2
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous DrivingCode2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth EstimationCode2
Label Efficient Visual Abstractions for Autonomous DrivingCode2
LandMarkSystem Technical ReportCode2
FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud MapsCode2
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario AnalysisCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
GaussianAD: Gaussian-Centric End-to-End Autonomous DrivingCode2
3D Object Detection for Autonomous Driving: A Comprehensive SurveyCode2
Large Trajectory Models are Scalable Motion Predictors and PlannersCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous DrivingCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of VehiclesCode2
Advances in 4D Generation: A SurveyCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
LightLoc: Learning Outdoor LiDAR Localization at Light SpeedCode2
Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous DrivingCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model ConversionCode2
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane PriorsCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous DrivingCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
Benchmarking the Robustness of LiDAR Semantic Segmentation ModelsCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Show:102550
← PrevPage 7 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified