SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 125 of 6092 papers

TitleStatusHype
NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and BenchmarkingCode7
Vista: A Generalizable Driving World Model with High Fidelity and Versatile ControllabilityCode7
GenAD: Generalized Predictive Model for Autonomous DrivingCode7
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
Getting SMARTER for Motion Planning in Autonomous Driving SystemsCode5
The Role of World Models in Shaping Autonomous Driving: A Comprehensive SurveyCode5
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric PerspectivesCode5
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous DrivingCode5
Neural Fields in Robotics: A SurveyCode5
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth EstimationCode5
Awesome Multi-modal Object TrackingCode5
VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic PlanningCode5
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian SplattingCode5
A Survey on Vision-Language-Action Models for Autonomous DrivingCode4
Pseudo-Simulation for Autonomous DrivingCode4
3D Scene Generation: A SurveyCode4
OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action ModelCode4
Multimodal Chain-of-Thought Reasoning: A Comprehensive SurveyCode4
Diffusion-Based Planning for Autonomous Driving with Flexible GuidanceCode4
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous DrivingCode4
UniScene: Unified Occupancy-centric Driving Scene GenerationCode4
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy PredictionCode4
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous DrivingCode4
UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2heightCode4
Show:102550
← PrevPage 1 of 244Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified