SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 150 of 6092 papers

TitleStatusHype
GenAD: Generalized Predictive Model for Autonomous DrivingCode7
NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and BenchmarkingCode7
Vista: A Generalizable Driving World Model with High Fidelity and Versatile ControllabilityCode7
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
The Role of World Models in Shaping Autonomous Driving: A Comprehensive SurveyCode5
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous DrivingCode5
Awesome Multi-modal Object TrackingCode5
Neural Fields in Robotics: A SurveyCode5
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian SplattingCode5
Getting SMARTER for Motion Planning in Autonomous Driving SystemsCode5
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric PerspectivesCode5
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth EstimationCode5
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic PlanningCode5
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal TransformersCode4
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous DrivingCode4
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense PredictionCode4
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy PredictionCode4
UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2heightCode4
UniScene: Unified Occupancy-centric Driving Scene GenerationCode4
A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion PerspectiveCode4
SparseDrive: End-to-End Autonomous Driving via Sparse Scene RepresentationCode4
Pseudo-Simulation for Autonomous DrivingCode4
Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous DrivingCode4
Segment and Track AnythingCode4
A Survey on Vision-Language-Action Models for Autonomous DrivingCode4
End-to-end Autonomous Driving: Challenges and FrontiersCode4
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy PredictionCode4
S^3Gaussian: Self-Supervised Street Gaussians for Autonomous DrivingCode4
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View RepresentationCode4
OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action ModelCode4
OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous DrivingCode4
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous DrivingCode4
Is Sora a World Simulator? A Comprehensive Survey on General World Models and BeyondCode4
Multimodal Chain-of-Thought Reasoning: A Comprehensive SurveyCode4
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual ReasoningCode4
End-to-End Autonomous Driving through V2X CooperationCode4
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and RecipeCode4
Diffusion-Based Planning for Autonomous Driving with Flexible GuidanceCode4
Planning-oriented Autonomous DrivingCode4
3D Scene Generation: A SurveyCode4
Geometric-aware Pretraining for Vision-centric 3D Object DetectionCode3
Generalizing Motion Planners with Mixture of Experts for Autonomous DrivingCode3
Generalized Trajectory Scoring for End-to-end Multimodal PlanningCode3
Generative AI for Autonomous Driving: Frontiers and OpportunitiesCode3
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingCode3
GenAD: Generative End-to-End Autonomous DrivingCode3
Epona: Autoregressive Diffusion World Model for Autonomous DrivingCode3
End-to-End Driving with Online Trajectory Evaluation via BEV World ModelCode3
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous DrivingCode3
Show:102550
← PrevPage 1 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified