SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 150 of 6092 papers

TitleStatusHype
NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and BenchmarkingCode7
Vista: A Generalizable Driving World Model with High Fidelity and Versatile ControllabilityCode7
GenAD: Generalized Predictive Model for Autonomous DrivingCode7
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
Getting SMARTER for Motion Planning in Autonomous Driving SystemsCode5
The Role of World Models in Shaping Autonomous Driving: A Comprehensive SurveyCode5
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric PerspectivesCode5
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous DrivingCode5
Neural Fields in Robotics: A SurveyCode5
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth EstimationCode5
Awesome Multi-modal Object TrackingCode5
VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic PlanningCode5
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian SplattingCode5
A Survey on Vision-Language-Action Models for Autonomous DrivingCode4
Pseudo-Simulation for Autonomous DrivingCode4
3D Scene Generation: A SurveyCode4
OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action ModelCode4
Multimodal Chain-of-Thought Reasoning: A Comprehensive SurveyCode4
Diffusion-Based Planning for Autonomous Driving with Flexible GuidanceCode4
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous DrivingCode4
UniScene: Unified Occupancy-centric Driving Scene GenerationCode4
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy PredictionCode4
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous DrivingCode4
UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2heightCode4
Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous DrivingCode4
SparseDrive: End-to-End Autonomous Driving via Sparse Scene RepresentationCode4
S^3Gaussian: Self-Supervised Street Gaussians for Autonomous DrivingCode4
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy PredictionCode4
A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion PerspectiveCode4
Is Sora a World Simulator? A Comprehensive Survey on General World Models and BeyondCode4
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual ReasoningCode4
End-to-End Autonomous Driving through V2X CooperationCode4
End-to-end Autonomous Driving: Challenges and FrontiersCode4
Segment and Track AnythingCode4
Planning-oriented Autonomous DrivingCode4
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and RecipeCode4
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense PredictionCode4
OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous DrivingCode4
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View RepresentationCode4
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal TransformersCode4
Epona: Autoregressive Diffusion World Model for Autonomous DrivingCode3
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-TuningCode3
Generalized Trajectory Scoring for End-to-end Multimodal PlanningCode3
Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action ModelsCode3
Generative AI for Autonomous Driving: Frontiers and OpportunitiesCode3
Detect Anything 3D in the WildCode3
End-to-End Driving with Online Trajectory Evaluation via BEV World ModelCode3
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous DrivingCode3
GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and ReconstructionCode3
Show:102550
← PrevPage 1 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified