SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 125 of 6092 papers

TitleStatusHype
Vista: A Generalizable Driving World Model with High Fidelity and Versatile ControllabilityCode7
NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and BenchmarkingCode7
GenAD: Generalized Predictive Model for Autonomous DrivingCode7
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric PerspectivesCode5
Awesome Multi-modal Object TrackingCode5
VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic PlanningCode5
Neural Fields in Robotics: A SurveyCode5
Getting SMARTER for Motion Planning in Autonomous Driving SystemsCode5
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous DrivingCode5
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth EstimationCode5
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian SplattingCode5
The Role of World Models in Shaping Autonomous Driving: A Comprehensive SurveyCode5
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and RecipeCode4
Planning-oriented Autonomous DrivingCode4
Is Sora a World Simulator? A Comprehensive Survey on General World Models and BeyondCode4
Multimodal Chain-of-Thought Reasoning: A Comprehensive SurveyCode4
3D Scene Generation: A SurveyCode4
End-to-End Autonomous Driving through V2X CooperationCode4
Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous DrivingCode4
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal TransformersCode4
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View RepresentationCode4
End-to-end Autonomous Driving: Challenges and FrontiersCode4
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy PredictionCode4
Show:102550
← PrevPage 1 of 244Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified