SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 51100 of 6092 papers

TitleStatusHype
USVTrack: USV-Based 4D Radar-Camera Tracking Dataset for Autonomous Driving in Inland Waterways0
Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning0
TDACloud: Point Cloud Recognition Using Topological Data Analysis0
Coherent Track-Before-Detect0
Bayesian Multiobject Tracking With Neural-Enhanced Motion and Measurement Models0
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
3D Gaussian Splatting for Fine-Detailed Surface Reconstruction in Large-Scale Scene0
AI-based Multimodal Biometrics for Detecting Smartphone Distractions: Application to Online Learning0
DRARL: Disengagement-Reason-Augmented Reinforcement Learning for Efficient Improvement of Autonomous Driving Policy0
R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level VisionCode1
Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles0
Toward Safety-First Human-Like Decision Making for Autonomous Vehicles in Time-Varying Traffic Flow0
ADRD: LLM-Driven Autonomous Driving Based on Rule-based Decision Systems0
Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment0
KDMOS:Knowledge Distillation for Motion SegmentationCode0
Cross-Modal Geometric Hierarchy Fusion: An Implicit-Submap Driven Framework for Resilient 3D Place Recognition0
STAGE: A Stream-Centric Generative World Model for Long-Horizon Driving-Scene Simulation0
X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability0
COME: Adding Scene-Centric Forecasting Control to Occupancy World ModelCode1
RelTopo: Enhancing Relational Modeling for Driving Scene Topology Reasoning0
FindMeIfYouCan: Bringing Open Set metrics to near , far and farther Out-of-Distribution Object Detection0
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-TuningCode3
A Survey on World Models Grounded in Acoustic Physical InformationCode0
Bridging Data-Driven and Physics-Based Models: A Consensus Multi-Model Kalman Filter for Robust Vehicle State Estimation0
GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction0
On the Natural Robustness of Vision-Language Models Against Visual Perception Attacks in Autonomous Driving0
Vision-based Lifting of 2D Object Detections for Automated Driving0
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario AnalysisCode2
Teleoperated Driving: a New Challenge for 3D Object Detection in Compressed Point Clouds0
FocalAD: Local Motion Planning for End-to-End Autonomous Driving0
Poutine: Vision-Language-Trajectory Pre-Training and Reinforcement Learning Post-Training Enable Robust End-to-End Autonomous Driving0
QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy PredictionCode2
LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System0
Using Language and Road Manuals to Inform Map Reconstruction for Autonomous Driving0
Adv-BMT: Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation0
ReSim: Reliable World Simulation for Autonomous Driving0
AD^2-Bench: A Hierarchical CoT Benchmark for MLLM in Autonomous Driving under Adverse Conditions0
ODG: Occupancy Prediction Using Dual Gaussians0
ECAM: A Contrastive Learning Approach to Avoid Environmental Collision in Trajectory ForecastingCode0
DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos0
Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic SegmentationCode1
RoCA: Robust Cross-Domain End-to-End Autonomous Driving0
Technical Report for Argoverse2 Scenario Mining Challenges on Iterative Error Correction and Spatially-Aware Prompting0
Perception Characteristics Distance: Measuring Stability and Robustness of Perception System in Dynamic Conditions under a Certain Decision RuleCode0
Robust Evolutionary Multi-Objective Network Architecture Search for Reinforcement Learning (EMNAS-RL)0
Diffusion Models for Safety Validation of Autonomous Driving Systems0
TrajFlow: Multi-modal Motion Prediction via Flow Matching0
R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation0
SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding0
Show:102550
← PrevPage 2 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified