SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 101150 of 6092 papers

TitleStatusHype
LiteVLM: A Low-Latency Vision-Language Model Inference Pipeline for Resource-Constrained Environments0
ZeroVO: Visual Odometry with Minimal Assumptions0
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving0
Scaling Laws of Motion Forecasting and Planning -- A Technical Report0
Ego-centric Learning of Communicative World Models for Autonomous Driving0
DLNet: Direction-Aware Feature Integration for Robust Lane Detection in Complex EnvironmentsCode0
Accelerating 3D Gaussian Splatting with Neural Sorting and Axis-Oriented Rasterization0
Generalized Trajectory Scoring for End-to-end Multimodal PlanningCode3
Gaussian Mapping for Evolving Scenes0
DriveAction: A Benchmark for Exploring Human-like Driving Decisions in VLA Models0
STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous DrivingCode1
Trajectory Entropy: Modeling Game State Stability from Multimodality Trajectory Prediction0
Edge-Enabled Collaborative Object Detection for Real-Time Multi-Vehicle PerceptionCode0
Structured Labeling Enables Faster Vision-Language Models for End-to-End Autonomous Driving0
Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline0
VideoMolmo: Spatio-Temporal Grounding Meets PointingCode2
Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth0
Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian SplattingCode2
AD-EE: Early Exiting for Fast and Reliable Vision-Language Models in Autonomous Driving0
Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous DrivingCode0
Autonomous Vehicle Lateral Control Using Deep Reinforcement Learning with MPC-PID Demonstration0
Pseudo-Simulation for Autonomous DrivingCode4
Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter EmbeddingCode2
GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation0
BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations0
Channel-adaptive Cross-modal Generative Semantic Communication for Point Cloud Transmission0
HiLO: High-Level Object Fusion for Autonomous Driving using TransformersCode0
Embedded Acoustic Intelligence for Automotive Systems0
DriveMind: A Dual-VLM based Reinforcement Learning Framework for Autonomous Driving0
LoRA as a Flexible Framework for Securing Large Vision Systems0
Using Diffusion Ensembles to Estimate Uncertainty for End-to-End Autonomous Driving0
NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic SegmentationCode0
Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic SegmentationCode0
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation0
Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic TasksCode1
ROAD: Responsibility-Oriented Reward Design for Reinforcement Learning in Autonomous Driving0
Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization0
Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action ModelsCode3
Autoregressive Meta-Actions for Unified Controllable Trajectory Generation0
Diffusion-Based Generative Models for 3D Occupancy Prediction in Autonomous Driving0
HMAD: Advancing E2E Driving with Anchored Offset Proposals and Simulation-Supervised Multi-target Scoring0
SHTOcc: Effective 3D Occupancy Prediction with Sparse Head and Tail VoxelsCode0
The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector0
Learnable Burst-Encodable Time-of-Flight Imaging for High-Fidelity Long-Distance Depth Sensing0
A Human-Centric Approach to Explainable AI for Personalized EducationCode0
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action ControlCode2
From Failures to Fixes: LLM-Driven Scenario Repair for Self-Evolving Autonomous Driving0
DriveRX: A Vision-Language Reasoning Model for Cross-Task Autonomous Driving0
Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO0
GaussianFusion: Gaussian-Based Multi-Sensor Fusion for End-to-End Autonomous DrivingCode0
Show:102550
← PrevPage 3 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified