SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 150 of 6092 papers

TitleStatusHype
GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous DrivingCode0
AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework0
LaViPlan : Language-Guided Visual Path Planning with RLVR0
Channel-wise Motion Features for Efficient Motion Segmentation0
World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving0
Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models0
Safeguarding Federated Learning-based Road Condition Classification0
Towards Autonomous Riding: A Review of Perception, Planning, and Control in Intelligent Two-Wheelers0
Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation0
A Survey on Interpretability in Visual Recognition0
3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving0
Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance0
LifelongPR: Lifelong knowledge fusion for point cloud place recognition based on replay and prompt learningCode0
I^2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene ForecastingCode2
Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT0
Objectomaly: Objectness-Aware Refinement for OoD Segmentation with Structural Consistency and Boundary Precision0
3DGS_LSR:Large_Scale Relocation for Autonomous Driving Based on 3D Gaussian Splatting0
Towards Solar Altitude Guided Scene Illumination0
TigAug: Data Augmentation for Testing Traffic Light Detection in Autonomous Driving Systems0
LeAD: The LLM Enhanced Planning System Converged with End-to-end Autonomous Driving0
MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video UnderstandingCode1
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World ModelsCode0
FMOcc: TPV-Driven Flow Matching for 3D Occupancy Prediction with Selective State Space Model0
3D Gaussian Splatting Driven Multi-View Robust Physical Adversarial Camouflage GenerationCode0
Following the Clues: Experiments on Person Re-ID using Cross-Modal IntelligenceCode0
LLM-based Realistic Safety-Critical Driving Video Generation0
Out-of-distribution detection in 3D applications: a review0
World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World ModelCode0
A Survey on Vision-Language-Action Models for Autonomous DrivingCode4
Epona: Autoregressive Diffusion World Model for Autonomous DrivingCode3
Where, What, Why: Towards Explainable Driver Attention PredictionCode1
Point Cloud Compression and Objective Quality Assessment: A Survey0
Integrating Multi-Modal Sensors: A Review of Fusion Techniques for Intelligent Vehicles0
MADrive: Memory-Augmented Driving Scene Modeling0
DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic0
GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction0
SAM4D: Segment Anything in Camera and LiDAR Streams0
Out-of-Distribution Semantic Occupancy PredictionCode1
V2X-REALM: Vision-Language Model-Based Robust End-to-End Cooperative Autonomous Driving with Adaptive Long-Tail Modeling0
Differential Transformer-driven 6G Physical Layer for Collaborative Perception Enhancement0
Brain2Model Transfer: Training sensory and decision models with human neural activity as a teacher0
Lightweight Multi-Frame Integration for Robust YOLO Object Detection in Videos0
From 2D to 3D Cognition: A Brief Survey of General World Models0
Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios0
A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects0
PEVLM: Parallel Encoding for Vision-Language Models0
Unified Vision-Language-Action Model0
Self-Supervised Multimodal NeRF for Autonomous DrivingCode1
A Framework for Uncertainty Quantification Based on Nearest Neighbors Across Layers0
Show:102550
← PrevPage 1 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified