SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 601650 of 6092 papers

TitleStatusHype
VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion0
Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking0
CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems0
VVRec: Reconstruction Attacks on DL-based Volumetric Video Upstreaming via Latent Diffusion Model with Gamma Distribution0
InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer0
CalibRefine: Deep Learning-Based Online Automatic Targetless LiDAR-Camera Calibration with Iterative and Attention-Driven Post-RefinementCode1
GaussianFlowOcc: Sparse and Weakly Supervised Occupancy Estimation using Gaussian Splatting and Temporal Flow0
Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances0
MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow EstimationCode1
An Expert Ensemble for Detecting Anomalous Scenes, Interactions, and Behaviors in Autonomous Driving0
Co-MTP: A Cooperative Trajectory Prediction Framework with Multi-Temporal Fusion for Autonomous DrivingCode1
AUKT: Adaptive Uncertainty-Guided Knowledge Transfer with Conformal Prediction0
Cross-Model Transferability of Adversarial Patches in Real-time Segmentation for Autonomous DrivingCode0
A Brain-Inspired Perception-Decision Driving Model Based on Neural Pathway Anatomical Alignment0
Interaction-Aware Model Predictive Decision-Making for Socially-Compliant Autonomous Driving in Mixed Urban Traffic Scenarios0
PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured EnvironmentsCode0
Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection0
Enhancing Vehicle Make and Model Recognition with 3D Attention Modules0
Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection0
OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner FrameworkCode2
Para-Lane: Multi-Lane Dataset Registering Parallel Scans for Benchmarking Novel View Synthesis0
Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence0
CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models0
VaViM and VaVAM: Autonomous Driving through Video Generative ModelingCode2
Getting SMARTER for Motion Planning in Autonomous Driving SystemsCode5
Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving ScenariosCode0
ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v110
OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images0
Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance0
AVD2: Accident Video Diffusion for Accident Video Description0
OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving0
RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation0
CrossFuse: Learning Infrared and Visible Image Fusion by Cross-Sensor Top-K Vision Alignment and Beyond0
Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving0
Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning0
SegRet: An Efficient Design for Semantic Segmentation with Retentive NetworkCode0
MEX: Memory-efficient Approach to Referring Multi-Object Tracking0
Activation-wise Propagation: A Universal Strategy to Break Timestep Constraints in Spiking Neural Networks for 3D Data Processing0
RadSplatter: Extending 3D Gaussian Splatting to Radio Frequencies for Wireless Radiomap Extrapolation0
Uncertain Multi-Objective Recommendation via Orthogonal Meta-Learning Enhanced Bayesian Optimization0
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning0
Fragility-aware Classification for Understanding Risk and Improving Generalization0
CoDiff: Conditional Diffusion Model for Collaborative 3D Object DetectionCode1
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionCode3
A Framework for Learning Scoring Rules in Autonomous Driving Planning Systems0
PrivilegedDreamer: Explicit Imagination of Privileged Information for Rapid Adaptation of Learned Policies0
NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing0
Adaptive Neural Networks for Intelligent Data-Driven Development0
The Role of World Models in Shaping Autonomous Driving: A Comprehensive SurveyCode5
V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models0
Show:102550
← PrevPage 13 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified