SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 851900 of 6092 papers

TitleStatusHype
Towards Selection and Transition Between Behavior-Based Neural Networks for Automated Driving0
Optimizing Low-Speed Autonomous Driving: A Reinforcement Learning Approach to Route Stability and Maximum Speed0
Autoware.Flex: Human-Instructed Dynamically Reconfigurable Autonomous Driving Systems0
Mapping the Mind of an Instruction-based Image Editing using SMILECode2
Camera-Based Localization and Enhanced Normalized Mutual Information0
Mask-RadarNet: Enhancing Transformer With Spatial-Temporal Semantic Context for Radar Object Detection in Autonomous Driving0
VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving0
Sparse Point Clouds Assisted Learned Image Compression0
LiHi-GS: LiDAR-Supervised Gaussian Splatting for Highway Driving Scene Reconstruction0
Drive-1-to-3: Enriching Diffusion Priors for Novel View Synthesis of Real Vehicles0
VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision0
DriveGPT: Scaling Autoregressive Behavior Models for Driving0
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous DrivingCode4
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous DrivingCode2
Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language ModelsCode0
Object Style Diffusion for Generalized Object Detection in Urban Scene0
An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training0
Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose EstimationCode1
Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction0
Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration0
A Black-Box Evaluation Framework for Semantic Robustness in Bird's Eye View DetectionCode0
Joint Perception and Prediction for Autonomous Driving: A SurveyCode2
CLIP-RLDrive: Human-Aligned Autonomous Driving via CLIP-Based Reward Shaping in Reinforcement Learning0
C2F-TP: A Coarse-to-Fine Denoising Framework for Uncertainty-Aware Trajectory PredictionCode0
SafeDrive: Knowledge- and Data-Driven Risk-Sensitive Decision-Making for Autonomous Vehicles with Large Language Models0
Quantitative Predictive Monitoring and Control for Safe Human-Machine Interaction0
StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models0
MapExpert: Online HD Map Construction with Simple and Efficient Sparse Map Element Expert0
DriveTester: A Unified Platform for Simulation-Based Autonomous Driving TestingCode1
Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks0
Open-World Panoptic Segmentation0
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingCode3
Improving the Transferability of 3D Point Cloud Attack via Spectral-aware Admix and Optimization Designs0
Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset0
AEPHORA: AI/ML-Based Energy-Efficient Proactive Handover and Resource Allocation0
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
NEST: A Neuromodulated Small-world Hypergraph Trajectory Prediction Model for Autonomous Driving0
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
DINO-Foresight: Looking into the Future with DINOCode2
Point Cloud-Assisted Neural Image Compression0
CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird's Eye View Perception0
Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents0
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation0
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy PredictionCode1
RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models0
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition ControlCode2
OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving0
RowDetr: End-to-End Row Detection Using Polynomials0
EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models0
WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language ModelCode1
Show:102550
← PrevPage 18 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified