SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 24512500 of 6092 papers

TitleStatusHype
A Computer Vision Approach for Autonomous Cars to Drive Safe at Construction Zone0
FSF-Net: Enhance 4D Occupancy Forecasting with Coarse BEV Scene Flow for Autonomous Driving0
Enhancing Pedestrian Trajectory Prediction with Crowd Trip InformationCode0
VLMine: Long-Tail Data Mining with Vision Language Models0
SPformer: A Transformer Based DRL Decision Making Method for Connected Automated Vehicles0
Curb Your Attention: Causal Attention Gating for Robust Trajectory Prediction in Autonomous Driving0
Goal-based Neural Physics Vehicle Trajectory Prediction Model0
Enhancing LLM-based Autonomous Driving Agents to Mitigate Perception Attacks0
Margin-bounded Confidence Scores for Out-of-Distribution DetectionCode0
First Field Trial of LLM-Powered AI Agent for Lifecycle Management of Autonomous Driving Optical Networks0
A Survey on Large Language Model-empowered Autonomous Driving0
LFP: Efficient and Accurate End-to-End Lane-Level Planning via Camera-LiDAR Fusion0
Exploiting Minority Pseudo-Labels for Semi-Supervised Semantic Segmentation in Autonomous Driving0
METDrive: Multi-modal End-to-end Autonomous Driving with Temporal Guidance0
LLMs Can Check Their Own Results to Mitigate Hallucinations in Traffic Understanding Tasks0
LMT-Net: Lane Model Transformer Network for Automated HD Mapping from Sparse Vehicle Observations0
Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic0
RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View0
Unveiling the Black Box: Independent Functional Module Evaluation for Bird's-Eye-View Perception Model0
High-Order Evolving Graphs for Enhanced Representation of Traffic DynamicsCode0
RenderWorld: World Model with Self-Supervised 3D Label0
Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation0
TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection0
ExelMap: Explainable Element-based HD-Map Change Detection and Update0
DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving0
Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving0
Robust Bird's Eye View Segmentation by Adapting DINOv20
DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion0
Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference0
GlobalMapNet: An Online Framework for Vectorized Global HD Map Construction0
XLM for Autonomous Driving Systems: A Comprehensive Review0
CoMamba: Real-time Cooperative Perception Unlocked with State Space ModelsCode0
Risk-Aware Autonomous Driving with Linear Temporal Logic Specifications0
A Data-Informed Analysis of Scalable Supervision for Safety in Autonomous Vehicle Fleets0
MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action PredictionCode0
The Design of Informative Take-Over Requests for Semi-Autonomous Cyber-Physical Systems: Combining Spoken Language and Visual Icons in a Drone-Controller Setting0
ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutationCode0
The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine0
Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes0
Attack End-to-End Autonomous Driving through Module-Wise Noise0
GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution0
GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions0
Unsupervised Point Cloud Registration with Self-DistillationCode0
Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU0
Behavioral Cloning Models Reality Check for Autonomous Driving0
Module-wise Adaptive Adversarial Training for End-to-end Autonomous Driving0
UdeerLID+: Integrating LiDAR, Image, and Relative Depth with Semi-Supervised0
MyGo: Consistent and Controllable Multi-View Driving Video Generation with Camera Control0
Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception0
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving0
Show:102550
← PrevPage 50 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified