SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 751800 of 6092 papers

TitleStatusHype
Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous DrivingCode2
BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement ModuleCode0
LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process ThinkingCode2
Decoding Interpretable Logic Rules from Neural Networks0
A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition0
Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving0
Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving0
GAC-Net_Geometric and attention-based Network for Depth Completion0
LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language ModelsCode1
Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving0
Common Sense Is All You Need0
TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/VideosCode0
Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion0
Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding0
CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving0
LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models0
Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions0
The global consensus on the risk management of autonomous driving0
DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving0
CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection0
AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR DataCode1
NextStop: An Improved Tracker For Panoptic LIDAR Segmentation DataCode0
FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection0
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving0
Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions0
Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud: A SurveyCode1
Image Segmentation: Inducing graph-based learningCode0
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving0
SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving0
Hybrid Machine Learning Model with a Constrained Action Space for Trajectory Prediction0
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric PerspectivesCode5
A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation0
MObI: Multimodal Object Inpainting Using Diffusion Models0
4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic SegmentationCode0
LDMapNet-U: An End-to-End System for City-Scale Lane-Level Map Updating0
GCP: Guarded Collaborative Perception with Spatial-Temporal Aware Malicious Agent DetectionCode0
RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging RadarCode1
Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models0
Evaluating Scenario-based Decision-making for Interactive Autonomous Driving Using Rational Criteria: A Survey0
Enhancing Large Vision Model in Street Scene Semantic Understanding through Leveraging Posterior Optimization Trajectory0
MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception0
JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration0
Leveraging SD Map to Augment HD Map-based Trajectory Prediction0
DriveScape: High-Resolution Driving Video Generation by Multi-View Feature Fusion0
Pseudo Visible Feature Fine-Grained Fusion for Thermal Object DetectionCode1
D^3CTTA: Domain-Dependent Decorrelation for Continual Test-Time Adaption of 3D LiDAR Segmentation0
Multi-Modal Aerial-Ground Cross-View Place Recognition with Neural ODEs0
DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving0
PIDLoc: Cross-View Pose Optimization Network Inspired by PID ControllersCode1
GLane3D: Detecting Lanes with Graph of 3D Keypoints0
Show:102550
← PrevPage 16 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified