SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 51100 of 6092 papers

TitleStatusHype
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action AlignmentCode3
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and ReasoningCode3
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous DrivingCode3
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionCode3
Safety at Scale: A Comprehensive Survey of Large Model SafetyCode3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPTCode3
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingCode3
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous DrivingCode3
HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous DrivingCode3
SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous DrivingCode3
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAMCode3
DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving ScenesCode3
Generalizing Motion Planners with Mixture of Experts for Autonomous DrivingCode3
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based AgentsCode3
Does End-to-End Autonomous Driving Really Need Perception Tasks?Code3
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous DrivingCode3
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
DeepInteraction++: Multi-Modality Interaction for Autonomous DrivingCode3
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge BasesCode3
CarLLaVA: Vision language models for camera-only closed-loop drivingCode3
Enhancing End-to-End Autonomous Driving with Latent World ModelCode3
HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking ScenariosCode3
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous DrivingCode3
SMART: Scalable Multi-agent Real-time Motion Generation via Next-token PredictionCode3
CarDreamer: Open-Source Learning Platform for World Model based Autonomous DrivingCode3
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous DrivingCode3
Vision-based 3D occupancy prediction in autonomous driving: a review and outlookCode3
NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous DrivingCode3
HPNet: Dynamic Trajectory Forecasting with Historical Prediction AttentionCode3
RoadBEV: Road Surface Reconstruction in Bird's Eye ViewCode3
LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR SynthesisCode3
Producing and Leveraging Online Map Uncertainty in Trajectory PredictionCode3
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object DetectionCode3
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object DetectionCode3
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video GenerationCode3
Embodied Understanding of Driving ScenariosCode3
Behavior Generation with Latent ActionsCode3
Leveraging Enhanced Queries of Point Sets for Vectorized Map ConstructionCode3
GenAD: Generative End-to-End Autonomous DrivingCode3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-AgentsCode3
OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous DrivingCode3
SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous DrivingCode3
DeFlow: Decoder of Scene Flow Network in Autonomous DrivingCode3
Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and OpportunitiesCode3
DriveLM: Driving with Graph Visual Question AnsweringCode3
Mind the map! Accounting for existing map information when estimating online HDMaps from sensorCode3
LLM4Drive: A Survey of Large Language Models for Autonomous DrivingCode3
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUsCode3
Show:102550
← PrevPage 2 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified