SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 5175 of 6092 papers

TitleStatusHype
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action AlignmentCode3
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and ReasoningCode3
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous DrivingCode3
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionCode3
Safety at Scale: A Comprehensive Survey of Large Model SafetyCode3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPTCode3
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingCode3
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous DrivingCode3
HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous DrivingCode3
SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous DrivingCode3
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAMCode3
DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving ScenesCode3
Generalizing Motion Planners with Mixture of Experts for Autonomous DrivingCode3
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based AgentsCode3
Does End-to-End Autonomous Driving Really Need Perception Tasks?Code3
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous DrivingCode3
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
DeepInteraction++: Multi-Modality Interaction for Autonomous DrivingCode3
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge BasesCode3
CarLLaVA: Vision language models for camera-only closed-loop drivingCode3
Enhancing End-to-End Autonomous Driving with Latent World ModelCode3
HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking ScenariosCode3
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous DrivingCode3
Show:102550
← PrevPage 3 of 244Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified