SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 901925 of 6092 papers

TitleStatusHype
WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language ModelCode1
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy PredictionCode2
GaussianAD: Gaussian-Centric End-to-End Autonomous DrivingCode2
DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous DrivingCode2
MMD-OPT : Maximum Mean Discrepancy Based Sample Efficient Collision Risk Minimization for Autonomous Driving0
Doe-1: Closed-Loop Autonomous Driving with Large World ModelCode2
Automatic Image Annotation for Mapped Features Detection0
Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model0
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement LearningCode0
Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation0
GPD-1: Generative Pre-training for DrivingCode2
Physical Informed Driving World Model0
Neural Observation Field Guided Hybrid Optimization of Camera PlacementCode0
DriveMM: All-in-One Large Multimodal Model for Autonomous DrivingCode2
Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic ScenariosCode0
ITPNet: Towards Instantaneous Trajectory Prediction for Autonomous Driving0
Fast Occupancy Network0
Test-time Correction with Human Feedback: An Online 3D Detection System via Visual Prompting0
PPT: Pretraining with Pseudo-Labeled Trajectories for Motion Forecasting0
Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization0
Driv3R: Learning Dense 4D Reconstruction for Autonomous DrivingCode2
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction0
Prediction of Occluded Pedestrians in Road Scenes using Human-like Reasoning: Insights from the OccluRoads Dataset0
World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving0
AgentAlign: Misalignment-Adapted Multi-Agent Perception for Resilient Inter-Agent Sensor Correlations0
Show:102550
← PrevPage 37 of 244Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified