SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 526550 of 6092 papers

TitleStatusHype
Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera0
HisTrackMap: Global Vectorized High-Definition Map Construction via History Map Tracking0
Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense0
Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru0
RS2AD: End-to-End Autonomous Driving Data Generation from Roadside Sensor Observations0
CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting0
HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective PriorsCode0
Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and BenchmarkCode2
Temporal Triplane Transformers as Occupancy World Models0
CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving0
Steerable Pyramid Weighted Loss: Multi-Scale Adaptive Weighting for Semantic Segmentation0
AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation0
Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving0
Attention, Please! PixelSHAP Reveals What Vision-Language Models Actually Focus On0
CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving0
Future-Aware Interaction Network For Motion Forecasting0
StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place RecognitionCode0
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection0
Segment Anything, Even Occluded0
VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene CompletionCode1
Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection0
Treble Counterfactual VLMs: A Causal Approach to HallucinationCode0
From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning0
ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation0
TransParking: A Dual-Decoder Transformer Framework with Soft Localization for End-to-End Automatic Parking0
Show:102550
← PrevPage 22 of 244Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified