SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 876900 of 6092 papers

TitleStatusHype
Quantitative Predictive Monitoring and Control for Safe Human-Machine Interaction0
StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models0
MapExpert: Online HD Map Construction with Simple and Efficient Sparse Map Element Expert0
DriveTester: A Unified Platform for Simulation-Based Autonomous Driving TestingCode1
Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks0
Open-World Panoptic Segmentation0
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingCode3
Improving the Transferability of 3D Point Cloud Attack via Spectral-aware Admix and Optimization Designs0
Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset0
AEPHORA: AI/ML-Based Energy-Efficient Proactive Handover and Resource Allocation0
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
NEST: A Neuromodulated Small-world Hypergraph Trajectory Prediction Model for Autonomous Driving0
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
DINO-Foresight: Looking into the Future with DINOCode2
Point Cloud-Assisted Neural Image Compression0
CP-Guard: Malicious Agent Detection and Defense in Collaborative Bird's Eye View Perception0
Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents0
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation0
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy PredictionCode1
RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models0
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition ControlCode2
OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving0
RowDetr: End-to-End Row Detection Using Polynomials0
EI-Drive: A Platform for Cooperative Perception with Realistic Communication Models0
WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language ModelCode1
Show:102550
← PrevPage 36 of 244Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified