SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 501550 of 6092 papers

TitleStatusHype
GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and ReconstructionCode3
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario UnderstandingCode2
Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback0
Evaluating the Impact of Synthetic Data on Object Detection Tasks in Autonomous Driving0
Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection0
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation0
Post-interactive Multimodal Trajectory Prediction for Autonomous Driving0
Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latant Space0
Hybrid Rendering for Multimodal Autonomous Driving: Merging Neural and Physics-Based Simulation0
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action AlignmentCode3
LiSu: A Dataset and Method for LiDAR Surface Normal EstimationCode1
STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive ApplicationsCode1
JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data0
Simulating Automotive Radar with Lidar and Camera Inputs0
Task-Oriented Co-Design of Communication, Computing, and Control for Edge-Enabled Industrial Cyber-Physical Systems0
CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous DrivingCode1
HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single DecoderCode2
FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback0
V-Max: A Reinforcement Learning Framework for Autonomous DrivingCode2
Simulator Ensembles for Trustworthy Autonomous Driving Testing0
Controllable 3D Outdoor Scene Generation via Scene GraphsCode2
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and ReasoningCode3
LEGO-Motion: Learning-Enhanced Grids with Occupancy Instance Modeling for Class-Agnostic Motion Prediction0
Chameleon: Fast-slow Neuro-symbolic Lane Topology ExtractionCode2
GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts0
Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera0
HisTrackMap: Global Vectorized High-Definition Map Construction via History Map Tracking0
Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense0
Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru0
RS2AD: End-to-End Autonomous Driving Data Generation from Roadside Sensor Observations0
CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting0
HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective PriorsCode0
Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and BenchmarkCode2
Temporal Triplane Transformers as Occupancy World Models0
CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving0
Steerable Pyramid Weighted Loss: Multi-Scale Adaptive Weighting for Semantic Segmentation0
AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation0
Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving0
Attention, Please! PixelSHAP Reveals What Vision-Language Models Actually Focus On0
CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving0
Future-Aware Interaction Network For Motion Forecasting0
StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place RecognitionCode0
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection0
Segment Anything, Even Occluded0
VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene CompletionCode1
Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection0
Treble Counterfactual VLMs: A Causal Approach to HallucinationCode0
From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning0
ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation0
TransParking: A Dual-Decoder Transformer Framework with Soft Localization for End-to-End Automatic Parking0
Show:102550
← PrevPage 11 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified