SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 451500 of 6092 papers

TitleStatusHype
LRFusionPR: A Polar BEV-Based LiDAR-Radar Fusion Network for Place RecognitionCode1
What is the Added Value of UDA in the VFM Era?Code1
SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision SystemsCode1
Collaborative Perception Datasets for Autonomous Driving: A ReviewCode1
UncAD: Towards Safe End-to-end Autonomous Driving via Online Map UncertaintyCode1
ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language ModelsCode1
SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR SynthesisCode1
Uni-PrevPredMap: Extending PrevPredMap to a Unified Framework of Prior-Informed Modeling for Online Vectorized HD Map ConstructionCode1
CAFE-AD: Cross-Scenario Adaptive Feature Enhancement for Trajectory Planning in Autonomous DrivingCode1
From Shadows to Safety: Occlusion Tracking and Risk Mitigation for Urban Autonomous DrivingCode1
Can Test-Time Scaling Improve World Foundation Model?Code1
Cal or No Cal? -- Real-Time Miscalibration Detection of LiDAR and Camera SensorsCode1
Multi-modal Knowledge Distillation-based Human Trajectory ForecastingCode1
Mitigating Trade-off: Stream and Query-guided Aggregation for Efficient and Effective 3D Occupancy PredictionCode1
VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene FlowCode1
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous DrivingCode1
M3Net: Multimodal Multi-task Learning for 3D Detection, Segmentation, and Occupancy Prediction in Autonomous DrivingCode1
Unraveling the Effects of Synthetic Data on End-to-End Autonomous DrivingCode1
HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous DrivingCode1
MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous DrivingCode1
EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point CloudsCode1
MamBEV: Enabling State Space Models to Learn Birds-Eye-View RepresentationsCode1
SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World ModelCode1
Tracking Meets Large Multimodal Models for Driving Scenario UnderstandingCode1
TriLiteNet: Lightweight Model for Multi-Task Visual PerceptionCode1
Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene UnderstandingCode1
3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene ReconstructionCode1
Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop TrainingCode1
Bench2FreeAD: A Benchmark for Vision-based End-to-end Navigation in Unstructured Robotic EnvironmentsCode1
BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion ModelCode1
DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion GenerationCode1
OCCUQ: Exploring Efficient Uncertainty Quantification for 3D Occupancy PredictionCode1
Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSMCode1
STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive ApplicationsCode1
CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous DrivingCode1
LiSu: A Dataset and Method for LiDAR Surface Normal EstimationCode1
Secure On-Device Video OOD Detection Without BackpropagationCode1
VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene CompletionCode1
FastMap: Fast Queries Initialization Based Vectorized HD Map Reconstruction FrameworkCode1
CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late HybridizationCode1
Dur360BEV: A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous DrivingCode1
Enhancing Monocular 3D Scene Completion with Diffusion ModelCode1
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation ModelsCode1
SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird's-Eye-View SegmentationCode1
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event CamerasCode1
CalibRefine: Deep Learning-Based Online Automatic Targetless LiDAR-Camera Calibration with Iterative and Attention-Driven Post-RefinementCode1
MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow EstimationCode1
Co-MTP: A Cooperative Trajectory Prediction Framework with Multi-Temporal Fusion for Autonomous DrivingCode1
CoDiff: Conditional Diffusion Model for Collaborative 3D Object DetectionCode1
Event-aided Semantic Scene CompletionCode1
Show:102550
← PrevPage 10 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified