SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 401450 of 6092 papers

TitleStatusHype
MonoDETR: Depth-guided Transformer for Monocular 3D Object DetectionCode2
Real-time Object Detection for Streaming PerceptionCode2
Learning from All VehiclesCode2
TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with TransformersCode2
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane BenchmarkCode2
FUTR3D: A Unified Sensor Fusion Framework for 3D DetectionCode2
HybridNets: End-to-End Perception NetworkCode2
LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting NetworkCode2
Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A SurveyCode2
Autonomous Driving on Curvy Roads Without Reliance on Frenet Frame: A Cartesian-Based Trajectory Planning MethodCode2
Pedestrian Detection: Domain Generalization, CNNs, Transformers and BeyondCode2
HiVT: Hierarchical Vector Transformer for Multi-Agent Motion PredictionCode2
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-ViewCode2
Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene ImagesCode2
2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object DetectionCode2
2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D Object DetectionCode2
Multi-Modal Fusion Transformer for End-to-End Autonomous DrivingCode2
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous DrivingCode2
Cityscapes 3D: Dataset and Benchmark for 9 DoF Vehicle DetectionCode2
Label Efficient Visual Abstractions for Autonomous DrivingCode2
LGSVL Simulator: A High Fidelity Simulator for Autonomous DrivingCode2
Scalability in Perception for Autonomous Driving: Waymo Open DatasetCode2
nuScenes: A multimodal dataset for autonomous drivingCode2
PointPillars: Fast Encoders for Object Detection from Point CloudsCode2
SECOND: Sparsely Embedded Convolutional DetectionCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video UnderstandingCode1
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
Where, What, Why: Towards Explainable Driver Attention PredictionCode1
Out-of-Distribution Semantic Occupancy PredictionCode1
Self-Supervised Multimodal NeRF for Autonomous DrivingCode1
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level VisionCode1
COME: Adding Scene-Centric Forecasting Control to Occupancy World ModelCode1
Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic SegmentationCode1
STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous DrivingCode1
Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic TasksCode1
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous DrivingCode1
DriveCamSim: Generalizable Camera Simulation via Explicit Camera Modeling for Autonomous DrivingCode1
RealEngine: Simulating Autonomous Driving in Realistic ContextCode1
Chirp Delay-Doppler Domain Modulation: A New Paradigm of Integrated Sensing and Communication for Autonomous VehiclesCode1
Always Clear Depth: Robust Monocular Depth Estimation under Adverse WeatherCode1
Large Wireless Localization Model (LWLM): A Foundation Model for Positioning in 6G NetworksCode1
OpenLKA: An Open Dataset of Lane Keeping Assist from Recent Car Models under Real-world Driving ConditionsCode1
Empirical Performance Evaluation of Lane Keeping Assist on Modern Production VehiclesCode1
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous DrivingCode1
M3CAD: Towards Generic Cooperative Autonomous Driving BenchmarkCode1
DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at OnceCode1
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic FusionCode1
Show:102550
← PrevPage 9 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified