SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 201250 of 6092 papers

TitleStatusHype
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object DetectionCode2
A Simulation Benchmark for Autonomous Racing with Large-Scale Human DataCode2
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation MasksCode2
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model ConversionCode2
WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous DrivingCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature AttentionCode2
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent SpaceCode2
4D Contrastive Superflows are Dense 3D Representation LearnersCode2
PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion ModelsCode2
Solving Motion Planning Tasks with a Scalable Generative ModelCode2
SeFlow: A Self-Supervised Scene Flow Method in Autonomous DrivingCode2
SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point CloudCode2
DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view RepresentationCode2
Asynchronous Large Language Model Enhanced Planner for Autonomous DrivingCode2
DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving ApplicationsCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane PriorsCode2
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social ExperiencesCode2
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of VehiclesCode2
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous DrivingCode2
Autonomous Driving with Spiking Neural NetworksCode2
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous DrivingCode2
MultiOOD: Scaling Out-of-Distribution Detection for Multiple ModalitiesCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving ScenesCode2
RoGs: Large Scale Road Surface Reconstruction with Meshgrid GaussianCode2
ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous VehiclesCode2
RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging RadarCode2
Multi-Space Alignments Towards Universal LiDAR SegmentationCode2
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic SegmentationCode2
Deep Learning-Based Point Cloud Registration: A Comprehensive Survey and TaxonomyCode2
SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth EstimationCode2
Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End SystemCode2
PRAM: Place Recognition Anywhere Model for Efficient Visual LocalizationCode2
Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios?Code2
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong ElicitingCode2
Is CLIP the main roadblock for fine-grained open-world perception?Code2
LidarDM: Generative LiDAR Simulation in a Generated WorldCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
GraphAD: Interaction Scene Graph for End-to-end Autonomous DrivingCode2
Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous DrivingCode2
Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous DrivingCode2
TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane SegmentationCode2
RCooper: A Real-world Large-scale Dataset for Roadside Cooperative PerceptionCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningCode2
Adaptive Fusion of Single-View and Multi-View Depth for Autonomous DrivingCode2
Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous DrivingCode2
Show:102550
← PrevPage 5 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified