SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 101150 of 6092 papers

TitleStatusHype
CarDreamer: Open-Source Learning Platform for World Model based Autonomous DrivingCode3
Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving ResearchCode3
Generative AI for Autonomous Driving: Frontiers and OpportunitiesCode3
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingCode3
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based AgentsCode3
Generalized Trajectory Scoring for End-to-end Multimodal PlanningCode3
GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and ReconstructionCode3
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionCode3
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-TuningCode3
LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR SynthesisCode3
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous DrivingCode3
Fully Sparse 3D Object DetectionCode2
BEVFusion: A Simple and Robust LiDAR-Camera Fusion FrameworkCode2
Fully Sparse 3D Occupancy PredictionCode2
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario AnalysisCode2
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-ViewCode2
FocalFormer3D: Focusing on Hard Instance for 3D Object DetectionCode2
FRNet: Frustum-Range Networks for Scalable LiDAR SegmentationCode2
FUTR3D: A Unified Sensor Fusion Framework for 3D DetectionCode2
Benchmarking the Robustness of LiDAR Semantic Segmentation ModelsCode2
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model ConversionCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop DrivingCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous DrivingCode2
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of VehiclesCode2
FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud MapsCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
GAIA-1: A Generative World Model for Autonomous DrivingCode2
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous DrivingCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
Autonomous Driving with Spiking Neural NetworksCode2
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane PriorsCode2
Autonomous Driving on Curvy Roads Without Reliance on Frenet Frame: A Cartesian-Based Trajectory Planning MethodCode2
4D Contrastive Superflows are Dense 3D Representation LearnersCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerCode2
Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object SegmentationCode2
DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view RepresentationCode2
2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object DetectionCode2
Show:102550
← PrevPage 3 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified