SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 101150 of 6092 papers

TitleStatusHype
Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving ResearchCode3
CRITERIA: a New Benchmarking Paradigm for Evaluating Trajectory Prediction Models for Autonomous DrivingCode3
MapTRv2: An End-to-End Framework for Online Vectorized HD Map ConstructionCode3
iDisc: Internal Discretization for Monocular Depth EstimationCode3
Geometric-aware Pretraining for Vision-centric 3D Object DetectionCode3
VAD: Vectorized Scene Representation for Efficient Autonomous DrivingCode3
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous DrivingCode3
Tri-Perspective View for Vision-Based 3D Semantic Occupancy PredictionCode3
MapTR: Structured Modeling and Learning for Online Vectorized HD Map ConstructionCode3
TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous DrivingCode3
TorchSparse: Efficient Point Cloud Inference EngineCode3
Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception ChallengeCode3
I^2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene ForecastingCode2
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario AnalysisCode2
QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy PredictionCode2
Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian SplattingCode2
VideoMolmo: Spatio-Temporal Grounding Meets PointingCode2
Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter EmbeddingCode2
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action ControlCode2
Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future ProspectsCode2
WeatherEdit: Controllable Weather Editing with 4D Gaussian FieldCode2
TopoPoint: Enhance Topology Reasoning via Endpoint Detection in Autonomous DrivingCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
iPad: Iterative Proposal-centric End-to-End Autonomous DrivingCode2
Neurosymbolic Diffusion ModelsCode2
LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous DrivingCode2
CaRL: Learning Scalable Planning Policies with Simple RewardsCode2
LangCoop: Collaborative Driving with LanguageCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
LandMarkSystem Technical ReportCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
Towards Generating Realistic 3D Semantic Training Data for Autonomous DrivingCode2
SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data PretrainingCode2
LightLoc: Learning Outdoor LiDAR Localization at Light SpeedCode2
Advances in 4D Generation: A SurveyCode2
Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and PlanningCode2
GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised MatchingCode2
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario UnderstandingCode2
Unlocking Generalization Power in LiDAR Point Cloud RegistrationCode2
HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single DecoderCode2
V-Max: A Reinforcement Learning Framework for Autonomous DrivingCode2
Controllable 3D Outdoor Scene Generation via Scene GraphsCode2
Chameleon: Fast-slow Neuro-symbolic Lane Topology ExtractionCode2
Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and BenchmarkCode2
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous DrivingCode2
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop DrivingCode2
DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-ResolutionCode2
VaViM and VaVAM: Autonomous Driving through Video Generative ModelingCode2
OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner FrameworkCode2
Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous DrivingCode2
Show:102550
← PrevPage 3 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified