SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 351400 of 6092 papers

TitleStatusHype
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition ControlCode2
GaussianAD: Gaussian-Centric End-to-End Autonomous DrivingCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulatorCode2
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent SpaceCode2
GAIA-1: A Generative World Model for Autonomous DrivingCode2
Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
MultiOOD: Scaling Out-of-Distribution Detection for Multiple ModalitiesCode2
Fully Sparse 3D Occupancy PredictionCode2
NeuRAD: Neural Rendering for Autonomous DrivingCode2
Fully Sparse 3D Object DetectionCode2
Neurosymbolic Diffusion ModelsCode2
Doe-1: Closed-Loop Autonomous Driving with Large World ModelCode2
FUTR3D: A Unified Sensor Fusion Framework for 3D DetectionCode2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous DrivingCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud MapsCode2
FocalFormer3D: Focusing on Hard Instance for 3D Object DetectionCode2
Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and PlanningCode2
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model ConversionCode2
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous DrivingCode2
Driv3R: Learning Dense 4D Reconstruction for Autonomous DrivingCode2
Online Video Understanding: OVBench and VideoChat-OnlineCode2
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of VehiclesCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy PerceptionCode2
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving ApplicationsCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditionsCode2
An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous DrivingCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario AnalysisCode2
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane BenchmarkCode2
PillarNet: Real-Time and High-Performance Pillar-based 3D Object DetectionCode2
Pillar R-CNN for Point Cloud 3D Object DetectionCode2
2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D Object DetectionCode2
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario UnderstandingCode2
ADMap: Anti-disturbance framework for reconstructing online vectorized HD mapCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future ProspectsCode2
Drive Like a Human: Rethinking Autonomous Driving with Large Language ModelsCode2
Query-Centric Trajectory PredictionCode2
2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object DetectionCode2
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane PriorsCode2
Show:102550
← PrevPage 8 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified