SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 351375 of 6092 papers

TitleStatusHype
GAIA-1: A Generative World Model for Autonomous DrivingCode2
Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous DrivingCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous DrivingCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent SpaceCode2
MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D QueriesCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
DetGPT: Detect What You Need via ReasoningCode2
An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving ScenarioCode2
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic SegmentationCode2
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane PriorsCode2
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free EnvironmentsCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
ADMap: Anti-disturbance framework for reconstructing online vectorized HD mapCode2
Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and PlanningCode2
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction FollowingCode2
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving ApplicationsCode2
Show:102550
← PrevPage 15 of 244Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified