SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 301350 of 6092 papers

TitleStatusHype
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion modelCode2
TopoMLP: A Simple yet Strong Pipeline for Driving Topology ReasoningCode2
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous DrivingCode2
GPT-Driver: Learning to Drive with GPTCode2
You Only Look at Once for Real-time and Generic Multi-TaskCode2
GAIA-1: A Generative World Model for Autonomous DrivingCode2
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language ModelsCode2
Rethinking Imitation-based Planner for Autonomous DrivingCode2
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering SupervisionCode2
DriveDreamer: Towards Real-world-driven World Models for Autonomous DrivingCode2
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target SimulationCode2
PivotNet: Vectorized Pivot Learning for End-to-end HD Map ConstructionCode2
StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map ConstructionCode2
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View RepresentationCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
LATR: 3D Lane Detection from Monocular Images with TransformerCode2
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous DrivingCode2
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous DrivingCode2
COCO-O: A Benchmark for Object Detectors under Natural Distribution ShiftsCode2
A Simple and Model-Free Path Filtering Algorithm for Smoothing and AccuracyCode2
Drive Like a Human: Rethinking Autonomous Driving with Large Language ModelsCode2
LimSim: A Long-term Interactive Multi-scenario Traffic SimulatorCode2
Recent Advancements in End-to-End Autonomous Driving using Deep Learning: A SurveyCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention QueryingCode2
RoMe: Towards Large Scale Road Surface Reconstruction via Mesh RepresentationCode2
QCNeXt: A Next-Generation Framework For Joint Multi-Agent Trajectory PredictionCode2
MachMap: End-to-End Vectorized Solution for Compact HD-Map ConstructionCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
The 1st-place Solution for CVPR 2023 OpenLane Topology in Autonomous Driving ChallengeCode2
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
Hidden Biases of End-to-End Driving ModelsCode2
StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street ViewsCode2
UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous DrivingCode2
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving ScenarioCode2
DetGPT: Detect What You Need via ReasoningCode2
VDT: General-purpose Video Diffusion Transformers via Mask ModelingCode2
Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenesCode2
CLRerNet: Improving Confidence of Lane Detection with LaneIoUCode2
Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous DrivingCode2
Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous DrivingCode2
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive ReviewCode2
Transformer-Based Visual Segmentation: A SurveyCode2
VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving SceneCode2
OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy PredictionCode2
Graph-based Topology Reasoning for Driving ScenesCode2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth EstimationCode2
An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
Show:102550
← PrevPage 7 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified