SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 251300 of 6092 papers

TitleStatusHype
LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process ThinkingCode2
A Simulation Benchmark for Autonomous Racing with Large-Scale Human DataCode2
A Survey on Multimodal Large Language Models for Autonomous DrivingCode2
Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous DrivingCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
A Simple and Model-Free Path Filtering Algorithm for Smoothing and AccuracyCode2
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition ControlCode2
GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised MatchingCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
GAIA-1: A Generative World Model for Autonomous DrivingCode2
GaussianAD: Gaussian-Centric End-to-End Autonomous DrivingCode2
Fully Sparse 3D Occupancy PredictionCode2
ConceptFusion: Open-set Multimodal 3D MappingCode2
FUTR3D: A Unified Sensor Fusion Framework for 3D DetectionCode2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
FocalFormer3D: Focusing on Hard Instance for 3D Object DetectionCode2
A real-time dynamic obstacle tracking and mapping system for UAV navigation and collision avoidance with an RGB-D cameraCode2
FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud MapsCode2
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of VehiclesCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model ConversionCode2
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario AnalysisCode2
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy PredictionCode2
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action ControlCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
CLRerNet: Improving Confidence of Lane Detection with LaneIoUCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object SegmentationCode2
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse TransformersCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view RepresentationCode2
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic SegmentationCode2
Advances in 4D Generation: A SurveyCode2
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane PriorsCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
Drone-assisted Road Gaussian Splatting with Cross-view UncertaintyCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving ApplicationsCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature AttentionCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerCode2
A Review of Safe Reinforcement Learning: Methods, Theory and ApplicationsCode2
FRNet: Frustum-Range Networks for Scalable LiDAR SegmentationCode2
Fully Sparse 3D Object DetectionCode2
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous DrivingCode2
An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
Show:102550
← PrevPage 6 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified