SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 201250 of 6092 papers

TitleStatusHype
GPD-1: Generative Pre-training for DrivingCode2
GPT-Driver: Learning to Drive with GPTCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous DrivingCode2
Autonomous Driving with Spiking Neural NetworksCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulatorCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving ApplicationsCode2
Autonomous Driving on Curvy Roads Without Reliance on Frenet Frame: A Cartesian-Based Trajectory Planning MethodCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
Human-AI Shared Control via Policy DissectionCode2
Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios?Code2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
CaRL: Learning Scalable Planning Policies with Simple RewardsCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of VehiclesCode2
Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmarkCode2
3D Object Detection for Autonomous Driving: A Comprehensive SurveyCode2
Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?Code2
COCO-O: A Benchmark for Object Detectors under Natural Distribution ShiftsCode2
ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous VehiclesCode2
Adaptive Fusion of Single-View and Multi-View Depth for Autonomous DrivingCode2
Cityscapes 3D: Dataset and Benchmark for 9 DoF Vehicle DetectionCode2
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse TransformersCode2
CLRerNet: Improving Confidence of Lane Detection with LaneIoUCode2
LangCoop: Collaborative Driving with LanguageCode2
LangProp: A code optimization framework using Large Language Models applied to drivingCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous DrivingCode2
LGSVL Simulator: A High Fidelity Simulator for Autonomous DrivingCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
ConceptFusion: Open-set Multimodal 3D MappingCode2
LiDAR Snowfall Simulation for Robust 3D Object DetectionCode2
LightLoc: Learning Outdoor LiDAR Localization at Light SpeedCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
Asynchronous Large Language Model Enhanced Planner for Autonomous DrivingCode2
Make Your ViT-based Multi-view 3D Detectors Faster via Token CompressionCode2
MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene UnderstandingCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object SegmentationCode2
MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory PredictionCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane PriorsCode2
MF-MOS: A Motion-Focused Model for Moving Object SegmentationCode2
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model ConversionCode2
Drone-assisted Road Gaussian Splatting with Cross-view UncertaintyCode2
Show:102550
← PrevPage 5 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified