SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 301325 of 6092 papers

TitleStatusHype
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
A Simple and Model-Free Path Filtering Algorithm for Smoothing and AccuracyCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
FocalFormer3D: Focusing on Hard Instance for 3D Object DetectionCode2
An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
Advances in 4D Generation: A SurveyCode2
Asynchronous Large Language Model Enhanced Planner for Autonomous DrivingCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and BenchmarkCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
Hidden Biases of End-to-End Driving ModelsCode2
HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single DecoderCode2
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic SegmentationCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
Human-AI Shared Control via Policy DissectionCode2
HumanBench: Towards General Human-centric Perception with Projector Assisted PretrainingCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
Autonomous Driving on Curvy Roads Without Reliance on Frenet Frame: A Cartesian-Based Trajectory Planning MethodCode2
Is CLIP the main roadblock for fine-grained open-world perception?Code2
Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object SegmentationCode2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth EstimationCode2
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane PriorsCode2
Show:102550
← PrevPage 13 of 244Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified