SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 251300 of 6092 papers

TitleStatusHype
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object DetectionCode2
PointCore: Efficient Unsupervised Point Cloud Anomaly Detector Using Local-Global FeaturesCode2
OccFusion: Multi-Sensor Fusion Framework for 3D Semantic Occupancy PredictionCode2
On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous DrivingCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
Deep learning for 3D human pose estimation and mesh recovery: A surveyCode2
ICP-Flow: LiDAR Scene Flow Estimation with ICPCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action ChainCode2
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language ModelCode2
PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving EnvironmentsCode2
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction FollowingCode2
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity EnhancementCode2
MF-MOS: A Motion-Focused Model for Moving Object SegmentationCode2
ADMap: Anti-disturbance framework for reconstructing online vectorized HD mapCode2
Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop TechnologiesCode2
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural CalibrationCode2
LangProp: A code optimization framework using Large Language Models applied to drivingCode2
RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAMCode2
WidthFormer: Toward Efficient Transformer-based BEV View TransformationCode2
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large ModelsCode2
MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene UnderstandingCode2
Visual Point Cloud Forecasting enables Scalable Autonomous DrivingCode2
Fully Sparse 3D Occupancy PredictionCode2
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous DrivingCode2
LingoQA: Visual Question Answering for Autonomous DrivingCode2
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free EnvironmentsCode2
DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving ScenesCode2
LMDrive: Closed-Loop End-to-End Driving with Large Language ModelsCode2
Exploring Radar Data Representations in Autonomous Driving: A Comprehensive ReviewCode2
Towards Knowledge-driven Autonomous DrivingCode2
FRNet: Frustum-Range Networks for Scalable LiDAR SegmentationCode2
Open-sourced Data Ecosystem in Autonomous Driving: the Present and FutureCode2
Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?Code2
PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty AwarenessCode2
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous DrivingCode2
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving ApplicationsCode2
Panacea: Panoramic and Controllable Video Generation for Autonomous DrivingCode2
OccWorld: Learning a 3D Occupancy World Model for Autonomous DrivingCode2
NeuRAD: Neural Rendering for Autonomous DrivingCode2
SelfOcc: Self-Supervised Vision-Based 3D Occupancy PredictionCode2
A Survey on Multimodal Large Language Models for Autonomous DrivingCode2
Sparse4D v3: Advancing End-to-End 3D Detection and TrackingCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
Tactics2D: A Highly Modular and Extensible Simulator for Driving Decision-makingCode2
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous DrivingCode2
Large Trajectory Models are Scalable Motion Predictors and PlannersCode2
RoboDepth: Robust Out-of-Distribution Depth Estimation under CorruptionsCode2
Vision Language Models in Autonomous Driving: A Survey and OutlookCode2
UniPAD: A Universal Pre-training Paradigm for Autonomous DrivingCode2
Show:102550
← PrevPage 6 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified