SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 25012550 of 6092 papers

TitleStatusHype
Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving0
Enhancing the Spatial Awareness Capability of Multi-Modal Large Language Model0
FLODCAST: Flow and Depth Forecasting via Multimodal Recurrent Architectures0
Large Trajectory Models are Scalable Motion Predictors and PlannersCode2
ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object DetectionCode0
Siamese-DETR for Generic Multi-Object TrackingCode0
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning0
Fine-Tuning Language Models Using Formal Methods Feedback0
EqDrive: Efficient Equivariant Motion Forecasting with Multi-Modality for Autonomous Driving0
Three Pillars improving Vision Foundation Model Distillation for LidarCode1
Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models0
A Hybrid Graph Network for Complex Activity Detection in Video0
Navigating Data Heterogeneity in Federated Learning A Semi-Supervised Federated Object DetectionCode1
YOLO-BEV: Generating Bird's-Eye View in the Same Way as 2D Object Detection0
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUsCode3
Using Knowledge Awareness to improve Safety of Autonomous Driving0
Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated DrivingCode0
ParisLuco3D: A high-quality target dataset for domain generalization of LiDAR perception0
MVFAN: Multi-View Feature Assisted Network for 4D Radar Object Detection0
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation0
Pixel-Level Clustering Network for Unsupervised Image Segmentation0
Data-driven Traffic Simulation: A Comprehensive Review0
RoboDepth: Robust Out-of-Distribution Depth Estimation under CorruptionsCode2
P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic SegmentationCode0
BM2CP: Efficient Collaborative Perception with LiDAR-Camera ModalitiesCode1
DICE: Diverse Diffusion Model with Scoring for Trajectory Prediction0
Vision Language Models in Autonomous Driving: A Survey and OutlookCode2
Equivariant Map and Agent Geometry for Autonomous Driving Motion Prediction0
Exploring Driving Behavior for Autonomous Vehicles Based on Gramian Angular Field Vision Transformer0
OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal 3D DataCode1
Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative Pose EncodingCode1
Multi‑camera trajectory matching based on hierarchical clustering and constraintsCode1
LeTFuser: Light-weight End-to-end Transformer-Based Sensor Fusion for Autonomous Driving with Multi-Task LearningCode1
One-Bit Byzantine-Tolerant Distributed Learning via Over-the-Air Computation0
Using Experience Classification for Training Non-Markovian Tasks0
Reinforcement learning with non-ergodic reward increments: robustness via ergodicity transformationsCode0
LiDAR-based 4D Occupancy Completion and ForecastingCode1
DORec: Decomposed Object Reconstruction and Segmentation Utilizing 2D Self-Supervised Features0
Path Following Control of Automated Vehicle Considering Uncertainties and Disturbances with Parametric Varying0
SoTTA: Robust Test-Time Adaptation on Noisy Data StreamsCode1
Multimodal Object Query Initialization for 3D Object Detection0
JM3D & JM3D-LLM: Elevating 3D Understanding with Joint Multi-modal CuesCode1
Real-Time Traffic Sign Detection: A Case Study in a Santa Clara Suburban Neighborhood0
Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving0
Data-driven Invariance for Reference Governors0
PU-Ray: Domain-Independent Point Cloud Upsampling via Ray Marching on Neural Implicit SurfaceCode0
Dealing with uncertainty: balancing exploration and exploitation in deep recurrent reinforcement learningCode0
DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative PerceptionCode1
NSM4D: Neural Scene Model Based Online 4D Point Cloud Sequence Understanding0
HeightFormer: A Multilevel Interaction and Image-adaptive Classification-regression Network for Monocular Height Estimation with Aerial Images0
Show:102550
← PrevPage 51 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified