SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 601650 of 6092 papers

TitleStatusHype
ABNet: Attention BarrierNet for Safe and Scalable Robot LearningCode1
A Two-Stage Masked Autoencoder Based Network for Indoor Depth CompletionCode1
SemanticSpray++: A Multimodal Dataset for Autonomous Driving in Wet Surface ConditionsCode1
3D-AVS: LiDAR-based 3D Auto-Vocabulary SegmentationCode1
Instruct Large Language Models to Drive like HumansCode1
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object MotionCode1
Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World AttacksCode1
Frequency-based Matcher for Long-tailed Semantic SegmentationCode1
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement LearningCode1
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic SegmentationCode1
Real-Time Environment Condition Classification for Autonomous VehiclesCode1
Learning Shared RGB-D Fields: Unified Self-supervised Pre-training for Label-efficient LiDAR-Camera 3D PerceptionCode1
DiffuBox: Refining 3D Object Detection with Point DiffusionCode1
3D Annotation-Free Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous DrivingCode1
Label-efficient Semantic Scene Completion with Scribble AnnotationsCode1
NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud InterpolationCode1
CLRKDNet: Speeding up Lane Detection with Knowledge DistillationCode1
FADet: A Multi-sensor 3D Object Detection Network based on Local Featured AttentionCode1
NeRO: Neural Road Surface ReconstructionCode1
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous DrivingCode1
Multi-Object Tracking in the DarkCode1
Probing Multimodal LLMs as World Models for DrivingCode1
SemanticFormer: Holistic and Semantic Traffic Scene Representation for Trajectory Prediction using Knowledge GraphsCode1
SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor ConfigurationsCode1
MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object DetectionCode1
CLFT: Camera-LiDAR Fusion Transformer for Semantic Segmentation in Autonomous DrivingCode1
FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous DrivingCode1
VG4D: Vision-Language Model Goes 4D Video RecognitionCode1
KI-GAN: Knowledge-Informed Generative Adversarial Networks for Enhanced Multi-Vehicle Trajectory Forecasting at Signalized IntersectionsCode1
N-Agent Ad Hoc TeamworkCode1
VRS-NeRF: Visual Relocalization with Sparse Neural Radiance FieldCode1
SEVD: Synthetic Event-based Vision Dataset for Ego and Fixed Traffic PerceptionCode1
WROOM: An Autonomous Driving Approach for Off-Road NavigationCode1
PillarTrack:Boosting Pillar Representation for Transformer-based 3D Single Object Tracking on Point CloudsCode1
Homography Guided Temporal Fusion for Road Line and Marking SegmentationCode1
Better Monocular 3D Detectors with LiDAR from the PastCode1
Detecting Every Object from EventsCode1
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object DetectionCode1
WcDT: World-centric Diffusion Transformer for Traffic Scene GenerationCode1
OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning DenoisingCode1
BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression TasksCode1
Human-compatible driving partners through data-regularized self-play reinforcement learningCode1
UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain GapsCode1
Scenario-Based Curriculum Generation for Multi-Agent Autonomous DrivingCode1
PNAS-MOT: Multi-Modal Object Tracking with Pareto Neural Architecture SearchCode1
CR3DT: Camera-RADAR Fusion for 3D Detection and TrackingCode1
Unifying Lane-Level Traffic Prediction from a Graph Structural Perspective: Benchmark and BaselineCode1
Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency RegularizationsCode1
Mask-based Invisible Backdoor Attacks on Object DetectionCode1
Large Language Models Powered Context-aware Motion Prediction in Autonomous DrivingCode1
Show:102550
← PrevPage 13 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified