SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 151200 of 6092 papers

TitleStatusHype
OccLE: Label-Efficient 3D Semantic Occupancy Prediction0
CogAD: Cognitive-Hierarchy Guided End-to-End Autonomous Driving0
Towards Human-Like Trajectory Prediction for Autonomous Driving: A Behavior-Centric Approach0
CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting0
WeatherEdit: Controllable Weather Editing with 4D Gaussian FieldCode2
Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future ProspectsCode2
Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation0
DriveCamSim: Generalizable Camera Simulation via Explicit Camera Modeling for Autonomous DrivingCode1
DiffVLA: Vision-Language Guided Diffusion Planning for Autonomous Driving0
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous DrivingCode1
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving0
Echo Planning for Autonomous Driving: From Current Observations to Future Trajectories and Back0
ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos0
FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving0
Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour0
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation0
TopoPoint: Enhance Topology Reasoning via Endpoint Detection in Autonomous DrivingCode2
RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection0
InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning0
CrashAgent: Crash Scenario Generation via Multi-modal Reasoning0
Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling0
Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras0
LiloDriver: A Lifelong Learning Framework for Closed-loop Motion Planning in Long-tail Autonomous Driving ScenariosCode0
CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving0
BadDepth: Backdoor Attacks Against Monocular Depth Estimation in the Physical World0
SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving0
DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving0
RealEngine: Simulating Autonomous Driving in Realistic ContextCode1
Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)0
VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving0
Human-like Semantic Navigation for Autonomous Driving using Knowledge Representation and Large Language Models0
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
Chirp Delay-Doppler Domain Modulation: A New Paradigm of Integrated Sensing and Communication for Autonomous VehiclesCode1
RadarRGBD A Multi-Sensor Fusion Dataset for Perception with RGB-D and mmWave RadarCode0
Challenger: Affordable Adversarial Driving Video Generation0
Generative AI for Autonomous Driving: A Review0
VERDI: VLM-Embedded Reasoning for Autonomous Driving0
RIS Beam Calibration for ISAC Systems: Modeling and Performance Analysis0
ALN-P3: Unified Language Alignment for Perception, Prediction, and Planning in Autonomous Driving0
TinyDrive: Multiscale Visual Question Answering with Selective Token Routing for Autonomous Driving0
HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning0
HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving0
LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval0
AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving0
Learning-based Autonomous Oversteer Control and Collision Avoidance0
seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic SegmentationCode0
DC-Scene: Data-Centric Learning for 3D Scene UnderstandingCode0
iPad: Iterative Proposal-centric End-to-End Autonomous DrivingCode2
Spectral-Aware Global Fusion for RGB-Thermal Semantic Segmentation0
Rate-Accuracy Bounds in Visual Coding for Machines0
Show:102550
← PrevPage 4 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified