SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 151200 of 6092 papers

TitleStatusHype
FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud MapsCode2
STAMP: Scalable Task And Model-agnostic Collaborative PerceptionCode2
GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian SplattingCode2
Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous DrivingCode2
LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process ThinkingCode2
Online Video Understanding: OVBench and VideoChat-OnlineCode2
Mapping the Mind of an Instruction-based Image Editing using SMILECode2
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous DrivingCode2
Joint Perception and Prediction for Autonomous Driving: A SurveyCode2
DINO-Foresight: Looking into the Future with DINOCode2
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition ControlCode2
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy PredictionCode2
GaussianAD: Gaussian-Centric End-to-End Autonomous DrivingCode2
Doe-1: Closed-Loop Autonomous Driving with Large World ModelCode2
DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous DrivingCode2
GPD-1: Generative Pre-training for DrivingCode2
DriveMM: All-in-One Large Multimodal Model for Autonomous DrivingCode2
Driv3R: Learning Dense 4D Reconstruction for Autonomous DrivingCode2
Stag-1: Towards Realistic 4D Driving Simulation with Video Generation ModelCode2
SADG: Segment Any Dynamic Gaussian Without Object TrackersCode2
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
Monocular Lane Detection Based on Deep Learning: A SurveyCode2
Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel MethodCode2
DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous DrivingCode2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation GraphCode2
DrivingSphere: Building a High-fidelity 4D World for Closed-loop SimulationCode2
On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDARCode2
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulatorCode2
UniDrive: Towards Universal Driving Perception Across Camera ConfigurationsCode2
WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic SegmentationCode2
BEVLoc: Cross-View Localization and Matching via Birds-Eye-View SynthesisCode2
DeMo: Decoupling Motion Forecasting into Directional Intentions and Dynamic StatesCode2
Motion Forecasting in Continuous DrivingCode2
GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous DrivingCode2
DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy PredictionCode2
Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous DrivingCode2
OPUS: Occupancy Prediction Using a Sparse SetCode2
MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous DrivingCode2
A Comprehensive Survey on Evidential Deep Learning and Its ApplicationsCode2
AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditionsCode2
Make Your ViT-based Multi-view 3D Detectors Faster via Token CompressionCode2
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
UTrack: Multi-Object Tracking with Uncertain DetectionsCode2
RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured EnvironmentsCode2
Drone-assisted Road Gaussian Splatting with Cross-view UncertaintyCode2
TripleMixer: A 3D Point Cloud Denoising Model for Adverse WeatherCode2
MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory PredictionCode2
Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement NetworkCode2
Show:102550
← PrevPage 4 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified