SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 301350 of 6092 papers

TitleStatusHype
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of VehiclesCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model ConversionCode2
Benchmarking the Robustness of LiDAR Semantic Segmentation ModelsCode2
GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian SplattingCode2
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-ViewCode2
Asynchronous Large Language Model Enhanced Planner for Autonomous DrivingCode2
Autonomous Driving on Curvy Roads Without Reliance on Frenet Frame: A Cartesian-Based Trajectory Planning MethodCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and PlanningCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent SpaceCode2
Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
Fully Sparse 3D Occupancy PredictionCode2
A Survey on Multimodal Large Language Models for Autonomous DrivingCode2
Human-AI Shared Control via Policy DissectionCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable FiltersCode2
Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving ApplicationsCode2
iPad: Iterative Proposal-centric End-to-End Autonomous DrivingCode2
Advances in 4D Generation: A SurveyCode2
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulatorCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
A Simulation Benchmark for Autonomous Racing with Large-Scale Human DataCode2
3D Object Detection for Autonomous Driving: A Comprehensive SurveyCode2
Label Efficient Visual Abstractions for Autonomous DrivingCode2
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous DrivingCode2
LangCoop: Collaborative Driving with LanguageCode2
Large Trajectory Models are Scalable Motion Predictors and PlannersCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
A Simple and Model-Free Path Filtering Algorithm for Smoothing and AccuracyCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
Chameleon: Fast-slow Neuro-symbolic Lane Topology ExtractionCode2
Cityscapes 3D: Dataset and Benchmark for 9 DoF Vehicle DetectionCode2
CLRerNet: Improving Confidence of Lane Detection with LaneIoUCode2
LidarDM: Generative LiDAR Simulation in a Generated WorldCode2
LiDAR Snowfall Simulation for Robust 3D Object DetectionCode2
FUTR3D: A Unified Sensor Fusion Framework for 3D DetectionCode2
LimSim: A Long-term Interactive Multi-scenario Traffic SimulatorCode2
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse TransformersCode2
LMDrive: Closed-Loop End-to-End Driving with Large Language ModelsCode2
A Review of Safe Reinforcement Learning: Methods, Theory and ApplicationsCode2
MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene UnderstandingCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object SegmentationCode2
Show:102550
← PrevPage 7 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified