SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 201250 of 6092 papers

TitleStatusHype
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse TransformersCode2
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud MapsCode2
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large ModelsCode2
A real-time dynamic obstacle tracking and mapping system for UAV navigation and collision avoidance with an RGB-D cameraCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
4D Contrastive Superflows are Dense 3D Representation LearnersCode2
Autonomous Driving on Curvy Roads Without Reliance on Frenet Frame: A Cartesian-Based Trajectory Planning MethodCode2
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario AnalysisCode2
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of VehiclesCode2
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View TransformationCode2
ADAPT: Action-aware Driving Caption TransformerCode2
Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable FiltersCode2
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous DrivingCode2
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model ConversionCode2
Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and SegmentationCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather ConditionsCode2
LandMarkSystem Technical ReportCode2
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity EnhancementCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
FRNet: Frustum-Range Networks for Scalable LiDAR SegmentationCode2
Adaptive Fusion of Single-View and Multi-View Depth for Autonomous DrivingCode2
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-ViewCode2
CaRL: Learning Scalable Planning Policies with Simple RewardsCode2
Enhancing Vectorized Map Perception with Historical Rasterized MapsCode2
3D Object Detection for Autonomous Driving: A Comprehensive SurveyCode2
LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting NetworkCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
LightLoc: Learning Outdoor LiDAR Localization at Light SpeedCode2
LimSim: A Long-term Interactive Multi-scenario Traffic SimulatorCode2
LingoQA: Visual Question Answering for Autonomous DrivingCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
End-to-End Vectorized HD-map Construction with Piecewise Bezier CurveCode2
Benchmarking the Robustness of LiDAR Semantic Segmentation ModelsCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous DrivingCode2
Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane PriorsCode2
Exploring the Causality of End-to-End Autonomous DrivingCode2
Fully Sparse 3D Object DetectionCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised MatchingCode2
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningCode2
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic SegmentationCode2
Show:102550
← PrevPage 5 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified