SOTAVerified

Autonomous Driving

Autonomous driving is the task of driving a vehicle without human conduction.

Many of the state-of-the-art results can be found at more general task pages such as 3D Object Detection and Semantic Segmentation.

(Image credit: Exploring the Limitations of Behavior Cloning for Autonomous Driving)

Papers

Showing 651700 of 6092 papers

TitleStatusHype
Injecting Planning-Awareness into Prediction and Detection EvaluationCode1
Interpretable Self-Aware Neural Networks for Robust Trajectory PredictionCode1
Deep Metric Learning for Open World Semantic SegmentationCode1
Deep Learning for Vision-based Prediction: A SurveyCode1
IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision for Autonomous DrivingCode1
Deep Learning for Omnidirectional Vision: A Survey and New PerspectivesCode1
Deep Learning for 3D Point Cloud Understanding: A SurveyCode1
Deep learning for radar data exploitation of autonomous vehicleCode1
iCurb: Imitation Learning-based Detection of Road Curbs using Aerial Images for Autonomous DrivingCode1
Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop TrainingCode1
HVDetFusion: A Simple and Robust Camera-Radar Fusion FrameworkCode1
HypLiLoc: Towards Effective LiDAR Pose Regression with Hyperbolic FusionCode1
DeepReach: A Deep Learning Approach to High-Dimensional ReachabilityCode1
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic SegmentationCode1
DeepIPC: Deeply Integrated Perception and Control for an Autonomous Vehicle in Real EnvironmentsCode1
Human Performance Capture from Monocular Video in the WildCode1
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative PerceptionCode1
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous DrivingCode1
DASGIL: Domain Adaptation for Semantic and Geometric-aware Image-based LocalizationCode1
DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic SegmentationCode1
Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic TasksCode1
HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDARCode1
Human-Centric Autonomous Systems With LLMs for User Command ReasoningCode1
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object MotionCode1
CSFlow: Learning Optical Flow via Cross Strip Correlation for Autonomous DrivingCode1
Curricular Subgoals for Inverse Reinforcement LearningCode1
DatasetEquity: Are All Samples Created Equal? In The Quest For Equity Within DatasetsCode1
Crowdsourced 3D Mapping: A Combined Multi-View Geometry and Self-Supervised Learning ApproachCode1
How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary InvestigationCode1
AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object DetectionCode1
Cross-modal Learning for Domain Adaptation in 3D Semantic SegmentationCode1
HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving ScenariosCode1
MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian DetectionCode1
A Unified Probabilistic Approach to Traffic Conflict DetectionCode1
CrossDTR: Cross-view and Depth-guided Transformers for 3D Object DetectionCode1
DARTH: Holistic Test-time Adaptation for Multiple Object TrackingCode1
CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point BlendingCode1
Human-compatible driving partners through data-regularized self-play reinforcement learningCode1
A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future DirectionsCode1
DDD17: End-To-End DAVIS Driving DatasetCode1
IGDrivSim: A Benchmark for the Imitation Gap in Autonomous DrivingCode1
A Unified Query-based Paradigm for Point Cloud UnderstandingCode1
PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous DrivingCode1
Deep Federated Learning for Autonomous DrivingCode1
CR3DT: Camera-RADAR Fusion for 3D Detection and TrackingCode1
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent ForecastingCode1
HLA-Face: Joint High-Low Adaptation for Low Light Face DetectionCode1
Deep Learning for 3D Point Clouds: A SurveyCode1
COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy PredictionCode1
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous DrivingCode1
Show:102550
← PrevPage 14 of 122Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ReasonNetDriving Score79.95Unverified
2InterFuserDriving Score76.18Unverified
3TCPDriving Score75.14Unverified
4TF++ WPDriving Score66.32Unverified
5Learning From All Vehicles (LAV)Driving Score61.85Unverified
6TransFuserDriving Score61.18Unverified
7TransFuser (Reproduced)Driving Score55.04Unverified
8TCP (Reproduced)Driving Score47.91Unverified
9Latent TransFuserDriving Score45.2Unverified
10GRIADDriving Score36.79Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC69.17Unverified
2TransFuserRC56.36Unverified
#ModelMetricClaimedVerifiedStatus
1Geometric FusionRC86.91Unverified
2TransFuserRC78.41Unverified