| ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents | Mar 19, 2025 | Autonomous DrivingImage Stitching | —Unverified | 0 |
| GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving | Mar 19, 2025 | Autonomous DrivingTrajectory Prediction | —Unverified | 0 |
| DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling | Mar 19, 2025 | Autonomous DrivingPosition | —Unverified | 0 |
| MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models | Mar 19, 2025 | Adversarial RobustnessAutonomous Driving | —Unverified | 0 |
| SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments | Mar 19, 2025 | Autonomous DrivingComputational Efficiency | —Unverified | 0 |
| CP-NCBF: A Conformal Prediction-based Approach to Synthesize Verified Neural Control Barrier Functions | Mar 18, 2025 | Autonomous DrivingConformal Prediction | —Unverified | 0 |
| SuperPC: A Single Diffusion Model for Point Cloud Completion, Upsampling, Denoising, and Colorization | Mar 18, 2025 | 3D ReconstructionAutonomous Driving | —Unverified | 0 |
| Driving behavior recognition via self-discovery learning | Mar 18, 2025 | Autonomous Driving | —Unverified | 0 |
| ChatBEV: A Visual Language Model that Understands BEV Maps | Mar 18, 2025 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds | Mar 18, 2025 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 0 |
| SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model | Mar 18, 2025 | Autonomous DrivingImage Generation | CodeCode Available | 1 |
| RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving | Mar 18, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Robust3D-CIL: Robust Class-Incremental Learning for 3D Perception | Mar 18, 2025 | Autonomous Drivingclass-incremental learning | —Unverified | 0 |
| Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning | Mar 18, 2025 | Autonomous DrivingMotion Planning | CodeCode Available | 2 |
| Advances in 4D Generation: A Survey | Mar 18, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations | Mar 18, 2025 | Autonomous DrivingMamba | CodeCode Available | 1 |
| Tracking Meets Large Multimodal Models for Driving Scenario Understanding | Mar 18, 2025 | Autonomous Driving | CodeCode Available | 1 |
| TriLiteNet: Lightweight Model for Multi-Task Visual Perception | Mar 17, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 1 |
| SparseAlign: A Fully Sparse Framework for Cooperative Object Detection | Mar 17, 2025 | Autonomous Drivingobject-detection | —Unverified | 0 |
| Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey | Mar 17, 2025 | 3D ReconstructionAutonomous Driving | —Unverified | 0 |
| SAM2 for Image and Video Segmentation: A Comprehensive Survey | Mar 17, 2025 | Autonomous DrivingImage Segmentation | —Unverified | 0 |
| OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering | Mar 17, 2025 | 3D Multi-Object TrackingAutonomous Driving | —Unverified | 0 |
| AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction | Mar 17, 2025 | Autonomous DrivingNavigate | —Unverified | 0 |
| GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching | Mar 17, 2025 | Autonomous DrivingImage Generation | CodeCode Available | 2 |
| A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives | Mar 17, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| L2COcc: Lightweight Camera-Centric Semantic Scene Completion via Distillation of LiDAR Model | Mar 16, 2025 | Autonomous Driving | —Unverified | 0 |
| Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding | Mar 16, 2025 | Autonomous DrivingRAG | CodeCode Available | 1 |
| Point Cloud Based Scene Segmentation: A Survey | Mar 16, 2025 | 3D Object Detection3D Semantic Segmentation | —Unverified | 0 |
| Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey | Mar 16, 2025 | Autonomous Drivingmultimodal generation | CodeCode Available | 4 |
| Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training | Mar 15, 2025 | Autonomous DrivingBench2Drive | CodeCode Available | 1 |
| Bench2FreeAD: A Benchmark for Vision-based End-to-end Navigation in Unstructured Robotic Environments | Mar 15, 2025 | Autonomous DrivingRobot Navigation | CodeCode Available | 1 |
| DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving | Mar 15, 2025 | Autonomous DrivingBench2Drive | —Unverified | 0 |
| 3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene Reconstruction | Mar 15, 2025 | 3D ReconstructionAutonomous Driving | CodeCode Available | 1 |
| Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation | Mar 14, 2025 | Autonomous DrivingData Augmentation | —Unverified | 0 |
| DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation | Mar 14, 2025 | 3D geometryAutonomous Driving | CodeCode Available | 1 |
| Active Learning from Scene Embeddings for End-to-End Autonomous Driving | Mar 14, 2025 | Active LearningAutonomous Driving | —Unverified | 0 |
| DynRsl-VLM: Enhancing Autonomous Driving Perception with Dynamic Resolution Vision-Language Models | Mar 14, 2025 | Autonomous DrivingComputational Efficiency | —Unverified | 0 |
| BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion Model | Mar 14, 2025 | Autonomous DrivingData Augmentation | CodeCode Available | 1 |
| Centaur: Robust End-to-End Autonomous Driving with Test-Time Training | Mar 14, 2025 | Autonomous DrivingNavSim | —Unverified | 0 |
| A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving | Mar 14, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Learning-Based MPC for Fuel Efficient Control of Autonomous Vehicles with Discrete Gear Selection | Mar 14, 2025 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 0 |
| Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM | Mar 13, 2025 | Autonomous DrivingDecoder | CodeCode Available | 1 |
| TAIJI: Textual Anchoring for Immunizing Jailbreak Images in Vision Language Models | Mar 13, 2025 | Autonomous Driving | —Unverified | 0 |
| OCCUQ: Exploring Efficient Uncertainty Quantification for 3D Occupancy Prediction | Mar 13, 2025 | Autonomous DrivingNavigate | CodeCode Available | 1 |
| Mamba-VA: A Mamba-based Approach for Continuous Emotion Recognition in Valence-Arousal Space | Mar 13, 2025 | Autonomous DrivingEmotion Recognition | CodeCode Available | 0 |
| Unlock the Power of Unlabeled Data in Language Driving Model | Mar 13, 2025 | Autonomous DrivingQuestion Answering | —Unverified | 0 |
| MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction | Mar 13, 2025 | 3DGS3D Scene Reconstruction | —Unverified | 0 |
| TGP: Two-modal occupancy prediction with 3D Gaussian and sparse points for 3D Environment Awareness | Mar 13, 2025 | Autonomous DrivingPrediction | —Unverified | 0 |
| Unlocking Generalization Power in LiDAR Point Cloud Registration | Mar 13, 2025 | Autonomous DrivingPoint Cloud Registration | CodeCode Available | 2 |
| TARS: Traffic-Aware Radar Scene Flow Estimation | Mar 13, 2025 | Autonomous Drivingobject-detection | —Unverified | 0 |