DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models Feb 19, 2024 Autonomous Driving Scene Understanding
— Unverified 0DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional Autoencoder Nov 5, 2021 Autonomous Vehicles Image Segmentation
— Unverified 0Boundary Seeking GANs Jan 1, 2018 Scene Understanding Text Generation
— Unverified 0Joint Optical Flow and Temporally Consistent Semantic Segmentation Jul 26, 2016 Motion Estimation Optical Flow Estimation
— Unverified 0DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving Aug 29, 2024 Autonomous Driving Denoising
— Unverified 0DreamAnywhere: Object-Centric Panoramic 3D Scene Generation Jun 25, 2025 Novel View Synthesis Object
— Unverified 0Bottom-up Instance Segmentation using Deep Higher-Order CRFs Sep 8, 2016 Instance Segmentation Object
— Unverified 03D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map Dec 10, 2021 Autonomous Vehicles Navigate
— Unverified 0Joint prototype and coefficient prediction for 3D instance segmentation Jul 9, 2024 3D Instance Segmentation Instance Segmentation
— Unverified 0DORSal: Diffusion for Object-centric Representations of Scenes et al Jun 13, 2023 Neural Rendering Object
— Unverified 0DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation May 28, 2025 Autonomous Navigation RAG
— Unverified 0Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding Dec 1, 2021 Disentanglement Domain Adaptation
— Unverified 0Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs Jun 5, 2025 cross-modal alignment Dense Captioning
— Unverified 0Does CLIP perceive art the same way we do? May 8, 2025 Image Generation Scene Understanding
— Unverified 0Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation Mar 25, 2023 Domain Adaptation ERP
— Unverified 0Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions Sep 11, 2018 Question Answering Scene Understanding
— Unverified 0Do Deep Neural Networks Model Nonlinear Compositionality in the Neural Representation of Human-Object Interactions? Mar 31, 2019 Human-Object Interaction Detection Object
— Unverified 0Answerability Fields: Answerable Location Estimation via Diffusion Models Jul 26, 2024 Question Answering Scene Understanding
— Unverified 0Jointly Learning to Parse and Perceive: Connecting Natural Language to the Physical World Jan 1, 2013 Language Acquisition Question Answering
— Unverified 0Joint Modeling of Visual Objects and Relations for Scene Graph Generation Dec 1, 2021 Graph Embedding Graph Generation
— Unverified 0Joint Semantic and Motion Segmentation for dynamic scenes using Deep Convolutional Networks Apr 18, 2017 Motion Segmentation Optical Flow Estimation
— Unverified 0DIV-FF: Dynamic Image-Video Feature Fields For Environment Understanding in Egocentric Videos Mar 11, 2025 Scene Understanding
— Unverified 0Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation May 11, 2025 Autonomous Driving Domain Adaptation
— Unverified 0Distraction-Aware Shadow Detection Jun 1, 2019 Scene Understanding Shadow Detection
— Unverified 0DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Jun 17, 2024 3D geometry 3D Semantic Occupancy Prediction
— Unverified 0An Intelligent Safety System for Human-Centered Semi-Autonomous Vehicles Dec 10, 2018 Autonomous Driving Autonomous Vehicles
— Unverified 0Distillation of Human-Object Interaction Contexts for Action Recognition Dec 17, 2021 Action Recognition Graph Attention
— Unverified 03D-RCNN: Instance-Level 3D Object Reconstruction via Render-and-Compare Jun 1, 2018 3D Object Reconstruction Autonomous Driving
— Unverified 0BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight Jul 11, 2024 Autonomous Driving BEV Segmentation
— Unverified 03D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning Feb 13, 2025 Code Generation Scene Understanding
— Unverified 0Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows Mar 20, 2022 Human-Object Interaction Detection Object
— Unverified 0Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition Jun 1, 2016 Image Segmentation Object Recognition
— Unverified 0Discovery of Shared Semantic Spaces for Multi-Scene Video Query and Summarization Jul 27, 2015 Scene Understanding Semantic Similarity
— Unverified 0An Exemplar-based CRF for Multi-instance Object Segmentation Jun 1, 2014 Instance Segmentation Object
— Unverified 0Disaster Anomaly Detector via Deeper FCDDs for Explainable Initial Responses Jun 5, 2023 Anomaly Detection Disaster Response
— Unverified 0BlindSpotNet: Seeing Where We Cannot See Jul 8, 2022 Depth Estimation Monocular Depth Estimation
— Unverified 0Adapting to Length Shift: FlexiLength Network for Trajectory Prediction Mar 31, 2024 Autonomous Driving Prediction
— Unverified 0iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability Jun 25, 2021 Bias Detection Question Answering
— Unverified 0DirectShape: Direct Photometric Alignment of Shape Priors for Visual Vehicle Pose and Shape Estimation Apr 22, 2019 3D Object Detection Autonomous Driving
— Unverified 0Direction-Aware Semi-Dense SLAM Sep 18, 2017 Scene Understanding Segmentation
— Unverified 0Blending Learning and Inference in Structured Prediction Oct 8, 2012 Prediction Scene Understanding
— Unverified 0DINeMo: Learning Neural Mesh Models with no 3D Annotations Mar 26, 2025 3D Pose Estimation 6D Pose Estimation
— Unverified 0A New Ratio Image Based CNN Algorithm For SAR Despeckling Jun 10, 2019 General Classification Scene Understanding
— Unverified 0J-MOD^2: Joint Monocular Obstacle Detection and Depth Estimation Sep 25, 2017 Depth Estimation Scene Understanding
— Unverified 0Audio-Visual Collaborative Representation Learning for Dynamic Saliency Prediction Sep 17, 2021 Representation Learning Saliency Prediction
— Unverified 0Digital Divides in Scene Recognition: Uncovering Socioeconomic Biases in Deep Learning Systems Jan 23, 2024 Scene Classification Scene Recognition
— Unverified 0Active Scene Understanding via Online Semantic Reconstruction Jun 18, 2019 Scene Parsing Scene Understanding
— Unverified 03D Question Answering for City Scene Understanding Jul 24, 2024 Autonomous Driving Question Answering
— Unverified 0DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data Mar 22, 2024 Denoising Scene Understanding
— Unverified 0Diffusion Models in 3D Vision: A Survey Oct 7, 2024 Autonomous Driving Computational Efficiency
— Unverified 0