CASPNet++: Joint Multi-Agent Motion Prediction Aug 15, 2023 Autonomous Driving motion prediction
— Unverified 0ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail Mar 21, 2025 Object Scene Understanding
— Unverified 0Estimating Depth from Monocular Images as Classification Using Deep Fully Convolutional Residual Networks May 8, 2016 Depth Estimation General Classification
— Unverified 0Expanding Frozen Vision-Language Models without Retraining: Towards Improved Robot Perception Aug 31, 2023 Activity Recognition Human Activity Recognition
— Unverified 0Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios Jun 25, 2025 Autonomous Driving Decision Making
— Unverified 0ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding Jun 30, 2024 Graph Generation Graph Neural Network
— Unverified 0Explicit3D: Graph Network with Spatial Inference for Single Image 3D Object Detection Feb 13, 2023 3D Object Detection Graph Generation
— Unverified 0Cascaded Classification Models: Combining Models for Holistic Scene Understanding Dec 1, 2008 3D Reconstruction 3D Scene Reconstruction
— Unverified 0Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection Jul 17, 2025 Scene Understanding
— Unverified 0Exploiting Temporal Coherence for Multi-modal Video Categorization Feb 7, 2020 object-detection Object Detection
— Unverified 0Transavs: End-To-End Audio-Visual Segmentation With Transformer May 12, 2023 Scene Understanding Segmentation
— Unverified 0ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation Sep 7, 2020 Autonomous Driving Domain Adaptation
— Unverified 0Car Segmentation and Pose Estimation using 3D Object Models Dec 21, 2015 3D Pose Estimation Image Segmentation
— Unverified 0Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors Oct 12, 2024 3D Generation 3D geometry
— Unverified 0A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors Dec 3, 2023 Active Learning Instance Segmentation
— Unverified 0A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators Aug 1, 2018 Attribute Natural Questions
— Unverified 0Enhancing image captioning with depth information using a Transformer-based framework Jul 24, 2023 Image Captioning Image Paragraph Captioning
— Unverified 0Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning Mar 15, 2024 Autonomous Driving Human-Object Interaction Detection
— Unverified 0Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving Sep 11, 2023 Autonomous Driving Descriptive
— Unverified 0Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding Jun 17, 2024 3D Object Detection 3D Semantic Segmentation
— Unverified 0Multilateral Cascading Network for Semantic Segmentation of Large-Scale Outdoor Point Clouds Sep 21, 2024 Scene Understanding Semantic Segmentation
— Unverified 0Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps May 24, 2025 Scene Understanding Spatial Reasoning
— Unverified 03D-VirtFusion: Synthetic 3D Data Augmentation through Generative Diffusion Models and Controllable Editing Aug 25, 2024 Data Augmentation Diversity
— Unverified 0HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors Aug 12, 2024 Scene Understanding Semantic Segmentation
— Unverified 0HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes Mar 29, 2024 3DGS Autonomous Vehicles
— Unverified 0End-to-End Race Driving with Deep Reinforcement Learning Jul 6, 2018 Deep Reinforcement Learning Domain Adaptation
— Unverified 0End-to-end Autonomous Driving using Deep Learning: A Systematic Review Aug 27, 2023 Autonomous Driving object-detection
— Unverified 0Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving Sep 4, 2024 Autonomous Driving Decision Making
— Unverified 0Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision Mar 28, 2025 Optical Flow Estimation Point Tracking
— Unverified 0Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind May 18, 2025 Benchmarking Scene Understanding
— Unverified 0A Reinforcement Learning Approach to Target Tracking in a Camera Network Jul 26, 2018 Q-Learning reinforcement-learning
— Unverified 0Empowering Large Language Models with 3D Situation Awareness Mar 29, 2025 Scene Understanding
— Unverified 0Empowering cyberphysical systems of systems with intelligence Jul 5, 2021 Decision Making Management
— Unverified 0Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation? Apr 23, 2022 Robot Manipulation Scene Understanding
— Unverified 0EML-NET:An Expandable Multi-Layer NETwork for Saliency Prediction May 2, 2018 Saliency Prediction Scene Understanding
— Unverified 0Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery Mar 29, 2025 Action Understanding Instrument Recognition
— Unverified 0A Reflectance Based Method For Shadow Detection and Removal Jul 11, 2018 Detecting Shadows Scene Understanding
— Unverified 0A diffusion and clustering-based approach for finding coherent motions and understanding crowd scenes Feb 16, 2016 Clustering Optical Flow Estimation
— Unverified 0Embracing Diffraction: A Paradigm Shift in Wireless Sensing and Communication May 2, 2025 Scene Understanding
— Unverified 0EmbRACE-3K: Embodied Reasoning and Action in Complex Environments Jul 14, 2025 Scene Understanding Spatial Reasoning
— Unverified 0Embodied Visual Active Learning for Semantic Segmentation Dec 17, 2020 Active Learning Deep Reinforcement Learning
— Unverified 0Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding Dec 31, 2024 Robot Manipulation Scene Understanding
— Unverified 0Camera-Radar Perception for Autonomous Vehicles and ADAS: Concepts, Datasets and Metrics Mar 8, 2023 Autonomous Vehicles Scene Understanding
— Unverified 0Are Cars Just 3D Boxes? - Jointly Estimating the 3D Shape of Multiple Objects Jun 1, 2014 3D geometry 3D Shape Modeling
— Unverified 0Embodied Scene Understanding for Vision Language Models via MetaVQA Jan 15, 2025 Decision Making Question Answering
— Unverified 0Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles May 9, 2025 Autonomous Navigation Autonomous Vehicles
— Unverified 0Camera Control at the Edge with Language Models for Scene Understanding May 9, 2025 Language Modeling Language Modelling
— Unverified 0Addressing the Sim2Real Gap in Robotic 3D Object Classification Oct 28, 2019 3D Object Classification Classification
— Unverified 03D Shape Augmentation with Content-Aware Shape Resizing May 15, 2024 3D Generation Scene Understanding
— Unverified 0Elastic Interaction Energy-Informed Real-Time Traffic Scene Perception Oct 2, 2023 Autonomous Driving Image Segmentation
— Unverified 0