AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis Feb 3, 2025 Object Counting Scene Understanding
— Unverified 00 Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring Jul 1, 2021 Food Recognition Image Captioning
— Unverified 00 Egocentric Activity Recognition and Localization on a 3D Map May 20, 2021 Action Localization Action Recognition
— Unverified 00 Efficient Point Transformer for Large-scale 3D Scene Understanding Sep 29, 2021 3D Semantic Segmentation Quantization
— Unverified 00 A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance May 16, 2024 LIDAR Semantic Segmentation Scene Understanding
— Unverified 00 Efficient Label Collection for Unlabeled Image Datasets Jun 1, 2015 Active Learning Autonomous Navigation
— Unverified 00 Efficient Interactive 3D Multi-Object Removal Jan 29, 2025 Object Scene Understanding
— Unverified 00 CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting Apr 16, 2025 3DGS 3D Instance Segmentation
— Unverified 00 CaDIS: Cataract Dataset for Image Segmentation Jun 27, 2019 2D Semantic Segmentation task 1 (8 classes) 2D Semantic Segmentation task 2 (17 classes)
— Unverified 00 Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving Jan 12, 2025 Autonomous Driving Decision Making
— Unverified 00 Efficient 3D Instance Mapping and Localization with Neural Fields Mar 28, 2024 3D Instance Segmentation Image Segmentation
— Unverified 00 Real-time Semantic Segmentation with Context Aggregation Network Nov 2, 2020 Real-Time Semantic Segmentation Scene Understanding
— Unverified 00 EarthNets: Empowering AI in Earth Observation Oct 10, 2022 Deep Learning Earth Observation
— Unverified 00 EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding Jun 3, 2024 Domain Adaptation Open Vocabulary Semantic Segmentation
— Unverified 00 BYE: Build Your Encoder with One Sequence of Exploration Data for Long-Term Dynamic Scene Understanding Dec 3, 2024 Motion Estimation Object
— Unverified 00 Application of Multimodal Large Language Models in Autonomous Driving Dec 21, 2024 Autonomous Driving Decision Making
— Unverified 00 AdaToken-3D: Dynamic Spatial Gating for Efficient 3D Large Multimodal-Models Reasoning May 19, 2025 Multimodal Reasoning Scene Understanding
— Unverified 00 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer Jan 2, 2025 Scene Understanding
— Unverified 00 360^ High-Resolution Depth Estimation via Uncertainty-aware Structural Knowledge Transfer Apr 17, 2023 Depth Estimation Monocular Depth Estimation
— Unverified 00 DynaSLAM II: Tightly-Coupled Multi-Object Tracking and SLAM Oct 15, 2020 Autonomous Driving Decision Making
— Unverified 00 Trajectory-based Scene Understanding using Dirichlet Process Mixture Model Mar 18, 2018 Clustering Decision Making
— Unverified 00 BUTLER: Building Understanding in TextWorld via Language for Embodied Reasoning Jan 1, 2021 Scene Understanding
— Unverified 00 Dynamic Scene Understanding from Vision-Language Representations Jan 20, 2025 Grounded Situation Recognition Human-Human Interaction Recognition
— Unverified 00 Building an Affordances Map with Interactive Perception Mar 11, 2019 General Classification Scene Understanding
— Unverified 00 Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning in Autonomous Driving Sep 30, 2019 Autonomous Driving Decision Making
— Unverified 00 Dynamic Clustering Transformer Network for Point Cloud Segmentation May 30, 2023 Clustering Decoder
— Unverified 00 A pooling based scene text proposal technique for scene text reading in the wild Nov 25, 2018 Scene Understanding Text Spotting
— Unverified 00 DublinCity: Annotated LiDAR Point Cloud and its Applications Sep 6, 2019 3D Reconstruction Scene Understanding
— Unverified 00 Bridging Scene Understanding and Task Execution with Flexible Simulation Environments Nov 20, 2020 Graph Generation reinforcement-learning
— Unverified 00 BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text Detection Jan 18, 2024 Diversity Scene Text Detection
— Unverified 00 DSNet: An Efficient CNN for Road Scene Segmentation Apr 10, 2019 Autonomous Driving GPU
— Unverified 00 DSM: Building A Diverse Semantic Map for 3D Visual Grounding Apr 11, 2025 3D visual grounding Scene Understanding
— Unverified 00 Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration Feb 23, 2025 3DGS 3D Semantic Segmentation
— Unverified 00 BOX3D: Lightweight Camera-LiDAR Fusion for 3D Object Detection and Localization Aug 27, 2024 3D Object Detection Benchmarking
— Unverified 00 A Dataset for Semantic Segmentation in the Presence of Unknowns Mar 28, 2025 Anomaly Detection Anomaly Segmentation
— Unverified 00 DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving May 7, 2024 3D Object Detection Autonomous Driving
— Unverified 00 DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models Feb 19, 2024 Autonomous Driving Scene Understanding
— Unverified 00 DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional Autoencoder Nov 5, 2021 Autonomous Vehicles Image Segmentation
— Unverified 00 Boundary Seeking GANs Jan 1, 2018 Scene Understanding Text Generation
— Unverified 00 APARATE: Adaptive Adversarial Patch for CNN-based Monocular Depth Estimation for Autonomous Navigation Mar 2, 2023 Autonomous Driving Autonomous Navigation
— Unverified 00 DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving Aug 29, 2024 Autonomous Driving Denoising
— Unverified 00 DreamAnywhere: Object-Centric Panoramic 3D Scene Generation Jun 25, 2025 Novel View Synthesis Object
— Unverified 00 M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving Mar 19, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 00 Bottom-up Instance Segmentation using Deep Higher-Order CRFs Sep 8, 2016 Instance Segmentation Object
— Unverified 00 Anticipating Object State Changes in Long Procedural Videos May 21, 2024 Object Object State Change Classification
— Unverified 00 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Jul 9, 2024 Instruction Following Representation Learning
— Unverified 00 DORSal: Diffusion for Object-centric Representations of Scenes et al Jun 13, 2023 Neural Rendering Object
— Unverified 00 Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding Aug 1, 2023 3D geometry 3D Open-Vocabulary Instance Segmentation
— Unverified 00 DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation May 28, 2025 Autonomous Navigation RAG
— Unverified 00 Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding Dec 1, 2021 Disentanglement Domain Adaptation
— Unverified 00