Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization May 8, 2025 Scene Understanding Sound Source Localization
Code Code Available 1Generating Visual Spatial Description via Holistic 3D Scene Understanding May 19, 2023 Scene Understanding Text Generation
Code Code Available 1Holistic 3D Scene Understanding from a Single Image with Implicit Representation Mar 11, 2021 3D Object Detection 3D Shape Reconstruction
Code Code Available 1Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding Nov 30, 2023 GPU Inductive Bias
Code Code Available 1From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection Jul 30, 2021 3D Object Detection object-detection
Code Code Available 1IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments Jun 27, 2022 Autonomous Vehicles Scene Segmentation
Code Code Available 1A Two-Stage Masked Autoencoder Based Network for Indoor Depth Completion Jun 14, 2024 3D Reconstruction Autonomous Driving
Code Code Available 1Context Prior for Scene Segmentation Apr 3, 2020 Scene Segmentation Scene Understanding
Code Code Available 1A Survey on Deep Learning Technique for Video Segmentation Jul 2, 2021 Autonomous Driving Deep Learning
Code Code Available 1Image Masking for Robust Self-Supervised Monocular Depth Estimation Oct 5, 2022 Autonomous Driving Depth Estimation
Code Code Available 14D Panoptic LiDAR Segmentation Feb 24, 2021 4D Panoptic Segmentation Benchmarking
Code Code Available 1Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks Feb 17, 2023 Deblurring Deep Learning
Code Code Available 1F-ViTA: Foundation Model Guided Visible to Thermal Translation Apr 3, 2025 Scene Understanding Style Transfer
Code Code Available 1CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery Jul 11, 2023 Question Answering Scene Understanding
Code Code Available 1CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments May 23, 2024 Pose Estimation Scene Understanding
Code Code Available 1IRS: A Large Naturalistic Indoor Robotics Stereo Dataset to Train Deep Models for Disparity and Surface Normal Estimation Dec 20, 2019 Disparity Estimation Scene Understanding
Code Code Available 1A Survey on Deep Learning for Localization and Mapping: Towards the Age of Spatial Machine Intelligence Jun 22, 2020 Deep Learning Scene Understanding
Code Code Available 1KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D Sep 28, 2021 Multiple Object Tracking Novel View Synthesis
Code Code Available 1Deep learning for radar data exploitation of autonomous vehicle Mar 15, 2022 Autonomous Driving Deep Learning
Code Code Available 1A Survey of World Models for Autonomous Driving Jan 20, 2025 Anomaly Detection Autonomous Driving
Code Code Available 1AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D Scans Mar 24, 2024 3D Instance Segmentation Instance Segmentation
Code Code Available 1Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding Mar 20, 2025 Scene Understanding
Code Code Available 1Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality Mar 11, 2021 Scene Understanding Time Series
Code Code Available 1Learning How To Robustly Estimate Camera Pose in Endoscopic Videos Apr 17, 2023 3D Reconstruction Camera Pose Estimation
Code Code Available 1Automatic Extrinsic Calibration Method for LiDAR and Camera Sensor Setups Jan 12, 2021 Scene Understanding
Code Code Available 1From General to Specific: Informative Scene Graph Generation via Balance Adjustment Aug 30, 2021 Blocking Graph Generation
Code Code Available 1CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes Jul 1, 2024 Autonomous Vehicles Image Segmentation
Code Code Available 1Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding Jan 5, 2019 Domain Adaptation Scene Understanding
Code Code Available 1Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation Oct 30, 2020 Instance Segmentation Panoptic Segmentation
Code Code Available 1LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data Sep 19, 2023 Anomaly Detection Autonomous Driving
Code Code Available 1All-Day Multi-Camera Multi-Target Tracking Jan 1, 2025 All Mamba
Code Code Available 1Learning Triadic Belief Dynamics in Nonverbal Communication from Videos Apr 7, 2021 Scene Understanding
Code Code Available 1FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding Dec 5, 2020 image-classification Image Classification
Code Code Available 1DAF-Net: A Dual-Branch Feature Decomposition Fusion Network with Domain Adaptive for Infrared and Visible Image Fusion Sep 18, 2024 Infrared And Visible Image Fusion Scene Understanding
Code Code Available 1Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering Jun 4, 2021 Meta-Learning Scene Understanding
Code Code Available 1LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation Jun 14, 2017 GPU Scene Understanding
Code Code Available 1LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations Dec 9, 2024 Language Modeling Language Modelling
Code Code Available 1LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics Apr 30, 2025 In-Context Learning Object
Code Code Available 1Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild Jul 23, 2020 Few-Shot Object Detection Meta-Learning
Code Code Available 1FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving Aug 14, 2023 Autonomous Driving Optical Flow Estimation
Code Code Available 1DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames Nov 1, 2019 Autonomous Navigation GPU
Code Code Available 1LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous Driving Dec 7, 2022 Autonomous Driving Instance Segmentation
Code Code Available 1Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding Apr 9, 2025 Scene Understanding Self-Supervised Learning
Code Code Available 1MassMIND: Massachusetts Maritime INfrared Dataset Sep 9, 2022 Instance Segmentation Scene Understanding
Code Code Available 1AVSegFormer: Audio-Visual Segmentation with Transformer Jul 3, 2023 Decoder Scene Understanding
Code Code Available 1MGNet: Monocular Geometric Scene Understanding for Autonomous Driving Jun 27, 2022 Autonomous Driving Depth Estimation
Code Code Available 1A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed Images Feb 16, 2021 Decision Making Scene Understanding
Code Code Available 1AeroRIT: A New Scene for Hyperspectral Image Analysis Dec 17, 2019 Hyperspectral image analysis Image Super-Resolution
Code Code Available 1A Versatile and Efficient Reinforcement Learning Framework for Autonomous Driving Oct 22, 2021 Autonomous Driving reinforcement-learning
Code Code Available 1FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud Segmentation Mar 1, 2021 3D Semantic Segmentation Decoder
Code Code Available 1