DA-RNN: Semantic Mapping with Data Associated Recurrent Neural Networks Mar 9, 2017 Scene Understanding
Code Code Available 05 Improving Social Awareness Through DANTE: A Deep Affinity Network for Clustering Conversational Interactants Jul 24, 2019 Clustering Graph Clustering
Code Code Available 05 Auxiliary Tasks in Multi-task Learning May 16, 2018 Depth Estimation Multi-Task Learning
Code Code Available 05 On the iterative refinement of densely connected representation levels for semantic segmentation Apr 30, 2018 Image Segmentation Scene Understanding
Code Code Available 05 DADA: Driver Attention Prediction in Driving Accident Scenarios Dec 18, 2019 Driver Attention Monitoring Prediction
Code Code Available 05 One model to use them all: Training a segmentation model with complementary datasets Feb 29, 2024 All Anatomy
Code Code Available 05 Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs May 15, 2023 Relation Scene Graph Generation
Code Code Available 05 CrossModalityDiffusion: Multi-Modal Novel View Synthesis with Unified Intermediate Representation Jan 16, 2025 Novel View Synthesis Scene Understanding
Code Code Available 05 Hierarchical Superpixel Segmentation via Structural Information Theory Jan 13, 2025 graph construction graph partitioning
Code Code Available 05 Object Attribute Matters in Visual Question Answering Dec 20, 2023 Attribute Graph Neural Network
Code Code Available 05 Hierarchical Context Transformer for Multi-level Semantic Scene Understanding Feb 21, 2025 Contrastive Learning Representation Learning
Code Code Available 05 Auto-Embedding Generative Adversarial Networks for High Resolution Image Synthesis Mar 27, 2019 Generative Adversarial Network Image Generation
Code Code Available 05 Object-aware Sound Source Localization via Audio-Visual Scene Understanding Jan 1, 2025 Scene Understanding Sound Source Localization
Code Code Available 05 Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields Mar 17, 2024 3D Reconstruction NeRF
Code Code Available 05 On the Structures of Representation for the Robustness of Semantic Segmentation to Input Corruption Sep 2, 2020 Scene Understanding Segmentation
Code Code Available 05 P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation Oct 23, 2023 Autonomous Driving Decoder
Code Code Available 05 Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting Jul 6, 2021 3D Object Detection Autonomous Driving
Code Code Available 05 Neural Radiance Field Codebooks Jan 10, 2023 Object Representation Learning
Code Code Available 05 Multi-task Planar Reconstruction with Feature Warping Guidance Nov 25, 2023 3D Reconstruction Instance Segmentation
Code Code Available 05 Multi-task Geometric Estimation of Depth and Surface Normal from Monocular 360° Images Nov 4, 2024 Multi-Task Learning Scene Understanding
Code Code Available 05 Neural RGB->D Sensing: Depth and Uncertainty from a Video Camera Jan 9, 2019 3D Reconstruction 3D Scene Reconstruction
Code Code Available 05 ShelfNet for Fast Semantic Segmentation Nov 27, 2018 Autonomous Driving Decoder
Code Code Available 05 Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation Oct 31, 2018 3D Object Detection Camera Pose Estimation
Code Code Available 05 Multi-Resolution Multi-Modal Sensor Fusion For Remote Sensing Data With Label Uncertainty May 2, 2018 Scene Understanding Sensor Fusion
Code Code Available 05 Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans Dec 1, 2021 4D Panoptic Segmentation Autonomous Navigation
Code Code Available 05 Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation Mar 3, 2021 Autonomous Driving Depth Estimation
Code Code Available 05 Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents Nov 27, 2024 Autonomous Navigation Object Recognition
Code Code Available 05 Continual Learning of Unsupervised Monocular Depth from Videos Nov 4, 2023 Autonomous Driving Continual Learning
Code Code Available 05 MultiDepth: Single-Image Depth Estimation via Multi-Task Regression and Classification Jul 25, 2019 Autonomous Vehicles Classification
Code Code Available 05 MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization Nov 26, 2018 2D Object Detection 3D Object Detection
Code Code Available 05 Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data Jan 31, 2024 Benchmarking Change Detection
Code Code Available 05 Constructing a Visual Relationship Authenticity Dataset Oct 11, 2020 Relationship Detection Scene Understanding
Code Code Available 05 Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud Mar 23, 2019 3D Object Detection Depth Estimation
Code Code Available 05 MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking Apr 9, 2025 Autonomous Driving Language Modeling
Code Code Available 05 NextStop: An Improved Tracker For Panoptic LIDAR Segmentation Data Jan 8, 2025 Autonomous Driving Instance Segmentation
Code Code Available 05 Confidence-Aware Paced-Curriculum Learning by Label Smoothing for Surgical Scene Understanding Dec 22, 2022 Multi-Label Classification MUlTI-LABEL-ClASSIFICATION
Code Code Available 05 MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios Dec 27, 2024 Autonomous Driving Language Modeling
Code Code Available 05 MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities Aug 14, 2020 Representation Learning Scene Understanding
Code Code Available 05 MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation Nov 16, 2024 Depth Estimation Monocular Depth Estimation
Code Code Available 05 MGNiceNet: Unified Monocular Geometric Scene Understanding Nov 18, 2024 Autonomous Driving Autonomous Vehicles
Code Code Available 05 Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange Apr 11, 2024 Object Scene Understanding
Code Code Available 05 Computational Imaging for Machine Perception: Transferring Semantic Segmentation beyond Aberrations Nov 21, 2022 Domain Adaptation Scene Understanding
Code Code Available 05 MC-PanDA: Mask Confidence for Panoptic Domain Adaptation Jul 19, 2024 Domain Adaptation Panoptic Segmentation
Code Code Available 05 General-Purpose Deep Point Cloud Feature Extractor Mar 12, 2018 3D Object Classification 3D Point Cloud Classification
Code Code Available 05 Attend, Infer, Repeat: Fast Scene Understanding with Generative Models Mar 28, 2016 Scene Understanding
Code Code Available 05 Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation Mar 18, 2024 Common Sense Reasoning Efficient Exploration
Code Code Available 05 Matterport3D: Learning from RGB-D Data in Indoor Environments Sep 18, 2017 General Classification Scene Understanding
Code Code Available 05 Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis Jun 28, 2023 Scene Understanding
Code Code Available 05 METEOR Guided Divergence for Video Captioning Dec 20, 2022 Hierarchical Reinforcement Learning Scene Understanding
Code Code Available 05 Model-based inexact graph matching on top of CNNs for semantic scene understanding Jan 18, 2023 Brain Segmentation Deep Learning
Code Code Available 05