HOC-Search: Efficient CAD Model and Pose Retrieval from RGB-D Scans Sep 12, 2023 3D Object Retrieval 3D Scene Reconstruction
Code Code Available 1Human-centric Scene Understanding for 3D Large-scale Scenarios Jul 26, 2023 Action Recognition Scene Understanding
Code Code Available 1Image Segmentation Using Deep Learning: A Survey Jan 15, 2020 Decoder Deep Learning
Code Code Available 1Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization May 8, 2025 Scene Understanding Sound Source Localization
Code Code Available 1GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding Mar 14, 2024 Contrastive Learning Representation Learning
Code Code Available 1Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images Aug 6, 2021 Depth Estimation Panoptic Segmentation
Code Code Available 1Group Contextual Encoding for 3D Point Clouds Dec 1, 2020 Scene Understanding
Code Code Available 1HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation Mar 28, 2023 Panoptic Scene Graph Generation Scene Graph Generation
Code Code Available 1GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields Apr 1, 2024 Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation
Code Code Available 1Grounded Situation Recognition with Transformers Nov 19, 2021 Decoder Grounded Situation Recognition
Code Code Available 1GFF: Gated Fully Fusion for Semantic Segmentation Apr 3, 2019 Scene Parsing Scene Understanding
Code Code Available 1Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models May 15, 2023 3D Object Detection Image Captioning
Code Code Available 1Generating Visual Spatial Description via Holistic 3D Scene Understanding May 19, 2023 Scene Understanding Text Generation
Code Code Available 1Global Aggregation then Local Distribution in Fully Convolutional Networks Sep 16, 2019 Instance Segmentation object-detection
Code Code Available 1Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection Jan 1, 2021 Common Sense Reasoning Graph Generation
Code Code Available 1Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping Apr 1, 2024 image-classification Image Classification
Code Code Available 1Learning Triadic Belief Dynamics in Nonverbal Communication from Videos Apr 7, 2021 Scene Understanding
Code Code Available 1F-ViTA: Foundation Model Guided Visible to Thermal Translation Apr 3, 2025 Scene Understanding Style Transfer
Code Code Available 1Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation Dec 16, 2021 Feature Importance Scene Understanding
Code Code Available 1From General to Specific: Informative Scene Graph Generation via Balance Adjustment Aug 30, 2021 Blocking Graph Generation
Code Code Available 1Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation Mar 2, 2022 Domain Adaptation Scene Understanding
Code Code Available 1FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud Segmentation Mar 1, 2021 3D Semantic Segmentation Decoder
Code Code Available 1From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection Jul 30, 2021 3D Object Detection object-detection
Code Code Available 1Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild Jul 23, 2020 Few-Shot Object Detection Meta-Learning
Code Code Available 1Behind the Curtain: Learning Occluded Shapes for 3D Object Detection Dec 4, 2021 3D Object Detection Object
Code Code Available 1Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning May 31, 2022 Common Sense Reasoning Graph Generation
Code Code Available 1Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving May 13, 2025 3D visual grounding Autonomous Driving
Code Code Available 1FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding Dec 5, 2020 image-classification Image Classification
Code Code Available 1Explainable Object-induced Action Decision for Autonomous Vehicles Mar 20, 2020 Autonomous Driving Autonomous Vehicles
Code Code Available 1Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding Nov 9, 2015 Decision Making Decoder
Code Code Available 1A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed Images Feb 16, 2021 Decision Making Scene Understanding
Code Code Available 1Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis Mar 9, 2021 3d scene graph generation graph construction
Code Code Available 1Boundary-induced and scene-aggregated network for monocular depth prediction Feb 26, 2021 Depth Estimation Depth Prediction
Code Code Available 1BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection Nov 17, 2022 3D Object Detection Depth Estimation
Code Code Available 1Beyond Appearances: Material Segmentation with Embedded Spectral Information from RGB-D imagery May 17, 2024 Material Classification Material Recognition
Code Code Available 1FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier Convolutions Oct 4, 2022 Depth Estimation Monocular Depth Estimation
Code Code Available 13DP3: 3D Scene Perception via Probabilistic Programming Oct 30, 2021 Object Pose Estimation
Code Code Available 1Event-based Motion Segmentation with Spatio-Temporal Graph Cuts Dec 16, 2020 Motion Segmentation Scene Understanding
Code Code Available 1Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts Dec 16, 2020 3D Semantic Segmentation Instance Segmentation
Code Code Available 1FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving Aug 14, 2023 Autonomous Driving Optical Flow Estimation
Code Code Available 1Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge Nov 21, 2023 Large Language Model Multimodal Deep Learning
Code Code Available 1Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model Mar 30, 2025 Depth Estimation Monocular Depth Estimation
Code Code Available 1Bidirectional Projection Network for Cross Dimension Scene Understanding Mar 26, 2021 2D Semantic Segmentation 3D Semantic Segmentation
Code Code Available 1AVSegFormer: Audio-Visual Segmentation with Transformer Jul 3, 2023 Decoder Scene Understanding
Code Code Available 1Bi-level Dynamic Learning for Jointly Multi-modality Image Fusion and Beyond May 11, 2023 Scene Understanding
Code Code Available 1Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding Jan 28, 2022 Graph Attention Knowledge Distillation
Code Code Available 1Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Nov 29, 2024 3D geometry 3DGS
Code Code Available 1EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery Jan 20, 2025 Language Modeling Language Modelling
Code Code Available 1Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation Sep 20, 2021 Decoder Prediction
Code Code Available 1Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments Jul 10, 2022 Instance Segmentation Panoptic Segmentation
Code Code Available 1