Relation-aware Instance Refinement for Weakly Supervised Visual Grounding Mar 24, 2021 Object Relation
Code Code Available 1Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation Dec 22, 2021 Common Sense Reasoning Question Answering
Code Code Available 1GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields Apr 1, 2024 Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation
Code Code Available 1ReorientBot: Learning Object Reorientation for Specific-Posed Placement Feb 22, 2022 Motion Generation Motion Planning
Code Code Available 1RescueNet: A High Resolution UAV Semantic Segmentation Benchmark Dataset for Natural Disaster Damage Assessment Feb 24, 2022 Scene Understanding Segmentation
Code Code Available 1RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction Nov 30, 2020 3D geometry Object
Code Code Available 1Grounded Situation Recognition with Transformers Nov 19, 2021 Decoder Grounded Situation Recognition
Code Code Available 1Class-Incremental Domain Adaptation with Smoothing and Calibration for Surgical Report Generation Jul 23, 2021 Domain Adaptation Few-Shot Learning
Code Code Available 1ROOT: VLM based System for Indoor Scene Understanding and Beyond Nov 24, 2024 Scene Generation Scene Understanding
Code Code Available 1Distilled Semantics for Comprehensive Scene Understanding from Videos Mar 31, 2020 Depth Estimation Knowledge Distillation
Code Code Available 1Generating Visual Spatial Description via Holistic 3D Scene Understanding May 19, 2023 Scene Understanding Text Generation
Code Code Available 1RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models Aug 27, 2024 Descriptive Language Modeling
Code Code Available 1DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection Dec 25, 2023 3D Object Detection object-detection
Code Code Available 1SaccadeNet: A Fast and Accurate Object Detector Mar 26, 2020 Object object-detection
Code Code Available 1General Geometry-aware Weakly Supervised 3D Object Detection Jul 18, 2024 3D Object Detection Object
Code Code Available 1Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model Mar 30, 2025 Depth Estimation Monocular Depth Estimation
Code Code Available 1Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Nov 29, 2024 3D geometry 3DGS
Code Code Available 1Explainable Object-induced Action Decision for Autonomous Vehicles Mar 20, 2020 Autonomous Driving Autonomous Vehicles
Code Code Available 1GFF: Gated Fully Fusion for Semantic Segmentation Apr 3, 2019 Scene Parsing Scene Understanding
Code Code Available 1SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences Mar 27, 2021 3D Object Classification 3d scene graph generation
Code Code Available 1SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchmark under Multiple Environments Nov 9, 2020 Autonomous Driving Depth Estimation
Code Code Available 1SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation Nov 2, 2015 Crowd Counting Decoder
Code Code Available 1F-ViTA: Foundation Model Guided Visible to Thermal Translation Apr 3, 2025 Scene Understanding Style Transfer
Code Code Available 1DPF: Learning Dense Prediction Fields with Weak Supervision Mar 29, 2023 Intrinsic Image Decomposition Prediction
Code Code Available 1Boundary-induced and scene-aggregated network for monocular depth prediction Feb 26, 2021 Depth Estimation Depth Prediction
Code Code Available 1Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models Jul 23, 2022 Scene Understanding
Code Code Available 1From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection Jul 30, 2021 3D Object Detection object-detection
Code Code Available 1Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level 3D Part Instance Segmentation Aug 9, 2022 3D Instance Segmentation 3D Part Segmentation
Code Code Available 1From General to Specific: Informative Scene Graph Generation via Balance Adjustment Aug 30, 2021 Blocking Graph Generation
Code Code Available 1SemSegDepth: A Combined Model for Semantic Segmentation and Depth Completion Sep 1, 2022 Depth Completion Scene Understanding
Code Code Available 1Global Aggregation then Local Distribution in Fully Convolutional Networks Sep 16, 2019 Instance Segmentation object-detection
Code Code Available 1Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving May 13, 2025 3D visual grounding Autonomous Driving
Code Code Available 1FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding Dec 5, 2020 image-classification Image Classification
Code Code Available 1DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction May 9, 2024 Contrastive Learning Scene Understanding
Code Code Available 1Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene Understanding Apr 16, 2020 Human Part Segmentation Panoptic Segmentation
Code Code Available 1Dual-Hybrid Attention Network for Specular Highlight Removal Jul 17, 2024 highlight removal Object Recognition
Code Code Available 1FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving Aug 14, 2023 Autonomous Driving Optical Flow Estimation
Code Code Available 1Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds Sep 1, 2021 3D Object Detection 3D Point Cloud Classification
Code Code Available 1Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning May 31, 2022 Common Sense Reasoning Graph Generation
Code Code Available 1Dynamic Graph Message Passing Networks Aug 19, 2019 Image Classification object-detection
Code Code Available 1Dynamic Graph Message Passing Networks for Visual Recognition Sep 20, 2022 image-classification Image Classification
Code Code Available 1Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models May 15, 2023 3D Object Detection Image Captioning
Code Code Available 1A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed Images Feb 16, 2021 Decision Making Scene Understanding
Code Code Available 1Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild Jul 23, 2020 Few-Shot Object Detection Meta-Learning
Code Code Available 1Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation Mar 8, 2024 Depth Estimation Monocular Depth Estimation
Code Code Available 1FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud Segmentation Mar 1, 2021 3D Semantic Segmentation Decoder
Code Code Available 1ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data Nov 17, 2021 3D Object Detection object-detection
Code Code Available 13UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding Jan 14, 2025 Language Modeling Language Modelling
Code Code Available 1Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation Dec 24, 2021 Depth Estimation Depth Prediction
Code Code Available 1FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier Convolutions Oct 4, 2022 Depth Estimation Monocular Depth Estimation
Code Code Available 1